Automated Supraclavicular Brown Adipose Tissue Segmentation in Computed Tomography Using nnU-Net: Integration with TotalSegmentator

Jørgensen, Kasper; Høi-Hansen, Frederikke Engel; Loos, Ruth J. F.; Hinge, Christian; Andersen, Flemming Littrup

doi:10.3390/diagnostics14242786

Open AccessArticle

Automated Supraclavicular Brown Adipose Tissue Segmentation in Computed Tomography Using nnU-Net: Integration with TotalSegmentator

by

Kasper Jørgensen

^1,*,†,

Frederikke Engel Høi-Hansen

^1,*,†,

Ruth J. F. Loos

²,

Christian Hinge

^1,‡ and

Flemming Littrup Andersen

^1,3,‡

¹

Department of Clinical Physiology and Nuclear Medicine, Rigshospitalet, University of Copenhagen, Blegdamsvej 9, 2100 Copenhagen, Denmark

²

Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, 2200 Copenhagen, Denmark

³

Department of Clinical Medicine, University of Copenhagen, 2200 Copenhagen, Denmark

^*

Authors to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

^‡

These authors also contributed equally to this work.

Diagnostics 2024, 14(24), 2786; https://doi.org/10.3390/diagnostics14242786

Submission received: 12 November 2024 / Revised: 5 December 2024 / Accepted: 8 December 2024 / Published: 11 December 2024

(This article belongs to the Special Issue AI as a Tool to Improve Hybrid Imaging in Cancer—2nd Edition)

Download

Browse Figures

Versions Notes

Abstract

Background/Objectives: Brown adipose tissue (BAT) plays a crucial role in energy expenditure and thermoregulation and has thus garnered interest in the context of metabolic diseases. Segmentation in medical imaging is time-consuming and prone to inter- and intra-operator variability. This study aims to develop an automated BAT segmentation method using the nnU-Net deep learning framework, integrated into the TotalSegmentator software, and to evaluate its performance in a large cohort of patients with lymphoma. Methods: A 3D nnU-Net model was trained on the manually annotated BAT regions from 159 lymphoma patients’ CT scans, employing a 5-fold cross-validation approach. An ensemble model was created using these folds to enhance segmentation performance. The model was tested on an independent cohort of 30 patients. The evaluation metrics included the DICE score and Hausdorff Distance (HD). Additionally, the mean standardized uptake value (SUV) in the BAT regions was analyzed in 7107 FDG PET/CT lymphoma studies to identify patterns in the BAT SUVs. Results: The ensemble model achieved a state-of-the-art average DICE score of 0.780 ± 0.077 and an HD of 29.0 ± 14.6 mm in the test set, outperforming the individual fold models. Automated BAT segmentation revealed significant differences in the BAT SUVs between the sexes, with higher values in women. The morning scans showed a higher BAT SUV compared to the afternoon scans, and seasonal variations were observed, with an increased uptake during the winter. The BAT SUVs decreased with age. Conclusions: The proposed automated BAT segmentation tool demonstrates robust performance, reducing the need for manual annotation. The analysis of a large patient cohort confirms the known patterns of BAT SUVs, highlighting the method’s potential for broader clinical and research applications.

Keywords:

brown adipose tissue; BAT segmentation; deep learning; nn-UNet; TotalSegmentator; lymphoma; PET/CT; automated segmentation

1. Introduction

Brown adipose tissue (BAT) is an important endocrine tissue primarily involved in energy expenditure and non-shivering thermoregulation [1,2]. Unlike white adipose tissue (WAT), which stores energy, BAT burns energy to produce heat. This function is especially crucial in newborns to maintain body temperature. The primary regions of interest for BAT are the supraclavicular and neck areas, as well as the perirenal and paravertebral regions [3].

While BAT is abundant in newborns, it is believed to regress with age. However, studies using positron emission tomography (PET) have identified metabolically active BAT in some adults, highlighting its potential role in adult metabolism [4]. The activation of BAT is mediated by the tissue-specific uncoupling protein 1 (UCP1), which uncouples oxidative phosphorylation in mitochondria, leading to an increased energy expenditure. This process has been shown to lower the plasma glucose and lipid levels in the blood, thereby improving metabolic homeostasis. Consequently, BAT has garnered significant attention for its potential therapeutic effects on various metabolic diseases, including diabetes [5,6,7,8,9]. Therefore, BAT has become an increasingly important area of research, as its unique properties open possibilities for future diagnostic opportunities and therapies, such as the transplantation or stimulation of BAT regeneration.

However, the accurate and consistent identification and quantification of BAT in imaging studies remains challenging. The manual segmentation of BAT is time-consuming, operator-dependent, and prone to variability, which motivates the use of standardized and automated segmentation methods [10]. Early efforts to segment BAT predominantly utilize simple thresholding techniques to define BAT based on a range of Hounsfield units (HUs) from computed tomography (CT) scans in combination with standardized uptake values (SUVs) from PET scans [11]. Typically, BAT is characterized as tissue with HU values ranging from −190 to −10 and exhibiting an elevated FDG uptake (SUV > 1.2) [12]. Although these methods are relatively straightforward, they are limited by the inherent subjectivity in selecting HU and SUV cutoff values [13], the latter influenced by a variety of biologic and technologic factors [14,15].

Deep learning methods have become invaluable for medical segmentation tasks, as they can directly map image data to segmentation masks without the need for handcrafted features. Among these methods, deep convolutional neural networks (CNNs) have shown exceptional performance in medical image analysis [16,17]. One of the most successful architectures is the U-Net, which has proven to be highly effective in a range of medical imaging applications, including image segmentation and synthesis [18]. Building on this, the nnU-Net extends the U-Net framework to create a fully automated and adaptive segmentation tool. Unlike traditional models, nnU-Net eliminates the need for the manual tuning of hyperparameters or architecture adjustments for each specific task. Instead, it automatically configures itself based on the dataset’s characteristics, making it highly adaptable across diverse medical imaging tasks [19]. An example of its application is TotalSegmentator, a popular open-source tool capable of segmenting over 100 different body parts from CT images. TotalSegmentator leverages nnU-Net models to perform these segmentation tasks [20]. However, most research on BAT segmentation combines imaging modalities like a PET/CT or a PET/MRI. PET imaging with a glucose tracer, such as 18F-FDG, is especially effective at highlighting metabolically active BAT, which can then be co-registered with CT anatomical data [21]. In contrast, methods that use PET for segmentation risk biasing the SUV analysis, as the segmentation depends on the very SUV activity being analyzed. While segmenting BAT from CT images alone is challenging due to the lack of specific contrast distinguishing it from other adipose tissues, CT images still provide sufficient information for BAT identification by physicians.

In this study, we propose a supraclavicular BAT segmentation model based on a 3D nnU-Net ensemble that utilizes only the anatomical information from CT images for the segmentation. The method integrates seamlessly with the existing TotalSegmentator software (version: TotalSegmentator 2.4.0), facilitating integration into the established workflows. This software already segments a wide range of anatomical regions from a CT alone, including subcutaneous and visceral fat. This suggests that BAT should also be segmentable without relying on PET data.

To demonstrate the practicality of our method, we applied it to segment a large cohort of over 7000 PET/CT scans from patients diagnosed with lymphoma. Our automated segmentation procedure efficiently generated BAT segmentation masks, which enabled the extraction of mean BAT SUVs from the corresponding PET scans. This facilitated the identification of demographic trends in the BAT SUVs with high confidence, highlighting the potential of our segmentation tool for use in BAT research and related studies.

2. Materials and Methods

2.1. Patient Cohorts

This retrospective study included 7296 whole-body FDG-PET/CT scans from 2752 patients with lymphoma undergoing staging, an interim treatment assessment, and an end-of-treatment evaluation, acquired during clinical routine at Rigshospitalet, Copenhagen, Denmark. All scans were performed between September 2017 and September 2022, and the 189 most recent scans from different patients were used for developing a segmentation model. Of these, 30 studies were kept for testing, and the remaining 159 were used for training, utilizing a 5-fold cross-validation split. The remaining 7107 scans were used for subsequent descriptive analyses of BAT. We will refer to this latter cohort as LymphBAT-7107. Patient demographics for the training and test cohorts are presented in Table 1. All patient-specific data were annomymized and managed in accordance with the Danish Data Protection Agency Act No. 502. The project was approved by the National Ethics Comité (Reference No.: 2213953).

2.2. PET/CT Acquisition Parameters

Whole-body (WB) CT scans, 89% of which were contrast-enhanced, were acquired using various scanners. The majority of the scans (~82%) were obtained from a Siemens Biograph 64 Vision 600 (Siemens Healthineers, Erlangen, Germany). However, multiple scans were acquired using a Biograph 128 Vision 600 Edge (~10%) and a Biograph 64 mCT Flow (~8%). Most of the scans covered the region from the mid-thigh to the top of the head, with some scans extending to include the lower extremities. Images were reconstructed with a spacing of 2.0 × 0.98 × 0.98 mm, corresponding to a median CT volume resolution of 471 × 512 × 512 voxels. All PET scans were reconstructed using point spread function (PSF) technology.

2.3. Manual BAT Annotation Procedure

The manual segmentation of supraclavicular BAT was performed on 189 CT images by a single reader using Mirada Medical DBx software version 01 R2 (Mirada Medical Ltd., Oxford, UK) on axial slices. The use of a single reader eliminated inter-observer variability. An experienced physician supervised the quality of the segmentations to ensure accuracy. These three-dimensional segmentation masks were used as ground truth for both the training and testing of the segmentation model.

2.4. Automated BAT Segmentation Using nnU-Net

The CT and BAT segmentation pairs were employed in a supervised training scheme using the nnU-Net v2 model (version: nnunetv2==2.5.1) [19], integrated within the TotalSegmentator framework (version: TotalSegmentator==2.4.0) [20]. A 5-fold cross-validation was conducted, resulting in five distinct models. Optimal preprocessing steps were automatically determined by the nnU-Net framework, utilizing the 3D full-resolution configuration. Preprocessing included intensity normalization through z-score standardization [19]. Each model was trained for 1000 epochs with a batch size of 2 using a joint DICE and cross-entropy loss function with deep supervision enabled, operating on patches of 112 × 128 × 128 voxels. Data augmentation included elastic deformations, random rotations, scaling, gamma adjustments, and intensity shifts as per the default nnU-Net settings; however, mirroring augmentations were disabled to maintain consistency with the TotalSegmentator framework settings. Model checkpoints were selected based on performance on the corresponding validation set, which may have introduced optimistic bias into the cross-validated validation metrics. This underscores the importance of the acquired independent test set.

The five models from the cross-validation were subsequently combined into an ensemble model by averaging their logits. This final ensemble model was then used for inference on the independent test set.

2.5. Evaluation

2.5.1. BAT Segmentation Quality

The predicted BAT masks were evaluated against the ground truth BAT masks using three metrics: DICE, Intersection over Union (IoU), and Hausdorff Distance (HD) [21,22,23]. The DICE score, ranging from 0 to 1, measures the overlap between the predicted and actual BAT regions. Higher values indicate better prediction accuracy. IoU, which also ranges from 0 to 1, is defined as the ratio of the intersection to the union of the predicted and ground truth masks:

D I C E = \frac{2 \times |A \cap B|}{|A| + |B|} = \frac{2 \times T P}{2 \times T P + F N + F P}, I o U = \frac{|A \cap B|}{|A \cup B|} = \frac{T P}{T P + F N + F P}

(1)

Here,

T P

denotes true positive pixel predictions,

F N

represents false negatives, and

F P

indicates false positives. While DICE provides a measure of overlap, IoU focuses on the ratio of correctly predicted pixels to all pixels in the combined regions. The two metrics are complementary: DICE is particularly sensitive to small regions and balanced segmentation, while IoU emphasizes overall pixel-level accuracy. The Hausdorff Distance, ranging from 0 to infinity, measures the greatest distance from any point in a set

A

(predicted mask) to the closest point in another set

B

(ground truth mask) with smaller values indicating better prediction accuracy [21]. It is calculated as follows:

H (A, B) = \max (h (A, B), h (B, A)), where h (A, B) = \max_{a \in A} \min_{b \in B} {‖a - b‖}_{2} .

(2)

HD is particularly useful for identifying cases where false positive predictions occur far from the expected neck regions. For example, a small false positive prediction in the abdominal area will significantly affect HD but may have a negligible impact on DICE or IoU. These metrics were chosen due to their complementary strengths in evaluating segmentation accuracy. DICE and IoU assess the pixel-level overlap and accuracy, while HD captures spatial errors, providing a comprehensive evaluation of BAT segmentation performance. This combination ensures robustness in detecting both the precise alignment and outlier predictions.

2.5.2. Descriptive Analyses

A statistical analysis of the standardized uptake value (SUV) signal in BAT was conducted on the LymphBAT-7107 cohort described in Section 2.1. Using the trained segmentation model, BAT masks were inferred from the CT scans, and mean BAT SUVs were extracted from the corresponding PET scans. The SUV is calculated using the following formula:

S U V = \frac{c_{i m g}}{I D} \times B W .

(3)

In this equation, ID represents the injected dose in Becquerels (Bqs), BW refers to the patient’s body weight in kilograms, and

c_{i m g}

is the activity concentration in the image measured in Bq/mL. Our goal was to compare the mean SUV in BAT across four different patients and the following study features: patient sex (M/F), patient age (Young Adults (0–39)/Middle-Aged Adults (40–59)/Older Adults (60–79)/Elderly (80+)), scan time (morning [AM]/afternoon [PM]), and season (winter/spring/summer/fall). We calculated the mean and standard error of the mean (SEM) for each subgroup and used Welch’s T-Test to determine significant general patterns in BAT SUVs across the demographic and temporal factors. We applied three thresholds for statistical significance: p < 0.05, p < 0.01, and p < 0.001.

3. Results

3.1. Assessment of BAT Segmentation Performance

The evaluation metrics, DICE, IoU, and HD, for both the cross-validation (CV) models and the final ensemble model are summarized in Table 2. The DICE and HD metrics are visualized as violin plots in Figure 1. When using individual fold models, some patients obtained poor DICE and HD scores due to false positive BAT predictions far from the neck region. This issue is not observed in the combined ensemble model, which brings a drastic reduction in HD from 60.7 to 29.0 and an improvement in the DICE score from 0.749 to 0.780. The IoU also improved from 0.613 to 0.646 in the tabulated results, reflecting the enhanced pixel-level accuracy achieved by the ensemble.

Figure 2 presents a visual comparison of the model-predicted BAT segmentations against the ground truth annotations across the four representative test patients, which were randomly selected. In general, there is a strong visual correspondence between the predicted and ground truth segmentations, particularly in the regions with well-defined BAT structures. However, some inconsistencies are observed, notably in the areas where the ground truth annotations appear ambiguous or less defined. In these cases, the overlap analysis highlights the discrepancies, with true positive pixels shown in green, while the false negative (red) and false positive (blue) pixels reveal the regions where the model either missed the BAT or over-segmented, respectively.

Figure 3 provides a detailed example of a single representative test patient, where we display the contours of the manually annotated BAT region (blue) alongside the predicted region (red). Additionally, the segmented regions are visualized as 3D structures to better grasp the entire volume, rather than just the individual slices. For this test patient, we observed that the predicted BAT volume aligns well with the ground truth, but there are some predicted regions near the shoulders (indicated by green arrows) that were not annotated as BAT in the manual segmentation. From the 3D structure, we can see a relatively large, predicted BAT region that is disconnected from the rest of the remaining BAT volume.

3.2. Findings from the Descriptive Analyses in Patients with Lymphoma

The automated BAT segmentation in the LymphBAT-7107 cohort of lymphoma patients predicted an average BAT volume of 73.3 mL with a standard deviation of 47.4 mL. Figure 4 and Table 3 contain the mean SUV in BAT across the different sub-cohorts. SUV uptake is significantly higher in the females than in the males and is greater in the morning than in the afternoon. Seasonal variations are evident, with the highest uptake observed in the winter, followed by a decline in the spring, reaching its lowest point in the summer, and rising again in the fall. Additionally, the mean SUV in BAT generally decreases with age, except in the 80+ cohort, where a noticeable increase in SUV uptake is observed.

Figure 5 depicts the predicted segmentations and PET images for two example patients with metabolically active BAT. Note that the areas of increased PET activity correspond to the segmented BAT.

4. Discussion

This study demonstrates the feasibility and effectiveness of an automated supraclavicular brown adipose tissue (BAT) segmentation model using the nnU-Net framework, integrated within the TotalSegmentator tool. By leveraging only CT images, we have developed a robust method that bypasses the need for PET data. To the best of our knowledge, this is the first CT-only BAT segmentation model. It achieved a mean Dice score of 0.780 and a Hausdorff Distance (HD) of 29.0 mm on the independent test set, underscoring its potential for reliable and large-scale BAT segmentation.

While the reported DICE and HD scores may not seem immediately impressive compared to other segmentation tasks involving more delineated organs, which often achieve DICE scores above 0.9, it is important to consider the challenges unique to BAT. In particular, distinguishing BAT tissue from other adjacent adipose tissue is inherently difficult due to the similarity in HU values. Zhao et al. proposed BAT-Net for BAT segmentation on multi-modal magnetic resonance imaging (MRI) scans, achieving a DICE score of approximately 0.88 [24]. However, this is not directly comparable to our results, as an MRI inherently offers better differentiation between soft tissues, making the segmentation task easier. Yet, an MRI is often unavailable in many clinical settings, especially in combination with a PET scan, where PET/CT scans remain more commonly used. Another model from Wang et al., ICA-UNet which is a 2D convolutional model that takes both a CT and the corresponding PET slices as inputs, reports a DICE score of 0.91 and a HD of 7.3. However, since their model uses both a PET/CT and is evaluated on 2D slices and not the entire 3D volume, these metrics are also not directly comparable to our results. To address the lack of comparative studies in CT-only BAT segmentation, we propose our model as a baseline for future research.

The visual inspection of the predicted BAT regions across several test cases generally showed good correspondence with the ground truth. In the areas where inconsistencies were observed, it was often challenging to determine whether the ground truth or the predicted regions best represented the actual BAT. Some cases showed disconnected BAT areas in the shoulder regions that were absent in the ground truth annotations. This suggests that applying post-processing steps to retain only the largest BAT region on each side could potentially improve the segmentation accuracy. Despite the challenges of segmenting BAT from a CT, we consider the obtained performance sufficient for practical use, and we view our model’s simplicity and potential integration as a plug-and-play addition to TotalSegmentator as a significant advantage over the more complex multi-modal U-Net variations seen in related research. Furthermore, by excluding the PET as input we also eliminate the risk of biasing the SUV analysis.

The performance across the individual fold models was consistent, demonstrating that the training process is robust and does not heavily depend on the choice of seed or specific training data. Building on this consistency, the real performance gain comes from the ensemble model, which combines the predictions from all the fold models to mitigate the outlier predictions and significantly improve the overall segmentation metrics. However, this performance gain comes at the cost of approximately five times the inference time compared to individual models. Despite this, the ensemble inference time remains short (~1–2 min per scan), making it unlikely to pose a problem in most clinical or research workflows. Our model is freely available on https://github.com/depict-rh/bat-seg (accessed on 11 November 2024).

This study also presents, to our knowledge, the largest retrospective analysis of SUVs in BAT, providing valuable insights into the demographic and temporal factors influencing BAT SUVs. Our findings corroborate the existing literature, showing that BAT uptake was higher in women than in men [23,25]. This difference may be attributed to factors like a higher sensitivity to cold and body composition, where women typically have more subcutaneous fat, and hormonal influences, such as estrogen, and a greater sensitivity to insulin, which may enhance thermogenesis [26,27,28,29]. Additionally, women may rely more on non-shivering thermogenesis to regulate body temperature due to differences in the thermoneutral zones [30]. Secondly, the BAT SUV was higher in the individuals scanned in the morning, aligning with the research on circadian rhythms and metabolic processes [31]. The increased BAT SUVs in the morning may be driven by higher metabolic demands after waking and the fasting state, along with the cooler ambient temperatures that stimulate non-shivering thermogenesis. We also observed a higher BAT uptake during the colder months, particularly in the winter, reflecting BAT’s role in generating heat to maintain body temperature [32]. Cold exposure in the winter triggers an increased SUV uptake, consistent with the well-established link between the temperature and BAT SUV. Lastly, we found that BAT SUVs decreases with age, which is well documented [25]. This has been attributed to factors like a reduced thermogenic capacity, a decrease in BAT mass, and a reduced metabolic demand in older adults. However, an unexpected increase in BAT SUVs was observed in the elderly cohort (80+), which may be due to survivor bias, as older individuals with a higher BAT SUV likely represent a healthier subgroup [33]. These observations provide validation for our approach to quantifying BAT SUVs, as they align with the established patterns of BAT SUVs related to temperature, age, and metabolic demand.

This study has some limitations. Firstly, the automated BAT segmentation model was trained and evaluated exclusively on the CT images from lymphoma patients. While the presence of lymphoma is not expected to significantly influence BAT anatomy or physiology, which suggests the model may generalize well to other patient populations, further validation is necessary to confirm this. Additionally, the lack of PET images in the model input may limit the segmentation performance, since metabolic activity is usually a key indicator for BAT. This limitation confines our model to anatomical segmentation. However, by segmenting BAT independently of its PET activity, we avoid the risk of biasing the SUV analysis by potentially excluding the BAT regions with lower SUVs. This ensures a more objective anatomical assessment, free from metabolic influence. Furthermore, due to the time-consuming process of obtaining manual BAT segmentations, the test set was limited to 30 patients, which may be insufficient for a robust model evaluation and the identification of potential issues. Future work should include a larger test set and ground truth BAT delineations from multiple experienced clinicians. To obtain such a dataset efficiently, one could employ a human-in-the-loop approach where clinicians refine the model-predicted segmentations of new CT images [34]. Importantly, this study focused solely on the supraclavicular region for BAT segmentation, as it is a primary site for BAT deposits in adults and represents a practical starting point for developing automated methods. However, BAT can be distributed across multiple anatomical regions, such as the perirenal and paravertebral areas, which were not included in our segmentation. Future efforts could extend the model to include these regions, providing a more comprehensive BAT analysis.

Compared to the studies focused on BAT segmentation using an MRI, our study is limited to CT images. Manual BAT segmentation on CT images is inherently challenging, as certain regions can be difficult to classify as BAT. In contrast, an MRI offers better soft tissue differentiation, which may make manual segmentation on an MRI slightly more precise and potentially a more reliable gold standard for BAT segmentation. A future validation approach could involve comparing manual BAT segmentations on an MRI with the co-registered segmentations generated by our CT-based nnU-Net model. However, a PET/CT is more commonly used than a PET/MRI, which strengthens the applicability of our model. Furthermore, while currently limited to CT images, our model has the potential to serve as a foundation for transfer learning in developing a BAT segmentation model for an MRI. This would expand its applicability and contribute to the TotalSegmentator software, which also supports an MRI [35].

5. Conclusions

In this study, we present an automated BAT segmentation tool that can be seamlessly integrated into the TotalSegmentator software, providing a robust alternative to cumbersome manual segmentation. Our statistical analysis of the mean BAT SUV across different demographic and temporal groups of a large patient cohort identifies the key factors influencing BAT SUVs, aligning with the trends observed in the existing literature. The introduction of this segmentation tool represents a significant advancement in the standardization of a BAT analysis, facilitating more efficient investigations into BAT SUVs and promoting further research in this field.

Author Contributions

Conceptualization, C.H., R.J.F.L. and F.L.A.; methodology, K.J., C.H. and F.L.A.; software, K.J.; validation, K.J., F.E.H.-H., C.H., R.J.F.L. and F.L.A.; formal analysis, K.J. and C.H.; investigation, K.J., F.E.H.-H. and C.H.; resources, F.L.A.; data curation, K.J., F.E.H.-H. and C.H.; writing—original draft preparation, K.J. and F.E.H.-H.; writing—review and editing, K.J., F.E.H.-H., R.J.F.L., C.H. and F.L.A.; visualization, K.J.; supervision, C.H. and F.L.A.; project administration, F.L.A.; funding acquisition, F.L.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Use of the retrospective patient data was approved by the National Ethics Comité on 5 January 2023 (Reference No.: 2213953).

Informed Consent Statement

Patient consent was waived due to the retrospective study design and in agreement with the regional ethics committee. All the subject data were anonymized.

Data Availability Statement

Data supporting the reported results can be obtained via contact with the corresponding author upon reasonable request and legal approval. The data are not publicly available due to no public data sharing agreement. The inference model is available on GitHub: https://github.com/depict-rh/bat-seg (accessed on 11 November 2024).

Acknowledgments

We acknowledge the use of OpenAI’s ChatGPT for assistance in improving the grammar and sentence structure of this manuscript. Ruth J.F. Loos is an employee at the Novo Nordisk Foundation Center for Basic Metabolic Research, which is an independent research center at the University of Copenhagen, partially funded by an unrestricted donation from the Novo Nordisk Foundation (NNF18CC0034900, NNF23SA0084103).

Conflicts of Interest

Siemens Healthineers has granted Rigshospitalet a fund of DKK 2.000.000 for the PhD salary of Christian Hinge. The funders had no role in the design of this study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

Cannon, B.; Nedergaard, J. Brown Adipose Tissue: Function and Physiological Significance. Physiol. Rev. 2004, 84, 277–359. [Google Scholar] [CrossRef]
Harms, M.; Seale, P. Brown and beige fat: Development, function and therapeutic potential. Nat. Med. 2013, 19, 1252–1263. [Google Scholar] [CrossRef]
Coulier, B.; Montfort, L.; Richelle, F.; Brichant, C. Brown Adipose Tissue (BAT) causing unusual cervical and scapular uptake of 18F-FDG in a young patient with Hodgkin’s lymphoma. J. Belg. Soc. Radiol. 2015, 99, 105. [Google Scholar] [CrossRef][Green Version]
Gunawardana, S.C. Therapeutic value of brown adipose tissue: Correcting metabolic disease through generating healthy fat. Adipocyte 2012, 1, 250–255. [Google Scholar] [CrossRef][Green Version]
Whitehead, A.; Krause, F.N.; Moran, A.; MacCannell, A.D.; Scragg, J.L.; McNally, B.D.; Boateng, E.; Murfitt, S.A.; Virtue, S.; Wright, J.; et al. Brown and beige adipose tissue regulate systemic metabolism through a metabolite interorgan signaling axis. Nat. Commun. 2021, 12, 1905. [Google Scholar] [CrossRef]
Wang, G.X.; Zhao, X.Y.; Lin, J.D. The brown fat secretome: Metabolic functions beyond thermogenesis. Trends Endocrinol. Metab. 2015, 26, 231–237. [Google Scholar] [CrossRef] [PubMed]
Lee, J.; Ustione, A.; Wilkerson, E.M.; Balakrishnan, R.; Thurmond, D.C.; Goldfarb, D.; Piston, D.W. Insulin-Independent Regulation of Type 1 Diabetes via Brown Adipocyte-Secreted Proteins and the Novel Glucagon Regulator Nidogen-2. bioRxiv, 2024; preprint. [Google Scholar] [CrossRef]
Abdelhafez, Y.G.; Wang, G.; Li, S.; Pellegrinelli, V.; Chaudhari, A.J.; Ramirez, A.; Sen, F.; Vidal-Puig, A.; Sidossis, L.S.; Klein, S.; et al. The Role of Brown Adipose Tissue in Branched-chain Amino Acid Clearance in People. iScience 2024, 27, 110559. [Google Scholar] [CrossRef] [PubMed]
Wang, B.; Hu, Z.; Cui, L.; Zhao, M.; Su, Z.; Jiang, Y.; Liu, J.; Zhao, Y.; Hou, Y.; Yang, X.; et al. βAR-mTOR-lipin1 pathway mediates PKA-RIIβ deficiency-induced adipose browning. Theranostics 2024, 14, 5316–5335. [Google Scholar] [CrossRef] [PubMed]
Wilder-Smith, A.J.; Yang, S.; Weikert, T.; Bremerich, J.; Haaf, P.; Segeroth, M.; Ebert, L.C.; Sauter, A.; Sexauer, R. Automated Detection, Segmentation, and Classification of Pericardial Effusions on Chest CT Using a Deep Convolutional Neural Network. Diagnostics 2022, 12, 1045. [Google Scholar] [CrossRef] [PubMed]
Lee, M.Y.; Crandall, J.; Kasal, K.; Wahl, R. A Comparison of Semi-Automated Brown Adipose Tissue Segmentation Methods. J. Nucl. Med. 2020, 61, 1400. [Google Scholar]
Chen, K.Y.; Cypess, A.M.; Laughlin, M.R.; Haft, C.R.; Hu, H.H.; Bredella, M.A.; Enerbäck, S.; Kinahan, P.E.; Lichtenbelt, W.M.; Lin, F.I.; et al. Brown Adipose Reporting Criteria in Imaging STudies (BARCIST 1.0): Recommendations for Standardized FDG-PET/CT Experiments in Humans. Cell Metab. 2016, 24, 210–222. [Google Scholar] [CrossRef]
Martinez-Tellez, B.; Nahon, K.J.; Sanchez-Delgado, G.; Abreu-Vieira, G.; Llamas-Elvira, J.M.; Van Velden, F.H.P.; Arias-Bouda, L.M.P.; Rensen, P.C.N.; Boon, M.R.; Ruiz, J.R.; et al. The impact of using BARCIST 1.0 criteria on quantification of BAT volume and activity in three independent cohorts of adults. Sci. Rep. 2018, 8, 8567. [Google Scholar] [CrossRef] [PubMed]
Adams, M.C.; Turkington, T.G.; Wilson, J.M.; Wong, T.Z. A systematic review of the factors affecting accuracy of SUV measurements. Am. J. Roentgenol. 2010, 195, 310–320. [Google Scholar] [CrossRef] [PubMed]
Lundström, E.; Strand, R.; Forslund, A.; Bergsten, P.; Weghuber, D.; Ahlström, H.; Kullberg, J. Automated segmentation of human cervical-supraclavicular adipose tissue in magnetic resonance images. Sci. Rep. 2017, 7, 3064. [Google Scholar] [CrossRef]
Li, L.; Qin, L.; Xu, Z.; Yin, Y.; Wang, X.; Kong, B.; Bai, J.; Lu, Y.; Fang, Z.; Song, Q.; et al. Using Artificial Intelligence to Detect COVID-19 and Community-acquired Pneumonia Based on Pulmonary CT: Evaluation of the Diagnostic Accuracy. Radiology 2020, 296, E65–E71. [Google Scholar] [CrossRef] [PubMed]
Ronneberger, O.F.P.; Brox, T. U-Net: Convolutional networks for biomedical image segmentation. In Proceedings of the Medical Image Computing and Computer-Assisted Intervention—MICCAI 2015, Munich, Germany, 5–9 October 2015; Lecture Notes in Computer Science. Springer International Publishing: Cham, Switzerland; pp. 234–241. [Google Scholar] [CrossRef]
Siddique, N.; Paheding, S.; Elkin, C.P.; Devabhaktuni, V. U-Net and its variants for medical image segmentation: Theory and applications. arXiv, 2020; arXiv:2011.01118. [Google Scholar] [CrossRef]
Isensee, F.; Jaeger, P.F.; Kohl, S.A.A.; Petersen, J.; Maier-Hein, K.H. nnU-Net: A self-configuring method for deep learning-based biomedical image segmentation. Nat. Methods 2021, 18, 203–211. [Google Scholar] [CrossRef]
Wasserthal, J.; Breit, H.-C.; Meyer, M.T.; Pradella, M.; Hinck, D.; Sauter, A.W.; Heye, T.; Boll, D.T.; Cyriac, J.; Yang, S.; et al. TotalSegmentator: Robust Segmentation of 104 Anatomic Structures in CT Images. Radiol. Artif. Intell. 2023, 5, e230024. [Google Scholar] [CrossRef]
Yeghiazaryan, V.; Voiculescu, I. Family of boundary overlap metrics for the evaluation of medical image segmentation. J. Med. Imaging 2018, 5, 015006. [Google Scholar] [CrossRef] [PubMed]
Dice, L.R. Measures of the amount of ecologic association between species. Ecology 1945, 26, 297–302. [Google Scholar] [CrossRef]
Gerig, G.; Jomier, M.; Chakos, M. Valmet: A new validation tool for assessing and improving 3D object segmentation. Lect. Notes Comput. Sci. 2001, 2208, 516–523. [Google Scholar]
Zhao, Y.; Zhao, Y.; Tang, C.; Tang, C.; Cui, B.; Cui, B.; Somasundaram, A.; Somasundaram, A.; Raspe, J.; Raspe, J.; et al. Automated segmentation of the human supraclavicular fat depot via deep neural network in water-fat separated magnetic resonance images. Quant. Imaging Med. Surg. 2023, 13, 4699–4715. [Google Scholar] [CrossRef] [PubMed]
Singh, R.; Barrios, A.; Dirakvand, G.; Pervin, S. Human brown adipose tissue and metabolic health: Potential for therapeutic avenues. Cells 2021, 10, 3030. [Google Scholar] [CrossRef] [PubMed]
Symonds, M.E.; Aldiss, P.; Pope, M.; Budge, H. Recent advances in our understanding of brown and beige adipose tissue: The good fat that keeps you healthy. F1000Research 2018, 7, 1129. [Google Scholar] [CrossRef]
Virtanen, K.A.; Lidell, M.E.; Orava, J.; Heglind, M.; Westergren, R.; Niemi, T.; Taittonen, M.; Laine, J.; Savisto, N.-J.; Enerbäck, S.; et al. Functional Brown Adipose Tissue in Healthy Adults. N. Engl. J. Med. 2009, 360, 1518–1525. [Google Scholar] [CrossRef] [PubMed]
Winn, N.C.; Grunewald, Z.I.; Gastecki, M.L.; Woodford, M.L.; Welly, R.J.; Clookey, S.L.; Ball, J.R.; Gaines, T.L.; Karasseva, N.G.; Kanaley, J.A.; et al. Deletion of UCP1 enhances ex vivo aortic vasomotor function in female but not male mice despite similar susceptibility to metabolic dysfunction. Am. J. Physiol. Metab. 2017, 313, E402–E412. [Google Scholar] [CrossRef]
Cypess, A.M.; Lehman, S.; Williams, G.; Tal, I.; Rodman, D.; Goldfine, A.B.; Kuo, F.C.; Palmer, E.L.; Tseng, Y.H.; Doria, A.; et al. Identification and Importance of Brown Adipose Tissue in Adult Humans. N. Engl. J. Med. 2009, 360, 1509–1517. [Google Scholar] [CrossRef]
Cannon, B.; Nedergaard, J. Nonshivering thermogenesis and its adequate measurement in metabolic studies. J. Exp. Biol. 2011, 214, 242–253. [Google Scholar] [CrossRef]
Peng, X.; Chen, Y. The emerging role of circadian rhythms in the development and function of thermogenic fat. Front. Endocrinol. 2023, 14, 1175845. [Google Scholar] [CrossRef]
Au-Yong, I.T.H.; Thorn, N.; Ganatra, R.; Perkins, A.C.; Symonds, M.E. Brown adipose tissue and seasonal variation in humans. Diabetes 2009, 58, 2583–2587. [Google Scholar] [CrossRef]
Mattson, M.P. Perspective: Does brown fat protect against diseases of aging? Ageing Res. Rev. 2009, 9, 69–76. [Google Scholar] [CrossRef] [PubMed]
Kerdvibulvech, C.; Li, Q.; Duffy, V.G. Empowering Zero-Shot Object Detection: A Human-in-the-Loop Strategy for Unveiling Unseen Realms in Visual Data. In Digital Human Modeling and Applications in Health, Safety, Ergonomics and Risk Management; Lecture Notes in Computer Science; Springer Nature: Cham, Switzerland, 2024; pp. 235–244. [Google Scholar] [CrossRef]
D’Antonoli, T.A.; Berger, L.K.; Indrakanti, A.K.; Vishwanathan, N.; Weiß, J.; Jung, M.; Berkarda, Z.; Rau, A.; Reisert, M.; Küstner, T.; et al. TotalSegmentator MRI: Sequence-Independent Segmentation of 59 Anatomical Structures in MR images. arXiv, 2024; arXiv:2405.19492. [Google Scholar] [CrossRef]

Figure 1. Violin plots show the distribution of DICE and HD metrics for each individual fold model in the CV, with a combined plot for full CV results. The last violin plots represent the ensemble model’s performance on the independent test set.

Figure 2. Comparison of BAT segmentation across four test patients. Each column shows the CT images, ground truth BAT annotations, model-predicted BAT segmentations, and agreement analysis showing true positive pixels (green), false negative pixels (red), and false positive pixels (blue).

Figure 3. Example of manual (blue) and predicted (red) BAT regions with 3D views showing discrepancies, including unannotated predictions near the shoulders (green arrows).

Figure 4. Mean values across different subject groups in the LymphBAT–7107 cohort with 95% con–fidence intervals (mean

\pm

1.96 SEM) shown.

Figure 4. Mean values across different subject groups in the LymphBAT–7107 cohort with 95% con–fidence intervals (mean

\pm

1.96 SEM) shown.

Figure 5. Application of the BAT segmentation model for SUV PET analysis on two test patients from the LymphBAT-7007 cohort, for which ground truth BAT annotations are unavailable. First column displays the model-inferred BAT regions, while the second shows the CT with PET overlay.

Table 1. Patient demographics (mean

\pm

SD) for training and testing cohorts. BAT Volume refers to the manually annotated BAT volume. We do not have manually annotated BAT volumes for the LymphBAT-7107 cohort used for descriptive analysis.

Table 1. Patient demographics (mean

\pm

SD) for training and testing cohorts. BAT Volume refers to the manually annotated BAT volume. We do not have manually annotated BAT volumes for the LymphBAT-7107 cohort used for descriptive analysis.

Patient Cohort	Age [Years]	Weight [kg]	Height [m]	BMI	BAT Vol. [mL]
Train (n = 159)
Men (n = 75 (47%))	$63.5 \pm$ 15.3	$83.1 \pm$ 14.4	$1.80 \pm$ 0.08	$25.3 \pm$ 4.7	$117.0 \pm$ 66.3
Women (n = 84 (53%))	$63.4 \pm$ 16.0	$77.5 \pm$ 16.2	$1.64 \pm$ 0.06	$24.9 \pm$ 5.4	$86.0 \pm$ 72.1
Test (n = 30)
Men (n = 16 (53%))	$57.3 \pm$ 20.3	$74.1 \pm$ 11.7	$1.79 \pm$ 0.07	$22.6 \pm$ 3.7	$97.0 \pm$ 47.9
Women (n = 14 (47%))	$55.3 \pm$ 20.2	$74.7 \pm$ 21.6	$1.66 \pm$ 0.06	$27.8 \pm$ 6.8	$101.4 \pm$ 66.7
LymphBAT-7107 (n = 7107)
Men (n = 4011 (56%))	$62.2 \pm$ 16.2	$81.9 \pm$ 15.8	$1.79 \pm$ 0.07	$25.4 \pm$ 4.5	-
Women (n = 3096 (44%))	$63.2 \pm$ 17.0	$68.4 \pm$ 16.0	$1.66 \pm$ 0.6	$24.9 \pm$ 5.5	-

Table 2. Evaluation metrics based on validation splits and the ensemble model in the independent test set. Metrics are reported as mean

\pm

SEM.

Table 2. Evaluation metrics based on validation splits and the ensemble model in the independent test set. Metrics are reported as mean

\pm

SEM.

Model	Evaluation Set	$DICE ↑$	$IoU ↑$	$HD ↓$ [mm]
Single fold model	Validation Fold 0 (n = 32)	$0.750 \pm$ 0.022	$0.613 \pm$ 0.025	$61.2 \pm$ 13.0
	Validation Fold 1 (n = 32)	$0.749 \pm$ 0.021	$0.611 \pm$ 0.024	$48.4 \pm$ 6.6
	Validation Fold 2 (n = 32)	$0.732 \pm$ 0.028	$0.598 \pm$ 0.030	$82.0 \pm$ 24.4
	Validation Fold 3 (n = 32)	$0.764 \pm$ 0.017	$0.627 \pm$ 0.021	$72.1 \pm$ 20.6
	Validation Fold 4 (n = 31)	$0.749 \pm$ 0.023	$0.614 \pm$ 0.026	$38.9 \pm$ 4.7
	Combined Validation set (n = 159)	$0.749 \pm$ 0.010	$0.613 \pm$ 0.011	$60.7 \pm$ 7.2
Ensemble model	Test set (n = 30)	$0.780 \pm$ 0.014	$0.646 \pm$ 0.019	$29.0 \pm$ 2.7

Table 3. Comparison of mean SUV in BAT across different subject groups in the large LymphBAT-7107 (n = 7107) cohort. Statistically significant differences, p < 0.05, are marked with *, p < 0.01 are marked with **, and p < 0.001 are marked with ***.

Grouping	#	Mean	SEM	Welch’s t-Test (p-Values)
Sex				F
M	4011	0.623	0.004	*** 1.47 × 10⁻²³
F	3096	0.703	0.007	–
Time of day				PM
AM	2884	0.676	0.006	*** 9.44 × 10⁻⁵
PM	4223	0.645	0.004	–
Season				Spring	Summer	Fall
Winter	1734	0.689	0.009	* 0.0132	*** 8.54 × 10⁻⁷	* 1.44 × 10⁻³
Spring	1740	0.660	0.007	–	** 5.93 × 10⁻³	0.4880
Summer	1896	0.633	0.007	–	–	* 0.0270
Fall	1737	0.653	0.006	–	–	–
Age group				40–59	60–79	80+
0–39	876	0.841	0.021	*** 7.08 × 10⁻¹⁹	* 2.49 × 10⁻²³	*** 1.18 × 10⁻¹⁷
40–59	1570	0.642	0.007	–	* 0.0448	0.4352
60–79	3938	0.625	0.003	–	–	** 2.20 × 10⁻³
80+	723	0.650	0.007	–	–	–

Note: No correction for multiple t-tests has been applied in this analysis.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Jørgensen, K.; Høi-Hansen, F.E.; Loos, R.J.F.; Hinge, C.; Andersen, F.L. Automated Supraclavicular Brown Adipose Tissue Segmentation in Computed Tomography Using nnU-Net: Integration with TotalSegmentator. Diagnostics 2024, 14, 2786. https://doi.org/10.3390/diagnostics14242786

AMA Style

Jørgensen K, Høi-Hansen FE, Loos RJF, Hinge C, Andersen FL. Automated Supraclavicular Brown Adipose Tissue Segmentation in Computed Tomography Using nnU-Net: Integration with TotalSegmentator. Diagnostics. 2024; 14(24):2786. https://doi.org/10.3390/diagnostics14242786

Chicago/Turabian Style

Jørgensen, Kasper, Frederikke Engel Høi-Hansen, Ruth J. F. Loos, Christian Hinge, and Flemming Littrup Andersen. 2024. "Automated Supraclavicular Brown Adipose Tissue Segmentation in Computed Tomography Using nnU-Net: Integration with TotalSegmentator" Diagnostics 14, no. 24: 2786. https://doi.org/10.3390/diagnostics14242786

APA Style

Jørgensen, K., Høi-Hansen, F. E., Loos, R. J. F., Hinge, C., & Andersen, F. L. (2024). Automated Supraclavicular Brown Adipose Tissue Segmentation in Computed Tomography Using nnU-Net: Integration with TotalSegmentator. Diagnostics, 14(24), 2786. https://doi.org/10.3390/diagnostics14242786

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Automated Supraclavicular Brown Adipose Tissue Segmentation in Computed Tomography Using nnU-Net: Integration with TotalSegmentator

Abstract

1. Introduction

2. Materials and Methods

2.1. Patient Cohorts

2.2. PET/CT Acquisition Parameters

2.3. Manual BAT Annotation Procedure

2.4. Automated BAT Segmentation Using nnU-Net

2.5. Evaluation

2.5.1. BAT Segmentation Quality

2.5.2. Descriptive Analyses

3. Results

3.1. Assessment of BAT Segmentation Performance

3.2. Findings from the Descriptive Analyses in Patients with Lymphoma

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI