Next Article in Journal
NJN: A Dataset for the Normal and Jaundiced Newborns
Previous Article in Journal
Using Machine Learning to Expand the Ann Arbor Staging System for Hodgkin and Non-Hodgkin Lymphoma
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Improvement in Disease Diagnosis in Computed Tomography Images by Correlating Organ Volumes with Disease Occurrences in Humans

1
German Cancer Research Center (DKFZ), Im Neuenheimer Feld 280, 69120 Heidelberg, Germany
2
Institute for AI in Medicine (IKIM), University Medicine Essen (AöR), Girardetstraße 2, 45131 Essen, Germany
3
Institute for Bioinformatics and Chemoinformatics, Westphalia University of Applied Sciences, August-Schmidt-Ring 10, 45665 Recklinghausen, Germany
4
Cancer Research Center Cologne Essen, University Medicine Essen, 45147 Essen, Germany
5
German Cancer Consortium (DKTK), Partner Site Essen, Hufelandstraße 55, 45147 Essen, Germany
*
Author to whom correspondence should be addressed.
BioMedInformatics 2023, 3(3), 526-542; https://doi.org/10.3390/biomedinformatics3030036
Submission received: 19 May 2023 / Revised: 21 June 2023 / Accepted: 26 June 2023 / Published: 5 July 2023
(This article belongs to the Section Imaging Informatics)

Abstract

:
Some diseases are known to cause or coincide with volume changes of certain structures in the body. Since these changes can be used to identify diseases, in this paper, we aimed to discover such new correlations. To this end, we trained a machine learning model based on the TotalSegmentator model on computed tomography (CT) image data, to segment 104 anatomical structures, while trying to improve the accuracy of the model. We then used the model to segment CT scans of decedents who had at least one of 18 diseases. After correlating the structure volumes with disease occurrences, a possible new correlation between chronic artery failure and iliac artery volume was found and others were confirmed. However, due to the limitations of the model and the underlying data, further research is required.

Graphical Abstract

1. Introduction

The human body is an intricate network of cooperating structures and organs, which has not yet been fully explored. Diseases affecting one part of the body can thus have an effect on different, possibly otherwise unrelated parts of the body and may cause it to change. A simple and well-known example of this is spleen enlargement in patients with liver cirrhosis. This is due to the fact that when damaged tissue inside the liver blocks the blood flow coming from the portal vein, organs along the portal system, such as the spleen, experience higher blood pressure and increase in size and stiffness [1]. The coincidence of such changes with certain diseases may tell us more about the human body and may lead to improved diagnoses and treatment plans. To make further progress in this area, we looked at the distribution of volumes of anatomical structures, including bones, organs, large vessels, and large muscle groups, in patients with different diseases using computed tomography (CT) scans from patients who have a medical history of chronic or acute disease, and by correlating those disease markers with the volume of the aforementioned structures. During this work, we employed the TotalSegmentator model, which is an artificial neural network (ANN) [2], used for the segmentation of 104 anatomical structures [3], in order to support radiologists in evaluating radiological scan data. Additionally, we also attempted to improve the accuracy of the model by exploring different training parameters, such as the training duration and learning rate schedule. The 349 CT scans on which statistical analysis was performed were taken from the New Mexico Decedent Image Database (NMDID) [4], a database of postmortem CT scans routinely performed on decedents from New Mexico. The most common diseases among the patients are hypertension and diabetes type II. This arises from the way the data were chosen, since hypertension and diabetes type II are common among elderly people, which themselves are common in the NMDID, due to its nature as a database of scans of deceased individuals.
This kind of statistical analysis of organ sizes has previously not been conducted on this scale to our knowledge. New correlations can improve diagnostics by improving the diagnostic value of a CT scan and making it possible to recognize signs of diseases not originally checked for.

2. Materials and Methods

2.1. Model

The model used is a modified nnU-Net [5], a model and training framework developed by the MIC-DKFZ team, who aimed to develop a specialized model architecture for most medical segmentation tasks by automating decisions such as determining the patch size or number of layers for a given dataset. Its architecture is based on the U-Net [6]. The framework around the model was slightly modified into the TotalSegmentator model by the Department of Research and Analysis at University Hospital Basel, by disabling the benchmarking performed by the CUDA back end, which would have used a large amount of hardware resources when running inference [3]. Additionally, some training parameters such as training length and certain data augmentations were modified to improve the model accuracy. We built upon those changes and further explored different training lengths as well as learning rate schedules. The model uses the Nesterov-SGD learning rule, as this is the default optimizer for the nnU-Net [5].

2.2. Dataset for Model Training

The dataset used for training the TotalSegmentator model was a collection of 1204 CT scans of various body parts collected in several hospitals and clinics [3]. From the images, 104 anatomical structures, if present, were segmented, including bones, organs, muscle groups, and large blood vessels. A full list is included in Table A1. The segmentation was performed using a combination of existing models and board-certified radiologists, with radiologists correcting any mistakes in the segmentations produced by the models. Structures for which no model was available were segmented with a nnU-Net trained on the existing data. We modified a copy of the dataset by combining segmentations of the left and right versions of a body part, where those body parts would be mostly mirror images of each other. This was done to help the model recognize structures when applying data augmentation such as mirroring during the training. The 1204 images were split between a training dataset, a validation dataset, and a test dataset. The training dataset contained 1085 images, while the validation dataset contained 57 images. The test dataset contained 62 images. These were also the splits used in the original TotalSegmentator article, which is why they were used here. An example slice from the dataset can be found in Figure 1.

2.3. Dataset Used for Correlation

Since the dataset used for the model training had to be anonymized to be made public, no information regarding any potential diseases was included. Therefore, the data used in the second part of this paper were taken from the NMDID cases. The data from the NMDID consisted of 349 patient entries, with each entry consisting of two torso CT volumes, one optimized for soft tissue and the other for bone visualization. Example slices of a soft tissue-optimized scan can be found in Figure 2. The exact parameters used for the CT scan are listed in Figure A1. Prominent diagnoses in this dataset are hypertension (189 cases), diabetes type II (81 cases), and coronary artery disease (62 cases). A full list of diagnoses can be found in Table A2.

2.4. Training

In all following equations, l r is the learning rate for the current epoch e p o c h , while l r m a x is the maximal learning rate for the training, which occurs over m a x _ e p o c h s epochs. p c t represents the training progress and is defined as e p o c h / m a x _ e p o c h s , while p e a k refers to the proportion of m a x _ e p o c h s after which l r m a x is reached.
The model was trained on two NVIDIA RTX A6000 graphics cards using the data parallel method for 1000 epochs. The learning rate was scheduled with one of the following algorithms:
  • Built-In: The built-in algorithm of the nnU-Net framework. The learning rate starts at the maximum learning rate and decreases polynomially using the formula
    l r = l r m a x × ( 1 e p o c h m a x _ e p o c h s ) 0.9 .
  • Linear: The learning rate is varied cyclically over the course of training, rising linearly from zero to a maximal learning rate for the first part of a cycle, then decreasing linearly to zero during the second half of the cycle. The entire training consists of two cycles. When using this scheduler, the learning rate is cycled twice. The maximum learning rate is scheduled to be reached after half of the training. The practice of using multiple cycles was inspired by [7]. The formula for calculating the learning rate is
    l r = l r m a x × p c t 0.5 i f p c t 0.5 l r m a x × ( 1 p c t 0.5 ) i f p c t > 0.5 .
  • Linear to Exponential: Modification of linear learning rate scheduling, which replaces the linear decrease in the second part with an exponential decrease, while only using one cycle. The maximum learning rate is scheduled to be reached after one quarter of the training. Here, the formula is
    l r = l r m a x × p c t p e a k i f p c t p e a k l r m a x × e 1 p c t p e a k i f p c t > p e a k .
  • Modified Linear to Exponential: To increase training efficiency in the early epochs and maintain greater training momentum during the second part of the training, we modified the learning rate schedule by taking the square root of the learning rate in the first part of training and decreasing the base from e to two for the exponential decay in the second half of the training. The modified formula is
    l r = l r m a x × p c t p e a k i f p c t p e a k l r m a x × 2 1 p c t p e a k i f p c t > p e a k .

3. Results

3.1. Network Training

The models were evaluated by calculating the Dice coefficient [5,8] between the predicted segmentations and the actual segmentations for the test part of the TotalSegmentator dataset. The Dice coefficients are listed in Table 1. It is apparent that, while the training results did not surpass the results of the TotalSegmentator model, they were mostly close to the given lower bound. The Dice coefficient for the training with the modified linear to exponential learning rate scheduler is the lowest listed here.
Training times, including preprocessing and postprocessing for all images can be found in Table 2.

3.2. Statistical Analysis

The volumes of the segmented structures were calculated by obtaining the voxel count for each segmented class and multiplying it by the volume per voxel in liters (dm 3 ). Related structures (e.g., left and right kidney, ribs) were combined into a single structure for this. The sacrum was included in the vertebrae structure, as it is, for the purposes of this paper, an extension of the spine and not an independent structure. Some structures that did not fully appear in any image but still appeared as mistakes in the segmentation were removed from the data. Those structures include the brain, the face, and the vertebrae C1 through C6. A graph containing an overview of the volumes can be found in Figure A2. The average volumes and standard deviations for each structure are listed in Table 3. After extracting the volumes, the Pearson correlation coefficient between the volumes of all pairs of structures was calculated (see Table A3). Correlation coefficients were also calculated between each disease marker and the volume of each structure. These can also be found in Table A3. Correlation coefficients between volumes exceeding 0.5 are listed in Table 4. Correlation coefficients between structure volumes and diseases mostly did not exceed 0.2. The only exceptions to this were between chronic heart failure and iliac artery volume (r = 0.238, p < 0.001) and between aorta volume and chronic obstructive pulmonary disease (COPD), with r = 0.228.
The partial correlation coefficients also mostly did not exceed an absolute value of 0.2. The only exceptions to this were between spleen volume and liver cirrhosis (r = −0.267, p < 0.001), kidney volume and liver cirrhosis (r = 0.257, p < 0.001), and kidney volume and diabetes type II (r = 0.205, p < 0.001).

4. Discussion

4.1. Network Training

Training losses were mostly slightly below those reported by [3], indicating no significant improvements in segmentation quality. One exception was training with the modified linear to exponential learning rate scheduler (see Section 2.4 and Table 1). Here, the final Dice coefficient at the end of training was 0.88496, indicating a poor suitability for training this network compared with the other schedulers. Another exception was the model trained with the Linear learning rate scheduler, which was the only model to reach a final Dice coefficient of more than 0.96, as the TotalSegmentator model. This was the only scheduler to use more than two cycles, and it was also the scheduler with the longest run time. This may indicate that the overfitting point of the training may not have been reached and that a longer training may further improve results. This also indicates that multiple cycles might help the net generalize better.
Regarding the inference and training times of the trained models, despite the high computational cost needed for segmentation, the Dice score was around 0.95 for most models, indicating that it performs well while ideally introducing only limited inaccuracies. Even when taking into account the time needed to correct these errors, evaluating the model still saves time compared to manual segmentation. It may be possible to reduce the hardware footprint (including inference times) of the algorithm by rerunning the configuration and training on a less powerful device, as the nnU-Net configurator by default tries to use as much of the available processing power as possible, leading to a strong hardware dependency if trained on very powerful hardware. By using less powerful hardware, not only would the memory footprint of the model decrease, but so would the inference time, due to a smaller and simpler model.

4.2. Statistical Analysis-Simple Correlation Coefficients

The highest correlations between volumes, as seen in Table 4, were between certain bones and muscle groups, which is to be expected, given that the human body tends to grow proportionally.
The correlation between chronic heart failure and iliac artery volume has not previously been described in the literature to the best of our knowledge. This may have been caused by an overrepresentation of hypertension in the underlying data, a known cause of heart failure, as well as of enlarged blood vessels. This overrepresentation could have led to an increased number of cases with enlarged vessels, as well as heart failures. Additionally, the patients were scanned after rigor mortis had subsided, allowing the muscles in the vessel walls to relax and possibly distend. Another explanation for this correlation, however, is that the segmentation of the iliac arteries was qualitatively inferior, causing random variations in the volume data to appear as correlations. This hypothesis is supported by the fact that the TotalSegmentator also had problems segmenting the iliac artery and vein [3]. In addition, the data from the NMDID had a significantly different contrast level compared to the data the model was trained on (compare Figure 1 and Figure 2). This difference put the NMDID data outside of the data distribution on which the model was trained, which also decreased the segmentation quality. These factors may have compounded, to result in the irregular segmentation shape also seen in Figure A2, where the volumes for the iliac artery and vein do not seem to be normally distributed. The variation coefficient was also rather high, with values of 0.709 and 0.690 for the iliac artery and vein, respectively. The p-value of this correlation was however very small (p < 0.001), indicating only a very small probability that this correlation was not found in the obtained data. Further research, including better volume data, is needed.
The second notable correlation, between occurrences of COPD and aorta volume, has previously been described by [9], who made no claims about causality or temporality, due to the limitations of their study. Since this paper is also limited by the fact that the used CT scans were postmortem, no claims regarding causality or temporality can be made here either. There may, however, be a common cause for these two conditions, since endothelial damage and a decrease in elastin and collagen is a cause of aortic wall degradation, as well as emphysema, which is a major part of COPD [9,10].

4.3. Statistical Analysis-Partial Correlation Coefficients

The positive correlation between kidney size and diabetes type II fits with current medical knowledge about kidney hypertrophy in patients with diabetes, and more specifically diabetic nephropathy [11]. With diabetes type I, this connection was also somewhat observed in the data, with a slightly positive coefficient (r = 0.112); it was, however, weaker than the correlation with diabetes type II. This might have simply been due to the inferior quality of the predicted volumes or statistical variation due to a smaller sample size of patients with diabetes type I (n = 23).
The slight positive correlation between kidney size and liver cirrhosis also fits with current medical knowledge. Both the kidneys and the liver are responsible for regulating blood composition. In cases of liver failure, the kidneys may compensate by increasing filtering activity, which is accompanied by an increase in volume [12].
The slightly negative correlation between spleen size and liver cirrhosis is contrary to established medical knowledge. Usually, in cases of liver cirrhosis, the spleen should show an increase in size [1]; however, this correlation suggests that, in our cases, the spleen tended to have a smaller volume in cases with liver cirrhosis. This inaccuracy in our data may have again been caused by inaccurate volume data. However, the p-value was again very small (p < 0.0001), hinting that there might have been a problem with the model when segmenting enlarged spleens. Possibly, this came from the training dataset not including cases with spleen enlargement, which would put enlarged spleens outside of the patterns the model had learned to recognize, causing the model to segment the spleen incorrectly.

4.4. Conclusions

The neural network training produced models that were reasonably accurate but that require powerful hardware to run. The statistical analysis of organ volumes yielded a previously undescribed correlation between chronic heart failure and iliac artery size, and confirmed a correlation between liver cirrhosis and kidney size, as well as a correlation between aorta volume and COPD. Due to the low correlation coefficients and difficulties in segmenting the NMDID data, further research is required to confirm the results.

4.5. Outlook

Due to the long training times, only a small part of the possible nnU-Net configurations could be explored. Future work may, thus, be dedicated to improving the model by exploring more model configurations. Currently, however, the segmentations produced by the model show inaccuracies when segmenting small soft tissue structures, such as the spleen, smaller vessels, or the heart (apparent in the high variation coefficients in Table 3).
The data correlation yielded mixed results. No new strong correlations were found, and the weak correlations that were found require further research. Future work may focus on trying to replicate the correlation between iliac vessel size and chronic heart failure with higher quality segmentations or, since perfect segmentation results are highly unlikely to ever be achieved, higher quality volume data from other sources. Another problem with the dataset was its limited size, as most diseases occurred in less than 40 cases. Additionally, about half of the cases had two or more disease markers. This leads to problems isolating possible effects of diseases. This could be mitigated by using a larger dataset with more patients and a better disease distribution, as well as by including healthy patients for comparison.

Author Contributions

Conceptualization, J.K. and J.E.; Data curation, T.v.M.; Formal analysis, T.v.M. and S.P.; Investigation, T.v.M. and S.P.; Project administration, J.K. and J.E.; Supervision, J.K. and J.E.; Visualization, T.v.M.; Writing—original draft, T.v.M.; Writing—review & editing, J.K., J.E. and S.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The CT images used in this study to train the neural network are openly available in Zenodo at https://doi.org/10.5281/zenodo.6802614 (accessed on 18 May 2023) (in addition, only the surface meshes of the segmentations can be found under MedShapeNet at https://medshapenet.ikim.nrw/ (accessed on 18 May 2023)). Restrictions apply to the availability of the CT images from which the structure volumes were calculated. These data were obtained from the New Mexico Decedent Image Database, the Free Access Decedent Database funded by the National Institute of Justice grant number 2016-DN-BX-0144. Information on how to request access can be found at https://nmdid.unm.edu/ (accessed on 18 May 2023).

Acknowledgments

This work was supported by the REACT-EU project KITE (Plattform für KI-Translation Essen, https://kite.ikim.nrw/ (accessed on 18 May 2023), EFRE-0801977). Furthermore, the Austrian Science Fund (FWF) KLI 1044: ‘enFaced 2.0-Instant AR Tool for Maxillofacial Surgery’, https://enfaced2.ikim.nrw/ (accessed on 18 May 2023).

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
CTComputed Tomography
NMDID New Mexico Decedent Image Database

Appendix A

Table A1. Classes segmented by the TotalSegmentator model.
Table A1. Classes segmented by the TotalSegmentator model.
adrenal_gland_leftadrenal_gland_rightaorta
autochthon_leftautochthon_rightbrain
clavicula_leftclavicula_rightcolon
duodenumesophagusface
femur_leftfemur_rightgallbladder
gluteus_maximus_leftgluteus_maximus_rightgluteus_medius_left
gluteus_medius_rightgluteus_minimus_leftgluteus_minimus_right
heart_atrium_leftheart_atrium_rightheart_myocardium
heart_ventricle_leftheart_ventricle_righthip_left
hip_righthumerus_lefthumerus_right
iliac_artery_leftiliac_artery_rightiliac_vena_left
iliac_vena_rightiliopsoas_leftiliopsoas_right
inferior_vena_cavakidney_leftkidney_right
liverlung_lower_lobe_leftlung_lower_lobe_right
lung_middle_lobe_rightlung_upper_lobe_leftlung_upper_lobe_right
pancreasportal_vein_and_splenic_veinpulmonary_artery
rib_left_1rib_left_10rib_left_11
rib_left_12rib_left_2rib_left_3
rib_left_4rib_left_5rib_left_6
rib_left_7rib_left_8rib_left_9
rib_right_1rib_right_10rib_right_11
rib_right_12rib_right_2rib_right_3
rib_right_4rib_right_5rib_right_6
rib_right_7rib_right_8rib_right_9
sacrumscapula_leftscapula_right
small_bowelspleenstomach
tracheaurinary_bladdervertebrae_C1
vertebrae_C2vertebrae_C3vertebrae_C4
vertebrae_C5vertebrae_C6vertebrae_C7
vertebrae_L1vertebrae_L2vertebrae_L3
vertebrae_L4vertebrae_L5vertebrae_T1
vertebrae_T10vertebrae_T11vertebrae_T12
vertebrae_T2vertebrae_T3vertebrae_T4
vertebrae_T5vertebrae_T6vertebrae_T7
vertebrae_T8vertebrae_T9

Appendix B

Figure A1. Excerpt of the CT scan protocol of the Office of the Medical Investigator of New Mexico [13,14,15] if the decedent is older than 8 years and has not decomposed. Only scans c and d (1 × 0.5 mm soft tissue and 1 × 0.5 mm bone) have been used in this paper.
Figure A1. Excerpt of the CT scan protocol of the Office of the Medical Investigator of New Mexico [13,14,15] if the decedent is older than 8 years and has not decomposed. Only scans c and d (1 × 0.5 mm soft tissue and 1 × 0.5 mm bone) have been used in this paper.
Biomedinformatics 03 00036 g0a1
Table A2. List of disease markers and their occurrences in the NMDID dataset.
Table A2. List of disease markers and their occurrences in the NMDID dataset.
DiseaseOccurrences
Hypertension181
Diabetes type II73
Coronary artery disease56
Hyperlipidemia43
COPD36
Asthma33
Cirrhosis of the liver31
Hepatitis C28
Stroke26
Diabetes type I22
Arthritis19
Chronic heart failure18
Non-epileptic seizures14
Autoimmune diseases12
Myocardial infarction12
Staphylococcus aureus7
Osteoporosis3
HIV/AIDS3
Figure A2. Histograms of structure volumes, as segmented by the model after combining mirrored and related structures.
Figure A2. Histograms of structure volumes, as segmented by the model after combining mirrored and related structures.
Biomedinformatics 03 00036 g0a2
Table A3. Partial Pearson correlation coefficients between volumes of segmented and postprocessed structures and diseases of the NMDID dataset after accounting for all other data.
Table A3. Partial Pearson correlation coefficients between volumes of segmented and postprocessed structures and diseases of the NMDID dataset after accounting for all other data.
Adrenal GlandAortaAutochtonClaviculaColonDuodenum
Adrenal gland0.000−0.0490.075−0.054−0.085−0.018
Aorta−0.0490.0000.053−0.054−0.0340.059
Autochton0.0750.0530.000−0.0570.139−0.041
Clavicula−0.054−0.054−0.0570.0000.082−0.045
Colon−0.085−0.0340.1390.0820.0000.154
Duodenum−0.0180.059−0.041−0.0450.1540.000
Esophagus−0.047−0.2320.049−0.0680.215−0.040
Femur0.0370.109−0.261−0.1310.083−0.085
Gluteal muscles−0.010−0.0800.358−0.0310.1200.038
Heart0.1300.029−0.170−0.043−0.0660.015
Hip−0.014−0.022−0.1420.018−0.0370.088
Humerus0.029−0.0360.0250.466−0.0300.087
Iliac artery0.0530.332−0.016−0.053−0.0300.106
Iliac vena−0.1080.257−0.0650.104−0.0020.059
Iliopsoas−0.077−0.0860.417−0.027−0.118−0.040
Inferior vena cava−0.024−0.050−0.054−0.0460.1180.052
Kidney0.1920.0900.1030.164−0.142−0.048
Liver0.067−0.0450.004−0.0610.0370.017
Lung−0.0900.1680.100−0.062−0.0850.000
Pancreas0.1100.066−0.0400.035−0.3430.201
Portal and splenic vein0.1230.0100.000−0.0490.1260.097
Pulmonary artery0.0650.1170.1080.076−0.0140.017
Ribs−0.050−0.1030.1450.045−0.0280.111
Scapula0.0930.0890.1160.428−0.0740.020
Small bowel0.0880.042−0.004−0.0210.351−0.090
Spleen0.085−0.0140.1200.0150.0200.053
Stomach−0.0570.0840.097−0.0300.0790.050
Trachea−0.0170.031−0.0600.0040.083−0.133
Urinary bladder−0.088−0.0520.064−0.000−0.0260.031
Vertebrae0.0170.1050.1030.0440.092−0.116
Diabetes type I−0.0240.0030.003−0.0200.062−0.034
Diabetes type II0.121−0.029−0.0390.1030.0190.020
COPD−0.0740.1390.0280.048−0.1160.016
Non−epileptic seizures−0.0120.0710.058−0.008−0.027−0.046
Asthma−0.048−0.0230.021−0.0880.081−0.027
Hypertension0.0970.066−0.0470.090−0.024−0.025
Arthritis−0.0120.021−0.0170.0990.082−0.085
Chronic heart failure−0.024−0.0400.063−0.0580.091−0.022
Stroke0.0390.040−0.0230.027−0.014−0.086
Myocardial infarction0.030−0.0390.0890.0240.063−0.012
Hyperlipidemia−0.046−0.033−0.0340.024−0.0660.019
HIV/AIDS0.0240.0010.0590.004−0.028−0.016
Hepatitis C0.013−0.012−0.001−0.012−0.045−0.056
Osteoporosis0.006−0.032−0.035−0.0170.0130.005
Cirrhosis of the liver0.0690.052−0.0040.085−0.0540.024
Coronary artery disease−0.120−0.095−0.0110.025−0.0090.033
Staphylococcus aureus−0.0060.092−0.0120.0480.074−0.185
Autoimmune diseases0.064−0.031−0.0070.065−0.0440.059
EsophagusFemurGluteal
Muscles
HeartHip
Adrenal gland−0.0470.037−0.0100.130−0.014
Aorta−0.2320.109−0.0800.029−0.022
Autochton0.049−0.2610.358−0.170−0.142
Clavicula−0.068−0.131−0.031−0.0430.018
Colon0.2150.0830.120−0.066−0.037
Duodenum−0.040−0.0850.0380.0150.088
Esophagus0.0000.019−0.0030.0740.038
Femur0.0190.0000.1480.0140.296
Gluteal muscles−0.0030.1480.0000.0140.163
Heart0.0740.0140.0140.000−0.086
Hip0.0380.2960.163−0.0860.000
Humerus0.0080.083−0.055−0.065−0.020
Iliac artery−0.0380.103−0.0710.0670.124
Iliac vena0.0960.0130.0600.1990.125
Iliopsoas−0.0240.0470.4800.0640.080
Inferior vena cava−0.210−0.1800.010−0.031−0.061
Kidney0.0500.0010.0790.065−0.017
Liver−0.007−0.0940.0160.0920.019
Lung0.206−0.053−0.2020.0750.063
Pancreas0.0860.0330.030−0.074−0.090
Portal and splenic vein0.107−0.1490.0960.088−0.047
Pulmonary artery0.1880.0260.1000.388−0.055
Ribs−0.0890.167−0.0680.0030.004
Scapula0.1080.083−0.0000.1190.058
Small bowel0.0050.0160.024−0.060−0.018
Spleen0.176−0.0750.0150.138−0.021
Stomach0.114−0.0650.1250.116−0.011
Trachea0.109−0.072−0.0870.021−0.015
Urinary bladder0.0660.0400.0430.1000.017
Vertebrae−0.025−0.0760.0130.0080.536
Diabetes type I0.185−0.031−0.080−0.0650.050
Diabetes type II0.077−0.040−0.009−0.0160.070
COPD0.0180.1310.101−0.053−0.074
Non−epileptic seizures0.0220.000−0.0250.0790.026
Asthma−0.0490.0010.006−0.0010.026
Hypertension0.0400.0430.0940.075−0.117
Arthritis0.0320.0540.0190.0060.040
Chronic heart failure−0.092−0.000−0.0050.154−0.045
Stroke0.091−0.050−0.110−0.0120.000
Myocardial infarction0.0420.0710.0110.129−0.042
Hyperlipidemia0.041−0.0230.038−0.0440.003
HIV/AIDS0.0120.1210.0180.114−0.028
Hepatitis C−0.0260.0290.0290.091−0.051
Osteoporosis−0.014−0.152−0.041−0.0530.051
Cirrhosis of the liver0.066−0.004−0.0120.0220.031
Coronary artery disease−0.0140.0700.0700.111−0.104
Staphylococcus aureus0.105−0.055−0.017−0.0200.025
Autoimmune diseases0.0550.1020.0000.028−0.055
HumerusIliac ArteryIliac VenaIliopsoasInferior Vena
Cava
Adrenal gland0.0290.053−0.108−0.077−0.024
Aorta−0.0360.3320.257−0.086−0.050
Autochton0.025−0.016−0.0650.417−0.054
Clavicula0.466−0.0530.104−0.027−0.046
Colon−0.030−0.030−0.002−0.1180.118
Duodenum0.0870.1060.059−0.0400.052
Esophagus0.008−0.0380.096−0.024−0.210
Femur0.0830.1030.0130.047−0.180
Gluteal muscles−0.055−0.0710.0600.4800.010
Heart−0.0650.0670.1990.064−0.031
Hip−0.0200.1240.1250.080−0.061
Humerus0.000−0.036−0.022−0.0190.056
Iliac artery−0.0360.0000.173−0.0130.119
Iliac vena−0.0220.1730.0000.1080.131
Iliopsoas−0.019−0.0130.1080.000−0.013
Inferior vena cava0.0560.1190.131−0.0130.000
Kidney−0.023−0.0960.209−0.0680.154
Liver−0.053−0.1350.127−0.1000.034
Lung−0.028−0.0440.0020.100−0.048
Pancreas−0.004−0.0180.1270.1040.079
Portal and splenic vein0.0970.0660.030−0.078−0.104
Pulmonary artery0.0660.052−0.065−0.1150.089
Ribs−0.327−0.052−0.035−0.0320.066
Scapula0.2990.039−0.0130.133−0.009
Small bowel−0.0510.178−0.1860.002−0.016
Spleen0.0160.0370.072−0.079−0.008
Stomach−0.017−0.1000.100−0.0710.084
Trachea0.0340.040−0.0180.066−0.063
Urinary bladder−0.0530.037−0.149−0.110−0.015
Vertebrae−0.0630.044−0.159−0.1260.180
Diabetes type I0.0230.021−0.0410.0360.050
Diabetes type II−0.0370.049−0.125−0.0410.025
COPD0.0520.065−0.038−0.107−0.025
Non−epileptic seizures0.053−0.032−0.075−0.0020.075
Asthma−0.0260.044−0.0160.0030.002
Hypertension−0.0630.103−0.061−0.0060.011
Arthritis0.039−0.0420.0010.0120.094
Chronic heart failure−0.0660.1370.007−0.0990.032
Stroke0.1040.0180.0820.055−0.003
Myocardial infarction−0.0380.1220.003−0.0620.040
Hyperlipidemia−0.019−0.0320.026−0.061−0.015
HIV/AIDS−0.026−0.041−0.052−0.0160.022
Hepatitis C0.059−0.0280.039−0.003−0.003
Osteoporosis−0.0220.0190.0520.062−0.062
Cirrhosis of the liver−0.006−0.045−0.028−0.071−0.064
Coronary artery disease0.0040.170−0.0380.0200.088
Staphylococcus aureus−0.0030.1150.0170.031−0.027
Autoimmune diseases0.0280.0090.0160.015−0.051
Adrenal gland0.1920.067−0.0900.1100.123
Aorta0.090−0.0450.1680.0660.010
Autochton0.1030.0040.100−0.0400.000
Clavicula0.164−0.061−0.0620.035−0.049
KidneyLiverLungPancreasPortal and
Splenic Vein
Colon−0.1420.037−0.085−0.3430.126
Duodenum−0.0480.0170.0000.2010.097
Esophagus0.050−0.0070.2060.0860.107
Femur0.001−0.094−0.0530.033−0.149
Gluteal muscles0.0790.016−0.2020.0300.096
Heart0.0650.0920.075−0.0740.088
Hip−0.0170.0190.063−0.090−0.047
Humerus−0.023−0.053−0.028−0.0040.097
Iliac artery−0.096−0.135−0.044−0.0180.066
Iliac vena0.2090.1270.0020.1270.030
Iliopsoas−0.068−0.1000.1000.104−0.078
Inferior vena cava0.1540.034−0.0480.079−0.104
Kidney0.0000.091−0.042−0.0310.018
Liver0.0910.0000.1090.168−0.125
Lung−0.0420.1090.000−0.1930.057
Pancreas−0.0310.168−0.1930.0000.239
Portal and splenic vein0.018−0.1250.0570.2390.000
Pulmonary artery−0.026−0.060−0.0640.037−0.087
Ribs0.203−0.0760.1980.0910.028
Scapula−0.1420.1090.038−0.077−0.064
Small bowel0.2090.247−0.0820.1500.029
Spleen0.1210.173−0.0940.1130.067
Stomach0.084−0.096−0.0300.147−0.057
Trachea−0.044−0.1060.0780.042−0.048
Urinary bladder0.0590.1320.0520.0680.029
Vertebrae−0.032−0.0080.1220.0750.040
Diabetes type I0.1120.051−0.013−0.060−0.063
Diabetes type II0.2050.020−0.0060.055−0.093
COPD−0.017−0.0410.174−0.0290.018
Non−epileptic seizures−0.014−0.023−0.115−0.0280.060
Asthma0.003−0.0700.0530.0650.044
Hypertension−0.148−0.008−0.075−0.0480.107
Arthritis−0.0490.0440.003−0.0330.071
Chronic heart failure−0.117−0.035−0.060−0.0460.015
Stroke0.037−0.0030.0200.120−0.058
Myocardial infarction0.001−0.0140.049−0.003−0.007
Hyperlipidemia0.077−0.0940.0370.013−0.004
HIV/AIDS−0.0480.1140.0000.042−0.017
Hepatitis C−0.0970.0470.0570.098−0.014
Osteoporosis0.0670.0160.080−0.005−0.050
Cirrhosis of the liver0.2580.0970.0310.096−0.148
Coronary artery disease−0.011−0.059−0.0270.010−0.003
Staphylococcus aureus−0.0920.1100.044−0.0700.031
Autoimmune diseases−0.0360.028−0.034−0.028−0.029
Pulmonary
Artery
RibsScapulaSmall BowelSpleen
Adrenal gland0.065−0.0500.0930.0880.085
Aorta0.117−0.1030.0890.042−0.014
Autochton0.1080.1450.116−0.0040.120
Clavicula0.0760.0450.428−0.0210.015
Colon−0.014−0.028−0.0740.3510.020
Duodenum0.0170.1110.020−0.0900.053
Esophagus0.188−0.0890.1080.0050.176
Femur0.0260.1670.0830.016−0.075
Gluteal muscles0.100−0.068−0.0000.0240.015
Heart0.3880.0030.119−0.0600.138
Hip−0.0550.0040.058−0.018−0.021
Humerus0.066−0.3270.299−0.0510.016
Iliac artery0.052−0.0520.0390.1780.037
Iliac vena−0.065−0.035−0.013−0.1860.072
Iliopsoas−0.115−0.0320.1330.002−0.079
Inferior vena cava0.0890.066−0.009−0.016−0.008
Kidney−0.0260.203−0.1420.2090.121
Liver−0.060−0.0760.1090.2470.173
Lung−0.0640.1980.038−0.082−0.094
Pancreas0.0370.091−0.0770.1500.113
Portal and splenic vein−0.0870.028−0.0640.0290.067
Pulmonary artery0.000−0.051−0.131−0.010−0.013
Ribs−0.0510.0000.4080.0590.007
Scapula−0.1310.4080.0000.113−0.038
Small bowel−0.0100.0590.1130.000−0.017
Spleen−0.0130.007−0.038−0.0170.000
Stomach0.1620.0260.0010.155−0.364
Trachea0.037−0.0180.049−0.100−0.025
Urinary bladder−0.0420.0420.011−0.1140.026
Vertebrae0.1450.2120.183−0.0440.079
Diabetes type I−0.037−0.089−0.0130.059−0.013
Diabetes type II−0.0550.011−0.0810.076−0.060
COPD0.043−0.061−0.1190.0550.040
Non−epileptic seizures−0.0420.091−0.0730.0090.025
Asthma0.075−0.0320.112−0.0920.029
Hypertension−0.031−0.002−0.035−0.0110.040
Arthritis0.0020.014−0.1720.099−0.026
Chronic heart failure−0.0380.0290.167−0.0720.111
Stroke−0.0410.136−0.1540.014−0.156
Myocardial infarction−0.052−0.0450.0210.013−0.049
Hyperlipidemia−0.045−0.0120.0100.0650.004
HIV/AIDS−0.007−0.042−0.063−0.023−0.053
Hepatitis C−0.0900.049−0.0220.081−0.035
Osteoporosis0.050−0.0420.035−0.010−0.038
Cirrhosis of the liver−0.078−0.022−0.0740.135−0.266
Coronary artery disease−0.0190.033−0.058−0.0850.001
Staphylococcus aureus−0.0580.012−0.0070.0470.005
Autoimmune diseases−0.096−0.052−0.1360.004−0.043
StomachTracheaUrinary BladderVertebrae
Adrenal gland−0.057−0.017−0.0880.017
Aorta0.0840.031−0.0520.105
Autochton0.097−0.0600.0640.103
Clavicula−0.0300.004−0.0000.044
Colon0.0790.083−0.0260.092
Duodenum0.050−0.1330.031−0.116
Esophagus0.1140.1090.066−0.025
Femur−0.065−0.0720.040−0.076
Gluteal muscles0.125−0.0870.0430.013
Heart0.1160.0210.1000.008
Hip−0.011−0.0150.0170.536
Humerus−0.0170.034−0.053−0.063
Iliac artery−0.1000.0400.0370.044
Iliac vena0.100−0.018−0.149−0.159
Iliopsoas−0.0710.066−0.110−0.126
Inferior vena cava0.084−0.063−0.0150.180
Kidney0.084−0.0440.059−0.032
Liver−0.096−0.1060.132−0.008
Lung−0.0300.0780.0520.122
Pancreas0.1470.0420.0680.075
Portal and splenic vein−0.057−0.0480.0290.040
Pulmonary artery0.1620.037−0.0420.145
Ribs0.026−0.0180.0420.212
Scapula0.0010.0490.0110.183
Small bowel0.155−0.100−0.114−0.044
Spleen−0.364−0.0250.0260.079
Stomach0.000−0.0740.023−0.063
Trachea−0.0740.0000.1560.133
Urinary bladder0.0230.1560.000−0.039
Vertebrae−0.0630.133−0.0390.000
Diabetes type I−0.034−0.0290.0560.017
Diabetes type II−0.0540.0240.099−0.047
COPD0.004−0.111−0.1240.102
Non−epileptic seizures0.0240.0630.116−0.085
Asthma0.021−0.077−0.037−0.135
Hypertension0.034−0.086−0.0220.121
Arthritis−0.0870.036−0.0010.009
Chronic heart failure0.0550.0180.072−0.070
Stroke−0.105−0.0680.0570.053
Myocardial infarction−0.150−0.028−0.051−0.035
Hyperlipidemia0.010−0.054−0.030−0.004
HIV/AIDS−0.0470.094−0.0140.064
Hepatitis C−0.071−0.034−0.0250.020
Osteoporosis−0.0400.050−0.013−0.027
Cirrhosis of the liver−0.128−0.0000.108−0.048
Coronary artery disease−0.011−0.062−0.0520.048
Staphylococcus aureus−0.002−0.1350.025−0.033
Autoimmune diseases0.112−0.090−0.0010.160

References

  1. Takuma, Y.; Nouso, K.; Morimoto, Y.; Tomokuni, J.; Sahara, A.; Takabatake, H.; Matsueda, K.; Yamamoto, H. Portal Hypertension in Patients with Liver Cirrhosis: Diagnostic Accuracy of Spleen Stiffness. Radiology 2016, 279, 609–619. [Google Scholar] [CrossRef] [PubMed]
  2. Egger, J.; Gsaxner, C.; Pepe, A.; Pomykala, K.L.; Jonske, F.; Kurz, M.; Li, J.; Kleesiek, J. Medical deep learning—A systematic meta-review. Comput. Methods Programs Biomed. 2022, 221, 106874. [Google Scholar] [CrossRef] [PubMed]
  3. Wasserthal, J.; Meyer, M.; Breit, H.C.; Cyriac, J.; Yang, S.; Segeroth, M. TotalSegmentator: Robust segmentation of 104 anatomical structures in CT images. arXiv 2022, arXiv:2208.05868. [Google Scholar] [CrossRef]
  4. Edgar, H.; Daneshvari Berry, S.; Moes, E.; Adolphi, N.; Bridges, P.; Nolte, K. New Mexico Decedent Image Database; Office of the Medical Investigator, University of New Mexico: Albuquerque, NM, USA, 2020. [Google Scholar] [CrossRef]
  5. Isensee, F.; Jaeger, P.F.; Kohl, S.A.A.; Petersen, J.; Maier-Hein, K.H. nnU-Net: A self-configuring method for deep learning-based biomedical image Segmentation. Nat. Methods 2020, 18, 203–211. [Google Scholar] [CrossRef] [PubMed]
  6. Ronneberger, O.; Fischer, P.; Brox, T. U-Net: Convolutional Networks for Biomedical Image Segmentation. In Proceedings of the Medical Image Computing and Computer-Assisted Intervention (MICCAI), Munich, Germany, 5–9 October 2015; LNCS; Springer: Berlin/Heidelberg, Germany, 2015; Volume 9351, pp. 234–241. [Google Scholar] [CrossRef] [Green Version]
  7. Smith, L.N. Cyclical Learning Rates for Training Neural Networks. In Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision (WACV), Santa Rosa, CA, USA, 24–31 March 2017. [Google Scholar] [CrossRef]
  8. Egger, J.; Kapur, T.; Fedorov, A.; Pieper, S.; Miller, J.V.; Veeraraghavan, H.; Freisleben, B.; Golby, A.J.; Nimsky, C.; Kikinis, R. GBM volumetry using the 3D Slicer medical image computing platform. Sci. Rep. 2013, 3, 1–7. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  9. Fujikura, K.; Albini, A.; Barr, R.G.; Parikh, M.; Kern, J.; Hoffman, E.; Hiura, G.T.; Bluemke, D.A.; Carr, J.; Lima, J.A.; et al. Aortic enlargement in chronic obstructive pulmonary disease (COPD) and emphysema: The Multi-Ethnic Study of Atherosclerosis (MESA) COPD study. Int. J. Cardiol. 2021, 331, 214–220. [Google Scholar] [CrossRef] [PubMed]
  10. Snider, G.L.; Kleinerman, J.; Thurlbeck, W.M.; Bengali, Z.H. The Definition of Emphysema. Am. Rev. Respir. Dis. 1985, 132, 182–185. [Google Scholar]
  11. Habib, S.L. Kidney atrophy vs hypertrophy in diabetes: Which cells are involved? Cell Cycle 2018, 17, 1683–1687. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  12. Saad, M.; Ellis, C.L.; O’Neill, W.C. Renal Hypertrophy in Liver Failure. Kidney Int. Rep. 2018, 3, 1464–1467. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Office of the Medical Investigator-UNM. Available online: https://hsc.unm.edu/omi/ (accessed on 18 May 2023).
  14. CT Scan Protocol. Available online: https://nmdid.unm.edu/resources/data-information (accessed on 18 May 2023).
  15. Torso Protocol. Available online: https://nmdid.unm.edu/docs/CT_OMI_protocol.pdf (accessed on 18 May 2023).
Figure 1. Example slices of a computed tomography (CT) scan used for measuring the performance of the model. The scan is part of the TotalSegmentator dataset [3]. The slices are taken from the same scan, the left slice is in the frontal plane, while the right slice is in the sagittal plane.
Figure 1. Example slices of a computed tomography (CT) scan used for measuring the performance of the model. The scan is part of the TotalSegmentator dataset [3]. The slices are taken from the same scan, the left slice is in the frontal plane, while the right slice is in the sagittal plane.
Biomedinformatics 03 00036 g001
Figure 2. Example slices of a CT scan used for generating organ volume data. The scan is part of the New Mexico Decedent Image Database (NMDID) cases [4]. The slices are taken from the same scan, the left slice is in the frontal plane, while the right slice is in the sagittal plane.
Figure 2. Example slices of a CT scan used for generating organ volume data. The scan is part of the New Mexico Decedent Image Database (NMDID) cases [4]. The slices are taken from the same scan, the left slice is in the frontal plane, while the right slice is in the sagittal plane.
Biomedinformatics 03 00036 g002
Table 1. Dice coefficients of the different trained models on the test data of the TotalSegmentator dataset. The aggregate score is calculated by taking the average Dice coefficient for each class at the end of the training. Also listed is the mean Dice coefficient for all classes achieved by [5]. The learning rate schedulers are as given in Section 2.4. The “Opposing Labels” column states whether labels pertaining to similar versions of the same structure (e.g., left and right kidney) have been left separate or have been combined into one label containing all instances of the structure.
Table 1. Dice coefficients of the different trained models on the test data of the TotalSegmentator dataset. The aggregate score is calculated by taking the average Dice coefficient for each class at the end of the training. Also listed is the mean Dice coefficient for all classes achieved by [5]. The learning rate schedulers are as given in Section 2.4. The “Opposing Labels” column states whether labels pertaining to similar versions of the same structure (e.g., left and right kidney) have been left separate or have been combined into one label containing all instances of the structure.
Opposing LabelsLearning Rate SchedulingEpochsAggregate Score
SeparateBuilt-in10000.9512
Linear40000.9611
Linear to Exponential10000.9491
Quick Linear to Exponential10000.9592
Modified Linear to Exponential10000.8850
CombinedLinear to Exponential10000.9412
TotalSegmentator model4000greater than 0.96
Table 2. Average time elapsed per epoch for each of the models during training, in seconds. The first epoch was not included because of certain setup processes of the training script.
Table 2. Average time elapsed per epoch for each of the models during training, in seconds. The first epoch was not included because of certain setup processes of the training script.
Opposing LabelsLearning Rate SchedulingEpochsSeconds per Epoch
SeparateLinear to Exponential1000332.2
Linear4000712.1
Quick Linear to Exponential1000668.7
Modified Linear to Exponential1000649.4
Built-in1000283.9
CombinedModified1000624.8
Table 3. Mean volume and standard deviation for each processed structure in liters (dm 3 ).
Table 3. Mean volume and standard deviation for each processed structure in liters (dm 3 ).
StructureMean VolumeStandard DeviationVariation Coefficient
Adrenal gland0.00540.00280.5179
Aorta0.08440.05020.5946
Autochton1.05260.30780.2924
Clavicula0.05370.01700.3166
Colon1.51310.70310.4647
Duodenum0.03360.01690.5029
Esophagus0.04030.01450.3604
Femur0.47470.11200.2358
Gluteal muscles1.69340.57100.3372
Heart0.32390.12510.3861
Hip0.74450.16290.2189
Humerus0.08770.05310.6056
Iliac artery0.01190.00840.7093
Iliac vena0.02120.01460.6903
Iliopsoas0.53180.20420.3839
Inferior vena cava0.01100.01080.9834
Kidney0.26860.10130.3769
Liver0.86130.42210.4901
Lung2.62320.75470.2877
Pancreas0.02140.01950.9088
Portal and splenic vein0.00470.00430.9149
Pulmonary artery0.02910.01670.5718
Ribs0.41370.11270.2723
Scapula0.21550.05200.2412
Small bowel0.99910.42530.4257
Spleen0.76380.28660.3752
Stomach0.40960.29610.7229
Trachea0.02100.01290.6145
Urinary bladder0.19020.18320.9630
Vertebrae0.91090.16840.1848
Table 4. Pearson correlation coefficients (r) exceeding 0.5 between structure volumes.
Table 4. Pearson correlation coefficients (r) exceeding 0.5 between structure volumes.
Structuresrp-Value
HipVertebrae0.7353<0.05
AutochthonGluteal muscles0.7116<0.05
IliopsoasGluteal muscles0.7102<0.05
AutochthonIliopsoas0.6829<0.05
ScapulaRibs0.6481<0.05
ScapulaVertebrae0.6076<0.05
ClaviculaScapula0.5979<0.05
HipScapula0.5974<0.05
RibsVertebrae0.5954<0.05
HipRibs0.5917<0.05
ScapulaAutochthon0.5044<0.05
ClaviculaHumerus0.5003<0.05
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

van Meegdenburg, T.; Kleesiek, J.; Egger, J.; Perrey, S. Improvement in Disease Diagnosis in Computed Tomography Images by Correlating Organ Volumes with Disease Occurrences in Humans. BioMedInformatics 2023, 3, 526-542. https://doi.org/10.3390/biomedinformatics3030036

AMA Style

van Meegdenburg T, Kleesiek J, Egger J, Perrey S. Improvement in Disease Diagnosis in Computed Tomography Images by Correlating Organ Volumes with Disease Occurrences in Humans. BioMedInformatics. 2023; 3(3):526-542. https://doi.org/10.3390/biomedinformatics3030036

Chicago/Turabian Style

van Meegdenburg, Timo, Jens Kleesiek, Jan Egger, and Sören Perrey. 2023. "Improvement in Disease Diagnosis in Computed Tomography Images by Correlating Organ Volumes with Disease Occurrences in Humans" BioMedInformatics 3, no. 3: 526-542. https://doi.org/10.3390/biomedinformatics3030036

Article Metrics

Back to TopTop