External Validation of an Open-Source Model for Automated Muscle Segmentation in CT Imaging of Cancer Patients
Abstract
1. Introduction
2. Materials and Methods
2.1. Model and Training Data
2.2. Reference Standard for External Validation
2.3. Inference Processing

2.4. Analysis
3. Results
3.1. Visual Inspection of Segmentation Errors
3.2. Subject Characteristic Analysis
4. Discussion
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| AI | Artificial Intelligence |
| BMI | Body Mass Index |
| CT | Computed Tomography |
| DBSCAN | Density-Based Clustering Algorithm |
| DSC | Dice Similarity Coefficient |
| HU | Hounsfield Unit |
| IV contrast | Intravenous Contrast |
| L3 | Lumbar Vertebra 3 |
| nnU-Net | Open-Source AI Workflow |
| PET-CT | Positron Emission Tomography—Computed Tomography |
| SAROS | Sparsely Annotated Region and Organ Segmentation, a publicly available dataset |
References
- Cruz-Jentoft, A.J.; Bahat, G.; Bauer, J.; Boirie, Y.; Bruyère, O.; Cederholm, T.; Cooper, C.; Landi, F.; Rolland, Y.; Sayer, A.A.; et al. Sarcopenia: Revised European consensus on definition and diagnosis. Age Ageing 2019, 48, 601. [Google Scholar] [CrossRef] [PubMed]
- Beaudart, C.; Zaaria, M.; Pasleau, F.; Reginster, J.; Bruyère, O. Health Outcomes of Sarcopenia: A Systematic Review and Meta-Analysis. PLoS ONE 2017, 12, e0169548. [Google Scholar] [CrossRef] [PubMed]
- Koh, F.H.; Loh, C.H.; Tan, W.J.; Ho, L.M.L.; Yen, D.; Chua, J.M.W.; Kok, S.S.; Sivarajah, S.S.; Chew, M.; Foo, F. Structured presurgery prehabilitation for aged patients undergoing elective surgery significantly improves surgical outcomes and reduces cost: A nonrandomized sequential comparative prospective cohort study. Nutr. Clin. Pract. 2022, 37, 645–653. [Google Scholar] [CrossRef] [PubMed]
- Srivastava, S.; Pandey, V.K.; Singh, A.; Dar, A.H. Exploring the Potential of Treating Sarcopenia through Dietary Interventions. J. Food Biochem. 2024, 2024, 1–19. [Google Scholar] [CrossRef]
- Van den Broeck, J.; Sealy, M.J.; Brussaard, C.; Jager-Wittenaar, H.; Scafoglieri, A. Correlation of skeletal muscle area and muscle attenuation between l3, c3, and t4 level in patients with cancer: Results from the body-convert study group. Clin. Nutr. ESPEN 2021, 46, S576–S577. [Google Scholar] [CrossRef]
- Sealy, M.J.; Van den Broeck, J.; Brussaard, C.; Kunstman, B.; Scafoglieri, A.; Jager-Wittenaar, H. Variations in vertebral muscle mass and muscle quality in adult patients with different types of cancer. Nutrition 2024, 128, 112553. [Google Scholar] [CrossRef]
- Sealy, M.J.; Dechaphunkul, T.; van der Schans, C.P.; Krijnen, W.P.; Roodenburg, J.L.N.; Walker, J.; Jager-Wittenaar, H.; Baracos, V.E. Low muscle mass is associated with early termination of chemotherapy related to toxicity in patients with head and neck cancer. Clin. Nutr. 2020, 39, 501–509. [Google Scholar] [CrossRef]
- Mitsiopoulos, N.; Baumgartner, R.N.; Heymsfield, S.B.; Lyons, W.; Gallagher, D.; Ross, R. Cadaver validation of skeletal muscle measurement by magnetic resonance imaging and computerized tomography. J. Appl. Physiol. 1998, 85, 115–122. [Google Scholar] [CrossRef]
- Sanada, K.; Kearns, C.F.; Midorikawa, T.; Abe, T. Prediction and validation of total and regional skeletal muscle mass by ultrasound in Japanese adults. Eur. J. Appl. Physiol. 2006, 96, 24–31. [Google Scholar] [CrossRef]
- Mourtzakis, M.; Prado, C.M.M.; Lieffers, J.R.; Reiman, T.; McCargar, L.J.; Baracos, V.E. A practical and precise approach to quantification of body composition in cancer patients using computed tomography images acquired during routine care. Appl. Physiol. Nutr. Metab. 2008, 33, 997–1006. [Google Scholar] [CrossRef]
- Anderson, D.E.; D’Agostino, J.M.; Bruno, A.G.; Demissie, S.; Kiel, D.P.; Bouxsein, M.L. Variations of CT-Based Trunk Muscle Attenuation by Age, Sex, and Specific Muscle. J. Gerontol. Ser. A Biol. Sci. Med. Sci. 2013, 68, 317–323. [Google Scholar] [CrossRef]
- Tolonen, A.; Pakarinen, T.; Sassi, A.; Kyttä, J.; Cancino, W.; Rinta-Kiikka, I.; Pertuz, S.; Arponen, O. Methodology, clinical applications, and future directions of body composition analysis using computed tomography (CT) images: A review. Eur. J. Radiol. 2021, 145, 109943. [Google Scholar] [CrossRef]
- Elhakim, T.; Trinh, K.; Mansur, A.; Bridge, C.; Daye, D. Role of Machine Learning-Based CT Body Composition in Risk Prediction and Prognostication: Current State and Future Directions. Diagnostics 2023, 13, 968. [Google Scholar] [CrossRef]
- Dabiri, S.; Popuri, K.; Cespedes Feliciano, E.M.; Caan, B.J.; Baracos, V.E.; Beg, M.F. Muscle segmentation in axial computed tomography (CT) images at the lumbar (L3) and thoracic (T4) levels for body composition analysis. Comput. Med. Imaging Graph. 2019, 75, 47–55. [Google Scholar] [CrossRef] [PubMed]
- Islam, S.; Kanavati, F.; Arain, Z.; Da Costa, O.F.; Crum, W.; Aboagye, E.O.; Rockall, A. Fully automated deep-learning section-based muscle segmentation from CT images for sarcopenia assessment. Clin. Radiol. 2022, 77, e363–e371. [Google Scholar] [CrossRef]
- Mai, D.V.C.; Drami, I.; Pring, E.T.; Gould, L.E.; Lung, P.; Popuri, K.; Chow, V.; Beg, M.F.; Athanasiou, T.; Jenkins, J.T.; et al. A systematic review of automated segmentation of 3D computed-tomography scans for volumetric body composition analysis. J. Cachexia Sarcopenia Muscle 2023, 14, 1973–1986. [Google Scholar] [CrossRef] [PubMed]
- Bedrikovetski, S.; Seow, W.; Kroon, H.M.; Traeger, L.; Moore, J.W.; Sammour, T. Artificial intelligence for body composition and sarcopenia evaluation on computed tomography: A systematic review and meta-analysis. Eur. J. Radiol. 2022, 149, 110218. [Google Scholar] [CrossRef] [PubMed]
- Kreher, R.; Hinnerichs, M.; Preim, B.; Saalfeld, S.; Surov, A. Deep-learning-based Segmentation of Skeletal Muscle Mass in Routine Abdominal CT Scans. In Vivo 2022, 36, 1807–1811. [Google Scholar]
- Dietz, M.V.; Popuri, K.; Janssen, L.; Salehin, M.; Ma, D.; Chow, V.T.Y.; Lee, H.; Verhoef, C.; Madsen, E.V.; Beg, M.F.; et al. Evaluation of a fully automated computed tomography image segmentation method for fast and accurate body composition measurements. Nutrition 2025, 129, 112592. [Google Scholar] [CrossRef]
- Kreher, R.; Hille, G.; Preim, B.; Hinnerichs, M.; Borggrefe, J.; Surov, A.; Saalfeld, S. Multilabel segmentation and analysis of skeletal muscle and adipose tissue in routine abdominal CT scans. Comput. Biol. Med. 2025, 186, 109622. [Google Scholar] [CrossRef]
- Isensee, F.; Wald, T.; Ulrich, C.; Baumgartner, M.; Roy, S.; Maier-Hein, K.; Jaeger, P.F. nnU-Net Revisited: A Call for Rigorous Validation in 3D Medical Image Segmentation. arXiv 2024, arXiv:2404.09556. [Google Scholar]
- Isensee, F.; Jaeger, P.F.; Kohl, S.A.A.; Petersen, J.; Maier-Hein, K.H. nnU-Net: A self-configuring method for deep learning-based biomedical image segmentation. Nat. Methods 2021, 18, 203–211. [Google Scholar] [CrossRef] [PubMed]
- Koitka, S.; Baldini, G.; Kroll, L.; van Landeghem, N.; Pollok, O.B.; Haubold, J.; Pelka, O.; Kim, M.; Kleesiek, J.; Nensa, F.; et al. SAROS: A dataset for whole-body region and organ segmentation in CT imaging. Sci. Data 2024, 11, 483. [Google Scholar] [CrossRef]
- Koitka, S.; Baldini, G.; Kroll, L.; van Landeghem, N.; Haubold, J.; Sung Kim, M.; Kleesiek, J.; Nensa, F.; Hosch, R. SAROS—A Large, Heterogeneous, and Sparsely Annotated Segmentation Dataset on CT Imaging Data. 2024. Available online: https://www.cancerimagingarchive.net/analysis-result/saros/ (accessed on 28 December 2023).
- Jensen, G.L.; Cederholm, T.; Correia, M.I.T.D.; Gonzalez, M.C.; Fukushima, R.; Higashiguchi, T.; de Baptista, G.A.; Barazzoni, R.; Blaauw, R.; Coats, A.J.; et al. GLIM Criteria for the Diagnosis of Malnutrition: A Consensus Report from the Global Clinical Nutrition Community. J. Parenter. Enter. Nutr. 2019, 43, 32–40. [Google Scholar] [CrossRef] [PubMed]
- Garbe, C.; Amaral, T.; Peris, K.; Hauschild, A.; Arenberger, P.; Basser-Seguin, N.; Bastholt, L.; Bataille, V.; del Marmol, V.; Dréno, B.; et al. European consensus-based interdisciplinary guideline for melanoma. Part 2: Treatment—Update 2019. Eur. J. Cancer 2020, 126, 159–177. [Google Scholar] [CrossRef]
- Berry, M.F. Esophageal cancer: Staging system and guidelines for staging and treatment. J. Thorac. Dis. 2014, 6, S289–S297. [Google Scholar]
- Silvestri, G.A.; Gonzalez, A.V.; Jantz, M.A.; Margolis, M.L.; Gould, M.K.; Tanoue, L.T.; Harris, L.J.; Detterbeck, F.C. Methods for staging non-small cell lung cancer: Diagnosis and management of lung cancer, 3rd ed: American College of Chest Physicians evidence-based clinical practice guidelines. Chest 2013, 143, e211S–e250S. [Google Scholar] [CrossRef]
- Lydiatt, W.M.; Patel, S.G.; O’Sullivan, B.; Brandwein, M.S.; Ridge, J.A.; Migliacci, J.C.; Loomis, A.M.; Shah, J.P. Head and neck cancers—Major changes in the American Joint Committee on cancer eighth edition cancer staging manual. CA Cancer J. Clin. 2017, 67, 122–137. [Google Scholar] [CrossRef]
- Scikit-Learn Developers. Scikit-Learn (1.3.2) API Reference, DBSCAN. Available online: https://scikit-learn.org/stable/modules/generated/sklearn.cluster.DBSCAN.html (accessed on 17 December 2024).
- Virtanen, P.; Gommers, R.; Oliphant, T.E.; Haberland, M.; Reddy, T.; Cournapeau, D.; Burovski, E.; Peterson, P.; Weckesser, W.; Bright, J.; et al. SciPy 1.0: Fundamental algorithms for scientific computing in Python (1.10.1). Nat. Methods 2020, 17, 261–272. [Google Scholar] [CrossRef]
- Reinke, A.; Tizabi, M.D.; Baumgartner, M.; Eisenmann, M.; Heckmann-Nötzel, D.; Kavur, A.E.; Rädsch, T.; Sudre, C.H.; Acion, L.; Antonelli, M.; et al. Understanding metric-related pitfalls in image analysis validation. Nat. Methods 2024, 21, 182–194. [Google Scholar] [CrossRef]
- Maier-Hein, L.; Reinke, A.; Godau, P.; Tizabi, M.D.; Buettner, F.; Christodoulou, E.; Glocker, B.; Isensee, F.; Kleesiek, J.; Kozubek, M.; et al. Metrics reloaded: Recommendations for image analysis validation. Nat. Methods 2024, 21, 195–212. [Google Scholar] [CrossRef]
- Martin, L.; Birdsell, L.; MacDonald, N.; Reiman, T.; Clandinin, M.T.; McCargar, L.J.; Murphy, R.; Ghosh, S.; Sawyer, M.B.; Baracos, V.E. Cancer Cachexia in the Age of Obesity: Skeletal Muscle Depletion Is a Powerful Prognostic Factor, Independent of Body Mass Index. J. Clin. Oncol. 2013, 31, 1539–1547. [Google Scholar] [CrossRef]
- Zoabi, A. Adipose tissue composition determines its computed tomography radiodensity. Eur. Radiol. 2024, 34, 1635–1644. [Google Scholar] [CrossRef]
- Werf, A.; Dekker, I.M.; Meijerink, M.R.; Wierdsma, N.J.; Schueren, M.A.E.; Langius, J.A.E. Skeletal muscle analyses: Agreement between non-contrast and contrast CT scan measurements of skeletal muscle area and mean muscle attenuation. Clin. Physiol. Funct. Imaging 2018, 38, 366–372. [Google Scholar] [CrossRef]
- Victor, A. Artificial intelligence in global health: An unfair future for health in Sub-Saharan Africa? Health Aff. Sch. 2025, 3, qxaf023. [Google Scholar] [CrossRef]




| Test 1 | Dice Similarity Coefficient | Segmentation Surface Error | |||
|---|---|---|---|---|---|
| Correlation or Lowest DSC | p-Value | Correlation or Largest SSE | p-Value | ||
| Age (years) | Sp | corr. = −0.008 | 0.912 | corr. = −0.123 | 0.092 |
| BMI (kg/m2) | Sp | corr. = 0.517 | <0.001 * | corr. = 0.406 | <0.001 * |
| GLIM underweight (y/n) 2 | MW | 0.955 | <0.001 * | 5.195 cm2 | <0.001 * |
| Sex (M/F) | MW | 0.978 | 0.036 * | 3.917 cm2 | 0.498 |
| Cancer grade (1–4) | KW | 0.977 | 0.125 | 4.191 cm2 | 0.136 |
| Arm pos. (up vs. down) 3 | MW | 0.978 | 0.180 | 3.929 cm2 | 0.353 |
| Use of IV (y/n) | MW | 0.978 | 0.165 | 3.921 cm2 | 0.210 |
| CCI (1–4) | KW | 0.965 | 0.777 | 5.183 cm2 | 0.567 |
| Cancer types | KW | 0.974 | 0.037 * | 5.902 cm2 | 0.488 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Erenstein, H.; Van den Broeck, J.; van der Heij-Meijer, A.; Krijnen, W.P.; Scafoglieri, A.; Jager-Wittenaar, H.; Sealy, M.; van Ooijen, P. External Validation of an Open-Source Model for Automated Muscle Segmentation in CT Imaging of Cancer Patients. J. Imaging 2026, 12, 135. https://doi.org/10.3390/jimaging12030135
Erenstein H, Van den Broeck J, van der Heij-Meijer A, Krijnen WP, Scafoglieri A, Jager-Wittenaar H, Sealy M, van Ooijen P. External Validation of an Open-Source Model for Automated Muscle Segmentation in CT Imaging of Cancer Patients. Journal of Imaging. 2026; 12(3):135. https://doi.org/10.3390/jimaging12030135
Chicago/Turabian StyleErenstein, Hendrik, Jona Van den Broeck, Annemieke van der Heij-Meijer, Wim P. Krijnen, Aldo Scafoglieri, Harriët Jager-Wittenaar, Martine Sealy, and Peter van Ooijen. 2026. "External Validation of an Open-Source Model for Automated Muscle Segmentation in CT Imaging of Cancer Patients" Journal of Imaging 12, no. 3: 135. https://doi.org/10.3390/jimaging12030135
APA StyleErenstein, H., Van den Broeck, J., van der Heij-Meijer, A., Krijnen, W. P., Scafoglieri, A., Jager-Wittenaar, H., Sealy, M., & van Ooijen, P. (2026). External Validation of an Open-Source Model for Automated Muscle Segmentation in CT Imaging of Cancer Patients. Journal of Imaging, 12(3), 135. https://doi.org/10.3390/jimaging12030135

