Next Article in Journal
Sun-Exposed vs. Non-Sun-Exposed Areas: Epidemiology and Pathogenesis of Non-Metastatic Merkel Cell Carcinoma
Previous Article in Journal
Automated Assessment of Ki-67 Labeling Index Using Cell-Level Detection and Classification in Whole-Slide Images
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Comparison of Manual, Semi-Automatic, and Automatic CT-Based Methods for Liver Volume Segmentation

by
Berna Dogan
1,*,
Sadik Bugrahan Simsek
1,
Sefa Sonmez
1,
Merve Nur Ozgen Sonmez
1,
Omur Dasci
2 and
Zafer Ozmen
2
1
Department of Anatomy, Faculty of Medicine, Tokat Gaziosmanpasa University, Tokat 60100, Türkiye
2
Department of Radiology, Faculty of Medicine, Tokat Gaziosmanpasa University, Tokat 60100, Türkiye
*
Author to whom correspondence should be addressed.
Diagnostics 2026, 16(5), 817; https://doi.org/10.3390/diagnostics16050817
Submission received: 26 January 2026 / Revised: 2 March 2026 / Accepted: 6 March 2026 / Published: 9 March 2026
(This article belongs to the Special Issue Recent Advances in Abdominal Imaging)

Abstract

Background/Objectives: To evaluate whether semi-automatic and automatic CT-based liver segmentation methods can provide clinically acceptable volumetric agreement compared with manual segmentation while improving processing efficiency in routine practice. Methods: CT images from 86 individuals were retrospectively analyzed. Liver volumes were calculated using manual segmentation, RVX Semi-Automatic, RVX Deep Learning, and TotalSegmentator. Differences among methods were assessed using repeated-measures ANOVA. Agreement with manual segmentation was evaluated using a Bland–Altman analysis, while the Dice Similarity Coefficient (DICE) and Hausdorff Distance (HD) quantified spatial overlap and boundary deviation, respectively. Processing times were recorded. Results: Mean liver volumes were 1503.9 ± 356.0 cm3 (manual), 1512.6 ± 373.6 cm3 (RVX Semi-Automatic), 1549.8 ± 367.9 cm3 (RVX Deep Learning), and 1518.3 ± 365.8 cm3 (TotalSegmentator). RVX Deep Learning produced significantly higher volumes compared with manual segmentation (p < 0.001), whereas RVX Semi-Automatic and TotalSegmentator showed no significant differences (p > 0.05). DICE values were 0.911 ± 0.032, 0.946 ± 0.018, and 0.938 ± 0.021 for RVX Semi-Automatic, RVX Deep Learning, and TotalSegmentator, respectively. HD values were highest for the deep learning-based method. Processing times were shortest for RVX Deep Learning and longest for manual segmentation. Conclusions: Semi-automatic and automatic liver segmentation methods substantially reduce processing time while maintaining clinically acceptable volumetric agreement. Among the evaluated approaches, TotalSegmentator showed the closest agreement with manual segmentation, supporting its use in routine CT-based liver volumetry. Deep learning-based segmentation, although faster, tended to overestimate volume, potentially limiting its use in applications requiring high volumetric precision.

1. Introduction

Accurate and reliable assessment of liver volume has become increasingly important in clinical decision-making, particularly in the context of liver transplantation, major hepatectomy, and complex hepatic surgical procedures [1,2]. Precise volumetric evaluation is essential for estimating functional liver reserve, optimizing surgical planning, and reducing postoperative complications. In addition, liver volume measurement has gained relevance as an objective imaging-based parameter in the assessment of various hepatic conditions [3,4,5].
Computed tomography (CT) is the most commonly used imaging modality for liver volumetry [6]. Although manual segmentation is traditionally used as a clinical reference standard, it requires slice-by-slice delineation, is therefore time-consuming and operator-dependent, and does not represent an absolute ground truth [7,8,9]. Moreover, anatomical complexity and interindividual variability—particularly in the region of the porta hepatis—pose additional challenges for accurate segmentation [5]. To overcome these limitations, various semi-automatic and automatic segmentation methods have been developed, including deep learning-based approaches that aim to reduce processing time and enhance reproducibility. However, despite promising technical performance, these methods have not yet been widely adopted in routine clinical practice, mainly due to limited comparative validation and the lack of standardized evaluation across different segmentation strategies [10,11,12,13,14].
In this study, manual, semi-automatic, deep learning-based automatic, and automatic liver segmentation methods were systematically compared using contrast-enhanced CT images. The primary aim was to determine whether semi-automatic and automatic segmentation approaches can achieve clinically acceptable volumetric agreement comparable to manual segmentation.

2. Materials and Methods

This study was conducted in accordance with the principles of the Declaration of Helsinki. Ethical approval was obtained from the Tokat Gaziosmanpasa University Non-Interventional Clinical Research Ethics Committee (No: E-15235480-050.04-564671).

2.1. Study Population

In this retrospective study, contrast-enhanced abdominal CT images obtained between January 2025 and May 2025 at Tokat Gaziosmanpasa University Research and Application Hospital were reviewed. Images of healthy individuals who underwent abdominal CT scans for control purposes and had no known disease diagnosis were included. For each participant, anonymized demographic data, including age and sex, were recorded. Based on this list, the Picture Archiving and Communication System (PACS) database was accessed under the supervision of a radiologist. CT images that provided a precise evaluation of liver anatomy were included in the study. Images showing pathological formations in the abdominal region, hepatomegaly, a history of liver disease, or significant imaging artifacts were excluded from the analysis. According to these inclusion and exclusion criteria, abdominal CT images from a total of 86 individuals (41 males, 45 females) were included in the analysis.

2.2. CT Imaging Protocol

All abdominal CT images included in this study were obtained from a single institution using a 128-slice multidetector CT scanner (Optima CT660, GE Healthcare, Milwaukee, WI, USA). The imaging parameters were as follows: tube voltage of 120 kV, tube current of 150 mAs, collimation of 64 mm × 0.5 mm, slice thickness of 2.5 mm, reconstruction thickness of 1.25 mm, matrix size of 512 × 512 pixels, and gantry angle of 0°. For all participants, an iodinated contrast agent was administered intravenously at a dose of 1.5 mL/kg. Imaging was performed approximately 60 s after injection, in the portal venous phase, in the craniocaudal direction. Liver assessments were conducted during this phase, when maximum enhancement of the portal and hepatic veins within the parenchyma was achieved.

2.3. Segmentation Methods

All segmentation procedures were performed using modules available in the open-source software 3D Slicer (version 5.8.1; https://www.slicer.org/ (accessed on 1 April 2025)). The processing steps for each segmentation method are described below.

2.3.1. Manual Segmentation

Manual segmentation was performed using the Segment Editor module in 3D Slicer. Contrast-enhanced abdominal CT images in Digital Imaging and Communications in Medicine (DICOM) format were imported into the software. Each image was evaluated simultaneously in the axial, coronal, and sagittal planes. A new segment was created using the Add option in the Segment Editor module, and the liver was manually delineated slice by slice using the Paint tool. After completion of the labeling process, a three-dimensional model of the segment was generated using the Show 3D option. Volume measurements were automatically calculated through the Segment Statistics module.

2.3.2. Semi-Automatic Segmentation with the RVX Liver Module

For this segmentation, the R-Vessel-X (Robust Vascular Network Extraction and Understanding within Hepatic Biomedical Images, RVX) Liver Segmentation extension integrated into 3D Slicer was used [15]. The method was performed using a semi-automatic segmentation approach. First, CT images in DICOM format were imported into the software. Patient data were opened through the Load DICOM or Load Data modules within the RVX Liver Segmentation extension. Under the Liver tab, the Paint tool was used to manually label liver and non-liver regions with different colors. After the painting process, the Initialize option under the Grow from Seeds tab was selected. Any segmentation errors were manually corrected. Finally, liver volume was automatically calculated using the Segment Statistics module.

2.3.3. Deep Learning-Based Segmentation with the RVX Liver Module

After importing the CT images into 3D Slicer, the RVX Liver Segmentation module was used for liver segmentation. In the Data tab, threshold settings were applied as a preprocessing step. Then, under the Liver tab, the Segment CT command within the module was executed. To define segmentation boundaries, the Volume Rendering Region of Interest (ROI) option in the ROI sub-tab was selected and visualized using the Toggle ROI Visibility command. The liver region was manually adjusted using visual bounding boxes to include only the target organ. The segmentation process was finalized using the Apply command, and the liver volume was automatically calculated through the Quantification and Segment Statistics modules, located under the Modules menu. In both the RVX Semi-Automatic and RVX Deep Learning methods, when segmentation errors occurred (Figure 1), excess regions outside the liver boundaries were removed using the Erase tool, and missing parenchymal areas were added with the Paint tool. These corrections were limited to distinct errors; minor contour refinements were not performed.

2.3.4. Automatic Segmentation with the TotalSegmentator Module

After importing the CT images into 3D Slicer, the TotalSegmentator module [16] was selected. Segmentation was initiated using the Fast option, and the Apply command was executed to automatically separate the relevant anatomical structures, including the liver, into their respective segments. Three-dimensional visualization of the segmentation was provided using the Show 3D option. Once the process was completed, liver volume was automatically calculated using the Quantification and Segment Statistics modules. Examples of the four segmentation methods applied to the same axial CT slice are presented comparatively in Figure 2.

2.4. Segmentation Agreement Analysis

After completing all segmentation procedures, the liver masks obtained from each method were exported in nii.gz format. The Dice Similarity Coefficient (DICE) and Hausdorff Distance (HD) values were calculated using the SliverRTextension available in 3D Slicer. Within the software, the Segment Comparison module under the Radiotherapy section was accessed, and manual segmentation was defined as the reference segment. The comparison segments generated by each method (RVX Semi-Automatic, RVX Deep Learning, and TotalSegmentator) were uploaded individually. The options Compute Hausdorff Distance and Compute DICE Metrics were enabled, and the measurements were computed automatically. The resulting DICE and HD values were used to quantitatively evaluate the spatial agreement of each method with manual segmentation.

2.5. Statistical Analysis

Statistical analyses were performed using IBM SPSS Statistics software (version 28.0; IBM Corp., Armonk, NY, USA). Intra- and inter-observer reliability for the manual and semi-automatic segmentation methods was assessed using Intraclass Correlation Coefficients (ICCs) based on a two-way random-effects model with absolute agreement. For inter-observer analysis, segmentations were independently performed by two observers experienced in abdominal imaging and blinded to each other’s results. For intra-observer analysis, one observer repeated the segmentation after a minimum interval of two weeks. ICC values were interpreted as follows: <0.5 (poor), 0.5–0.75 (moderate), 0.75–0.9 (good), and >0.9 (excellent reliability). Data normality was assessed using the Kolmogorov–Smirnov test. As the data were normally distributed, differences in liver volumes among the four segmentation methods were evaluated using repeated-measures ANOVA with Bonferroni correction for multiple comparisons. Descriptive statistics were presented as mean ± standard deviation (SD). Each segmentation method was compared with manual segmentation. Absolute volume error was calculated as Vmanual − Vautomatic, and relative volume error as (Vmanual − Vautomatic)/Vmanual × 100 [17]. Absolute and relative volume differences were illustrated using boxplots showing the median and interquartile range. A p-value < 0.05 was considered statistically significant. Bland–Altman analysis was performed to assess volumetric agreement between manual segmentation and each alternative method by estimating the mean bias and 95% limits of agreement (bias ± 1.96 SD).

3. Results

The CT images of 41 male participants (mean age, 60.02 ± 17.32 years) and 45 female participants (mean age, 54.44 ± 18.29 years) were included in this study. There was no significant difference in mean age between the sexes (p = 0.151).
Two independent observers performed the manual and RVX Semi-Automatic segmentation procedures. Intra-observer reliability was assessed by repeating the segmentation of a randomly selected subset of 10 cases at two different time points, yielding ICC values of 0.997 and 0.996 for the two observers, respectively. Inter-observer reliability was evaluated by comparing the segmentations of the same subset performed by both observers. The results indicated excellent agreement for the manual method (ICC = 0.991) and good agreement for the semi-automatic method (ICC = 0.889). For subsequent analyses, the mean of the two observers’ volume measurements was used for these methods.
Repeated measures ANOVA was conducted to compare liver volumes obtained from the four segmentation methods. According to Mauchly’s Test of Sphericity, the assumption of equal variances was not met (p < 0.001). Therefore, the Greenhouse–Geisser correction was applied for interpretation. The analysis revealed that the segmentation method had a significant effect on liver volume measurements (F (2.348, 199.618) = 18.501, p < 0.001, η2 = 0.179).
The mean liver volumes (cm3), standard errors, and 95% confidence intervals (Lower–Upper Bound) for the different segmentation methods are presented in Table 1. Similar mean volume values were obtained among the methods, with the highest average liver volume observed using the RVX Deep Learning method.
The pairwise comparison of liver volumes obtained using the four segmentation methods is presented in Table 2. After applying the Bonferroni correction, liver volume measured by the RVX Deep Learning method was found to be significantly higher than that obtained with the manual, RVX Semi-Automatic, and TotalSegmentator methods (p < 0.05). No significant differences were observed among the other method pairs (p > 0.05).
The volumetric agreement between manual segmentation (reference method) and the RVX Semi-Automatic, RVX Deep Learning, and TotalSegmentator methods was evaluated using Bland–Altman analyses, where the differences were expressed as New Method—Manual Method. The mean difference for the RVX Semi-Automatic method was 8.63 cm3 (95% CI: −141.30 to 158.56 cm3). For the RVX Deep Learning method, the mean difference was 45.83 cm3 (95% CI: −72.40 to 164.86 cm3), indicating a systematic tendency to overestimate liver volume compared with manual segmentation. The TotalSegmentator method showed a mean difference of 14.37 cm3 (95% CI: −84.76 to 113.51 cm3). Measurements for all methods were mainly within the 95% limits of agreement; however, the lowest volumetric bias was observed in the RVX Semi-Automatic method. These findings suggest that the RVX Deep Learning method tends to yield higher liver volume estimates, whereas the TotalSegmentator and Semi-Automatic methods produce results closer to manual measurements (Figure 3).
Boxplots illustrating the absolute (Figure 4A) and relative (Figure 4B) liver volume differences between manual segmentation and the other three methods are presented (Figure 4).
The DICE values representing spatial overlap between each segmentation method and manual measurements were 0.911 ± 0.032 for the RVX Semi-Automatic method, 0.946 ± 0.018 for the RVX Deep Learning method, and 0.938 ± 0.021 for the TotalSegmentator method. In addition, the HD values, which evaluate boundary deviation, were 22.74 ± 6.24 mm, 40.26 ± 10.11 mm, and 31.33 ± 9.78 mm, respectively.
When the average segmentation times were evaluated across all methods, the shortest algorithm execution time was observed for the RVX Deep Learning method (0.64 ± 0.09 min). This was followed by TotalSegmentator (2.04 ± 0.47 min), RVX Semi-Automatic (10.42 ± 1.72 min), and manual segmentation (40.42 ± 5.86 min). In segmentations performed using the RVX Deep Learning method, over-segmentation of adjacent anatomical structures was identified in 8 of 86 cases (9.3%). These instances were manually corrected. In the cases requiring correction, the additional processing time was calculated as 5.2 ± 1.6 min. This time was not included in the automatic algorithm execution time. No manual correction was required in the remaining cases.

4. Discussion

This study compared manual, semi-automatic, and automatic liver segmentation methods for volume measurement with respect to volumetric agreement, processing time, reproducibility, and potential clinical applicability. Among the evaluated approaches, the RVX semi-automatic method demonstrated the closest agreement with manual segmentation, resulting in minimal volumetric deviation. TotalSegmentator yielded comparable liver volume measurements while substantially reducing processing time, supporting its potential use in routine clinical workflows. Although the RVX Deep Learning method provided the fastest segmentation, it consistently produced higher liver volume estimates than manual measurements, likely reflecting broader boundary delineation by the algorithm.
Several studies have demonstrated that automatic and semi-automatic segmentation methods provide advantages in both agreement and processing efficiency [18,19]. In benchmark datasets, semi-automatic segmentation approaches incorporating user interaction have shown lower variability and fewer outlier errors than automatic systems [20]. High agreement between automatic and manual liver volumetry has been reported in living donor liver transplantation (ICC = 0.994) [21], and a volumetric overlap error of 4.4% for semi-automatic segmentation has been considered acceptable within previously reported ranges [22]. These findings are consistent with the present results and support the reliability of semi-automatic segmentation as an alternative to manual volumetry.
Artificial intelligence-based segmentation algorithms have rapidly evolved, enabling fast and largely user-independent workflows; however, deep learning-based systems may exhibit limited generalizability due to overfitting and dataset variability, underscoring the need for broader validation across diverse imaging conditions [17,23,24,25]. In the present study, although the RVX Deep Learning method achieved the shortest processing time, it consistently produced higher liver volume estimates than manual segmentation, a tendency also reported for deep learning models trained on restricted datasets [17,23]. Algorithm performance is known to vary with anatomical complexity and contrast characteristics; for example, vascular segmentation studies have reported DICE values ranging from 0.75 to 0.83 [26,27]. Although the RVX Deep Learning model achieved a higher overlap coefficient (DICE = 0.94) for liver parenchyma in our cohort, this finding appears to reflect boundary overestimation rather than superior segmentation agreement. The corresponding increase in Hausdorff Distance values further supports the tendency toward boundary extension, leading to higher volumetric estimates.
Fully automatic tools such as TotalSegmentator provide reproducible volumetric measurements without user intervention [28]. In the present study, liver volume estimates obtained using TotalSegmentator did not differ significantly from manual measurements, while offering substantially shorter processing times and minimal operator dependency, supporting its suitability for routine clinical workflows. Automatic segmentation also facilitates surgical planning, treatment monitoring, and longitudinal assessment, providing greater precision than visual inspection or linear measurements in detecting conditions such as organomegaly [5,29].
Reported limits of agreement between CT-based semi-automatic and manual liver segmentation methods vary widely in the literature, ranging from −230 to 327 mL [30], −211 to 278 mL [21], and −503 to 509 mL [31]. Karlo et al. reported limits of −190 to 178 mL in liver resection specimens [32], while Nakayama et al. noted that automatic segmentation may produce larger volumetric errors in damaged livers [30]. Conversely, a recent study reported a mean volume difference of 3 mL with limits of agreement of −117 to 124 mL between semi-automatic and manual segmentation [22]. In the present study, mean differences were approximately 9 cm3 for RVX Semi-Automatic (−141 to 159 cm3), 14 cm3 for TotalSegmentator (−85 to 113 cm3), and 46 cm3 for RVX Deep Learning (−72 to 164 cm3), all within or narrower than previously reported ranges. Among these methods, RVX Semi-Automatic showed the slightest deviation from manual segmentation, whereas TotalSegmentator demonstrated the narrowest limits of agreement and a clear advantage in processing time.
Previous studies have reported substantial variability in liver segmentation time, influenced by factors such as software, hardware, and user experience [2,5,22,31]. In the present study, the RVX Semi-Automatic method reduced measurement time to approximately one-fourth of that required for manual segmentation. At the same time, TotalSegmentator achieved comparable volumetric agreement with an average processing time of about two minutes. Although the RVX Deep Learning method produced results in the shortest time, its volumetric discrepancies relative to manual segmentation warrant cautious interpretation.
This study has several limitations. Although manual segmentation was used as a clinical reference standard, it remains observer-dependent and does not represent an absolute gold standard, as no external ground truth (such as surgical specimen measurements or phantom validation) was available. Therefore, the present study evaluates inter-method agreement rather than absolute volumetric agreement. The single-center design and the use of CT images acquired with uniform acquisition parameters limited the assessment of scanner- or protocol-related effects. In addition, although major vessels were excluded during segmentation, the influence of intrahepatic vessels on volumetric measurements could not be fully controlled, potentially leading to minor variability in measurements. Liver segmentation was intentionally performed using the total liver volume, without subdivision by Couinaud classification or stratification by sex, to maintain a focused comparison of volumetric agreement and processing efficiency across segmentation techniques.

5. Conclusions

In conclusion, this study demonstrates that liver segmentation methods differ in their balance between volumetric agreement and processing efficiency. Semi-automatic segmentation achieved the closest agreement with manual measurements, whereas automatic segmentation approaches—particularly TotalSegmentator—provided comparable agreement with substantial gains in processing time. Deep learning-based segmentation, while fastest, tended to overestimate liver volume, underscoring the importance of careful method selection in clinical practice.

Author Contributions

Conceptualization, B.D.; methodology, B.D., S.B.S., S.S., M.N.O.S., O.D. and Z.O.; validation, B.D., S.B.S., S.S., M.N.O.S., O.D. and Z.O.; formal analysis, B.D. and S.B.S.; investigation, B.D., S.B.S., S.S. and M.N.O.S.; resources, O.D. and Z.O.; data curation, B.D., S.B.S., S.S. and M.N.O.S.; writing—original draft preparation, B.D. and S.B.S.; writing—review and editing, B.D., S.B.S., S.S. and M.N.O.S.; visualization, B.D. and S.B.S.; supervision, B.D.; project administration, B.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

This retrospective study was conducted in accordance with the principles of the Declaration of Helsinki. Ethical approval was obtained from the Tokat Gaziosmanpasa University Non-Interventional Clinical Research Ethics Committee (Approval Date: 22 April 2025, Approval No: E-15235480-050.04-564671).

Informed Consent Statement

Patient consent was waived due to the retrospective nature of the study and the use of anonymized imaging data.

Data Availability Statement

The data that support the findings of this study are not openly available due to ethical restrictions and are available from the corresponding author upon reasonable request.

Acknowledgments

The authors thank Niyazi Acer for his valuable guidance regarding the use of the 3D Slicer software.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
CTComputed tomography
ICCIntraclass correlation coefficient
ANOVAAnalysis of variance
DICEDice similarity coefficient
HDHausdorff distance
SDStandard deviation
ROIRegion of interest
RVXRobust vascular network extraction and understanding within hepatic biomedical images
DICOMDigital imaging and communications in medicine
PACSPicture archiving and communication system

References

  1. Ferrero, A.; Vigano, L.; Polastri, R.; Muratore, A.; Eminefendic, H.; Regge, D.; Capussotti, L. Postoperative liver dysfunction and future remnant liver: Where is the limit? Results of a prospective study. World J. Surg. 2007, 31, 1643–1651. [Google Scholar] [CrossRef]
  2. Gotra, A.; Chartrand, G.; Vu, K.N.; Vandenbroucke-Menu, F.; Massicotte-Tisluck, K.; de Guise, J.A.; Tang, A. Comparison of MRI- and CT-based semiautomated liver segmentation: A validation study. Abdom. Radiol. 2017, 42, 478–489. [Google Scholar] [CrossRef] [PubMed]
  3. Gloger, O.; Kuhn, J.; Stanski, A.; Völzke, H.; Puls, R. A fully automatic three-step liver segmentation method on LDA-based probability maps for multiple contrast MR images. Magn. Reson. Imaging 2010, 28, 882–897. [Google Scholar] [CrossRef] [PubMed]
  4. Joiner, B.J.; Simpson, A.L.; Leal, J.N.; D’angelica, M.I.; Do, R.K.G. Assessing splenic enlargement on CT by unidimensional measurement changes in patients with colorectal liver metastases. Abdom. Imaging 2015, 40, 2338–2344. [Google Scholar] [CrossRef] [PubMed]
  5. Pattanayak, P.; Turkbey, E.B.; Summers, R.M. Comparative evaluation of three software packages for liver and spleen segmentation and volumetry. Acad. Radiol. 2017, 24, 831–839. [Google Scholar] [CrossRef]
  6. Planz, V.B.; Lubner, M.G.; Pickhardt, P.J. Volumetric analysis at abdominal CT: Oncologic and non-oncologic applications. Br. J. Radiol. 2019, 92, 20180631. [Google Scholar] [CrossRef]
  7. Aljabri, M.; AlAmir, M.; AlGhamdi, M.; Abdel-Mottaleb, M.; Collado-Mesa, F. Towards a better understanding of annotation tools for medical imaging: A survey. Multimed. Tools Appl. 2022, 81, 25877–25911. [Google Scholar] [CrossRef]
  8. Kothari, G.; Woon, B.; Patrick, C.J.; Korte, J.; Wee, L.; Hanna, G.G.; Kron, T.; Hardcastle, N.; Siva, S. The impact of inter-observer variation in delineation on robustness of radiomics features in non-small cell lung cancer. Sci. Rep. 2022, 12, 12822. [Google Scholar] [CrossRef]
  9. Lenchik, L.; Heacock, L.; Weaver, A.A.; Boutin, R.D.; Cook, T.S.; Itri, J.; Filippi, C.G.; Gullapalli, R.P.; Lee, J.; Zagurovskaya, M.; et al. Automated segmentation of tissues using CT and MRI: A systematic review. Acad. Radiol. 2019, 26, 1695–1706. [Google Scholar] [CrossRef]
  10. Farooq, S.; Khan, Z. A survey of computer aided diagnosis (Cad) of liver in medical diagnosis. VAWKUM Trans. Comput. Sci. 2020, 8, 23–29. [Google Scholar] [CrossRef]
  11. Kalshabay, Y.; Zholdybay, Z.; Di Martino, M.; Medeubekov, U.; Baiguissova, D.; Ainakulova, A.; Doskhanov, M.; Baimakhanov, B. CT volume analysis in living donor liver transplantation: Accuracy of three different approaches. Insights Imaging 2023, 14, 82. [Google Scholar] [CrossRef]
  12. Khan, R.A.; Luo, Y.; Wu, F.X. Machine learning based liver disease diagnosis: A systematic review. Neurocomputing 2022, 468, 492–509. [Google Scholar] [CrossRef]
  13. Nayantara, P.V.; Kamath, S.; Manjunath, K.N.; Rajagopal, K. Computer-aided diagnosis of liver lesions using CT images: A systematic review. Comput. Biol. Med. 2020, 127, 104035. [Google Scholar] [CrossRef] [PubMed]
  14. Wang, S.; Liu, X.; Zhao, J.; Liu, Y.; Liu, S.; Liu, Y.; Zhao, J. Computer auxiliary diagnosis technique of detecting cholangiocarcinoma based on medical imaging: A review. Comput. Methods Programs Biomed. 2021, 208, 106265. [Google Scholar] [CrossRef] [PubMed]
  15. Affane, A.; Chetoui, M.A.; Lamy, J.; Lienemann, G.; Peron, R.; Beaurepaire, P.; Dollé, G.; Lebre, M.-A.; Magnin, B.; Merveille, O.; et al. The R-Vessel-X Project. IRBM 2025, 46, 100876. [Google Scholar] [CrossRef]
  16. Wasserthal, J.; Breit, H.C.; Meyer, M.T.; Pardella, M.; Hinck, D.; Sauter, A.W.; Heye, T.; Boll, D.T.; Cyriac, J.; Yang, S.; et al. TotalSegmentator: Robust segmentation of 104 anatomic structures in CT images. Radiol. Artif. Intell. 2023, 5, e230024. [Google Scholar] [CrossRef]
  17. Gross, M.; Huber, S.; Arora, S.; Pradella, M.; Hinck, D.; Sauter, A.W.; Heye, T.; Boll, D.T.; Cyriac, J.; Yang, S.; et al. Automated MRI liver segmentation for anatomical segmentation, liver volumetry, and extraction of radiomics. Eur. Radiol. 2024, 34, 5056–5065. [Google Scholar] [CrossRef]
  18. Dockès, J.; Varoquaux, G.; Poline, J.-B. Preventing dataset shift from breaking machine-learning biomarkers. GigaScience 2021, 10, giab055. [Google Scholar] [CrossRef]
  19. Lemke, A.J.; Brinkmann, M.J.; Schott, T.; Niehues, S.M.; Settmacher, U.; Neuhaus, P.; Felix, R. Living donor right liver lobes: Preoperative CT volumetric measurement for calculation of intraoperative weight and volume. Radiology 2006, 240, 736–742. [Google Scholar] [CrossRef]
  20. Heimann, T.; van Ginneken, B.; Styner, M.A.; Arzhaeva, Y.; Aurich, V.; Bauer, C.; Beck, A.; Becker, C.; Beichel, R.; Bekes, G.; et al. Comparison and evaluation of methods for liver segmentation from CT datasets. IEEE Trans. Med. Imaging 2009, 28, 1251–1265. [Google Scholar] [CrossRef]
  21. Suzuki, K.; Epstein, M.L.; Kohlbrenner, R.; Garg, S.; Hori, M.; Oto, A.; Baron, R.L. Quantitative radiology: Automated CT liver volumetry compared with interactive volumetry and manual volumetry. Am. J. Roentgenol. 2011, 197, W706–W712. [Google Scholar] [CrossRef]
  22. Gotra, A.; Chartrand, G.; Massicotte-Tisluck, K.; Morin-Roy, F.; Vandenbroucke-Menu, F.; de Guise, J.A.; Tang, A. Validation of a semiautomated liver segmentation method using CT for accurate volumetry. Acad. Radiol. 2015, 22, 1088–1098. [Google Scholar] [CrossRef] [PubMed]
  23. Gross, M.; Spektor, M.; Jaffe, A.; Kucukkaya, A.S.; Iseke, S.; Haider, S.P.; Strazzabosco, M.; Chapiro, J.; Onofrey, J.A. Improved performance and consistency of deep learning 3D liver segmentation with heterogeneous cancer stages in magnetic resonance imaging. PLoS ONE 2021, 16, e0260630. [Google Scholar] [CrossRef] [PubMed]
  24. Kavur, A.E.; Gezer, N.S.; Barış, M.; Sahin, Y.; Ozkan, S.; Baydar, B.; Yuksel, U.; Kilikcier, C.; Olut, S.; Akar, G.B.; et al. Comparison of semi-automatic and deep learning-based automatic methods for liver segmentation in living liver transplant donors. Diagn. Interv. Radiol. 2020, 26, 11–21. [Google Scholar] [CrossRef]
  25. Yu, A.C.; Mohajer, B.; Eng, J. External validation of deep learning algorithms for radiologic diagnosis: A systematic review. Radiol. Artif. Intell. 2022, 4, e210064. [Google Scholar] [CrossRef] [PubMed]
  26. Huang, Q.; Sun, J.; Ding, H.; Wang, X.; Wang, G. Robust liver vessel extraction using 3D U-Net with variant Dice loss function. Comput. Biol. Med. 2018, 101, 153–162. [Google Scholar] [CrossRef]
  27. Ibragimov, B.; Toesca, D.; Chang, D.; Koong, A.; Xing, L. Combining deep learning with anatomical analysis for segmentation of the portal vein for liver SBRT planning. Phys. Med. Biol. 2017, 62, 8943–8958. [Google Scholar] [CrossRef]
  28. Núñez, L.; Ferreira, C.; Mojtahed, A.; Lamb, H.; Cappio, S.; Husainy, M.A.; Dennis, A.; Pansini, M. Assessing the performance of AI-assisted technicians in liver segmentation, Couinaud division, and lesion detection: A pilot study. Abdom. Radiol. 2024, 49, 4264–4272. [Google Scholar] [CrossRef]
  29. Pettit, R.W.; Marlatt, B.B.; Corr, S.J.; Havelka, J.; Rana, A. nnU-Net deep learning method for segmenting parenchyma and determining liver volume from computed tomography images. Ann. Surg. Open 2022, 3, e0155. [Google Scholar] [CrossRef]
  30. Nakayama, Y.; Li, Q.; Katsuragawa, S.; Ikeda, R.; Hiai, Y.; Awai, K.; Kusunoki, S.; Yamashita, Y.; Okajima, H.; Inomata, Y.; et al. Automated hepatic volumetry for living related liver transplantation at multisection CT. Radiology 2006, 240, 743–748. [Google Scholar] [CrossRef]
  31. Luciani, A.; Rusko, L.; Baranes, L.; Pichon, E.; Loze, B.; Deux, J.-F.; Laurent, A.; Tran-Van-Nhieu, J.; Rahmouni, A. Automated liver volumetry in orthotopic liver transplantation using multiphase acquisitions on MDCT. Am. J. Roentgenol. 2012, 198, W568–W574. [Google Scholar] [CrossRef]
  32. Karlo, C.; Reiner, C.S.; Stolzmann, P.; Breitenstein, S.; Marincek, B.; Weishaupt, D.; Frauenfelder, T. CT- and MRI-based volumetry of resected liver specimens: Comparison with intraoperative volume and weight measurements and calculation of conversion factors. Eur. J. Radiol. 2010, 75, e107–e111. [Google Scholar] [CrossRef]
Figure 1. (A,B) Examples of erroneous segmentations obtained using the RVX Semi-Automatic (A) and RVX Deep Learning (B) methods. Images were taken from different cases and at different slice levels. Green areas represent the liver masks generated by the segmentation results.
Figure 1. (A,B) Examples of erroneous segmentations obtained using the RVX Semi-Automatic (A) and RVX Deep Learning (B) methods. Images were taken from different cases and at different slice levels. Green areas represent the liver masks generated by the segmentation results.
Diagnostics 16 00817 g001
Figure 2. Liver segmentation on the same CT slice using four different methods. (A) Manual segmentation performed using the Segment Editor module in 3D Slicer software. (B) Semi-automatic segmentation using the “Grow from Seeds” function within the RVX Liver Segmentation. (C) Segmentation using the RVX Liver Segmentation by deep learning. (D) Automatic segmentation performed using the TotalSegmentator module. Each method illustrates the delineation of liver boundaries on the same axial CT slice for visual comparison.
Figure 2. Liver segmentation on the same CT slice using four different methods. (A) Manual segmentation performed using the Segment Editor module in 3D Slicer software. (B) Semi-automatic segmentation using the “Grow from Seeds” function within the RVX Liver Segmentation. (C) Segmentation using the RVX Liver Segmentation by deep learning. (D) Automatic segmentation performed using the TotalSegmentator module. Each method illustrates the delineation of liver boundaries on the same axial CT slice for visual comparison.
Diagnostics 16 00817 g002
Figure 3. Bland–Altman plots showing the volume differences (New Method—Manual Method) between manual liver segmentation and each of the three methods —TotalSegmentator, RVX Deep Learning, and RVX Semi-Automatic—plotted against their mean volume. The solid line represents the mean bias, and the dashed lines indicate the 95% limits of agreement (±1.96 SD).
Figure 3. Bland–Altman plots showing the volume differences (New Method—Manual Method) between manual liver segmentation and each of the three methods —TotalSegmentator, RVX Deep Learning, and RVX Semi-Automatic—plotted against their mean volume. The solid line represents the mean bias, and the dashed lines indicate the 95% limits of agreement (±1.96 SD).
Diagnostics 16 00817 g003
Figure 4. Comparison of absolute (A) and relative (B) liver volume differences between manual segmentation and the three other segmentation methods using boxplots. The central line represents the median, and the boxes indicate the interquartile range (25th−75th percentiles).
Figure 4. Comparison of absolute (A) and relative (B) liver volume differences between manual segmentation and the three other segmentation methods using boxplots. The central line represents the median, and the boxes indicate the interquartile range (25th−75th percentiles).
Diagnostics 16 00817 g004
Table 1. Liver volume measurements obtained using different segmentation methods.
Table 1. Liver volume measurements obtained using different segmentation methods.
Estimates
Measure:Mean (cm3)Std. Error95% Confidence Interval
Lower
Bound
Upper
Bound
Manual1503.938.3821427.61580.2
RVX Semi-Automatic1512.640.2351432.51592.5
RVX Deep Learning1549.839.6291470.91628.5
TotalSegmentator1518.339.3901439.91596.6
Table 2. Pairwise comparisons of liver volume measurement differences among different segmentation methods.
Table 2. Pairwise comparisons of liver volume measurement differences among different segmentation methods.
Pairwise Comparisons
Measure:Mean
Difference
(I–J)
Std. ErrorSig. b95% Confidence Interval for Difference b
Lower BoundUpper Bound
Manual
RVX Semi-Automatic8.638.2471.000−30.91613.644
RVX Deep Learning45.83 *6.505<0.001−63.402−28.258
TotalSegmentator14.375.4540.060−29.1060.363
RVX Semi-Automatic
RVX Deep Learning37.19 *7.654<0.001−57.872−16.516
TotalSegmentator5.736.2101.000−22.51111.041
RVX Deep Learning
TotalSegmentator−31.45 *4.639<0.00118.92743.991
Based on estimated marginal means. *. The mean difference is significant at the 0.05 level. b. Adjustment for multiple comparisons: Bonferroni.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Dogan, B.; Simsek, S.B.; Sonmez, S.; Ozgen Sonmez, M.N.; Dasci, O.; Ozmen, Z. Comparison of Manual, Semi-Automatic, and Automatic CT-Based Methods for Liver Volume Segmentation. Diagnostics 2026, 16, 817. https://doi.org/10.3390/diagnostics16050817

AMA Style

Dogan B, Simsek SB, Sonmez S, Ozgen Sonmez MN, Dasci O, Ozmen Z. Comparison of Manual, Semi-Automatic, and Automatic CT-Based Methods for Liver Volume Segmentation. Diagnostics. 2026; 16(5):817. https://doi.org/10.3390/diagnostics16050817

Chicago/Turabian Style

Dogan, Berna, Sadik Bugrahan Simsek, Sefa Sonmez, Merve Nur Ozgen Sonmez, Omur Dasci, and Zafer Ozmen. 2026. "Comparison of Manual, Semi-Automatic, and Automatic CT-Based Methods for Liver Volume Segmentation" Diagnostics 16, no. 5: 817. https://doi.org/10.3390/diagnostics16050817

APA Style

Dogan, B., Simsek, S. B., Sonmez, S., Ozgen Sonmez, M. N., Dasci, O., & Ozmen, Z. (2026). Comparison of Manual, Semi-Automatic, and Automatic CT-Based Methods for Liver Volume Segmentation. Diagnostics, 16(5), 817. https://doi.org/10.3390/diagnostics16050817

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop