MDPI - Publisher of Open Access Journals

16 pages, 23623 KB

Open AccessArticle

Deep Learning-Based Blood Segmentation and Temporal Characterization for the Robin Heart Surgical Robot

by Klaudia Senator, Dariusz Krawczyk and Zbigniew Nawrat

Surgeries 2026, 7(2), 70; https://doi.org/10.3390/surgeries7020070 - 15 Jun 2026

Viewed by 439

Background/Objectives: In laparoscopic and robot-assisted surgery, bleeding may rapidly impair operative-field readability and procedural safety. In the broader Robin Heart teleoperation framework, interpretation of such events is relevant not only for scene understanding but also as a potential prerequisite for future safety-oriented [...] Read more.

Background/Objectives: In laparoscopic and robot-assisted surgery, bleeding may rapidly impair operative-field readability and procedural safety. In the broader Robin Heart teleoperation framework, interpretation of such events is relevant not only for scene understanding but also as a potential prerequisite for future safety-oriented supervisory functions under communication-degraded conditions. The aim of this study was to assess whether a deep learning model for blood segmentation could provide outputs suitable for preliminary image-level temporal characterization of visible blood-region behavior in laparoscopic video. Methods: A U-Net-based binary blood-segmentation model was implemented in-house in PyTorch and evaluated on three paired image–mask datasets: a simulated bleeding dataset prepared under controlled laboratory conditions, an internal operative laparoscopic dataset, and an external-domain subset derived from the public GynSurg dataset. Segmentation performance was assessed using 5-fold cross-validation and reported using the Dice coefficient and Intersection over Union (IoU). Training dynamics were analyzed using training and validation loss and Dice curves. Additional baseline comparisons were performed on the internal operative dataset using U-Net++ and DeepLabV3+. Temporal analysis was performed on selected video fragments, including a low-motion reference sequence without active bleeding progression, internal bleeding-related sequences, and external-domain sequences, using mask-derived descriptors and auxiliary optical-flow-based motion descriptors computed after camera-motion compensation within the detected blood-related ROI. Results: In 5-fold cross-validation, the U-Net-based model achieved Dice coefficient and IoU values of 0.915 ± 0.012 and 0.851 ± 0.019 on the simulated dataset, 0.856 ± 0.013 and 0.756 ± 0.025 on the internal operative dataset, and 0.707 ± 0.053 and 0.570 ± 0.056 on the external-domain GynSurg subset, respectively. On the internal operative dataset, the proposed model performed comparably to U-Net++ and slightly above DeepLabV3+ under the same cross-validation protocol. The temporal descriptor set differentiated low-motion reference behavior, more spatially coherent progression, rapid coherent expansion, and dynamic or motion-active progression profiles. Peak dA/dt reflected abrupt visible blood-area expansion, temporal IoU described mask stability over time, and optical-flow-based descriptors provided additional information on local motion activity within the detected blood-related ROI. Conclusions: The results support the feasibility of combining deep-learning-based blood segmentation with temporal and optical-flow-based descriptors for exploratory image-level characterization of visible blood-region behavior in laparoscopic video. Within the Robin Heart development pathway, such descriptors may, in the future, serve as candidate components of image-analysis support modules for safety-oriented teleoperative scenarios. At this stage, they should be interpreted as exploratory image-derived indicators rather than clinically validated markers of bleeding severity. Full article

(This article belongs to the Special Issue The Application of Artificial Intelligence in Surgical Procedures)

► Show Figures

Figure 1

25 pages, 12181 KB

Open AccessArticle

Neural Minimum-Distance Estimation for Collision-Aware Operation of Multi-Arm Laparoscopy Surgical Robots Through Learning-from-Simulation

by Sarvin Ghiasi, Majid Roshanfar, Jake Barralet, Liane S. Feldman and Amir Hooshiar

Sensors 2026, 26(12), 3744; https://doi.org/10.3390/s26123744 - 12 Jun 2026

Viewed by 314

Abstract

This study presents an integrated framework for enhancing the safety and operational efficiency of robotic arms in laparoscopic surgery by addressing minimum distance estimation between multi-arm manipulators and the associated collision-aware warning. By combining analytical modeling, real time simulation, and machine learning, the [...] Read more.

This study presents an integrated framework for enhancing the safety and operational efficiency of robotic arms in laparoscopic surgery by addressing minimum distance estimation between multi-arm manipulators and the associated collision-aware warning. By combining analytical modeling, real time simulation, and machine learning, the framework offers a robust solution for ensuring safe robotic operations. An analytical model was developed to estimate the minimum distances between robotic arms based on their joint configurations, offering theoretical calculations that serve as both a validation tool and a benchmark. To complement this, a 3D simulation environment was created to model two 7 DOF Kinova robotic arms (Kinova Inc., Boisbriand, QC, Canada), generating a diverse dataset of configurations for distance estimation and collision warning. Using these insights, a deep residual neural network model was trained with joint configurations as inputs. On the held out validation set, the model achieves

R^{2} = 0.940

, RMSE

= 42.0

mm, MAE

= 28.7

mm, and a near zero mean bias, demonstrating strong predictive accuracy and consistent generalization across the workspace. The framework is intended as an early collision warning layer, where a warning is triggered when the predicted inter-arm distance falls below a 0.2 m threshold, which corresponds to a surface to surface clearance of approximately 50 mm given the Kinova Gen3 (Kinova Inc., Boisbriand, QC, Canada) cross sectional radius. This work demonstrates the effectiveness of combining analytical modeling with machine learning to enhance the precision and reliability of multi-arm robotic systems. Full article

(This article belongs to the Special Issue Novel Actuation, Sensing, and Intelligent Control in Medical Robotic Systems)

► Show Figures

Figure 1

16 pages, 2305 KB

Open AccessArticle

Surgical Phase Recognition in Laparoscopic Cholecystectomy Using Artificial Intelligence

by Stefanos P. Raptis, Charalampos Theocharopoulos, Achilleas Theocharopoulos, Georgios Levantis, Dimitrios N. Varvoglis, Dimitrios C. Ziogas, Nikolaos Machairas, Georgios C. Sotiropoulos, Elissaios Kontis and Aristidis G. Vrahatis

Gastroenterol. Insights 2026, 17(2), 35; https://doi.org/10.3390/gastroent17020035 - 2 Jun 2026

Viewed by 439

Abstract

Background/Objectives: The global adoption of minimally invasive surgery has generated extensive video repositories, creating new opportunities for data-driven surgical education and quality assessment. Automated surgical phase recognition enables objective trainee evaluation, standardized competency assessment, and systematic procedural documentation. However, class imbalance in surgical [...] Read more.

Background/Objectives: The global adoption of minimally invasive surgery has generated extensive video repositories, creating new opportunities for data-driven surgical education and quality assessment. Automated surgical phase recognition enables objective trainee evaluation, standardized competency assessment, and systematic procedural documentation. However, class imbalance in surgical workflows, where certain phases comprise 30–35% of frames while others represent only 5–10%, remains a significant challenge. This imbalance causes models to underperform on underrepresented yet clinically important phases. Methods: A retrospective analysis of laparoscopic cholecystectomy videos is performed with the implementation of a frame—based deep learning framework to develop and validate a surgical phase recognition pipeline based on ResNet-50 architecture with transfer learning. The model was designed to extract features from surgical video frames and classify them into seven distinct phases, without incorporating temporal context. We used the Cholec80 dataset and applied class balancing techniques to address inherent class imbalance. Results: The model achieved a mean balanced accuracy of 91.80% across five folds with consistent performance across all surgical phases. Per-phase F1-scores ranged from 0.89 to 0.95, demonstrating balanced classification without significant performance degradation on underrepresented phases. The confusion matrix revealed prediction errors primarily among adjacent or visually similar phases, reflecting the inherent ambiguity of surgical phase transitions. In practical terms, the model correctly identified the surgical phase in more than 9 out of 10 frames, enabling reliable automated segmentation of the operative workflow. Conclusions: This study demonstrates that artificial intelligence can reliably analyze surgical video data, achieving consistent and accurate phase recognition in laparoscopic cholecystectomy. Full article

(This article belongs to the Section Gastrointestinal Disease)

► Show Figures

Figure 1

30 pages, 10578 KB

Open AccessArticle

IMAU-Net: A Hybrid Multi-Scale Deep Learning Framework for Liver Segmentation from Laparoscopic Images

by Syeda Sitara Waseem, Sarang Shaikh and Syed Rizwan Hassan

Sensors 2026, 26(9), 2695; https://doi.org/10.3390/s26092695 - 27 Apr 2026

Viewed by 572

Abstract

Accurate liver segmentation in laparoscopic surgery is critical but remains challenging due to low contrast, occlusion, and irregular organ boundaries. While deep learning has advanced medical image segmentation, existing models often trade off between accuracy, computational efficiency, and boundary precision. We propose IMAU-Net, [...] Read more.

Accurate liver segmentation in laparoscopic surgery is critical but remains challenging due to low contrast, occlusion, and irregular organ boundaries. While deep learning has advanced medical image segmentation, existing models often trade off between accuracy, computational efficiency, and boundary precision. We propose IMAU-Net, a hybrid architecture integrating a pre-trained InceptionV3 encoder with a novel bottleneck combining Multi-Core Pooling (MCP) and enhanced Atrous Spatial Pyramid Pooling (ASPP). The MCP module captures fine-to-medium spatial details through parallel multi-kernel pooling, while ASPP extracts multi-scale contextual information via dilated convolutions. Evaluated on the M2CAI dataset with 5-fold cross-validation, IMAU-Net achieves a mean Dice coefficient of 0.9179 ± 0.012 and IoU of 0.8483 ± 0.015. Furthermore, external validation on the independent CholecSeg8K dataset (250 test samples) demonstrates generalizability across different laparoscopic procedures, achieving a Dice coefficient of 0.8745 ± 0.0312 and AUC of 0.9542, with a performance degradation of only 4.3% despite domain shift between liver surgery and cholecystectomy. Comparative analysis with state of the art methods demonstrates superior performance, with computational efficiency suitable for real-time applications (45 FPS, 42.3 M parameters). The proposed architecture provides an optimal balance between accuracy and efficiency for intraoperative guidance systems. While evaluated on retrospective laparoscopic image datasets rather than real-time intraoperative workflows, the model demonstrates potential for integration into surgical guidance systems pending prospective validation. Full article

(This article belongs to the Special Issue Advanced Sensing and Signal Processing Technologies for Medical Applications)

► Show Figures

Figure 1

34 pages, 921 KB

Open AccessSystematic Review

Artificial Intelligence in Gastrointestinal Surgery: A Systematic Review of Its Role in Laparoscopic and Robotic Surgery

by Ludovica Gorini, Roberto de la Plaza Llamas, Daniel Alejandro Díaz Candelas, Rodrigo Arellano González, Wenzhong Sun, Jaime García Friginal, María Fra López and Ignacio Antonio Gemio del Rey

J. Pers. Med. 2025, 15(11), 562; https://doi.org/10.3390/jpm15110562 - 19 Nov 2025

Cited by 5 | Viewed by 2940

Abstract

Background: Artificial intelligence (AI) is transforming surgical practice by enhancing training, intraoperative guidance, decision-making, and postoperative assessment. However, its specific role in laparoscopic and robotic general surgery remains to be clearly defined. The objective is to systematically review the current applications of [...] Read more.

Background: Artificial intelligence (AI) is transforming surgical practice by enhancing training, intraoperative guidance, decision-making, and postoperative assessment. However, its specific role in laparoscopic and robotic general surgery remains to be clearly defined. The objective is to systematically review the current applications of AI in laparoscopic and robotic general surgery and categorize them by function and surgical context. Methods: A systematic search of PubMed and Web of Science was conducted up to 22 June 2025, using predefined search terms. Eligible studies focused on AI applications in laparoscopic or robotic general surgery, excluding urological, gynecological, and obstetric fields. Original articles in English or Spanish were included. Data extraction was performed independently by two reviewers and synthesized descriptively by thematic categories. Results: A total of 152 original studies were included. Most were conducted in laparoscopic settings (n = 125), while 19 focused on robotic surgery and 8 involved both. The majority were technical evaluations or retrospective observational studies. Seven thematic categories were identified: surgical decision support and outcome prediction; skill assessment and training; workflow recognition and intraoperative guidance; object or structure detection; augmented reality and navigation; image enhancement; technical assistance; and surgeon perception and preparedness. Most studies applied deep learning, for classification, prediction, recognition, and real-time guidance in laparoscopic cholecystectomies, colorectal and gastric surgeries. Conclusions: AI has been widely adopted in various domains of laparoscopic and robotic general surgery. While most studies remain in early developmental stages, the evidence suggests increasing maturity and integration into clinical workflows. Standardization of evaluation and reporting frameworks will be essential to translate these innovations into widespread practice. Full article

(This article belongs to the Special Issue Update on Robotic Gastrointestinal Surgery, 2nd Edition)

► Show Figures

Graphical abstract

10 pages, 501 KB

Open AccessArticle

From Bedside to Bot-Side: Artificial Intelligence in Emergency Appendicitis Management

by Koray Ersahin, Sebastian Sanduleanu, Sithin Thulasi Seetha, Johannes Bremm, Cavid Abbasli, Chantal Zimmer, Tim Damer, Jonathan Kottlors, Lukas Goertz, Christiane Bruns, David Maintz and Nuran Abdullayev

Life 2025, 15(9), 1387; https://doi.org/10.3390/life15091387 - 1 Sep 2025

Cited by 1 | Viewed by 2121

Abstract

Introduction: Acute appendicitis (AA) is a common cause of abdominal pain that can lead to complications like perforation and intra-abdominal abscesses, increasing morbidity and mortality, often requiring emergency surgery. Nevertheless, appendectomy is performed in up to 95% of uncomplicated cases, while complications like [...] Read more.

Introduction: Acute appendicitis (AA) is a common cause of abdominal pain that can lead to complications like perforation and intra-abdominal abscesses, increasing morbidity and mortality, often requiring emergency surgery. Nevertheless, appendectomy is performed in up to 95% of uncomplicated cases, while complications like perforation and intra-abdominal abscesses increase morbidity and mortality. The current study compares the accuracy of GPT-4.5, DeepSeek R1, and machine learning in assisting with surgical decision-making for patients presenting with lower abdominal pain at the Emergency Department. Methods: In this multicenter retrospective study, 63 histopathologically confirmed appendicitis patients and 50 control patients with right abdominal pain presenting at the Emergency Department at two German hospitals between October 2022 and October 2023 were included. Using each patient’s clinical, laboratory, and radiological findings, DeepSeek (with and without Retrieval-Augmented Generation using 2020 Jerusalem guidelines) was compared in terms of accuracy with GPT-4.5 and a random forest-based machine-learning model, with a board-certified surgeon (reference standard) to determine the optimal treatment approach (laparoscopic exploration/appendectomy versus conservative antibiotic therapy). Results: Accuracy of agreement with board-certified surgeons in the decision-making of appendectomy versus conservative therapy increased non-significantly from 80.5% to 83.2% with DeepSeek and from 70.8 to 76.1% when GPT-4.5 was provided with the World Journal of Emergency Surgery 2020 Jerusalem guidelines on the diagnosis and treatment of acute appendicitis. The estimated machine-learning model training accuracy was 84.3%, while the validation accuracy for the model was 85.0%. Discussion: GPT-4.5 and DeepSeek R1, as well as the machine-learning model, demonstrate promise in aiding surgical decision-making for appendicitis, particularly in resource-constrained settings. Ongoing training and validation are required to optimize the performance of such models. Full article

(This article belongs to the Special Issue Language Models in Lab Coats: AI-Powered Biomedical Interpretation)

► Show Figures

Figure 1

11 pages, 594 KB

Open AccessReview

Applications of Deep Learning Models in Laparoscopy for Gynecology

by Fani Gkrozou, Vasileios Bais, Charikleia Skentou, Dimitrios Rafail Kalaitzopoulos, Georgios Grigoriadis, Anastasia Vatopoulou, Minas Paschopoulos and Angelos Daniilidis

Medicina 2025, 61(8), 1460; https://doi.org/10.3390/medicina61081460 - 14 Aug 2025

Cited by 2 | Viewed by 1762

Abstract

Background and Objectives: The use of Artificial Intelligence (AI) in the medical field is rapidly expanding. This review aims to explore and summarize all published research on the development and validation of deep learning (DL) models in gynecologic laparoscopic surgeries. Materials and [...] Read more.

Background and Objectives: The use of Artificial Intelligence (AI) in the medical field is rapidly expanding. This review aims to explore and summarize all published research on the development and validation of deep learning (DL) models in gynecologic laparoscopic surgeries. Materials and Methods: MEDLINE, IEEE Xplore, and Google scholar were searched for eligible studies published between January 2000 and May 2025. Selected studies developed a DL model using datasets derived from gynecologic laparoscopic procedures. The exclusion criteria included non-gynecologic datasets, non-laparoscopic datasets, non-Convolutional Neural Network (CNN) models, and non-English publications. Results: A total of 16 out of 621 studies met our inclusion criteria. The findings were categorized into four main application areas: (i) anatomy classification (n = 6), (ii) anatomy segmentation (n = 5), (iii) surgical instrument classification and segmentation (n = 5), and (iv) surgical action recognition (n = 5). Conclusions: This review emphasizes the growing role of AI in gynecologic laparoscopy, improving anatomy recognition, instrument tracking, and surgical action analysis. As datasets grow and computational capabilities advance, these technologies are poised to improve intraoperative guidance and standardize surgical training. Full article

(This article belongs to the Section Obstetrics and Gynecology)

► Show Figures

Figure 1

30 pages, 2644 KB

Open AccessReview

Artificial Intelligence and Uterine Fibroids: A Useful Combination for Diagnosis and Treatment

by Andrea Tinelli, Andrea Morciano, Radmila Sparic, Safak Hatirnaz, Lorenzo E. Malgieri, Antonio Malvasi, Antonio D’Amato, Giorgio Maria Baldini and Giovanni Pecorella

J. Clin. Med. 2025, 14(10), 3454; https://doi.org/10.3390/jcm14103454 - 15 May 2025

Cited by 9 | Viewed by 5522

Abstract

This manuscript examines the role of artificial intelligence (AI) in the diagnosis and treatment of uterine fibroids and uterine sarcomas, offering a comprehensive assessment of AI-supported diagnostic and therapeutic techniques. Through the use of radiomics, machine learning, and deep neural network models, AI [...] Read more.

This manuscript examines the role of artificial intelligence (AI) in the diagnosis and treatment of uterine fibroids and uterine sarcomas, offering a comprehensive assessment of AI-supported diagnostic and therapeutic techniques. Through the use of radiomics, machine learning, and deep neural network models, AI shows promise in identifying benign and malignant uterine lesions, directing therapeutic decisions, and improving diagnostic accuracy. It also demonstrates significant capabilities in the timely detection of fibroids. Additionally, AI improves surgical precision, real-time structure detection, and patient outcomes by transforming surgical techniques such as myomectomy, robot-assisted laparoscopic surgery, and High-Intensity Focused Ultrasound (HIFU) ablation. By helping to forecast treatment outcomes and monitor progress during procedures like uterine fibroid embolization, AI also offers a fresh and fascinating perspective for improving the clinical management of these conditions. This review critically assesses the current literature, identifies the advantages and limitations of various AI approaches, and provides future directions for research and clinical implementation. Full article

(This article belongs to the Section Obstetrics & Gynecology)

► Show Figures

Figure 1

14 pages, 3518 KB

Open AccessArticle

Object Detection in Laparoscopic Surgery: A Comparative Study of Deep Learning Models on a Custom Endometriosis Dataset

by Andrey Bondarenko, Vilen Jumutc, Antoine Netter, Fanny Duchateau, Henrique Mendonca Abrão, Saman Noorzadeh, Giuseppe Giacomello, Filippo Ferrari, Nicolas Bourdel, Ulrik Bak Kirk and Dmitrijs Bļizņuks

Diagnostics 2025, 15(10), 1254; https://doi.org/10.3390/diagnostics15101254 - 15 May 2025

Cited by 5 | Viewed by 2016

Abstract

Background: Laparoscopic surgery for endometriosis presents unique challenges due to the complexity of and variability in lesion appearances within the abdominal cavity. This study investigates the application of deep learning models for object detection in laparoscopic videos, aiming to assist surgeons in accurately [...] Read more.

Background: Laparoscopic surgery for endometriosis presents unique challenges due to the complexity of and variability in lesion appearances within the abdominal cavity. This study investigates the application of deep learning models for object detection in laparoscopic videos, aiming to assist surgeons in accurately identifying and localizing endometriosis lesions and related anatomical structures. A custom dataset was curated, comprising of 199 video sequences and 205,725 frames. Of these, 17,560 frames were meticulously annotated by medical professionals. The dataset includes object detection annotations for 10 object classes relevant to endometriosis, alongside segmentation masks for some classes. Methods: To address the object detection task, we evaluated the performance of two deep learning models—FasterRCNN and YOLOv9—under both stratified and non-stratified training scenarios. Results: The experimental results demonstrated that stratified training significantly reduced the risk of data leakage and improved model generalization. The best-performing FasterRCNN object detection model achieved a high average test precision of 0.9811 ± 0.0084, recall of 0.7083 ± 0.0807, and mAP50 (mean average precision at 50% overlap) of 0.8185 ± 0.0562 across all presented classes. Despite these successes, the study also highlights the challenges posed by the weak annotations and class imbalances in the dataset, which impacted overall model performances. Conclusions: In conclusion, this study provides valuable insights into the application of deep learning for enhancing laparoscopic surgical precision in endometriosis treatment. The findings underscore the importance of robust dataset curation and advanced training strategies in developing reliable AI-assisted tools for surgical interventions. The latter could potentially improve the guidance of surgical interventions and prevent blind spots occurring in difficult to reach abdominal regions. Future work will focus on refining the dataset and exploring more sophisticated model architectures to further improve detection accuracy. Full article

(This article belongs to the Section Machine Learning and Artificial Intelligence in Diagnostics)

► Show Figures

Figure 1

15 pages, 14023 KB

Open AccessArticle

Using Masked Image Modelling Transformer Architecture for Laparoscopic Surgical Tool Classification and Localization

by Hisham ElMoaqet, Rami Janini, Mutaz Ryalat, Ghaith Al-Refai, Tamer Abdulbaki Alshirbaji, Nour Aldeen Jalal, Thomas Neumuth, Knut Moeller and Nassir Navab

Sensors 2025, 25(10), 3017; https://doi.org/10.3390/s25103017 - 10 May 2025

Cited by 3 | Viewed by 3684

Abstract

Artificial intelligence (AI) has shown its potential to advance applications in various medical fields. One such area involves developing integrated AI-based systems to assist in laparoscopic surgery. Surgical tool detection and phase recognition are key components to develop such systems, and therefore, they [...] Read more.

Artificial intelligence (AI) has shown its potential to advance applications in various medical fields. One such area involves developing integrated AI-based systems to assist in laparoscopic surgery. Surgical tool detection and phase recognition are key components to develop such systems, and therefore, they have been extensively studied in recent years. Despite significant advancements in this field, previous image-based methods still face many challenges that limit their performance due to complex surgical scenes and limited annotated data. This study proposes a novel deep learning approach for classifying and localizing surgical tools in laparoscopic surgeries. The proposed approach uses a self-supervised learning algorithm for surgical tool classification followed by a weakly supervised algorithm for surgical tool localization, eliminating the need for explicit localization annotation. In particular, we leverage the Bidirectional Encoder Representation from Image Transformers (BEiT) model for tool classification and then utilize the heat maps generated from the multi-headed attention layers in the BEiT model for the localizing of these tools. Furthermore, the model incorporates class weights to address the class imbalance issue resulting from different usage frequencies of surgical tools in surgeries. Evaluated on the Cholec80 benchmark dataset, the proposed approach demonstrated high performance in surgical tool classification, surpassing previous works that utilize both spatial and temporal information. Additionally, the proposed weakly supervised learning approach achieved state-of-the-art results for the localization task. Full article

(This article belongs to the Special Issue Advanced Deep Learning for Biomedical Sensing and Imaging)

► Show Figures

Figure 1

32 pages, 12463 KB

Open AccessEditor’s ChoiceArticle

Neuro-Visual Adaptive Control for Precision in Robot-Assisted Surgery

by Claudio Urrea, Yainet Garcia-Garcia, John Kern and Reinier Rodriguez-Guillen

Technologies 2025, 13(4), 135; https://doi.org/10.3390/technologies13040135 - 1 Apr 2025

Cited by 6 | Viewed by 2900

Abstract

This study introduces a Neuro-Visual Adaptive Control (NVAC) architecture designed to enhance precision and safety in robot-assisted surgery. The proposed system enables semi-autonomous guidance of the laparoscope based on image input. To achieve this, the architecture integrates the following: (1) a computer vision [...] Read more.

This study introduces a Neuro-Visual Adaptive Control (NVAC) architecture designed to enhance precision and safety in robot-assisted surgery. The proposed system enables semi-autonomous guidance of the laparoscope based on image input. To achieve this, the architecture integrates the following: (1) a computer vision system based on the YOLO11n model, which detects surgical instruments in real time; (2) a Model Reference Adaptive Control with Proportional–Derivative terms (MRAC-PD), which adjusts the robot’s behavior in response to environmental changes; and (3) Closed-Form Continuous-Time Neural Networks (CfC-mmRNNs), which efficiently model the system’s dynamics. These networks address common deep learning challenges, such as the vanishing gradient problem, and facilitate the generation of smooth control signals that minimize wear on the robot’s actuators. Performance evaluations were conducted in CoppeliaSim, utilizing real cholecystectomy images featuring surgical tools. Experimental results demonstrate that the NVAC achieves maximum tracking errors of 1.80 × 10⁻³ m, 1.08 × 10⁻⁴ m, and 1.90 × 10⁻³ m along the x, y, and z axes, respectively, under highly significant dynamic disturbances. This hybrid approach provides a scalable framework for advancing autonomy in robotic surgery. Full article

(This article belongs to the Special Issue Assistive Technologies in Care and Rehabilitation: Research, Developments, and International Initiatives)

► Show Figures

Figure 1

30 pages, 7046 KB

Open AccessArticle

Improving Surgical Scene Semantic Segmentation through a Deep Learning Architecture with Attention to Class Imbalance

by Claudio Urrea, Yainet Garcia-Garcia and John Kern

Biomedicines 2024, 12(6), 1309; https://doi.org/10.3390/biomedicines12061309 - 13 Jun 2024

Cited by 9 | Viewed by 3589

Abstract

This article addresses the semantic segmentation of laparoscopic surgery images, placing special emphasis on the segmentation of structures with a smaller number of observations. As a result of this study, adjustment parameters are proposed for deep neural network architectures, enabling a robust segmentation [...] Read more.

This article addresses the semantic segmentation of laparoscopic surgery images, placing special emphasis on the segmentation of structures with a smaller number of observations. As a result of this study, adjustment parameters are proposed for deep neural network architectures, enabling a robust segmentation of all structures in the surgical scene. The U-Net architecture with five encoder–decoders (U-Net5ed), SegNet-VGG19, and DeepLabv3+ employing different backbones are implemented. Three main experiments are conducted, working with Rectified Linear Unit (ReLU), Gaussian Error Linear Unit (GELU), and Swish activation functions. The applied loss functions include Cross Entropy (CE), Focal Loss (FL), Tversky Loss (TL), Dice Loss (DiL), Cross Entropy Dice Loss (CEDL), and Cross Entropy Tversky Loss (CETL). The performance of Stochastic Gradient Descent with momentum (SGDM) and Adaptive Moment Estimation (Adam) optimizers is compared. It is qualitatively and quantitatively confirmed that DeepLabv3+ and U-Net5ed architectures yield the best results. The DeepLabv3+ architecture with the ResNet-50 backbone, Swish activation function, and CETL loss function reports a Mean Accuracy (MAcc) of 0.976 and Mean Intersection over Union (MIoU) of 0.977. The semantic segmentation of structures with a smaller number of observations, such as the hepatic vein, cystic duct, Liver Ligament, and blood, verifies that the obtained results are very competitive and promising compared to the consulted literature. The proposed selected parameters were validated in the YOLOv9 architecture, which showed an improvement in semantic segmentation compared to the results obtained with the original architecture. Full article

(This article belongs to the Section Biomedical Engineering and Materials)

► Show Figures

Graphical abstract

13 pages, 2233 KB

Open AccessArticle

P-CSEM: An Attention Module for Improved Laparoscopic Surgical Tool Detection

by Herag Arabian, Tamer Abdulbaki Alshirbaji, Nour Aldeen Jalal, Sabine Krueger-Ziolek and Knut Moeller

Sensors 2023, 23(16), 7257; https://doi.org/10.3390/s23167257 - 18 Aug 2023

Cited by 8 | Viewed by 2452

Abstract

Minimal invasive surgery, more specifically laparoscopic surgery, is an active topic in the field of research. The collaboration between surgeons and new technologies aims to improve operation procedures as well as to ensure the safety of patients. An integral part of operating rooms [...] Read more.

Minimal invasive surgery, more specifically laparoscopic surgery, is an active topic in the field of research. The collaboration between surgeons and new technologies aims to improve operation procedures as well as to ensure the safety of patients. An integral part of operating rooms modernization is the real-time communication between the surgeon and the data gathered using the numerous devices during surgery. A fundamental tool that can aid surgeons during laparoscopic surgery is the recognition of the different phases during an operation. Current research has shown a correlation between the surgical tools utilized and the present phase of surgery. To this end, a robust surgical tool classifier is desired for optimal performance. In this paper, a deep learning framework embedded with a custom attention module, the P-CSEM, has been proposed to refine the spatial features for surgical tool classification in laparoscopic surgery videos. This approach utilizes convolutional neural networks (CNNs) integrated with P-CSEM attention modules at different levels of the architecture for improved feature refinement. The model was trained and tested on the popular, publicly available Cholec80 database. Results showed that the attention integrated model achieved a mean average precision of 93.14%, and visualizations revealed the ability of the model to adhere more towards features of tool relevance. The proposed approach displays the benefits of integrating attention modules into surgical tool classification models for a more robust and precise detection. Full article

(This article belongs to the Special Issue Optical and Acoustical Methods for Biomedical Imaging and Sensing)

► Show Figures

Figure 1

15 pages, 1971 KB

Open AccessArticle

Surgical Phase Recognition in Inguinal Hernia Repair—AI-Based Confirmatory Baseline and Exploration of Competitive Models

by Chengbo Zang, Mehmet Kerem Turkcan, Sanjeev Narasimhan, Yuqing Cao, Kaan Yarali, Zixuan Xiang, Skyler Szot, Feroz Ahmad, Sarah Choksi, Daniel P. Bitner, Filippo Filicori and Zoran Kostic

Bioengineering 2023, 10(6), 654; https://doi.org/10.3390/bioengineering10060654 - 27 May 2023

Cited by 18 | Viewed by 4126

Abstract

Video-recorded robotic-assisted surgeries allow the use of automated computer vision and artificial intelligence/deep learning methods for quality assessment and workflow analysis in surgical phase recognition. We considered a dataset of 209 videos of robotic-assisted laparoscopic inguinal hernia repair (RALIHR) collected from 8 surgeons, [...] Read more.

Video-recorded robotic-assisted surgeries allow the use of automated computer vision and artificial intelligence/deep learning methods for quality assessment and workflow analysis in surgical phase recognition. We considered a dataset of 209 videos of robotic-assisted laparoscopic inguinal hernia repair (RALIHR) collected from 8 surgeons, defined rigorous ground-truth annotation rules, then pre-processed and annotated the videos. We deployed seven deep learning models to establish the baseline accuracy for surgical phase recognition and explored four advanced architectures. For rapid execution of the studies, we initially engaged three dozen MS-level engineering students in a competitive classroom setting, followed by focused research. We unified the data processing pipeline in a confirmatory study, and explored a number of scenarios which differ in how the DL networks were trained and evaluated. For the scenario with 21 validation videos of all surgeons, the Video Swin Transformer model achieved ~0.85 validation accuracy, and the Perceiver IO model achieved ~0.84. Our studies affirm the necessity of close collaborative research between medical experts and engineers for developing automated surgical phase recognition models deployable in clinical settings. Full article

(This article belongs to the Special Issue Artificial Intelligence in Surgery)

► Show Figures

Graphical abstract

11 pages, 9286 KB

Open AccessArticle

A Multi-Task Convolutional Neural Network for Semantic Segmentation and Event Detection in Laparoscopic Surgery

by Giorgia Marullo, Leonardo Tanzi, Luca Ulrich, Francesco Porpiglia and Enrico Vezzetti

J. Pers. Med. 2023, 13(3), 413; https://doi.org/10.3390/jpm13030413 - 25 Feb 2023

Cited by 21 | Viewed by 4393

Abstract

The current study presents a multi-task end-to-end deep learning model for real-time blood accumulation detection and tools semantic segmentation from a laparoscopic surgery video. Intraoperative bleeding is one of the most problematic aspects of laparoscopic surgery. It is challenging to control and limits [...] Read more.

The current study presents a multi-task end-to-end deep learning model for real-time blood accumulation detection and tools semantic segmentation from a laparoscopic surgery video. Intraoperative bleeding is one of the most problematic aspects of laparoscopic surgery. It is challenging to control and limits the visibility of the surgical site. Consequently, prompt treatment is required to avoid undesirable outcomes. This system exploits a shared backbone based on the encoder of the U-Net architecture and two separate branches to classify the blood accumulation event and output the segmentation map, respectively. Our main contribution is an efficient multi-task approach that achieved satisfactory results during the test on surgical videos, although trained with only RGB images and no other additional information. The proposed multi-tasking convolutional neural network did not employ any pre- or postprocessing step. It achieved a Dice Score equal to 81.89% for the semantic segmentation task and an accuracy of 90.63% for the event detection task. The results demonstrated that the concurrent tasks were properly combined since the common backbone extracted features proved beneficial for tool segmentation and event detection. Indeed, active bleeding usually happens when one of the instruments closes or interacts with anatomical tissues, and it decreases when the aspirator begins to remove the accumulated blood. Even if different aspects of the presented methodology could be improved, this work represents a preliminary attempt toward an end-to-end multi-task deep learning model for real-time video understanding. Full article

► Show Figures

Graphical abstract

Search Results (21)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (21)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI