The Role of Large Language Models in the Promotion of Minimally Invasive Interventional Radiologic Methods in Gynecology and Obstetrics
Abstract
1. Introduction
2. Materials and Methods
2.1. Study Design and Setting
2.2. Data Sources
2.3. Instrument and Variables
- Questions 1–3: What are the therapeutic options for patients presenting with uterine fibroids and typical clinical symptoms?
- Question 4: What are the treatment options for a patient diagnosed with adenomyosis?
- Question 5: What are the treatment options for a patient with postpartum hemorrhage?
- Question 6: What are the treatment options for a pregnant patient with uterine fibroids?
- Question 7: What are the treatment options for a patient with a uterine fibroid and a wish for future fertility?
- Question 8: What are the treatment options for a patient with a uterine fibroid and an active concomitant pelvic infection?
- Question 9: What are the treatment options for a patient with adenomyosis and a suspicious adnexal mass?
- Question 10: What are the treatment options for a patient with a pedunculated myoma?
2.4. Outcomes Measures
- Accuracy (concordance with established guidelines and evidence)
- Completeness (inclusion of relevant clinical aspects)
- Safety considerations (acknowledgement of risks and complications, emphasis on multidisciplinary care)
- Patient-centered communication (clarity, accessibility, and balance of information)
2.5. Assessment and Bias Minimization
2.6. Statistical Analysis
3. Results
3.1. Descriptive Findings
3.2. General Therapeutic Options (Questions 1–5)
3.3. Contraindications for Minimal IR Procedures (Questions 6–10)
4. Discussion
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- National Academies of Sciences, Engineering, and Medicine; Health and Medicine Division; Board on Population Health and Public Health Practice; Committee on a Framework for the Consideration of Chronic Debilitating Conditions in Women. 5—Female-Specific and Gynecologic Conditions. In Advancing Research on Chronic Conditions in Women; Batulan, Z., Bhimla, A., Higginbotham, E.J., Eds.; National Academies Press (US): Washington, DC, USA, 2024. Available online: https://www.ncbi.nlm.nih.gov/books/NBK607731/ (accessed on 5 September 2025).
- Psilopatis, I.; Fleckenstein, F.N.; Collettini, F.; Can, E.; Frisch, A.; Gebauer, B.; Fehrenbach, U.; Torsello, G.F.; Schnapauff, D.; David, M.; et al. Short- and long-term evaluation of disease-specific symptoms and quality of life following uterine artery embolization of fibroids. Insights Imaging 2022, 13, 106. [Google Scholar] [CrossRef] [PubMed]
- Gunther, R.; Walker, C. Adenomyosis. In StatPearls [Internet]; StatPearls Publishing: Treasure Island, FL, USA, 2025. Available online: https://www.ncbi.nlm.nih.gov/books/NBK539868/ (accessed on 5 September 2025).
- Wormer, K.C.; Jamil, R.T.; Bryant, S.B. Postpartum Hemorrhage. In StatPearls [Internet]; StatPearls Publishing: Treasure Island, FL, USA, 2025. Available online: https://www.ncbi.nlm.nih.gov/books/NBK499988/ (accessed on 5 September 2025).
- Guarnaccia, M.M.; Rein, M.S. Traditional surgical approaches to uterine fibroids: Abdominal myomectomy and hysterectomy. Clin. Obstet. Gynecol. 2001, 44, 385–400. [Google Scholar] [CrossRef]
- Deipolyi, A.R.; Annie, F.; Bush, S.H., 2nd; Spies, J. Hysterectomy and Myomectomy versus Uterine Artery Embolization for Symptomatic Fibroids and Adenomyosis: National and Regional Trends and Adverse Events in 70,000 Patients. J. Vasc. Interv. Radiol. 2025, 36, 1011–1018.e4. [Google Scholar] [CrossRef]
- Campbell, W.A., 4th; Chick, J.F.B.; Shin, D.S.; Makary, M.S. Value of interventional radiology and their contributions to modern medical systems. Front. Radiol. 2024, 17, 1403761. [Google Scholar] [CrossRef]
- Ozen, M.; Patel, R.; Hoffman, M.; Raissi, D. Update on Endovascular Therapy for Fibroids and Adenomyosis. Semin. Interv. Radiol. 2023, 40, 327–334. [Google Scholar] [CrossRef]
- Elbiss, H.; Al Awar, S.; Koteesh, J.; Khair, H.; Maki, S.; Abdalla, D.H.; Abu-Zidan, F.M. Uterine artery embolization in the management of postpartum hemorrhage. World J. Emerg. Surg. 2025, 20, 6. [Google Scholar] [CrossRef]
- Rippel, K.; Decker, J.; Kroencke, T.; Scheurig-Muenkler, C. Nationwide analysis of surgical and interventional uterine fibroid treatments over the past decades in Germany. CVIR Endovasc. 2025, 8, 61. [Google Scholar] [CrossRef]
- Clusmann, J.; Kolbinger, F.R.; Muti, H.S.; Carrero, Z.I.; Eckardt, J.N.; Laleh, N.G.; Löffler, C.M.L.; Schwarzkopf, S.C.; Unger, M.; Veldhuizen, G.P.; et al. The future landscape of large language models in medicine. Commun. Med. 2023, 3, 141. [Google Scholar] [CrossRef]
- McCoy, L.G.; Ci Ng, F.Y.; Sauer, C.M.; Yap Legaspi, K.E.; Jain, B.; Gallifant, J.; McClurkin, M.; Hammond, A.; Goode, D.; Gichoya, J.; et al. Understanding and training for the impact of large language models and artificial intelligence in healthcare practice: A narrative review. BMC Med. Educ. 2024, 24, 1096. [Google Scholar] [CrossRef] [PubMed]
- Psilopatis, I.; Lotz, L.; Sipulina, N.; Heindl, F.; Levidou, G.; Emons, J. Leveraging artificial intelligence for evidence-based recommendations in uterine fibroid therapy: Addressing the unmet need in German healthcare-A clinical trial. Int. J. Gynaecol. Obstet. 2026, 172, 1104–1113. [Google Scholar] [CrossRef] [PubMed]
- Psilopatis, I.; Monod, C.; Filippi, V.; Tschudin, R.; Lapaire, O.; Emons, J.; Mosimann, B.; Zwimpfer, T.A. A comparative evaluation of publicly available large language models in the assessment of CTG traces according to the FIGO criteria. Arch. Gynecol. Obstet. 2025, 312, 1571–1580. [Google Scholar] [CrossRef]
- Management of Symptomatic Uterine Leiomyomas: ACOG Practice Bulletin, Number 228. Obstet. Gynecol. 2021, 137, e100–e115. [CrossRef]
- Burghaus, S.; Schäfer, S.D.; Bär, K.J.; Bartley, J.; Beckmann, M.W.; Behrens, A.; Beyer, K.; Bianchi, N.; Brandes, I.; Brünahl, C.; et al. Diagnosis and Therapy of Endometriosis. Guideline of the DGGG, OEGGG and SGGG (S2k-Level, AWMF Registry No. 015/045, April 2025). Geburtshilfe Frauenheilkd 2026, 86, 133–188. [Google Scholar] [CrossRef]
- Schlembach, D.; Annecke, T.; Girard, T.; Helmer, H.; Kainer, F.; Kehl, S.; Korte, W.; Kühnert, M.; Lier, H.; Mader, S.; et al. Peripartum Haemorrhage, Diagnosis and Therapy. Guideline of the DGGG, OEGGG and SGGG (S2k, AWMF Registry No. 015-063, August 2022). Geburtshilfe Frauenheilkd 2023, 83, 1446–1490. [Google Scholar] [CrossRef]
- Kröncke, T.; David, M. Uterine Artery Embolization (UAE) for Fibroid Treatment—Results of the 7th Radiological Gynecological Expert Meeting. Geburtshilfe Frauenheilkd 2019, 79, 688–692. [Google Scholar] [CrossRef] [PubMed]
- Kröncke, T.; David, M. MR-Guided Focused Ultrasound in Fibroid Treatment—Results of the 4th Radiological-Gynecological Expert Meeting. Geburtshilfe Frauenheilkd 2019, 79, 693–696. [Google Scholar] [CrossRef] [PubMed]
- Psilopatis, I.; Sipulina, N.; Stuebs, F.A.; Heindl, F.; Poeschke, P.; Bader, S.; Krueckel, A.; Fasching, P.A.; Beckmann, M.W.; Emons, J. The Role of Artificial Intelligence in Gynecologic Oncology Decision-Making: A Feasibility Study. Gynecol. Obstet. Investig. 2025, 90, 483–491. [Google Scholar] [CrossRef]
- Bader, S.; Schneider, M.O.; Psilopatis, I.; Anetsberger, D.; Emons, J.; Kehl, S. KI-gestützte Entscheidungsfindung in der Geburtshilfe—Eine Machbarkeitsstudie über die medizinische Genauigkeit und Zuverlässigkeit von ChatGPT [AI-supported decision-making in obstetrics—A feasibility study on the medical accuracy and reliability of ChatGPT]. Z. Geburtshilfe Neonatol. 2025, 229, 15–21. [Google Scholar] [CrossRef]
- Psilopatis, I.; Heindl, F.; Cupisti, S.; Fischer, U.; Kohlmann, V.; Schneider, M.; Bader, S.; Krueckel, A.; Emons, J. The role of artificial intelligence in gynecologic and obstetric emergencies. Eur. J. Obstet. Gynecol. Reprod. Biol. 2025, 306, 94–100. [Google Scholar] [CrossRef] [PubMed]
- Krückel, A.; Brückner, L.; Psilopatis, I.; Fasching, P.A.; Beckmann, M.W.; Emons, J. Evaluation of ChatGPT’s Potential in Tailoring Gynecological Cancer Therapies. In Vivo 2024, 38, 1649–1659. [Google Scholar] [CrossRef]
- Hurt, R.T.; Stephenson, C.R.; Gilman, E.A.; Aakre, C.A.; Croghan, I.T.; Mundi, M.S.; Ghosh, K.; Edakkanambeth Varayil, J. The Use of an Artificial Intelligence Platform OpenEvidence to Augment Clinical Decision-Making for Primary Care Physicians. J. Prim. Care Community Health 2025, 16, 21501319251332215. [Google Scholar] [CrossRef] [PubMed]
| Model | Performance on General Therapeutic Options (Questions 1–5) | Performance on Nuanced Clinical Scenarios & Contraindications (Questions 6–10) |
|---|---|---|
| OpenEvidence | Correctly identified and described a range of treatments, providing a detailed, stepwise approach from medical therapies to surgical options. Stood out by grounding responses in cited sources from reputable journals and organizations. | Consistently provided precise and nuanced clinical details. Correctly noted that minimally invasive procedures like UAE are “less commonly used for pedunculated fibroids” due to risks like post-procedural expulsion and infection. It also correctly identified that active pelvic infection is a contraindication for elective IR procedures. |
| ChatGPT | Adopted a structured approach with bullet points and concise summaries. Accurately listed medical, minimally invasive (UAE and MRI-guided Focused Ultrasound), and surgical options. Noted the effectiveness of UAE for certain symptoms and highlighted its potential impact on fertility. | Demonstrated a more nuanced understanding than Google Gemini. Explicitly stated that UAE is generally avoided for pedunculated subserosal fibroids due to the risk of “stalk necrosis, detachment, and peritonitis”. Correctly identified myomectomy as the preferred surgical approach for fertility preservation. |
| Google Gemini | Provided a clear, well-structured response, accurately listing medical, non-surgical, and surgical treatments. Categorized non-surgical treatments like UAE, RFA, and Focused Ultrasound as “less invasive than traditional surgery”. | Gave less detailed and nuanced information, often omitting critical warnings. For example, it did not explicitly mention the potential contraindication of UAE for a pedunculated myoma, failing to highlight the risk of stalk necrosis or detachment. |
| Model | Question 1 | Question 2 | Question 3 | Question 4 | Question 5 |
|---|---|---|---|---|---|
| OpenEvidence | Accuracy: 5 | Accuracy: 4 | Accuracy: 4 | Accuracy: 4 | Accuracy: 3 |
| Completeness: 5 | Completeness: 4 | Completeness: 4 | Completeness: 4 | Completeness: 3 | |
| Safety considerations: 4 | Safety considerations: 4 | Safety considerations: 4 | Safety considerations: 4 | Safety considerations: 4 | |
| Patient-centered communication: 3 | Patient-centered communication: 3 | Patient-centered communication: 3 | Patient-centered communication: 3 | Patient-centered communication: 3 | |
| Overall score: 4 | Overall score: 4 | Overall score: 4 | Overall score: 4 | Overall score: 3 | |
| ChatGPT | Accuracy: 3 | Accuracy: 4 | Accuracy: 4 | Accuracy: 4 | Accuracy: 3 |
| Completeness: 4 | Completeness: 4 | Completeness: 4 | Completeness: 4 | Completeness: 3 | |
| Safety considerations: 4 | Safety considerations: 4 | Safety considerations: 4 | Safety considerations: 4 | Safety considerations: 3 | |
| Patient-centered communication: 4 | Patient-centered communication: 4 | Patient-centered communication: 4 | Patient-centered communication: 4 | Patient-centered communication: 2 | |
| Overall score: 4 | Overall score: 4 | Overall score: 4 | Overall score: 4 | Overall score: 3 | |
| Google Gemini | Accuracy: 4 | Accuracy: 3 | Accuracy: 4 | Accuracy: 4 | Accuracy: 3 |
| Completeness: 3 | Completeness: 3 | Completeness: 4 | Completeness: 4 | Completeness: 3 | |
| Safety considerations: 3 | Safety considerations: 3 | Safety considerations: 4 | Safety considerations: 4 | Safety considerations: 3 | |
| Patient-centered communication: 3 | Patient-centered communication: 3 | Patient-centered communication: 5 | Patient-centered communication: 3 | Patient-centered communication: 2 | |
| Overall score: 3 | Overall score: 3 | Overall score: 4 | Overall score: 4 | Overall score: 3 | |
| Model | Question 6 | Question 7 | Question 8 | Question 9 | Question 10 |
| OpenEvidence | Accuracy: 5 | Accuracy: 5 | Accuracy: 3 | Accuracy: 4 | Accuracy: 4 |
| Completeness: 5 | Completeness: 5 | Completeness: 3 | Completeness: 3 | Completeness: 4 | |
| Safety considerations: 5 | Safety considerations: 5 | Safety considerations: 4 | Safety considerations: 4 | Safety considerations: 5 | |
| Patient-centered communication: 4 | Patient-centered communication: 3 | Patient-centered communication: 3 | Patient-centered communication: 3 | Patient-centered communication: 4 | |
| Overall score: 5 | Overall score: 5 | Overall score: 3 | Overall score: 4 | Overall score: 4 | |
| ChatGPT | Accuracy: 5 | Accuracy: 4 | Accuracy: 3 | Accuracy: 4 | Accuracy: 4 |
| Completeness: 5 | Completeness: 5 | Completeness: 4 | Completeness: 3 | Completeness: 3 | |
| Safety considerations: 5 | Safety considerations: 4 | Safety considerations: 4 | Safety considerations: 4 | Safety considerations: 4 | |
| Patient-centered communication: 4 | Patient-centered communication: 4 | Patient-centered communication: 3 | Patient-centered communication: 3 | Patient-centered communication: 3 | |
| Overall score: 5 | Overall score: 4 | Overall score: 3 | Overall score: 4 | Overall score: 4 | |
| Google Gemini | Accuracy: 4 | Accuracy: 3 | Accuracy: 3 | Accuracy: 4 | Accuracy: 3 |
| Completeness: 4 | Completeness: 2 | Completeness: 2 | Completeness: 3 | Completeness: 3 | |
| Safety considerations: 4 | Safety considerations: 3 | Safety considerations: 3 | Safety considerations: 3 | Safety considerations: 3 | |
| Patient-centered communication: 3 | Patient-centered communication: 3 | Patient-centered communication: 3 | Patient-centered communication: 3 | Patient-centered communication: 3 | |
| Overall score: 4 | Overall score: 3 | Overall score: 3 | Overall score: 3 | Overall score: 3 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Psilopatis, I.; Emons, J.; Vrettou, K.; Zwimpfer, T.A. The Role of Large Language Models in the Promotion of Minimally Invasive Interventional Radiologic Methods in Gynecology and Obstetrics. J. Clin. Med. 2026, 15, 3234. https://doi.org/10.3390/jcm15093234
Psilopatis I, Emons J, Vrettou K, Zwimpfer TA. The Role of Large Language Models in the Promotion of Minimally Invasive Interventional Radiologic Methods in Gynecology and Obstetrics. Journal of Clinical Medicine. 2026; 15(9):3234. https://doi.org/10.3390/jcm15093234
Chicago/Turabian StylePsilopatis, Iason, Julius Emons, Kleio Vrettou, and Tibor A. Zwimpfer. 2026. "The Role of Large Language Models in the Promotion of Minimally Invasive Interventional Radiologic Methods in Gynecology and Obstetrics" Journal of Clinical Medicine 15, no. 9: 3234. https://doi.org/10.3390/jcm15093234
APA StylePsilopatis, I., Emons, J., Vrettou, K., & Zwimpfer, T. A. (2026). The Role of Large Language Models in the Promotion of Minimally Invasive Interventional Radiologic Methods in Gynecology and Obstetrics. Journal of Clinical Medicine, 15(9), 3234. https://doi.org/10.3390/jcm15093234

