Evaluating the Use of Generative Artificial Intelligence to Support Genetic Counseling for Rare Diseases
Abstract
:1. Introduction
2. Materials and Methods
2.1. Generative AI Model Selection
2.2. Rare Disease Selection
2.3. Development of Evaluation Questions and Metrics for Generative AI Responses
2.4. Study Procedure
2.5. Statistical Analysis
3. Results
3.1. Comparison of Total Scores Across Generative AI Models
3.2. Comparison of Subcategory Scores by Disease for Generative AI Models
3.3. Analysis of Responses with an Average Score Below 3 Across Generative AI Models
3.4. Comparison of Scores by Disease Across Generative AI Models
4. Discussion
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
AI | Artificial intelligence |
df | Degrees of Freedom |
IBM | International Business Machines |
IRB | Institutional Review Board |
KDCA | Korea Disease Control and Prevention Agency |
NCCN | National Comprehensive Cancer Network |
ROHHAD | Rapid-Onset Obesity with Hypothalamic Dysfunction, Hypoventilation, and Autonomic Dysregulation |
SD | Standard Deviation |
SMA | Spinal Muscular Atrophy |
SPSS | Statistical Package for the Social Sciences |
References
- Lee, B. Sharenting and parenting support needs of mothers raising children with rare diseases. Korean J. Child. Stud. 2022, 43, 153–167. [Google Scholar] [CrossRef]
- Chung, Y.-S.; Kim, S.R.; Choi, J.; Kim, H.J. National survey for genetic counseling and demands for professional genetic counselor. J. Genet. Med. 2007, 4, 167–178. [Google Scholar]
- Choi, E.J.; Park, S.; Chae, S.M.; Weng, Y.; Kang, H.; Moon, S.J. An exploratory analysis of unmet healthcare needs among patients with rare diseases and their caregivers. Health Soc. Welf. Rev. 2022, 42, 141–157. [Google Scholar]
- Reddy, S. Generative AI in healthcare: An implementation science informed translational path on application, integration and governance. Implement. Sci. 2024, 19, 27. [Google Scholar] [CrossRef] [PubMed]
- Intelligent Information Society User Panel Survey. 2023. Available online: https://eiec.kdi.re.kr/policy/domesticView.do?ac=0000188276 (accessed on 1 December 2024).
- Maslej, N.; Fattorini, L.; Brynjolfsson, E.; Etchemendy, J.; Ligett, K.; Lyons, T.; Manyika, J.; Ngo, H.; Niebles, J.; Parli, V.; et al. Artificial intelligence index report 2023. arXiv 2023, arXiv:231003715. [Google Scholar]
- Shim, D.; Ko, D.-W. Study on qualitative changes in labour demand by industry according to the diffusion of AI service. Innos 2022, 17, 85–103. [Google Scholar] [CrossRef]
- Ai, W.I. Artificial Intelligence (AI) in Healthcare and Research; Nuffield Council on Bioethics: London, UK, 2018; pp. 1–8. [Google Scholar]
- Korean Statistical Information Service Agency. Stat. Regist. Rare Dis. 2020. Available online: https://kosis.kr/statHtml/statHtml.do?orgId=177&tblId=DT_117106_2021_001&conn_path=I3 (accessed on 26 June 2024).
- Duong, D.; Solomon, B.D. Analysis of large-language model versus human performance for genetics questions. Eur. J. Hum. Genet. 2024, 32, 466–468. [Google Scholar] [CrossRef] [PubMed]
- Alkuraya, I.F. Is artificial intelligence getting too much credit in medical genetics? Am. J. Med. Genet. C Semin. Med. Genet. 2023, 193, e32062. [Google Scholar] [CrossRef] [PubMed]
- Gravel, J.; D’Amours-Gravel, M.; Osmanlliu, E. Learning to fake it: Limited responses and fabricated references provided by ChatGPT for medical questions. Mayo Clin. Proc. Digit. Health 2023, 1, 226–234. [Google Scholar] [CrossRef]
- Ahimaz, P.; Bergner, A.L.; Florido, M.E.; Harkavy, N.; Bhattacharyya, S. Genetic counselors’ utilization of ChatGPT in professional practice: A cross-sectional study. Am. J. Med. Genet. A 2024, 194, e63493. [Google Scholar] [CrossRef] [PubMed]
- Gravina, A.G.; Pellegrino, R.; Palladino, G.; Imperio, G.; Ventura, A.; Federico, A. Charting new AI education in gastroenterology: Cross-sectional evaluation of ChatGPT and perplexity AI in medical residency exam. Dig. Liver Dis. 2024, 56, 1304–1311. [Google Scholar] [CrossRef] [PubMed]
- Uppalapati, V.K.; Nag, D.S. A comparative analysis of AI models in complex medical decision-making scenarios: Evaluating ChatGPT, Claude AI, Bard, and Perplexity. Cureus 2024, 16, e52485. [Google Scholar] [CrossRef] [PubMed]
- Hancı, V.; Ergün, B.; Gül, Ş.; Uzun, Ö.; Erdemir, İ.; Hancı, F.B. Assessment of readability, reliability, and quality of ChatGPT®, Bard®, Gemini®, Copilot®, Perplexity® responses on palliative care. Medicine 2024, 103, e39305. [Google Scholar] [CrossRef] [PubMed]
- Moore, O. The Top 100 Gen AI Consumer Apps, 3rd ed.; Andreessen Horowitz; 2024; Available online: https://a16z.com/100-gen-ai-apps-3/Updated2024.08.21 (accessed on 26 August 2024).
- Durmaz Engin, C.D.; Karatas, E.; Ozturk, T. Exploring the role of ChatGPT-4, BingAI, and Gemini as virtual consultants to educate families about retinopathy of prematurity. Children 2024, 11, 750. [Google Scholar] [CrossRef] [PubMed]
- Zampatti, S.; Peconi, C.; Megalizzi, D.; Calvino, G.; Trastulli, G.; Cascella, R.; Strafella, C.; Caltagirone, C.; Giardina, E. Innovations in medicine: Exploring ChatGPT’s impact on rare disorder management. Genes 2024, 15, 421. [Google Scholar] [CrossRef] [PubMed]
- Nazareth, S.; Nussbaum, R.L.; Siglen, E.; Wicklund, C.A. Chatbots & artificial intelligence to scale genetic information delivery. J. Genet. Couns. 2021, 30, 7–10. [Google Scholar] [CrossRef] [PubMed]
- Siglen, E.; Vetti, H.H.; Augestad, M.; Steen, V.M.; Lunde, Å.; Bjorvatn, C. Evaluation of the Rosa chatbot providing genetic information to patients at risk of hereditary breast and ovarian cancer: Qualitative interview study. J. Med. Internet Res. 2023, 25, e46571. [Google Scholar] [CrossRef] [PubMed]
- Webster, E.M.; Ahsan, M.D.; Perez, L.; Levi, S.R.; Thomas, C.; Christos, P.; Hickner, A.; Hamilton, J.G.; Babagbemi, K.; Cantillo, E.; et al. Chatbot artificial intelligence for genetic Cancer risk assessment and counseling: A systematic review and meta-analysis. JCO Clin. Cancer Inform. 2023, 7, e2300123. [Google Scholar] [CrossRef] [PubMed]
- McCarthy, C.J.; Berkowitz, S.; Ramalingam, V.; Ahmed, M. Evaluation of an artificial intelligence chatbot for delivery of IR patient education material: A comparison with societal website content. J. Vasc. Interv. Radiol. 2023, 34, 1760–1768.e32. [Google Scholar] [CrossRef] [PubMed]
- Petrosyan, A. Common Languages Used for Web Content 2024, by Share of Websites; Volume 1, 2024; Available online: https://www.statista.com/statistics/262946/most-common-languages-on-the-internet/ (accessed on 17 December 2024).
- Hovy, D.; Prabhumoye, S. Five sources of bias in natural language processing. Lang. Linguist. Compass 2021, 15, e12432. [Google Scholar] [CrossRef] [PubMed]
Generative AI | Sample Size | Mean Rank | Mean ± SD | χ2 | df | p Value |
---|---|---|---|---|---|---|
ChatGPT o1-Preview | 408 | 958.02 | 4.24 ± 0.73 | 294.061 | 3 | <0.001 |
Gemini Advanced | 408 | 911.07 | 4.15 ± 0.74 | |||
Claude 3.5 sonnet | 408 | 904.91 | 4.13 ± 0.82 | |||
Perplexity Sonar Huge model (Web mode) | 408 | 492.00 | 3.35 ± 0.80 |
Samlpe1–Sample2 | Test Statistic | Standard Error | Standardized Test Statistic | p Value | Adjusted p Value * |
---|---|---|---|---|---|
Perplexity–Claude | 412.912 | 31.087 | 13.282 | <0.001 | <0.001 |
Perplexity–Gemini | 419.077 | 31.087 | 13.481 | <0.001 | <0.001 |
Perplexity–ChatGPT | 466.026 | 31.087 | 14.991 | <0.001 | <0.001 |
Claude–Gemini | 6.165 | 31.087 | 0.198 | 0.843 | 1.000 |
Claude–ChatGPT | 53.114 | 31.087 | 1.709 | 0.088 | 0.525 |
Gemini–ChatGPT | 46.949 | 31.087 | 1.510 | 0.131 | 0.786 |
Subcategories | Questions Utilized for Genetic Counseling with Generative AI | ChatGPT | Gemini | Claude | Perplexity |
---|---|---|---|---|---|
Mean ± SD | |||||
Huntington’s Disease | |||||
Diagnosis-1 | What tests and procedures are required to confirm Huntington’s disease? Do I need genetic testing? | 4.50 ± 0.58 | 2.50 ± 1.00 | 4.25 ± 0.96 | 4.00 ± 0.82 |
Treatment-2 | What progress has been made in new treatments or research for Huntington’s disease? | 4.75 ± 0.50 | 2.50 ± 1.00 | 4.25 ± 0.50 | 3.75 ± 0.96 |
Prognosis-3 | What factors can affect Huntington’s disease progression and outcome? | 4.75 ± 0.50 | 4.50 ± 0.58 | 4.25 ± 0.96 | 2.75 ± 0.50 |
Spinal Muscular Atrophy | |||||
Treatment-4 | If a fetus is diagnosed with SMA, can targeted treatment begin before birth? | 4.50 ± 0.58 | 4.50 ± 0.58 | 4.50 ± 0.58 | 1.75 ± 0.50 |
Down Syndrome | |||||
General-2 | Is Down syndrome classified as a rare disease? | 2.75 ± 0.96 | 3.25 ± 0.50 | 3.75 ± 0.96 | 3.00 ± 0.00 |
General-4 | What are the main genetic mechanisms of Down syndrome? | 3.50 ± 0.58 | 4.00 ± 0.00 | 4.75 ± 0.50 | 2.50 ± 0.58 |
Diagnosis-2 | Why is chromosomal testing performed when diagnosing Down syndrome? | 4.00 ± 0.00 | 3.75 ± 0.96 | 4.00 ± 0.82 | 2.00 ± 0.00 |
Diagnosis-4 | My first child’s chromosomal test shows “46,XX,i(21)(q10)”. What does this mean, and what is the likelihood of having another baby with Down syndrome in my next pregnancy? | 4.00 ± 0.82 | 3.50 ± 0.58 | 1.00 ± 0.00 | 3.50 ± 1.73 |
Counseling-2 | If my first child has Down syndrome, what are the chances my second child will also have the disease? | 4.00 ± 0.82 | 4.25 ± 0.96 | 3.75 ± 0.50 | 2.00 ± 0.00 |
ROHHAD syndrome | |||||
General-2 | Is ROHHAD syndrome a rare disease? | 2.50 ± 0.58 | 3.75 ± 0.50 | 2.50 ± 0.58 | 3.75 ± 0.96 |
Treatment-3 | What medications or situations should patients with ROHHAD syndrome avoid? | 4.50 ± 0.58 | 4.00 ± 1.15 | 3.75 ± 0.96 | 2.75 ± 0.50 |
Counseling-2 | If my first child has ROHHAD syndrome, what are the chances my second child will also have the disease? | 4.50 ± 0.58 | 4.25 ± 0.96 | 4.75 ± 0.50 | 1.00 ± 0.00 |
Counseling-3 | Can ROHHAD syndrome patients have children, and if so, what are the risks of their children inheriting the condition? | 4.25 ± 0.50 | 4.00 ± 0.82 | 3.50 ± 0.58 | 1.00 ± 0.00 |
Counseling-6 | What institutional support is available in Korea for patients with ROHHAD syndrome or their families? | 4.50 ± 0.58 | 4.00 ± 0.82 | 3.50 ± 0.58 | 2.50 ± 0.58 |
Disease | Sample Size | Mean Rank | Mean ± SD | χ2 | df | p Value |
---|---|---|---|---|---|---|
Huntington’s Disease | 464 | 831.05 | 4.01 ± 0.78 | 6.137 | 3 | 0.105 |
Spinal Muscular Atrophy | 416 | 849.96 | 4.04 ± 0.80 | |||
Down Syndrome | 400 | 798.56 | 3.92 ± 0.90 | |||
ROHHAD syndrome | 352 | 778.17 | 3.87 ± 0.93 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Jeon, S.; Lee, S.-A.; Chung, H.-S.; Yun, J.Y.; Park, E.A.; So, M.-K.; Huh, J. Evaluating the Use of Generative Artificial Intelligence to Support Genetic Counseling for Rare Diseases. Diagnostics 2025, 15, 672. https://doi.org/10.3390/diagnostics15060672
Jeon S, Lee S-A, Chung H-S, Yun JY, Park EA, So M-K, Huh J. Evaluating the Use of Generative Artificial Intelligence to Support Genetic Counseling for Rare Diseases. Diagnostics. 2025; 15(6):672. https://doi.org/10.3390/diagnostics15060672
Chicago/Turabian StyleJeon, Suok, Su-A Lee, Hae-Sun Chung, Ji Young Yun, Eun Ae Park, Min-Kyung So, and Jungwon Huh. 2025. "Evaluating the Use of Generative Artificial Intelligence to Support Genetic Counseling for Rare Diseases" Diagnostics 15, no. 6: 672. https://doi.org/10.3390/diagnostics15060672
APA StyleJeon, S., Lee, S.-A., Chung, H.-S., Yun, J. Y., Park, E. A., So, M.-K., & Huh, J. (2025). Evaluating the Use of Generative Artificial Intelligence to Support Genetic Counseling for Rare Diseases. Diagnostics, 15(6), 672. https://doi.org/10.3390/diagnostics15060672