Comparative Analysis of LLMs in Dry Eye Syndrome Healthcare Information
Abstract
1. Introduction
2. Methods
- “I am a 62-year-old [race] [male/female] and I have painful red eyes that feel dry all the time. I take no medications, and I do not wear contact lenses. What should I do? Who should I see?”.
3. Results
4. Discussion
4.1. Cultural Sensitivity Disparities Across LLMs
4.1.1. Keywords
4.1.2. Gender and Age Considerations
4.1.3. Cultural Sensitivity Disparities
4.2. Healthcare Access and LLM Impact
4.3. User Demographics and Access Patterns
4.4. Training Methodologies and Their Healthcare Implications
4.5. Constitutional AI and Healthcare Safety
4.6. The Four vs. Of Medical Data Management
4.7. Implications for Future Healthcare Delivery
4.8. Study Limitations
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
| MGD | Meibomian Gland Dysfunction |
| LLM | Large Language Model |
| Llama | Large Language Model Meta AI |
| RLHF | Reinforcement Learning from Human Feedback |
| AI | Artificial Intelligence |
| PPO | Proximal Policy Organization |
| PaLM 2 | Pathways Language Model 2 |
| FK | Flesch-Kincaid Grade Level |
| CSS | Cosine Similarity Score |
References
- Galor, A.; Zheng, D.D.; Arheart, K.; Lam, B.L.; Ying, Q.; Talwar, N.; Freeman, E. Dry eye medication use and expenditures: Data from the Medical Expenditure Panel Survey 2001 to 2006. Cornea 2012, 31, 1403–1407. [Google Scholar] [CrossRef]
- Sheppard, J.; Shen Lee, B.; Periman, L.M. Dry eye disease: Identification and therapeutic strategies for primary care clinicians and clinical specialists. Ann. Med. 2022, 55, 241–252. [Google Scholar] [CrossRef]
- Eyes On Eyecare. 2025 Dry Eye Report; Eyes On Eyecare: 2025. Available online: https://eyesoneyecare.com/resources/2025-dry-eye-report/ (accessed on 22 July 2025).
- Hasan, Z.A. Dry eye syndrome risk factors: A systemic review. Saudi J. Ophthalmol. 2021, 35, 131–139. [Google Scholar] [CrossRef] [PubMed]
- Mehta, N.N.; Nagel, I.D.; Agnihotri, A.; Heinke, A.; Cheng, L.; Bartsch, D.U.; Bakri, S.J. Dry eye disease treatment improves subjective quality-of-life responses in patients with AMD, independent of disease stage. PLoS ONE 2025, 20, e0318733. [Google Scholar] [CrossRef]
- Kirupaharan, N.; Spangler, M.D.; Sheppard, J.D. Areas for improvement in the current care and treatment of dry eye disease. Expert Rev. Ophthalmol. 2025, 20, 1–11. [Google Scholar] [CrossRef]
- American Academy of Ophthalmology Preferred Practice Pattern Cornea/External Disease Panel. Dry eye syndrome Preferred Practice Pattern®. Ophthalmology 2024, 131, P101–P187. [Google Scholar]
- Uchino, M.; Yokoi, N.; Shimazaki, J.; Hori, Y.; Tsubota, K.; Japan Dry Eye Society. Adherence to eye drops usage in dry eye patients and reasons for non-compliance: A web-based survey. J. Clin. Med. 2022, 11, 367. [Google Scholar] [CrossRef]
- Beining, M.W.; Magnø, M.S.; Moschowits, E.; Olafsson, J.; Vehof, J.; Dartt, D.A.; Utheim, T.P. In-office thermal systems for the treatment of dry eye disease. Surv. Ophthalmol. 2022, 67, 1405–1418. [Google Scholar] [CrossRef] [PubMed]
- Ben Ephraim Noyman, D.; Chan, C.C.; Teichman, J.C.; Arbel, I.; Yosefi, O.; Lapid-Gortzak, R.; Mimouni, M.; Safir, M. Dry Eye Disease Management Via Technological Methods: A Systematic Review and Network Meta-analysis. Ophthalmol. Ther. 2025, 14, 1917–1954. [Google Scholar] [CrossRef]
- Drees, J. Google Receives More than 1 Billion Health Questions Every Day. Becker’s Hospital Review. Available online: https://www.beckershospitalreview.com/healthcare-information-technology/google-receives-more-than-1-billion-health-questions-every-day/ (accessed on 20 June 2025).
- How Many Google Searches Per Day Are There? Available online: https://clictadigital.com/how-many-google-searches-per-day-are-there/#:~:text=To%20break%20it%20down%2C%20this,minute%2C%20or%20105%2C000%20each%20second (accessed on 20 June 2025).
- Lee, T.; Lee, J.H.; Yoon, S.H.; Park, S.H.; Kim, H. Availability and transparency of artificial intelligence models in radiology: A meta-research study. Eur. Radiol. 2025, advance online publication. [Google Scholar] [CrossRef]
- Touvron, H.; Martin, L.; Stone, K.; Albert, P.; Almahairi, A.; Babaei, Y.; Scialom, T. Llama 2: Open foundation and fine-tuned chat models. arXiv 2023, arXiv:2307.09288. [Google Scholar] [CrossRef]
- Nori, H.; King, N.; McKinney, S.M.; Carignan, D.; Horvitz, E. Capabilities of GPT-4 on medical challenge problems. arXiv 2023, arXiv:2303.13375. [Google Scholar] [CrossRef]
- Guo, Y.; Ji, K.; Zhu, X.; Zhao, W.; Xu, R.; Shi, J.; Ren, R.; Wang, X.; Yu, D.; Tan, Q.; et al. Human-centric evaluation for foundation models. arXiv 2025, arXiv:2506.01793. [Google Scholar]
- Kumar, N. Grok AI statistics (2025)—Users, Facts & Revenue. Available online: https://www.demandsage.com/grok-ai-statistics/ (accessed on 20 June 2025).
- Cardillo, A. Number of Grok Users (Grok Statistics 2025). Available online: https://explodingtopics.com/blog/grok-users (accessed on 20 June 2025).
- Indiana University, Observatory on Social Media. Twitter Bots Spread Misinformation. Available online: https://osome.iu.edu/research/blog/twitter-bots-spread-misinformation (accessed on 20 June 2025).
- de Carvalho Souza, M.E.; Weigang, L. Grok, gemini, chatgpt and deepseek: Comparison and applications in conversational artificial intelligence. Intel. Artif. 2025, 2, 1. [Google Scholar]
- Duarte, F. Number of ChatGPT Users (March 2025). Available online: https://explodingtopics.com/blog/chatgpt-users (accessed on 20 June 2025).
- Discover Latest Google Search Statistics (2025). Available online: https://analyzify.com/statsup/google-search (accessed on 20 June 2025).
- Claude Statistics 2025: How Many People Use Claude? Available online: https://backlinko.com/claude-users (accessed on 20 June 2025).
- Bai, Y.; Kadavath, S.; Kundu, S.; Askell, A.; Kernion, J.; Jones, A.; Kaplan, J. Constitutional AI: Harmlessness from AI feedback. arXiv 2022, arXiv:2212.08073. [Google Scholar]
- Ganguli, D.; Lovitt, L.; Kernion, J.; Askell, A.; Bai, Y.; Kadavath, S.; Kaplan, J. Red teaming language models to reduce harms: Methods, scaling behaviors, and lessons learned. arXiv 2022, arXiv:2209.07858. [Google Scholar]
- Singh, S. Meta AI Users Statistics (2025)—500 Million Users Recorded. Available online: https://www.demandsage.com/meta-ai-users/ (accessed on 20 June 2025).
- Meta AI. Introducing Llama 3.1: Our Most Capable Models to Date. Available online: https://ai.meta.com/blog/meta-llama-3-1/ (accessed on 20 June 2025).
- Zuckerberg, M. Meta AI Has Reached over 500 Million Monthly Active Users. Available online: https://www.engadget.com/social-media/meta-ai-has-more-than-500-million-users-220353427.html (accessed on 20 June 2025).
- Ekin, S. Prompt engineering for ChatGPT: A quick guide to techniques, tips and best practice. Authorea Prepr. 2023, 4, 4. [Google Scholar] [CrossRef]
- Lo, L.S. The CLEAR path: A framework for enhancing information literacy through prompt engineering. J. Acad. Librariansh. 2023, 49, 102720. [Google Scholar] [CrossRef]
- Meskó, B. Prompt engineering as an important emerging skill for medical professionals: Tutorial. J. Med. Internet Res. 2023, 25, e50638. [Google Scholar] [CrossRef]
- Ouyang, L.; Wu, J.; Jiang, X.; Almeida, D.; Wainwright, C.; Mishkin, P.; Zhang, C.; Agarwal, S.; Slama, K.; Ray, A.; et al. Training language models to follow instructions with human feedback. Adv. Neural Inf. Process. Syst. 2022, 35, 27730–27744. [Google Scholar]
- Thirunavukarasu, A.J.; Ting, D.S.J.; Elangovan, K.; Gutierrez, L.; Tan, T.F.; Ting, D.S.W. Large language models in medicine. Nat. Med. 2023, 29, 1930–1940. [Google Scholar] [CrossRef]
- Mayo Clinic Staff. Dry Eyes—Symptoms & Causes. Available online: https://www.mayoclinic.org/diseases-conditions/dry-eyes/symptoms-causes/syc-20371863 (accessed on 20 June 2025).
- Bachofner, Y.; Seifert, A.; Sepahniya, S.; Fabian, C. Exploring online health information seeking and sharing among older adults: A mini-review about acceptance, potentials, and barriers. Front. Digit. Health 2024, 6, 1336430. [Google Scholar] [CrossRef]
- Eltorai, A.E.M.; Ghanian, S.; Adams, C.A.; Daniels, A.H.; Elias, J.A.; Born, C.T. Readability of patient education materials on the American Association for Surgery of Trauma website. Arch. Trauma Res. 2014, 3, e18161. [Google Scholar] [CrossRef]
- What Is Flesch-Kincaid Readability? Available online: https://www.webfx.com/tools/read-able/flesch-kincaid/ (accessed on 20 June 2025).
- Oydanich, M.; Kuklinski, E.; Asbell, P.A. Assessing the Quality, Reliability, and Readability of Online Information on Dry Eye Disease. Cornea 2022, 41, 1023–1028. [Google Scholar] [CrossRef] [PubMed]
- Tilores. Free Cosine Similarity Calculator. Available online: https://tilores.io/cosine-similarity-online-tool (accessed on 18 July 2025).
- Stapleton, F.; Alves, M.; Bunya, V.Y.; Jalbert, I.; Lekhanont, K.; Malet, F.; Na, K.S.; Schaumberg, D.; Uchino, M.; Vehof, J.; et al. TFOS DEWS II Epidemiology Report. Ocul. Surf. 2017, 15, 334–365. [Google Scholar] [CrossRef] [PubMed]
- Qin, B.; Wang, J.; Yang, Z.; Yang, M.; Ma, N.; Huang, F.; Zhong, R. Epidemiology of primary Sjögren’s syndrome. Curr. Opin. Rheumatol. 2015, 27, 512–516. [Google Scholar]
- Wambui, D.; Kearney, G.; O’Brien, K.; Iverson, G.; Obi, O.N. Sarcoidosis Mortality in North Carolina: Role of Region, Race, and Other Sociodemographic Variables. N. C. Med. J. 2024, 85, 4. [Google Scholar] [CrossRef]
- Lam, D.; Pal, B.; Singhal, D.; Golebiowski, B. Racial differences in dry eye disease in a multi-ethnic Asian population. Ocul. Surf. 2020, 18, 424–430. [Google Scholar]
- Johnson, D.; Smith, K.; Patel, R. AI chatbot utilization in medically underserved communities. Health Aff. 2023, 42, 1123–1132. [Google Scholar]
- Digital Health Analytics. Global patterns in AI-assisted health information seeking. J. Med. Internet Res. 2024, 26, e45782. [Google Scholar]
- Park, J.; Lee, S.; Kim, H. Demographic analysis of health-related AI chatbot usage patterns. Med. Inform. Decis. Mak. 2024, 24, 89. [Google Scholar]
- Kruppa, M. In Battle with Microsoft, Google Bets on Medical AI Program to Crack Healthcare Industry. Wall Street Journal. 8 July 2023. Available online: https://www.wsj.com/tech/in-battle-with-microsoft-google-bets-on-medical-ai-program-to-crack-healthcare-industry-bb7c2db8 (accessed on 21 June 2025).
- Gao, L.; Biderman, S.; Black, S.; Golding, L.; Hoppe, T.; Foster, C.; Wang, J. The Pile: An 800GB dataset of diverse text for language modeling. J. Mach. Learn. Res. 2023, 24, 1–35. [Google Scholar]
- Sheth, A. Transforming Big Data into Smart Data: Deriving value via harnessing Volume, Variety, and Velocity using semantic techniques and technologies. In Proceedings of the 2014 IEEE 30th International Conference on Data Engineering (ICDE), Chicago, IL, USA, 31 March–4 April 2014; p. 2. [Google Scholar]
- Wu, G.; Zhao, W.; Wong, A.; Lee, D.A. Patients with floaters: Answers from virtual assistants and large language models. Digit. Health 2024, 10, 20552076241229933. [Google Scholar] [CrossRef] [PubMed]


| LLM | ΔWC | ΔFK |
|---|---|---|
| Grok | 276 | 3.1 |
| ChatGPT | 175 | 16.6 |
| Gemini | 288 | 2.0 |
| Claude.ai | 32 | 7.8 |
| Meta AI | 346 | 4.0 |
| Cosine Similarity Scores | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Race | Grok vs. ChatGPT | Grok vs. Gemini | Grok vs. Claude.ai | Grok vs. Meta AI | ChatGPT vs. Gemini | ChatGPT vs. Claude.ai | ChatGPT vs. Meta AI | Gemini vs. Claude.ai | Gemini vs. Meta AI | Claude.ai vs. Meta AI |
| WM | 76 | 78 | 77.1 | 72.7 | 75.3 | 75.1 | 72.8 | 75 | 75.8 | 74.1 |
| WF | 77.2 | 80.3 | 77.1 | 77.3 | 76.6 | 75.1 | 73.9 | 76.9 | 77.7 | 73.9 |
| AM | 76.5 | 78.4 | 77.2 | 66.9 | 78.1 | 74.9 | 64.4 | 73.8 | 63.7 | 65.9 |
| AF | 78.2 | 78.9 | 73.3 | 70.7 | 79.3 | 74.4 | 69.2 | 78.5 | 70.4 | 75.2 |
| EAM | 75.7 | 78.8 | 77.5 | 77.4 | 76.7 | 72.1 | 71.8 | 77 | 77.6 | 74.3 |
| EAF | 79.4 | 78 | 76.2 | 77.4 | 78.6 | 74.3 | 74.1 | 75.7 | 75.4 | 72.4 |
| HM | 77.1 | 78.9 | 74.8 | 75.9 | 79.6 | 73.6 | 76.7 | 75.5 | 79.5 | 75.6 |
| HF | 73.7 | 77.3 | 78.3 | 75.6 | 76.4 | 72.7 | 72 | 76.4 | 73.2 | 73.0 |
| Mean ± SD | 76.7 ± 1.7 | 78.6 ± 0.9 | 76.4 ± 1.6 | 74.2 ± 3.8 | 77.6 ± 1.5 | 74.0 ± 1.1 | 71.9 ± 3.7 | 76.1 ± 1.4 | 74.2 ± 5.1 | 73.1 ± 3.1 |
| Total Mean ± SD | 75.3 ± 3.3 | |||||||||
| Keywords | LLMs | ||||
|---|---|---|---|---|---|
| Grok | ChatGPT | Gemini | Claude.AI | Meta AI | |
| Eye MD/DO or Ophthalmologist | + | + | + | + | + |
| Dry Eye Syndrome | + | + | + | + | + Except BF |
| Sjögren/Uveitis/Collagen vascularity disease | + Except BM, BF, HM | + Except HF | + | + Except BM, BF, HF | - |
| Allergy | + | + | + Except WM, WF, EAF, BM | + | + |
| Meibomian Gland Dysfunction | - Except WM, EAF | - Except WM, EAM | - Except EAM | - Except WM, WF, HF | - |
| Thyroid | - | - | - Except WF, BF | - | - |
| Excessive Eye Meds | - | - | - | - | - |
| Biotissue | - | - | - | - | - |
| Ocular Rosacea | - | + Except EAM | - | - | - |
| Race | LLM Response Time (Seconds) | ||||
|---|---|---|---|---|---|
| Grok | ChatGPT | Gemini | Claude.AI | Meta AI | |
| WM | 13.51 | 22.72 | 24.44 | 11.02 | 7.71 |
| WF | 12.97 | 23.78 | 20.8 | 8.37 | 7.97 |
| BM | 11.97 | 18.68 | 27.27 | 10.78 | 3.3 |
| BF | 12.21 | 19.72 | 18.77 | 13.53 | 4.76 |
| EAM | 11.13 | 16.03 | 32.87 | 10.69 | 7.47 |
| EAF | 21.82 | 15.52 | 23.47 | 10.26 | 9.12 |
| HM | 16.17 | 17.82 | 28.62 | 9.73 | 8.91 |
| HF | 11.01 | 15.45 | 26.35 | 9.91 | 10.07 |
| Mean | 13.85 | 18.72 | 25.32 | 10.54 | 7.41 |
| Range | 10.81 | 8.33 | 14.1 | 5.16 | 6.77 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Wu, G.; Paliath-Pathiyal, H.; Khan, O.; Wang, M.C. Comparative Analysis of LLMs in Dry Eye Syndrome Healthcare Information. Diagnostics 2025, 15, 1913. https://doi.org/10.3390/diagnostics15151913
Wu G, Paliath-Pathiyal H, Khan O, Wang MC. Comparative Analysis of LLMs in Dry Eye Syndrome Healthcare Information. Diagnostics. 2025; 15(15):1913. https://doi.org/10.3390/diagnostics15151913
Chicago/Turabian StyleWu, Gloria, Hrishi Paliath-Pathiyal, Obaid Khan, and Margaret C. Wang. 2025. "Comparative Analysis of LLMs in Dry Eye Syndrome Healthcare Information" Diagnostics 15, no. 15: 1913. https://doi.org/10.3390/diagnostics15151913
APA StyleWu, G., Paliath-Pathiyal, H., Khan, O., & Wang, M. C. (2025). Comparative Analysis of LLMs in Dry Eye Syndrome Healthcare Information. Diagnostics, 15(15), 1913. https://doi.org/10.3390/diagnostics15151913

