Vaccine Perception on Digital Platforms: Topic Modeling of YouTube Comments
Abstract
1. Introduction
2. Materials and Methods
2.1. Data Collection
2.2. Preparation of the Data
2.3. Topic Modeling
2.3.1. Hierarchical Dirichlet Process (hLDA)
2.3.2. Latent Semantic Analysis (LSA)
2.3.3. Non-Negative Matrix Factorization (NMF)
3. Findings
3.1. Analysis Results of Video Comments Using the Hierarchical Dirichlet Process Method
3.2. Findings from the Analysis of Video Comments Using the Latent Semantic Analysis Method
3.3. Findings from the Analysis of Video Comments Using the Non-Negative Matrix Factorization Method
3.4. Performance Comparison of Subject Modeling Methods
4. Discussion
5. Limitations
6. Conclusions and Recommendations
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Simonenko, V.B.; Abashin, V.G. Vaccination as protection against particularly dangerous infections and infectious processes that create public health emergencies of international importance. Clin. Med. (Russ. J.) 2022, 100, 5–10. [Google Scholar]
- Philippon, A. La saga des vaccins à ARNm. Bull. l’Académie Vétérinaire Fr. 2023, 176, 326–329. [Google Scholar] [CrossRef]
- Bezbaruah, R.; Sailo, N.; Mawii, Z.; Deka, K.; Bhutia, Y.D.; Kakoti, B. Myths and facts about vaccination. In Advanced Vaccination Technologies for Infectious and Chronic Diseases; Chavda, V.P., Vora, L.K., Apostolopoulos, V., Eds.; Stacy Masucci: London, UK, 2024. [Google Scholar]
- Plotkin, S.L.; Plotkin, S.A. A Short History of Vaccination. In Plotkin’s Vaccines, 7th ed.; Plotkin, S.A., Orenstein, W.A., Offit, P.A., Edwards, K.M., Eds.; Elsevier: Philadelphia, PA, USA, 2018. [Google Scholar]
- Immunization Agenda 2030 Partners. Immunization agenda 2030: A global strategy to leave no one behind. Vaccine 2024, 42, S5–S14. [Google Scholar] [CrossRef]
- Boccalini, S. Value of Vaccinations: A Fundamental Public Health Priority to Be Fully Evaluated. Vaccines 2025, 13, 479. [Google Scholar] [CrossRef]
- O’Brien, K.L.; Lemango, E.; Nandy, R.; Lindstrand, A. The immunization Agenda 2030: A vision of global impact, reaching all, grounded in the realities of a changing world. Vaccine 2024, 42, S1–S4. [Google Scholar] [CrossRef]
- Saman, S.; Chauhan, I.; Srivastava, N. Vaccines: An Important Tool for Infectious Disease. Recent Adv. Anti-Infect. Drug Discov. 2023, 18, 88–109. [Google Scholar] [CrossRef]
- Chatterjee, S.; Zehmakan, A.N. Effective Vaccination Strategies in Network-based SIR Model. arXiv 2023, arXiv:2305.16458v1. [Google Scholar] [CrossRef]
- Contreras, M.; Grazlewska, W.; Kasaija, P.D.; Fuente, J. Vaccines. In Handbook of Molecular Biotechnology; Liu, D., Ed.; CRC Press: Boca Raton, FL, USA, 2024; pp. 1–21. [Google Scholar]
- Kayser, V.; Ramzan, I. Vaccines and vaccination: History and emerging issues. Hum. Vaccin. Immunother. 2021, 17, 5255–5268. [Google Scholar] [CrossRef]
- Dasari, S. Advances in vaccination to combat pandemic outbreaks. In Pandemic Outbreaks in the 21st Century; Viswanath, B., Ed.; Academic Press: Cambridge, MA, USA; Elsevier: Amsterdam, The Netherlands, 2021. [Google Scholar]
- Zhong, J.; Zhuang, Y.; Zhang, M. Impact of epidemic prevention policies on public vaccination willingness: Empirical research in China. Front. Public Health 2024, 12, 1329228. [Google Scholar] [CrossRef]
- Siwakoti, S.; Evans, N.; Saphiro, J.N. Less reliable media drive interest in anti-vaccine information. Harv. Kennedy Sch. Misinformation Rev. 2023, 4, 1–15. [Google Scholar] [CrossRef]
- Green, J.; Druckman, J.N.; Baum, M.A.; Ognyanova, K.; Simonson, M.D.; Perlis, R.H.; Lazer, D. Media use and vaccine resistance. PNAS Nexus 2023, 2, pgad146. [Google Scholar] [CrossRef]
- Vučurović, M.; Jovanovic, V.; Krcmar, O. Anti-Vaccination movement as a supracultural phenomenon. Eur. J. Public Health 2023, 33, 558–569. [Google Scholar] [CrossRef]
- Stenberg, J.; Chea, E.; Tse, J.; Thornock, B.; Stephenson, L.G. A Physician’s Approach to the Vaccine-Hesitant Patient. Ann. Public Health Rep. 2023, 7, 323–327. [Google Scholar]
- Amendola, A.; Canuti, M. Vaccine-Preventable Diseases. In Global Health Essentials; Raviglione, M.C., Tediosi, F., Villa, S., Casamitjana, N., Plasència, A., Eds.; Sustainable Development Goals Series; Springer: Cham, Switzerland, 2023; pp. 117–127. [Google Scholar]
- Raulji, J.K.; Saini, J.R. Sanskrit lemmatizer for improvisation of morphological analyzer. J. Stat. Manag. Syst. 2019, 22, 613–625. [Google Scholar] [CrossRef]
- Trishala, G.; Mamatha, H.R. Implementation of Stemmer and Lemmatizer for a Low-Resource Language-Kannada. In Proceedings of International Conference on Intelligent Computing, Information and Control Systems; Springer: Singapore, 2021; pp. 345–358. [Google Scholar]
- Mohamed, S.A.; Mohamed, M.A. Lexicon and Rule-Based Word Lemmatization Approach for the Somali Language. arXiv 2023, arXiv:2308.01785. [Google Scholar] [CrossRef]
- Toporkov, O.; Agerri, R. On the Role of Morphological Information for Contextual Lemmatization. arXiv 2023, arXiv:2302.00407v3. [Google Scholar] [CrossRef]
- Teh, Y.W.; Jordan, M.I.; Beal, M.J.; Blei, D.M. Hierarchical Dirichlet Processes. J. Am. Stat. Assoc. 2006, 101, 1566–1581. [Google Scholar] [CrossRef]
- Buntine, W. Understanding Hierarchical Processes. Entropy 2022, 24, 1703. [Google Scholar] [CrossRef]
- Sheluhın, O.I.; Vanyushına, A.V.; Zhelnov, M.S. Use of Latent-Semantic Analysis in Preparation of Data For Identification of Anonymous Users By Digital Fingerprints. H&ES Res. 2022, 14, 36–44. [Google Scholar] [CrossRef]
- Kountouri, I.; Manousakis, E.G.; Tsekrekos, A.E. Latent semantic analysis of corporate social responsibility reports (with an application to Hellenic firms). Int. J. Discl. Gov. 2019, 14, 1–19. [Google Scholar] [CrossRef]
- Jorge-Botana, G.; Olmos, R.; Luzón, J.M. Bridging the theoretical gap between semantic representation models without the pressure of a ranking: Some lessons learnt from LSA. Cogn. Process. 2019, 21, 1–21. [Google Scholar] [CrossRef]
- Babalola, O.; Ojokoh, B.; Boyinbode, O. Comprehensive Evaluation of LDA, NMF, and BERTopic’s Performance on News Headline Topic Modeling. J. Comput. Theo. App. 2024, 2, 268–289. [Google Scholar] [CrossRef]
- Pavithra, C.B.; Savitha, J. Topic Modeling for Evolving Textual Data Using LDA, HDP, NMF, BERTOPIC, and DTM With a Focus on Research Papers. J. Technol. Inform. 2024, 5, 53–63. [Google Scholar] [CrossRef]
- Dutta, P.; De, R.K. MDSR-NMF: Multiple deconstruction single reconstruction deep neural network model for non-negative matrix factorization. Netw. Comput. Neural Syst. 2023, 34, 306–342. [Google Scholar] [CrossRef] [PubMed]
- Kobayashi, T.; Watanabe, K. End-to-End Trainable Weakly Non-Negative Factorization. In 2023 IEEE International Conference on Image Processing (ICIP); IEEE: Piscataway, NJ, USA, 2023; pp. 490–494. [Google Scholar]
- Brie, D.; Gillis, N.; Gillis, N.; Moussao, S. Non-negative Matrix Factorization. In Source Separation in Physical-Chemical Sensing; Jutten, C., Duarte, L.T., Moussaoui, S., Eds.; John Wiley & Sons, Ltd.: Hoboken, NJ, USA, 2023. [Google Scholar] [CrossRef]
- Mimno, D.; Wallach, H.M.; Talley, E.; Leenders, M.; McCallum, A. Optimizing semantic coherence in topic models. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP ‘11); Association for Computational Linguistics: Stroudsburg, PA, USA, 2011; pp. 262–272. [Google Scholar]
- Röder, M.; Both, A.; Hinneburg, A. Exploring the Space of Topic Coherence Measures. In Proceedings of the Eighth ACM International Conference on Web Search and Data Mining (WSDM ‘15); Association for Computing Machinery: New York, NY, USA, 2015; pp. 399–408. [Google Scholar] [CrossRef]
- Johnson, N.F.; Velásquez, N.; Restrepo, N.J.; Leahy, R.; Gabriel, N.; El Oud, S.; Lupu, Y. The online competition between pro-and anti-vaccination views. Nature 2020, 582, 230–233. [Google Scholar] [CrossRef]
- Carpentras, D.; Lüders, A.; Quayle, M. Mapping the global opinion space to explain anti-vaccine attraction. Sci. Rep. 2022, 12, 6188. [Google Scholar] [CrossRef]
- Cinelli, M.; De Francisci Morales, G.; Galeazzi, A.; Quattrociocchi, W.; Starnini, M. The echo chamber effect on social media. Proc. Natl. Acad. Sci. USA 2021, 118, e2023301118. [Google Scholar] [CrossRef]
- Puri, N.; Coomes, E.A.; Haghbayan, H.; Gunaratne, K. Social media and vaccine hesitancy: New updates for the era of COVID-19 and globalized infectious diseases. Hum. Vaccin. Immunother. 2020, 16, 2586–2593. [Google Scholar] [CrossRef]
- Sütçü, K.; Tekerek, B.; Özler, G. Aşı karışıtı Twitter paylaşımlarının metin madenciliği ve içerik analizi yöntemiyle incelenmesi. Hacet. Sağlık İdaresi Derg. 2022, 25, 827–838. [Google Scholar]
- Yousefinaghani, S.D. Prediction of COVID-19 waves using social media and Google search: A case study of the US and Canada. Front. Public Health 2021, 9, 656635. [Google Scholar] [CrossRef] [PubMed]
- Tellez, E. The Link Between Vaccination and Autism. Virginia Community College System. 2018, pp. 1–8. Available online: https://commons.vccs.edu/cgi/viewcontent.cgi?article=1026&context=student_writing (accessed on 30 May 2026).
- DeStefano, F.; Shimabukuro, T.T. The MMR Vaccine and Autism. Annu. Rev. Virol. 2019, 6, 585–600. [Google Scholar] [CrossRef]
- Matthew, Z.; Daniel, A.; Salmon, N.; Neal, A.; Halsey, W. Do Vaccines Cause Autism? In The Clinician’s Vaccine Safety Resource Guide; Springer: Cham, Switzerland, 2018; pp. 197–204. [Google Scholar]
- Agrawal, S.; Dayama, S.; Galthora, A. Vaccinate every child against COVID-19: A scientific or socioeconomic need? J. Fam. Med. Prim. Care 2022, 11, 1658–1663. [Google Scholar] [CrossRef]
- Weber, D.J.; Zimmerman, K.O.; Tartof, S.Y.; McLaughlin, J.M.; Pather, S. Risk of COVID-19 in Children throughout the Pandemic and the Role of Vaccination: A Narrative Review. Vaccines 2024, 12, 989. [Google Scholar] [CrossRef]
- Silva, B.C.; Pinto, F.F.; Araujo, I.; Barros, G.B.; Silvério, A.S. The Current Relevance of Vaccination For Covıd-19, Influenza, Poliomyelitis, Measles And Smallpox. Int. J. Dev. Res. 2023, 13, 62935–62939. [Google Scholar]
- Subramaniam, D.S.; Al Juboori, R.; Bianco, A.; Subramaniam, D.P. Knowledge and behavioral beliefs related to vaccination hesitancy among healthcare workers. Front. Public Health 2024, 12, 1518112. [Google Scholar] [CrossRef]
- Youssef, D.; Abou-Abass, L.; Issa, O.; Youssef, J.; Hassan, H. Unlocking the keys to COVID-19 vaccine acceptance: Insights from healthcare workers and the general population. Discov. Soc. Sci. Health 2024, 4, 79. [Google Scholar] [CrossRef]








| Keywords | Number of Comments | Comment % | Number of Videos | Video % |
|---|---|---|---|---|
| Covid Vaccine | 27,540 | 29.28 | 45 | 20.93 |
| Vaccine | 23,919 | 25.43 | 48 | 22.33 |
| Zero Dose Vaccine | 19,227 | 20.44 | 28 | 13.02 |
| Vaccination | 11,805 | 12.55 | 27 | 12.56 |
| mRNA | 4313 | 4.58 | 5 | 2.33 |
| Monkeypox | 2778 | 2.95 | 13 | 6.05 |
| HIV Vaccine | 2118 | 2.25 | 13 | 6.05 |
| Other Vaccines (BCG, Chickenpox, Flu, Polio, AIDS, Ebola, Hepatitis B) | 2372 | 2.52 | 36 | 16.74 |
| Total | 94,072 | 100 | 215 | 100 |
| Topic 1 | company (0.02), cure (0.02), wonder (0.02), money (0.02), court (0.02), smart (0.02), medication (0.02), help (0.01), pay (0.01), medicine (0.01), hurt (0.01), cured (0.01), fake (0.01), cancer (0.01), life (0.01) |
| Topic 2 | vaccine (0.03), child (0.02), vaccinated (0.02), autism (0.01), kid (0.01), person (0.01), risk (0.01), idiot (0.01), parent (0.01), actually (0.01), wrong (0.01), die (0.01), stupid (0.01), mean (0.01), unvaccinated (0.01) |
| Topic 3 | virus (0.05), body (0.03), immune (0.02), cell (0.02), mrna (0.02), protein (0.02), vaccine (0.01), disease (0.01), natural (0.01), immunity (0.01), dna (0.01), kill (0.01), spread (0.01), fight (0.01), response (0.01) |
| Topic 4 | covid (0.03), year (0.02), death (0.02), effect (0.01), sick (0.01), flu (0.01), vaccine (0.01), vaccination (0.01), vaccinated (0.01), life (0.01), shot (0.01), polio (0.01), hospital (0.01), booster (0.01), day (0.01) |
| Topic 5 | anti (0.07), vax (0.05), vaxxers (0.02), nice (0.01), video (0.01), interesting (0.01), explanation (0.01), change (0.01), afraid (0.01), experiment (0.01), mind (0.01), control (0.01), dangerous (0.01) |
| Topic 6 | covid (0.07), jab (0.02), heart (0.02), amp (0.02), conspiracy (0.02), died (0.01), comment (0.01), dead (0.01), lie (0.01), pfizer (0.01), attack (0.01), stupidity (0.01), brain (0.01), lost (0.01), theory (0.01) |
| Topic 7 | antivaxxers (0.03), propaganda (0.02), pox (0.01), smallpox (0.01), news (0.01), monkey (0.01), earth (0.01), history (0.01), argue (0.01), woman (0.01), sheep (0.01), flat (0.01), awesome (0.01), seriously (0.01), daughter (0.01) |
| Topic 8 | government (0.02), trump (0.02), medical (0.02), trust (0.02), science (0.02), doctor (0.01), public (0.01), pharma (0.01), research (0.01), medium (0.01), drug (0.01), health (0.01) |
| Topic 9 | blood (0.02), sad (0.01), literally (0.01), clot (0.01), poison (0.01), humanity (0.01), vaxxed (0.01), hilarious (0.01), dumb (0.01), sound (0.01), lab (0.01), herb (0.01), pure (0.01) |
| Topic 10 | good (0.02), mask (0.01), safe (0.01), feel (0.01), republican (0.01), job (0.01), hard (0.01), mandate (0.01), truth (0.01), sense (0.01), effective (0.01), listen (0.01), guy (0.01), hate (0.01) |
| Topic 1 | long, died, jab, life, disease, vaccination, effect, kid, body, death, good, child, virus, vaccinated, covid |
| Topic 2 | failure, stage, ivermectin, strain, bed, fatigue, update, version, fruit, positive, beast, scam, scurvy, citrus, covid |
| Topic 3 | hope, feel, doc, nice, stuff, informative, girl, idea, science, news, animation, information, explanation, covid, good |
| Topic 4 | vaccination, die, anti, risk, school, measles, covid, autism, unvaccinated, vaccinate, parent, good, kid, child, vaccinated |
| Topic 5 | antibody, infected, person, hiv, spread, protein, sick, body, immune, cell, corona, unvaccinated, good, virus, vaccinated |
| Topic 6 | hiv, disease, measles, autism, protein, body, immune, corona, cell, vaccinate, good, covid, parent, virus, child |
| Topic 7 | stupid, immune, cell, covid, vaccinate, parent, corona, good, body, child, vaxxers, kid, virus, vax, anti |
| Topic 8 | measles, feel, school, mrna, needle, cell, jab, poor, autism, covid, vaccinate, parent, virus, poison, kid |
| Topic 9 | covid, sick, blood, kid, amazing, dna, produce, vaccination, disease, vaccinated, immune, protein, mrna, cell, body |
| Topic 10 | effective, believe, company, medical, covid, virus, safe, pharma, money, poison, lie, science, body, government, trust |
| Topic 1 | infection, positive, against, mrna, tested, rate, month, die, symptom, flu, vax, pandemic, booster, death, covid |
| Topic 2 | heart, sick, bad, risk, problem, believe, health, flu, medical, safe, death, disease, vaccination, jab, effect |
| Topic 3 | doc, harm, nice, stuff, hope, informative, pretty, bad, girl, idea, news, information, explanation, job, good |
| Topic 4 | cured, dead, natural, herpes, herbal, protein, fight, cell, kill, spread, immune, cure, hiv, corona, virus |
| Topic 5 | infected, population, forced, fine, protected, measles, school, spread, die, risk, protect, against, sick, unvaccinated, vaccinated |
| Topic 6 | baby, protect, polio, school, adult, mother, vaccinating, autistic, risk, disease, measles, autism, vaccinate, parent, child |
| Topic 7 | wrong, dumb, vac, against, dislike, idiot, mom, mandate, die, movement, stupid, vaxx, vaxxers, vax, anti |
| Topic 8 | mom, care, polio, healthy, leave, needle, die, measles, poor, school, autism, poison, vaccinate, parent, kid |
| Topic 9 | cancer, part, natural, attack, poison, fight, antibody, blood, produce, dna, immune, protein, mrna, cell, body |
| Topic 10 | paid, evil, scientist, poison, truth, company, medical, speed, believe, money, pharma, lie, government, science, trust |
| Method | Number of Subjects | Coherence Score (c_v) | Study Duration (sn) |
|---|---|---|---|
| LSA | 10 | 0.5395 | 99 |
| hLDA | 10 | 0.7197 | 103,406 |
| NMF | 10 | 0.5803 | 280 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Sert, U.; Ersoy, E.; Tosun, Ö.; Hatıpoğlu, I. Vaccine Perception on Digital Platforms: Topic Modeling of YouTube Comments. Computers 2026, 15, 360. https://doi.org/10.3390/computers15060360
Sert U, Ersoy E, Tosun Ö, Hatıpoğlu I. Vaccine Perception on Digital Platforms: Topic Modeling of YouTube Comments. Computers. 2026; 15(6):360. https://doi.org/10.3390/computers15060360
Chicago/Turabian StyleSert, Uğurcan, Esra Ersoy, Ömür Tosun, and Irmak Hatıpoğlu. 2026. "Vaccine Perception on Digital Platforms: Topic Modeling of YouTube Comments" Computers 15, no. 6: 360. https://doi.org/10.3390/computers15060360
APA StyleSert, U., Ersoy, E., Tosun, Ö., & Hatıpoğlu, I. (2026). Vaccine Perception on Digital Platforms: Topic Modeling of YouTube Comments. Computers, 15(6), 360. https://doi.org/10.3390/computers15060360

