From Headlines to Thumbnails: Comparative Analysis of Web Publications in Bulgarian Digital Media and YouTube
Abstract
1. Introduction
- Do Bulgarian news agencies maintain a consistent thematic focus across their primary websites and official YouTube channels, or do their priorities diverge?
- What are the specific thematic differences in content strategy between text-based headlines (websites) and video-based titles (YouTube)?
- How do news agencies strategically use visual elements (thumbnails) on YouTube to complement, or alter, the messaging of their textual titles?
2. Materials and Methods
2.1. Web Scraping
- Bulgarian News Agency: As the publicly funded, national news agency of Bulgaria, BTA has a long-standing history and a mandate for comprehensive, factual coverage of politics, economy, culture, and international affairs. It operates as a formal, journalistic source. The official BTA website delivers up-to-date news from Bulgaria and around the world, focusing on socially significant events, economics, politics, and culture (Bulgarian News Agency, 2025b). The official BTA YouTube channel features video reports, press conferences, and interviews, often complementing the website’s content with multimedia formats (Bulgarian News Agency, 2025a).
- BGNES Agency: One of the largest private, commercially funded news agencies in the country, BGNES Agency offers timely reports and analysis. It is known for its aggressive, high-volume “media flow” style, focusing heavily on domestic and global developments. The BGNES website offers national and international news, analyses, and reports, with an emphasis on balanced coverage of current events (BGNES Agency, 2025b). The official BGNES YouTube channel includes video interviews, reports, and press events, synchronized with the publications on the website (BGNES Agency, 2025a).
- Blitz News: A prominent private, commercially funded agency, Blitz is known for its high-traffic, sensationalist, tabloid-style reporting. Its content is often visually intense and designed to maximize clicks (clickbait). The Blitz website publishes high-frequency news covering a wide range of topics—from politics and economics to lifestyle and sports (Blitz News, 2025b). The Blitz YouTube channel presents video materials, news bulletins, and reports that visually enrich and complement the articles published on the website (Blitz News, 2025a).
2.2. Quantitative Analysis
2.3. Qualitative Analysis
- Emotional valence (presence of strong emotions on faces: anger, surprise).
- Text overlays (use of provocative quotes, all-caps, bold colors).
- Iconography (use of symbols, flags, party logos).
- Branding (agency logos, consistent color schemes).
3. Results
3.1. Keyword Analysis (TF-IDF)
3.2. YouTube Thumbnails Added Value Analysis
4. Discussion
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
| AI | Artificial Intelligence |
| BTA | Bulgarian News (Telegraph) Agency |
| CTR | Click-Through Rate |
| IDF | Inverse Document Frequency |
| NLP | Natural language processing |
| SEO | Search Engine Optimization |
| TF | Term Frequency |
| TF-IDF | Term Frequency-Inverse Document Frequency |
Appendix A. TF-IDF Calculations
References
- Ahmed, E., Xue, L., Sankalp, A., Kong, H., Matos, A., Silenzio, V., & Singh, V. K. (2023). Predicting loneliness through digital footprints on Google and YouTube. Electronics, 12(23), 4821. [Google Scholar] [CrossRef]
- Arreeras, S., Phonsitthangkun, S., Arreeras, T., & Arimura, M. (2024). Spatial analysis on the service coverage of emergency facilities for fire disaster risk in an urban area using a web scraping method: A case study of Chiang Rai City, Thailand. Urban Science, 8(3), 140. [Google Scholar] [CrossRef]
- Avraam, E., Veglis, A., & Dimoulas, C. (2022). Comparison of publishing and consumption patterns in Greek media websites. Journalism and Media, 3(1), 134–143. [Google Scholar] [CrossRef]
- BGNES Agency. (2025a). Home. YouTube. Available online: https://www.youtube.com/@BGNESAgency (accessed on 1 July 2025).
- BGNES Agency. (2025b). News. Available online: https://bgnes.bg/ (accessed on 1 July 2025).
- Bhagat, K. K., Mishra, S., Dixit, A., & Chang, C.-Y. (2021). Public opinions about online learning during COVID-19: A sentiment analysis approach. Sustainability, 13(6), 3346. [Google Scholar] [CrossRef]
- Blitz News. (2025a). Home. YouTube. Available online: https://www.youtube.com/@BlitzBGNews (accessed on 1 July 2025).
- Blitz News. (2025b). News. Available online: https://blitz.bg/ (accessed on 1 July 2025).
- Bulgarian News Agency. (2025a). Home. Available online: https://www.youtube.com/@BulgarianNewsAgency (accessed on 1 July 2025).
- Bulgarian News Agency. (2025b). News. Available online: https://bta.bg/ (accessed on 1 July 2025).
- Chang, C.-W. (2022). Developing a multicriteria decision-making model based on a three-layer virtual internet of things algorithm model to rank players’ value. Mathematics, 10(14), 2369. [Google Scholar] [CrossRef]
- Eizmendi-Iraola, M., Peña-Fernández, S., & Morales-i-Gras, J. (2025). The gender gap in science communication on TikTok and YouTube: How platform dynamics shape the visibility of female science communicators. Journalism and Media, 6(3), 108. [Google Scholar] [CrossRef]
- Garganas, O. (2024). Digital video advertising: Breakthrough or extension of TV advertising in the new digital media landscape? Journalism and Media, 5(2), 749–765. [Google Scholar] [CrossRef]
- Giannakoulopoulos, A., Pergantis, M., Lamprogeorgos, A., & Lampoura, S. (2025). A data-driven approach to estimating the impact of audiovisual art events through web presence. Information, 16(2), 88. [Google Scholar] [CrossRef]
- Hassanien, H. E.-D. (2019). Web scraping scientific repositories for augmented relevant literature search using CRISP-DM. Applied System Innovation, 2(4), 37. [Google Scholar] [CrossRef]
- Hayes, D. R., Cappa, F., & Cardon, J. (2018). A framework for more effective dark web marketplace investigations. Information, 9(8), 186. [Google Scholar] [CrossRef]
- Ioscote, F., Gonçalves, A., & Quadros, C. (2024). Artificial intelligence in journalism: A ten-year retrospective of scientific articles (2014–2023). Journalism and Media, 5(3), 873–891. [Google Scholar] [CrossRef]
- Jung, W.-C., Kim, J., & Park, N. (2021). Web-browsing application using web scraping technology in Korean network separation application. Symmetry, 13(8), 1550. [Google Scholar] [CrossRef]
- Kirilov, R. (2021). Approaches for building information systems for monitoring the realization of students. Economic Alternatives, 3, 469–481. [Google Scholar] [CrossRef]
- Li, B., Kou, X., & Bonk, C. J. (2023). Embracing the disrupted language teaching and learning field: Analyzing YouTube content creation related to ChatGPT. Languages, 8(3), 197. [Google Scholar] [CrossRef]
- Liu, T.-M. (2025). Using snake roadkill patterns to indicate effects of climate change on snakes in Taiwan. Sustainability, 17(4), 1580. [Google Scholar] [CrossRef]
- Louro, J., Fidalgo, F., & Oliveira, Â. (2024). Recognition of food ingredients—Dataset analysis. Applied Sciences, 14(13), 5448. [Google Scholar] [CrossRef]
- Lu, J. C.-C. (2023). Creating effective educational videos on YouTube in higher education. Engineering Proceedings, 38(1), 32. [Google Scholar] [CrossRef]
- Luţan, E.-R., & Bădică, C. (2022). Emotion-based literature book classification using online reviews. Electronics, 11(20), 3412. [Google Scholar] [CrossRef]
- Mabrouk, A., Redondo, R. P. D., & Kayed, M. (2021). SEOpinion: Summarization and exploration of opinion from E-commerce websites. Sensors, 21(2), 636. [Google Scholar] [CrossRef]
- MacLean, C., & Cavallucci, D. (2024). Assessing fine-tuned NER models with limited data in French: Automating detection of new technologies, technological domains, and startup names in renewable energy. Machine Learning and Knowledge Extraction, 6(3), 1953–1968. [Google Scholar] [CrossRef]
- Manning, C. D., Raghavan, P., & Schütze, H. (2008). Introduction to information retrieval. Cambridge University Press. [Google Scholar]
- McGarry, K. (2023). Analyzing social media data using sentiment mining and bigram analysis for the recommendation of YouTube videos. Information, 14(7), 408. [Google Scholar] [CrossRef]
- Musleh, D. A., Alkhwaja, I., Alkhwaja, A., Alghamdi, M., Abahussain, H., Alfawaz, F., Min-Allah, N., & Abdulqader, M. M. (2023). Arabic sentiment analysis of YouTube comments: NLP-based machine learning approaches for content evaluation. Big Data and Cognitive Computing, 7(3), 127. [Google Scholar] [CrossRef]
- Naganawa, H., & Hirata, E. (2025a). Enhancing policy generation with GraphRAG and YouTube data: A logistics case study. Electronics, 14(7), 1241. [Google Scholar] [CrossRef]
- Naganawa, H., & Hirata, E. (2025b). Social media and logistics: Uncovering challenges and solutions through YouTube data. Logistics, 9(2), 56. [Google Scholar] [CrossRef]
- Naing, I., Aung, S. T., Wai, K. H., & Funabiki, N. (2024). A reference paper collection system using web scraping. Electronics, 13(14), 2700. [Google Scholar] [CrossRef]
- Osman, N. S., Kim, J.-H., Park, J.-H., & Park, H.-W. (2025). Identifying the impacts of social movement mobilization on YouTube: Social network analysis. Information, 16(1), 55. [Google Scholar] [CrossRef]
- Panagiotidis, K., & Veglis, A. (2020). Transitions in journalism—Toward a semantic-oriented technological framework. Journalism and Media, 1(1), 1–17. [Google Scholar] [CrossRef]
- Park, Y., & Shin, Y. (2022). Novel scratch programming blocks for web scraping. Electronics, 11(16), 2584. [Google Scholar] [CrossRef]
- Pérez, V., & Aybar, C. (2024). Challenges in geocoding: An analysis of R packages and web scraping approaches. ISPRS International Journal of Geo-Information, 13(6), 170. [Google Scholar] [CrossRef]
- Polpanich, O.-U., Bhatpuria, D., Santos Santos, T. F., & Krittasudthacheewa, C. (2022). Leveraging multi-source data and digital technology to support the monitoring of localized water changes in the Mekong Region. Sustainability, 14(3), 1739. [Google Scholar] [CrossRef]
- Rodríguez-Almonacid, D. V., Ramírez-Gil, J. G., Higuera, O. L., Hernández, F., & Díaz-Almanza, E. (2023). A comprehensive step-by-step guide to using data science tools in the gestion of epidemiological and climatological data in rice production systems. Agronomy, 13(11), 2844. [Google Scholar] [CrossRef]
- Salton, G., Wong, A., & Yang, C. S. (1975). A vector space model for automatic indexing. Communications of the ACM, 18(11), 613–620. [Google Scholar] [CrossRef]
- Santos, F., & Acosta, N. (2023). An approach based on web scraping and denoising encoders to curate food security datasets. Agriculture, 13(5), 1015. [Google Scholar] [CrossRef]
- Santos Duarte, A. L., Souza, E. R. d., Silva, M. P. d. O., Monte, M. B. d. S., Correia, N. O. d. A., de Carvalho, V. D. H., & Taques, F. H. (2025). Data protection in Brazil: Applying text mining in court documents. Engineering Proceedings, 87(1), 57. [Google Scholar] [CrossRef]
- Sarker, K. U., Saqib, M., Hasan, R., Mahmood, S., Hussain, S., Abbas, A., & Deraman, A. (2022). A ranking learning model by K-means clustering technique for web scraped movie data. Computers, 11(11), 158. [Google Scholar] [CrossRef]
- Sergiacomi, C., Vuletić, D., Paletto, A., Barbierato, E., & Fagarazzi, C. (2022). Exploring national park visitors’ judgements from social media: The case study of Plitvice Lakes National Park. Forests, 13(5), 717. [Google Scholar] [CrossRef]
- Silvallana, D. F., Elias, C., & Catalan-Matamoros, D. (2025). Exploring vaccine hesitancy in the Philippines: A content analysis of comments on national TV channel YouTube videos. International Journal of Environmental Research and Public Health, 22(6), 819. [Google Scholar] [CrossRef]
- Skoulikaris, C., & Krestenitis, Y. (2020). Cloud data scraping for the assessment of outflows from dammed rivers in the EU. A case study in South Eastern Europe. Sustainability, 12(19), 7926. [Google Scholar] [CrossRef]
- Sparck Jones, K. (1972). A statistical interpretation of term specificity and its application in retrieval. Journal of Documentation, 28(1), 11–21. [Google Scholar] [CrossRef]
- Tanasescu, L. G., Vines, A., Bologa, A. R., & Vaida, C. A. (2022). Big data ETL process and its impact on text mining analysis for employees’ reviews. Applied Sciences, 12(15), 7509. [Google Scholar] [CrossRef]
- Tsiourlini, M., Tzafilkou, K., Karapiperis, D., & Tjortjis, C. (2024). Text analytics on YouTube comments for food products. Information, 15(10), 599. [Google Scholar] [CrossRef]


| Media | Type | Style/Tone |
|---|---|---|
| Bulgarian News Agency | Public | Formal, factual, journalistic |
| BGNES Agency | Private | Aggressive, commercial, media flow |
| Blitz News | Private | Sensational, clickbait, visually intense |
| Digital Media | Website | Publications |
|---|---|---|
| Bulgarian News Agency | https://bta.bg (accessed on 1 July 2025) | 206,950 |
| BGNES Agency | https://bgnes.bg (accessed on 1 July 2025) | 21,492 |
| Blitz News | https://blitz.bg (accessed on 1 July 2025) | 86,661 |
| Channel | Homepage | Videos |
|---|---|---|
| Bulgarian News Agency | https://www.youtube.com/@BulgarianNewsAgency (accessed on 1 July 2025) | 5285 |
| BGNES Agency | https://www.youtube.com/@BGNESAgency (accessed on 1 July 2025) | 1095 |
| Blitz News | https://www.youtube.com/@BlitzBGNews (accessed on 1 July 2025) | 581 |
| Keyword | Mean TF-IDF Score |
|---|---|
| ukrayna (Ukraine) | 0.038 |
| borisov (former Prime Minister (PM) of Bulgaria) | 0.035 |
| voynata (the war) | 0.026 |
| rusiya (Russia) | 0.025 |
| radev (President of Bulgaria) | 0.023 |
| pravitelstvo (government) | 0.023 |
| kiril (Kiril Petkov, former PM of Bulgaria) | 0.022 |
| petkov (former PM of Bulgaria) | 0.022 |
| sofia (Sofia) | 0.021 |
| tsena (price) | 0.021 |
| ukrainski (Ukrainian) | 0.019 |
| ruski (Russian) | 0.018 |
| gaz (gas) | 0.018 |
| putin (President of Russia) | 0.017 |
| evropeyskiya (the European) | 0.016 |
| Keyword | Mean TF-IDF Score |
|---|---|
| teatur (theater) | 0.045 |
| izlozhba (exhibition) | 0.044 |
| kontsert (concert) | 0.038 |
| varna (coastal city in Bulgaria) | 0.031 |
| evropeyskiya (the European) | 0.028 |
| bulgaria (Bulgaria) | 0.027 |
| obshtina (municipality) | 0.027 |
| mezhdunarodniya (the international) | 0.026 |
| festival (fest) | 0.025 |
| plovdiv (second largest city in Bulgaria) | 0.024 |
| Keyword | Mean TF-IDF Score |
|---|---|
| borisov (former PM of Bulgaria) | 0.052 |
| tsena (price) | 0.046 |
| gaz (gas) | 0.042 |
| radev (President of Bulgaria) | 0.040 |
| pravitelstvo (government) | 0.039 |
| kiril (Kiril Petkov, former PM of Bulgaria) | 0.038 |
| petkov (former PM of Bulgaria) | 0.038 |
| gerb (political party) | 0.035 |
| asen (Asen Vasilev, former Minister of Finance of Bulgaria) | 0.034 |
| vasilev (former Minister of Finance of Bulgaria) | 0.034 |
| Keyword | Mean TF-IDF Score |
|---|---|
| ukrayna (Ukraine) | 0.051 |
| voynata (the war) | 0.043 |
| rusiya (Russia) | 0.039 |
| borisov (former PM of Bulgaria) | 0.038 |
| putin (President of Russia) | 0.032 |
| shok (schock) | 0.028 |
| tragediya (tragedy) | 0.027 |
| kiev (Kiev) | 0.027 |
| ruskata (the Russian) | 0.025 |
| petkov (former PM of Bulgaria) | 0.024 |
| Keyword | Mean TF-IDF Score |
|---|---|
| borisov (former PM of Bulgaria) | 0.053 |
| pravitelstvo (government) | 0.040 |
| kiril (Kiril Petkov, former PM of Bulgaria) | 0.038 |
| petkov (former PM of Bulgaria) | 0.038 |
| radev (President of Bulgaria) | 0.036 |
| izbori (elections) | 0.033 |
| rumen (Rumen Radev, President of Bulgaria) | 0.032 |
| gerb (political party) | 0.031 |
| asen (Asen Vasilev, former Minister of Finance of Bulgaria) | 0.031 |
| vasilev (former Minister of Finance of Bulgaria) | 0.031 |
| boyko (Boyko Borisov, former PM of Bulgaria) | 0.029 |
| mandata (the mandate) | 0.026 |
| glasuva (votes) | 0.024 |
| proektokabineta (the project-cabinet) | 0.023 |
| zhelyazkov (PM of Bulgaria) | 0.022 |
| Keyword | Mean TF-IDF Score |
|---|---|
| pravitelstvo (government) | 0.095 |
| proektokabineta (the project-cabinet) | 0.063 |
| glasuva (votes) | 0.063 |
| bulgaria (Bulgaria) | 0.056 |
| zhelyazkov (PM of Bulgaria) | 0.053 |
| radev (President of Bulgaria) | 0.052 |
| mandata (the mandate) | 0.049 |
| rumen (Rumen Radev, President of Bulgaria) | 0.046 |
| pravitelstvoto (the government) | 0.044 |
| ministri (ministers) | 0.040 |
| Keyword | Mean TF-IDF Score |
|---|---|
| borisov (former PM of Bulgaria) | 0.075 |
| izbori (elections) | 0.066 |
| gurtsiya (Greece) | 0.056 |
| pravitelstvo (government) | 0.055 |
| gerb (political party) | 0.048 |
| boyko (Boyko Borisov, former PM of Bulgaria) | 0.046 |
| asen (Asen Vasilev, former Minister of Finance of Bulgaria) | 0.045 |
| vasilev (former Minister of Finance of Bulgaria) | 0.045 |
| evrozonata (the eurozone) | 0.042 |
| finansi (finances) | 0.042 |
| Keyword | Mean TF-IDF Score |
|---|---|
| borisov (former PM of Bulgaria) | 0.076 |
| petkov (former PM of Bulgaria) | 0.054 |
| kiril (Kiril Petkov, former PM of Bulgaria) | 0.054 |
| cska (football club) | 0.052 |
| radev (President of Bulgaria) | 0.045 |
| asen (Asen Vasilev, former Minister of Finance of Bulgaria) | 0.042 |
| vasilev (former Minister of Finance of Bulgaria) | 0.042 |
| rumen (Rumen Radev, President of Bulgaria) | 0.038 |
| boyko (Boyko Borisov, former PM of Bulgaria) | 0.037 |
| kostadinov (Bulgarian politician) | 0.033 |
| Channel | Dominant Keyword Themes | Observed Thumbnail Strategy | Added Value |
|---|---|---|---|
| Bulgarian News Agency | Institutional Processes, Formal Politics | Formal, objective imagery depicting institutional symbols. Clean, professional aesthetic with minimal text overlays. Schematic: [wide shot of parliament hall + Agency logo] | Reinforces brand authority and objectivity. Provides context of scale and officiality. Appeals to audience seeking formal information. |
| BGNES Agency | Domestic and Economic Policy, International Relations | Conceptual, collage-style images juxtaposing a key person with a visual symbol of the topic. Schematic: [Politician’s photo + Euro symbol overlay] | Visualizes complex or abstract topics. Makes content more accessible. Directly links figures to specific issues. |
| Blitz News | Political Confrontation, Sports Figures | Emotional, close-up shots of key figures displaying strong emotions. Frequent use of large, provocative text overlays. Schematic: [Close-up of politician’s angry face + Red border + “SCANDAL!” in yellow text] | Dramatizes news into personal conflicts. Maximizes emotional engagement. Creates strong CTR appeal. |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Milev, P.H.; Tabov, Y.N. From Headlines to Thumbnails: Comparative Analysis of Web Publications in Bulgarian Digital Media and YouTube. Journal. Media 2025, 6, 202. https://doi.org/10.3390/journalmedia6040202
Milev PH, Tabov YN. From Headlines to Thumbnails: Comparative Analysis of Web Publications in Bulgarian Digital Media and YouTube. Journalism and Media. 2025; 6(4):202. https://doi.org/10.3390/journalmedia6040202
Chicago/Turabian StyleMilev, Plamen Hristov, and Yavor Nikolov Tabov. 2025. "From Headlines to Thumbnails: Comparative Analysis of Web Publications in Bulgarian Digital Media and YouTube" Journalism and Media 6, no. 4: 202. https://doi.org/10.3390/journalmedia6040202
APA StyleMilev, P. H., & Tabov, Y. N. (2025). From Headlines to Thumbnails: Comparative Analysis of Web Publications in Bulgarian Digital Media and YouTube. Journalism and Media, 6(4), 202. https://doi.org/10.3390/journalmedia6040202

