Next Article in Journal
Special Issue on the Curative Power of Medical Data
Previous Article in Journal
CaosDB—Research Data Management for Complex, Changing, and Automated Research Workflows
Previous Article in Special Issue
Health Care, Medical Insurance, and Economic Destitution: A Dataset of 1042 Stories
Article Menu
Issue 2 (June) cover image

Export Article

Open AccessArticle

Pedagogical Demonstration of Twitter Data Analysis: A Case Study of World AIDS Day, 2014

1
Department of Biostatistics, Epidemiology and Environmental Health Sciences, Jiann-Ping Hsu College of Public Health, Georgia Southern University, Statesboro, GA 30460, USA
2
School of Journalism and Communication, Chinese University of Hong Kong, Hong Kong Special Administrative Region, China
3
Journalism and Media Studies Centre, The University of Hong Kong, HongKong, China
4
School of Electrical and Computer Engineering, College of Engineering, The University of Georgia, Athens, GA 30602, USA
5
College of Community Innovation and Education, The University of Central Florida, Orlando, FL 32816, USA
*
Author to whom correspondence should be addressed.
Received: 7 May 2019 / Revised: 24 May 2019 / Accepted: 5 June 2019 / Published: 10 June 2019
(This article belongs to the Special Issue Big Data and Digital Health)
  |  
PDF [242 KB, uploaded 12 June 2019]

Abstract

As a pedagogical demonstration of Twitter data analysis, a case study of HIV/AIDS-related tweets around World AIDS Day, 2014, was presented. This study examined if Twitter users from countries with various income levels responded differently to World AIDS Day. The performance of support vector machine (SVM) models as classifiers of relevant tweets was evaluated. A manual coding of 1,826 randomly sampled HIV/AIDS-related original tweets from November 30 through December 2, 2014 was completed. Logistic regression was applied to analyze the association between the World Bank-designated income level of users’ self-reported countries and Twitter contents. To identify the optimal SVM model, 1278 (70%) of the 1826 sampled tweets were randomly selected as the training set, and 548 (30%) served as the test set. Another 180 tweets were separately sampled and coded as the held-out dataset. Compared with tweets from low-income countries, tweets from the Organization for Economic Cooperation and Development countries had 60% lower odds to mention epidemiology (adjusted odds ratio, aOR = 0.404; 95% CI: 0.166, 0.981) and three times the odds to mention compassion/support (aOR = 3.080; 95% CI: 1.179, 8.047). Tweets from lower-middle-income countries had 79% lower odds than tweets from low-income countries to mention HIV-affected sub-populations (aOR = 0.213; 95% CI: 0.068, 0.664). The optimal SVM model was able to identify relevant tweets from the held-out dataset of 180 tweets with an accuracy (F1 score) of 0.72. This study demonstrated how students can be taught to analyze Twitter data using manual coding, regression models, and SVM models. View Full-Text
Keywords: global health; health promotion; HIV/AIDS; social media; supervised machine learning; Twitter global health; health promotion; HIV/AIDS; social media; supervised machine learning; Twitter
This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).

Supplementary material

SciFeed

Share & Cite This Article

MDPI and ACS Style

Fung, I.C.-H.; Yin, J.; Pressley, K.D.; Duke, C.H.; Mo, C.; Liang, H.; Fu, K.-W.; Tse, Z.T.H.; Hou, S.-I. Pedagogical Demonstration of Twitter Data Analysis: A Case Study of World AIDS Day, 2014. Data 2019, 4, 84.

Show more citation formats Show less citations formats

Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Metrics

Article Access Statistics

1

Comments

[Return to top]
Data EISSN 2306-5729 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert
Back to Top