You are currently viewing a new version of our website. To view the old version click .
Foods
  • Article
  • Open Access

14 January 2025

Foodborne Event Detection Based on Social Media Mining: A Systematic Review

,
,
,
and
Unit of Biostatistics, Epidemiology and Public Health, Department of Cardiac, Thoracic, and Vascular Sciences, University of Padova, via Loredan, 18, 35121 Padova, Italy
*
Authors to whom correspondence should be addressed.
This article belongs to the Section Food Microbiology

Abstract

Foodborne illnesses represent a significant global health challenge, causing substantial morbidity and mortality. Conventional surveillance methods, such as laboratory-based reporting and physician notifications, often fail to enable early detection, prompting the exploration of innovative solutions. Social media platforms, combined with machine learning (ML), offer new opportunities for real-time monitoring and outbreak analysis. This systematic review evaluated the role of social networks in detecting and managing foodborne illnesses, particularly through the use of ML techniques to identify unreported events and enhance outbreak response. This review analyzed studies published up to December 2024 that utilized social media data and data mining to predict and prevent foodborne diseases. A comprehensive search was conducted across PubMed, EMBASE, CINAHL, Arxiv, Scopus, and Web of Science databases, excluding clinical trials, case reports, and reviews. Two independent reviewers screened studies using Covidence, with a third resolving conflicts. Study variables included social media platforms, ML techniques (shallow and deep learning), and model performance, with a risk of bias assessed using the PROBAST tool. The results highlighted Twitter and Yelp as primary data sources, with shallow learning models dominating the field. Many studies were identified as having high or unclear risk of bias. This review underscored the potential of social media and ML in foodborne disease surveillance and emphasizes the need for standardized methodologies and further exploration of deep learning models.

1. Introduction

Foodborne illnesses represent a significant and preventable public health challenge that affects millions of individuals worldwide every year [1]. The consequences of these diseases range from mild discomfort to severe health complications, including hospitalization and even death [2]. The latest statistics on foodborne diseases, worldwide and in Europe, highlight a significant health burden. Globally, nearly 1 in 10 people fall ill each year because of eating contaminated food, resulting in more than 420,000 deaths. Children under the age of 5 are disproportionately affected, accounting for 125,000 deaths annually [3]. The World Health Organization (WHO) also estimated that each year, unsafe food causes 600 million cases of foodborne diseases globally [4]. In Europe, this phenomenon is also of great concern. The European Food Safety Authority (EFSA) reported that, in 2021, there were 4005 foodborne outbreaks in the EU, representing a 29.8% increase compared to 2020 [5]. A more detailed report from EFSA and the European Center for Disease Prevention and Control (ECDC) noted that campylobacteriosis and salmonellosis were the most reported zoonoses (diseases transmitted from animals to humans) in 2021. Notably, more foodborne outbreaks and cases were documented in 2021 than in 2020 in the EU. Salmonella Enteritidis is the most frequently reported in these outbreaks [6].
In the United States, the Centers for Disease Control and Prevention (CDC) estimates that foodborne diseases make 48 million people fall ill, hospitalize 128,000, and cause 3000 deaths each year [7]. In 2020, 299 outbreaks of foodborne diseases were reported, causing 5987 illnesses, 641 hospitalizations, and 14 deaths [8]. These statistics underscore the ongoing challenges in ensuring food safety and reducing the incidence of foodborne diseases worldwide. Traditional methods of detecting and responding to foodborne illnesses often rely on formal reporting systems, laboratory tests, and periodic inspections, which may be limited in scope, timeliness, and effectiveness [9,10,11]. Traditional methods of detecting and responding to foodborne illnesses offer advantages over social media data, including greater reliability, validated laboratory-confirmed diagnoses, standardized reporting systems, and higher accuracy, as they are less prone to misinformation and bias often found in unverified social media sources [11].
In recent years, integrating digital platforms, social networks, and sophisticated computational methods has revolutionized public health monitoring and management in various areas [12], not just tracking foodborne illnesses. In disease surveillance, social networks, mobile applications, and online platforms have significantly impacted the real-time monitoring and follow-up of COVID-19 cases [13]. They facilitate rapid dissemination of information, enabling authorities and the public to respond quickly to evolving situations [14]. Although the Google Flu Trends platform no longer exists, it attempted to use search query data to monitor flu activity, showcasing the potential for digital surveillance [15]. AI has been used to predict infectious disease dynamics and the effects of interventions. By analyzing data trends, AI can predict how diseases might spread under different circumstances, helping to prepare and plan for outbreaks [16]. While social media and AI-driven approaches offer significant potential for detecting foodborne illnesses, they also come with notable disadvantages. Data derived from social networks can be unreliable and prone to misinformation, as posts may reflect personal anecdotes, rumors, or panic rather than verified cases. A supposed outbreak reported on social media may be exaggerated or entirely fabricated, making it difficult to distinguish real events from false alarms [17,18].
These novel approaches can revolutionize the detection of outbreaks, understanding of public responses, and implementation of preventive measures [19].
This systematic review aimed to explore and synthesize the current literature on using ML and other innovative technologies to detect and manage foodborne illnesses using social media or online review platforms, where users can leave a star rating and a detailed review of a business. In particular, this review aimed to assess the effectiveness of social media analysis by investigating how platforms like Twitter and Yelp are used to identify unreported foodborne events, extract relevant information, and analyze potential outbreaks. It also examined the development and application of machine learning models to perform real-time detection, prediction, and response to foodborne illness incidents. It analyzes various online tools, algorithms, and frameworks to enhance traditional surveillance systems, including graph neural networks (GNNs), text-mining (TM), and natural language processing (NLP). The work can hint at how the public reacts to foodborne outbreaks, including their concerns, behaviors, and interactions on social media platforms.
This study aimed to explore the various methods for predicting foodborne diseases using social media data.

2. Materials and Methods

Inclusion criteria encompass all studies dedicated to preventing foodborne diseases through social networks, focusing on those that use data mining techniques for preventive measures. On the contrary, clinical trials, case reports, case series, and reviews fall under exclusion criteria. Furthermore, exclusions extend to studies that do not leverage social media data or employ data mining techniques.

2.1. Information Sources and Search Strategy

MEDLINE was searched through PubMed, EMBASE, CINAHL, Web of Science, Arxiv, and Scopus. There was no other restriction in the search string or date. The last search was performed on 15 December 2024.

2.2. Selection Process

Two independent reviewers screened the retrieved articles using Covidence systematic review software, Veritas Health Innovation, Melbourne, Australia (available at www.covidence.org) [20]. Discrepancies were resolved by consensus or consultation with a third reviewer.

2.3. Data Extraction

Article characteristics are presented as means or percentages based on variable types. The following variables were collected during the research process: author, year, type of study, social media platform, country in which the study was conducted, language analyzed, structure involved, purpose of the survey, results, use of machine learning (ML), ML models tested, best ML model, best performance, and value (%). These parameters were systematically collected to evaluate and analyze the studies considered. We employed a dual categorization to evaluate the use of machine learning algorithms, dividing them into ‘shallow’ and ‘deep’ learning methods. Shallow learning, often referred to as traditional ML, involves algorithms that function with minimal layers of computation. These methods are characterized by their simplicity and efficiency in handling structured and rectangular data. Shallow learning algorithms typically include logistic regression, support vector machines, decision trees, and k-nearest neighbors. In contrast, deep learning (DL) consists of a subset of machine learning techniques (MLTs) that involve complex neural networks with multiple layers of processing and abstraction. DL methods, such as convolutional neural networks (CNNs) and recurrent neural networks (RNNs), are known for their proficiency in handling unstructured data such as images, sound, raw signals, and text.

2.4. Assessment of Risk of Bias

The Prediction Model Risk of Bias (RoB) Assessment Tool (PROBAST) was employed to determine the RoB in the included studies. PROBAST is designed to assess RoB in diagnostic and prognostic prediction model studies and comprises four domains—participants, predictors, outcomes, and analysis—covering 20 signaling questions (Qs) [21]. A domain was judged to have a “low RoB” if all signaling questions were answered as ‘Yes’ (Y) or ‘Probably Yes’ (PY). On the other hand, a response of ‘No’ (N) or ‘Probably No’ (PN) to one or more questions within a domain denoted a potential for bias. A response of ‘No Information’ (NI) indicated insufficient information to assess the risk of bias in that domain. The PROBAST evaluation was conducted independently by two reviewers (AC and SS), and disagreements were resolved through discussion or consultation with a third reviewer, as necessary.

3. Results

3.1. Characteristics of the Studies Included

This systematic review covered studies conducted over a decade, from 2012 to 2024. Figure S1 reports the PRISMA flowchart of the article screening process. Table 1 reports the list and main characteristics of the articles included in the review. Articles are the preferred type of communication, comprising 76.92% of publications (24 out of 31) [22,23,24,25], 27 (p. 201), [26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45].
Table 1. Overview of research publications analyzing social media platforms across various countries and languages from 2012 to 2024 and involved structures.
In terms of the platforms used to conduct research, Twitter is the most explored, being the focal point in 1.94% of the studies (13 out of 31) [22,25,31,35,38,39,44,45,46,47,50,51,52]. Yelp also attracts attention, being crucial in 22.58% of research efforts (7 out of 31) [23,26,27,30,33,48,49]. Other platforms, such as Weibo and Facebook, represent a smaller portion of the studies, with Weibo being a key component in 9.67% of the investigations (3 out of 31) [28,32,36], Facebook being investigated in 6.45% of the studies (2 out of 31) [37,40], and feedback data from agencies like the Singapore Food Agency’s CRMS contributing to 3.22% of studies (1 out of 31) [41].
Geographically, the USA stands out as the main focus, with 23 studies conducted there [23,24,25,26,27,29,30,31,33,34,35,38,39,40,42,43,44,46,48,49,50,51]. The European Union (EU) and China are also subjects of exploration, with the EU representing 9.67% of the studies (3 out of 31) [22,28,32] and China accounting for 6.45% (2 out of 31) [36,37]. Additionally, Singapore and Australia contributed new geographic diversity with one study each (3.22%) [41,45].
In terms of linguistic focus, English emerged as the dominant language analyzed, enveloping a considerable 90.2% of the studies (26 out of 31) [23,24,25,26,27,29,30,31,32,33,34,35,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52]. Other languages or combinations, such as English and German or exclusively Chinese and Italian, are notably less prevalent, each contributing to 3.23%% of the studies (1 out of 31) [22,32,37].

3.2. Characteristics of the Settings Considered

Examining the public settings involved in the food chain analyzed through social media platforms, restaurants emerged as a prominent point of interest, being integral in 90.48% of the studies (19 out of 31) [22,32,37].
Alternatives and combinations of other structures, such as homes, supermarkets, mass gatherings, trips, soccer games, and online retail platforms, are explored sporadically, each representing approximately 4.76% of the studies (1 out of 21) [17,20,29]. All these data are summarized in Table 1.

3.3. Keywords Used

Analyzing the 31 studies, 19 (61.29%) [23,25,26,27,30,32,33,34,35,36,37,38,39,42,43,44,45,47,48] reported keywords used to filter data related to food safety and health (Table 2).
Table 2. List of keywords found in the analyzed papers, grouped by categories.
Keywords such as “food poisoning” and “vomit” were recurrent, appearing in seven instances [27,30,33,38,39,43,44,47,48]. Other frequently used terms include “sick”, “illness”, “stomachache”, “diarrhea”, “nausea”, and “puke”, as highlighted in these studies [42,43,44,45]. Hashtags such as # foodsecurity, # foodinsecure, and # foodequity were also notable in studies exploring food security and related themes [45].
Co-occurrence analysis revealed frequent keyword pairings, such as (“vomit”, and “diarrhoea”) [30,33,38,43,44,48,52], and (“sick”, “vomit”) [27,30,42,48]. Notably, terms like “food poisoning” and “illness” emerged prominently in studies, while additional keywords such as “stomachache,” “puke,” and “nausea” were also observed. Hashtags, including # foodsecurity, # foodinsecure, and # foodequity, featured significantly in studies exploring food security themes [45]. Methodological analysis showed varied approaches in keyword generation, ranging from using pre-defined lists [23,25,36,38,40] to computational methodologies, such as ML or frequency-based selections [31,33,34,44,52]. Some methods also incorporated manual validation by experts [38]. Some methods also incorporated manual validation by experts (Table 3).
Table 3. Evaluation of machine learning models in the analyzed studies, highlighting performance metrics and best achieved results.

3.4. Machine Learning Techniques Applied

Among the studies explored, a discernible inclination toward the utilization of ML is evident, with a total of 26 studies employing ML techniques [22,24,25,26,27,28,29,30,31,32,33,34,36,38,39,40,41,42,43,44,45,46,47,49,50,52], as opposed to 5 that opted not to [23,35,37,48,51]. Delving deeper into the ML approaches adopted, shallow learning models were more prevalently utilized, being featured in 16 studies [22,24,25,26,27,28,29,30,32,33,40,42,45,47,49,50], whereas DL models appeared in 9 studies [31,34,36,38,39,41,43,44,52]. On the deep learning front, techniques like BERT, DistilBERT, XLNet [41], and EGAL [44] were notable. The authors of [43] introduced a novel approach combining pretrained BERT models (BERTweet, RoBERTa, BiLSTM, and MGADE), while latent Dirichlet allocation (LDA) was applied for topic modeling [43]. Support vector machines (SVMs) emerged as the most frequently utilized model, being the approach of choice in five studies [22,24,28,40,50]. This perhaps underscores its renowned versatility and efficacy in tackling various classification problems. An overview of performance metrics reveals a notable divergence in reporting practices across studies. F1 Score, a harmonic mean of precision and recall, surfaced as the most frequently reported metric, being cited in eight studies [24,29,30,31,34,38,39,49]. This suggests a tendency to evaluate models based on their ability to balance type I and type II errors once a threshold was selected, especially pertinent in imbalanced datasets. However, it is crucial to highlight the substantial variability in the performance metrics reported. Metrics like accuracy and AUC (area under curve) illustrate a broader range of values, ranging from 66.4% [47] to 92.0% [49] and 93.0% [26] to 98.0% [30], respectively (Table 2).

3.5. Risk of Bias Assessment

The detailed RoB assessment for each included study is compiled in Table 4 and Table S2. Overall, as depicted in Table 4, Domain 1—Participants—saw a significant proportion of studies (15) categorized as having a “High RoB”, while a substantial number (15) remained ambiguous, falling under the “Unclear RoB” category. In Domain 2 Predictors, because Q2 was predominantly (77%) characterized by PN responses, the majority (20) of the studies were identified with a “High RoB”, with only one research, was considered to have a “Low RoB”. A similar assessment was reported in Domain 3—Outcome—with 21 instances marked as “High Risk”. Importantly, Domain 4—Analysis—was evaluated as “Unclear RoB” in 23 scrutinized publications. In this case, a prevalent trend of NI responses was observed in multiple questions.
Table 4. Overall judgment of risk of bias for each domain.

4. Discussion

This systematic review explored the effectiveness of social networks and ML in detecting foodborne diseases. This review, which spans a decade of studies, demonstrated the central role of platforms such as Twitter and Yelp in using user-generated content for epidemiological research, with a significant focus on the English language and the US region. This could indicate various factors, such as data availability, technological infrastructure, or specific socio-cultural phenomena. Comparatively, those conducted by [53] also underscored the importance of social media in tracking infectious diseases or health trends. This similarity reinforces the potential of social networks as a valuable tool in public health monitoring and disease surveillance. It is important to emphasize that this approach serves as a complementary tool rather than a replacement for traditional surveillance methods. When used with great care, rigorously validated, and integrated with assessments by health authorities, social networks and machine learning can significantly enhance public health monitoring and improve the detection and response to foodborne disease outbreaks.
Despite the rapid advances in DL, our review indicated a noticeable preference for shallow ML models. Shallow learning models, while efficient and interpretable, may lack the capacity to analyze highly complex and unstructured data such as noisy social media posts [54]. This preference may arise from the desire for simpler, more interpretable models, as seen in previous studies [55]. Deep learning models (e.g., BERT and CNN) showed promising performance but remained underutilized due to computational challenges and dataset limitations. This is consistent with the findings of [56], where dataset limitations influenced model choice. This presents an exciting opportunity for further research, providing more profound insights into the prevailing practices and tendencies in applying ML in various studies.
Performance metrics were inconsistently reported across studies, with significant variability (e.g., accuracy ranging from 66.4% to 98%). F1 Score was the most frequently reported metric but was not universally applied. This inconsistency hinders cross-study comparisons and meta-analysis. A standardized approach for reporting machine learning performance metrics, including precision, recall, and F1 Score, is essential to ensure comparability. The variability may also reflect imbalanced datasets, where certain outcomes (e.g., disease vs. healthy) are over-represented, potentially leading to biased results.
The last couple of years have seen an increase in the application of Large Language Models (LLMs) for disease outbreaks. However, the use for detecting foodborne illnesses on social media has not been identified. For instance, it has been demonstrated that GPT prompting can effectively analyze social media posts to detect potential conjunctivitis outbreaks with accuracy comparable to human analysis [57], and also help determine the severity and prevalence of COVID-19 [58]. Moreover, GPT models and their open-source counterparts can be leveraged to quickly and efficiently understand public sentiments, such as those surrounding online discussions about vaccination [59]. Therefore, LLMs appear to be valuable tools for public health monitoring.
In this systematic review, the retrieved studies predominantly focus on the USA (23 studies), with far fewer studies conducted in Asia (e.g., China, India, and Japan) or other regions such as Europe and Australia. Surprisingly, specific studies detailing the use of social media platforms for this purpose in countries like India and Japan were not identified. This gap is notable given the significant application of digital technologies in the public health sectors of these countries. For instance, in India, modern technologies such as the internet and mobile phones are critical in bridging accessibility gaps in public health systems, particularly in rural areas facing significant health equity challenges. Similarly, Japan’s Society 5.0 initiative provides a robust framework for integrating advanced technologies like artificial intelligence, big data, and cloud computing to address various societal challenges, including public health [60]. Interestingly, we only identified one study set in Australia [45]. Nonetheless, in Australia, the past decade has witnessed a significant transformation in the healthcare sector with a rapid shift towards virtual healthcare services, illustrating a broader trend of digital adoption in public health [61]. This gap highlights a potential area for further research into how social media could be utilized for foodborne disease surveillance in these countries, building on their existing digital health infrastructure.
Our findings indicated that Twitter and Yelp dominate the current body of literature, collectively accounting for approximately 66% of the studies reviewed. This predominance is likely due to the public availability of data on these platforms, the structured nature of Yelp reviews, and the extensive use of Twitter for real-time communication. However, the future of academic engagement on Twitter appears uncertain, particularly following the shutdown of the academic research API, which significantly hindered scholars’ ability to access and analyze data on the platform. This change, along with Musk’s unscientific use of Twitter’s “polls” feature, his promotion of conspiracy theories about US elected officials, and his puerile demeanor, collectively contributed to an environment that many academics found increasingly unpalatable, leading them to either quit Twitter altogether or reduce their engagement with the platform [62].
In addition, this focus on a limited number of platforms may introduce a source bias, potentially overlooking valuable insights from alternative platforms, such as Facebook, Reddit, Instagram, or region-specific digital tools. It is important to recognize that different platforms may capture unique user behaviors and reporting patterns, particularly in under-represented regions. Therefore, future research should expand the scope of analysis to include diverse data sources, integrating alternative social media platforms and other digital tools to ensure broader and more equitable foodborne disease surveillance.
In recent years, real-time data acquisition technologies, such as those facilitated by the Internet of Things (IoT), have emerged as a complementary approach to social media-based foodborne disease detection. IoT devices in food factories and along supply chains allow for the continuous monitoring of critical parameters, such as temperature, humidity, and contamination levels, enabling the early detection of potential risks at the source [63]. While our review focused on user-generated content from social media platforms, the integration of real-time IoT data with social media mining could significantly enhance surveillance systems. This combined approach holds promise for providing a more comprehensive and immediate response to food safety threats, particularly in production environments where contamination risks are high. Future studies should explore the synergy between IoT-based monitoring and machine learning models applied to digital platforms to advance the field of foodborne disease surveillance.
Most studies (84.2%) focused on restaurants as the primary setting for foodborne disease surveillance. Other settings like homes, supermarkets, or mass gatherings were rarely considered. The heavy focus on restaurants may overlook other critical environments where foodborne diseases originate, such as households, schools, or supply chains.

Limitations

This systematic review indicated a conspicuous sparsity in reporting specific performance metrics in all studies. This hampers the comprehensive evaluation of ML models and poses challenges in conducting meta-analyses, which could be crucial in synthesizing findings and forging advancements in the field. This highlights a potential opportunity for improvement in future research, with the implementation of a standardized protocol for reporting various performance metrics, enhancing the comprehensiveness and comparability of studies within the field, as suggested by [36].
Our comprehensive assessment using the PROBAST tool also revealed significant concerns about the risk of bias in the 26 studies reviewed. Most studies were evaluated as having a high RoB for three (Participants, Predictors, Outcome) out of four domains. This aligns with the findings of [46], suggesting that high RoB is common in published clinical prediction models and is associated with poor discriminative performance. The last domain, that is, analysis, was evaluated as an unclear RoB for most studies. The limited representation of “Low RoB” evaluations, particularly for Domains 1, 3, and 4, may underline the need for more rigorous methodologies in future research.
The current focus on a limited number of platforms, such as Twitter and Yelp, may introduce source bias and overlook valuable insights from alternatives like Facebook, Reddit, Instagram, or region-specific tools. Expanding the scope to include diverse data sources, as well as integrating real-time IoT-based monitoring systems, could significantly improve foodborne disease surveillance.
Finally, the predominant emphasis on restaurant settings suggests a gap in addressing other critical environments where foodborne illnesses may originate, such as homes, schools, supermarkets, and supply chains. Addressing these limitations through interdisciplinary approaches that combine advanced ML methods, diverse data streams, and rigorous validation will be essential for advancing foodborne disease surveillance and enhancing public health outcomes.

5. Conclusions

Summarizing a decade of research highlights the utility of social media platforms, particularly Twitter and Yelp, for improving epidemiological research and public health surveillance of foodborne events. ML models, in particular shallow models, have been instrumental in deriving insights from user-generated content. However, future research efforts could benefit significantly from adopting standardized reporting protocols for ML model performance and exploring the potential of DL models in contexts where sufficient data is available. Ensuring that methods evolve as technology advances will be critical to maintaining the effectiveness of social media mining in detecting food-related events.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/foods14020239/s1, Figure S1. PRISMA Flowchart. Table S1. Search strategy for PubMed, last run 15 December 2024. Table S2. Risk of Bias evaluation for each domain using PROBAST scale. Refs. [22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52] are cited in Supplementary Materials.

Author Contributions

Conceptualization, D.G. and C.L.; methodology, H.O.; software, H.O.; formal analysis, S.S.; resources, D.G.; data curation, S.S. and A.C.; writing—original draft preparation, S.S.; writing—review and editing, H.O., C.L. and A.C.; visualization, S.S. and A.C.; supervision, D.G.; project administration, D.G.; funding acquisition, D.G. All authors have read and agreed to the published version of the manuscript.

Funding

The research project was financed with the allocated fund from the PON “Research and Innovation” 2014–2020, Action IV.6 “Research contracts on Green topics”.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding authors.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Pires, S.M.; Desta, B.N.; Mughini-Gras, L.; Mmbaga, B.T.; Fayemi, O.E.; Salvador, E.M.; Gobena, T.; Majowicz, S.E.; Hald, T.; Hoejskov, P.S.; et al. Burden of Foodborne Diseases: Think Global, Act Local. Curr. Opin. Food Sci. 2021, 39, 152–159. [Google Scholar] [CrossRef] [PubMed]
  2. Lee, H.; Yoon, Y. Etiological Agents Implicated in Foodborne Illness World Wide. Food Sci. Anim. Resour. 2021, 41, 1–7. [Google Scholar] [CrossRef] [PubMed]
  3. Foodborne Diseases. Available online: https://www.who.int/health-topics/foodborne-diseases (accessed on 16 October 2023).
  4. Estimating the Burden of Foodborne Diseases. Available online: https://www.who.int/activities/estimating-the-burden-of-foodborne-diseases (accessed on 16 October 2023).
  5. Monitoring of Foodborne Diseases | EFSA. Available online: https://www.efsa.europa.eu/en/topics/topic/monitoring-foodborne-diseases (accessed on 16 October 2023).
  6. The European Union One Health 2021 Zoonoses Report | EFSA. Available online: https://www.efsa.europa.eu/en/efsajournal/pub/7666 (accessed on 16 October 2023).
  7. Estimates of Foodborne Illness in the United States | Estimates of Foodborne Illness | CDC. Available online: https://www.cdc.gov/foodborneburden/index.html (accessed on 16 October 2023).
  8. Zheng, Y.; Gracia, A.; Hu, L. Predicting Foodborne Disease Outbreaks with Food Safety Certifications: Econometric and Machine Learning Analyses. J. Food Prot. 2023, 86, 100136. [Google Scholar] [CrossRef]
  9. López-Campos, G.; Martínez-Suárez, J.V.; Aguado-Urda, M.; López-Alonso, V. Detection, Identification, and Analysis of Foodborne Pathogens. In Microarray Detection and Characterization of Bacterial Foodborne Pathogens; López-Campos, G., Martínez-Suárez, J.V., Aguado-Urda, M., López-Alonso, V., Eds.; SpringerBriefs in Food, Health, and Nutrition; Springer: Boston, MA, USA, 2012; pp. 13–32. ISBN 978-1-4614-3250-0. [Google Scholar]
  10. Quintela, I.A.; Vasse, T.; Lin, C.-S.; Wu, V.C.H. Advances, Applications, and Limitations of Portable and Rapid Detection Technologies for Routinely Encountered Foodborne Pathogens. Front. Microbiol. 2022, 13, 1054782. [Google Scholar] [CrossRef]
  11. Oldroyd, R.A.; Morris, M.A.; Birkin, M. Identifying Methods for Monitoring Foodborne Illness: Review of Existing Public Health Surveillance Techniques. JMIR Public Health Surveill. 2018, 4, e57. [Google Scholar] [CrossRef] [PubMed]
  12. Zatsu, V.; Shine, A.E.; Tharakan, J.M.; Peter, D.; Ranganathan, T.V.; Alotaibi, S.S.; Mugabi, R.; Muhsinah, A.B.; Waseem, M.; Nayik, G.A. Revolutionizing the Food Industry: The Transformative Power of Artificial Intelligence—A Review. Food Chem. X 2024, 24, 101867. [Google Scholar] [CrossRef] [PubMed]
  13. Fallatah, D.I.; Adekola, H.A. Digital Epidemiology: Harnessing Big Data for Early Detection and Monitoring of Viral Outbreaks. Infect. Prev. Pract. 2024, 6, 100382. [Google Scholar] [CrossRef] [PubMed]
  14. Budd, J.; Miller, B.S.; Manning, E.M.; Lampos, V.; Zhuang, M.; Edelstein, M.; Rees, G.; Emery, V.C.; Stevens, M.M.; Keegan, N.; et al. Digital Technologies in the Public-Health Response to COVID-19. Nat. Med. 2020, 26, 1183–1192. [Google Scholar] [CrossRef] [PubMed]
  15. Ginsberg, J.; Mohebbi, M.H.; Patel, R.S.; Brammer, L.; Smolinski, M.S.; Brilliant, L. Detecting Influenza Epidemics Using Search Engine Query Data. Nature 2009, 457, 1012–1014. [Google Scholar] [CrossRef] [PubMed]
  16. Syrowatka, A.; Kuznetsova, M.; Alsubai, A.; Beckman, A.L.; Bain, P.A.; Craig, K.J.T.; Hu, J.; Jackson, G.P.; Rhee, K.; Bates, D.W. Leveraging Artificial Intelligence for Pandemic Preparedness and Response: A Scoping Review to Identify Key Use Cases. NPJ Digit. Med. 2021, 4, 96. [Google Scholar] [CrossRef]
  17. Chapman, B.; Raymond, B.; Powell, D. Potential of Social Media as a Tool to Combat Foodborne Illness. Perspect. Public Health 2014, 134, 225–230. [Google Scholar] [CrossRef] [PubMed]
  18. Duzen, Z.; Riveni, M.; Aktas, M.S. Analyzing the Spread of Misinformation on Social Networks: A Process and Software Architecture for Detection and Analysis. Computers 2023, 12, 232. [Google Scholar] [CrossRef]
  19. Gijsen, V.; Maddux, M.; Lavertu, A.; Gonzalez-Hernandez, G.; Ram, N.; Reeves, B.; Robinson, T.; Ziesenitz, V.; Shakhnovich, V.; Altman, R. #Science: The Potential and the Challenges of Utilizing Social Media and Other Electronic Communication Platforms in Health Care. Clin. Transl. Sci. 2020, 13, 26–30. [Google Scholar] [CrossRef] [PubMed]
  20. Covidence—Better Systematic Review Management. Available online: https://www.covidence.org/ (accessed on 19 January 2024).
  21. Moons, K.G.M.; Wolff, R.F.; Riley, R.D.; Whiting, P.F.; Westwood, M.; Collins, G.S.; Reitsma, J.B.; Kleijnen, J.; Mallett, S. PROBAST: A Tool to Assess Risk of Bias and Applicability of Prediction Model Studies: Explanation and Elaboration. Ann. Intern. Med. 2019, 170, W1–W33. [Google Scholar] [CrossRef] [PubMed]
  22. Denecke, K.; Krieck, M.; Otrusina, L.; Smrz, P.; Dolog, P.; Nejdl, W.; Velasco, E. How to Exploit Twitter for Public Health Monitoring? Methods Inf. Med. 2013, 52, 326–339. [Google Scholar] [CrossRef] [PubMed]
  23. Nsoesie, E.O.; Kluberg, S.A.; Brownstein, J.S. Online Reports of Foodborne Illness Capture Foods Implicated in Official Foodborne Outbreak Reports. Prev. Med. 2014, 67, 264–269. [Google Scholar] [CrossRef]
  24. Kate, K.; Negi, S.; Kalagnanam, J. Monitoring Food Safety Violation Reports from Internet Forums. In e-Health—For Continuity of Care; IOS Press: Amsterdam, The Netherlands, 2014; pp. 1090–1094. [Google Scholar]
  25. Widener, M.J.; Li, W. Using Geolocated Twitter Data to Monitor the Prevalence of Healthy and Unhealthy Food References across the US. Appl. Geogr. 2014, 54, 189–197. [Google Scholar] [CrossRef]
  26. Schomberg, J.P.; Haimson, O.L.; Hayes, G.R.; Anton-Culver, H. Supplementing Public Health Inspection via Social Media. PLoS ONE 2016, 11, e0152117. [Google Scholar] [CrossRef]
  27. Wang, Z.; Balasubramani, B.S.; Cruz, I.F. Predictive Analytics Using Text Classification for Restaurant Inspections. In Proceedings of the 3rd ACM SIGSPATIAL Workshop on Smart Cities and Urban Analytics, Redondo Beach, CA, USA, 7–10 November 2017; Association for Computing Machinery: New York, NY, USA, 2017; pp. 1–4. [Google Scholar]
  28. Cui, W.; Wang, P.; Du, Y.; Chen, X.; Guo, D.; Li, J.; Zhou, Y. An Algorithm for Event Detection Based on Social Media Data. Neurocomputing 2017, 254, 53–58. [Google Scholar] [CrossRef]
  29. Sadilek, A.; Caty, S.; DiPrete, L.; Mansour, R.; Schenk, T.; Bergtholdt, M.; Jha, A.; Ramaswami, P.; Gabrilovich, E. Machine-Learned Epidemiology: Real-Time Detection of Foodborne Illness at Scale. NPJ Digit. Med. 2018, 1, 36. [Google Scholar] [CrossRef] [PubMed]
  30. Effland, T.; Lawson, A.; Balter, S.; Devinney, K.; Reddy, V.; Waechter, H.; Gravano, L.; Hsu, D. Discovering Foodborne Illness in Online Restaurant Reviews. J. Am. Med. Inform. Assoc. 2018, 25, 1586–1592. [Google Scholar] [CrossRef]
  31. Șerban, O.; Thapen, N.; Maginnis, B.; Hankin, C.; Foot, V. Real-Time Processing of Social Media with SENTINEL: A Syndromic Surveillance System Incorporating Deep Learning for Health Classification. Inf. Process. Manag. 2019, 56, 1166–1184. [Google Scholar] [CrossRef]
  32. Zhang, M.; Guo, D.; Hu, J.; Jin, W. Risk Prediction and Assessment of Foodborne Disease Based on Big Data. In Proceedings of the 5th ACM SIGSPATIAL International Workshop on the Use of GIS in Emergency Management, Chicago, IL, USA, 5 November 2019; Association for Computing Machinery: New York, NY, USA, 2020; pp. 1–6. [Google Scholar]
  33. Mejia, J.; Mankad, S.; Gopal, A. A for Effort? Using the Crowd to Identify Moral Hazard in New York City Restaurant Hygiene Inspections. Inf. Syst. Res. 2019, 30, 1363–1386. [Google Scholar] [CrossRef]
  34. Maharana, A.; Cai, K.; Hellerstein, J.; Hswen, Y.; Munsell, M.; Staneva, V.; Verma, M.; Vint, C.; Wijaya, D.; Nsoesie, E.O. Detecting Reports of Unsafe Foods in Consumer Product Reviews. JAMIA Open 2019, 2, 330–338. [Google Scholar] [CrossRef] [PubMed]
  35. Glowacki, E.M.; Glowacki, J.B.; Chung, A.D.; Wilcox, G.B. Reactions to Foodborne Escherichia Coli Outbreaks: A Text-Mining Analysis of the Public’s Response. Am. J. Infect. Control 2019, 47, 1280–1282. [Google Scholar] [CrossRef]
  36. Gao, W.; Fang, Y.; Li, L.; Tao, X. Event Detection in Social Media via Graph Neural Network. In Proceedings of the Web Information Systems Engineering—WISE 2021, Melbourne, VIC, Australia, 26–29 October 2021; Zhang, W., Zou, L., Maamar, Z., Chen, L., Eds.; Springer International Publishing: Cham, Switzerland, 2021; pp. 370–384. [Google Scholar]
  37. Rizzoli, V.; Mascarello, G.; Pinto, A.; Crovato, S.; Ruzza, M.; Tiozzo, B.; Ravarotto, L. ‘Don’t Worry, Honey: It’s Cooked’: Addressing Food Risk during Pregnancy on Facebook Italian Posts. Foods 2021, 10, 2484. [Google Scholar] [CrossRef] [PubMed]
  38. Hu, R.; Zhang, D.; Tao, D.; Hartvigsen, T.; Feng, H.; Rundensteiner, E. TWEET-FID: An Annotated Dataset for Multiple Foodborne Illness Detection Tasks. arXiv 2022, arXiv:2205.10726. [Google Scholar]
  39. Tao, D.; Zhang, D.; Hu, R.; Rundensteiner, E.; Feng, H. Crowdsourcing and Machine Learning Approaches for Extracting Entities Indicating Potential Foodborne Outbreaks from Social Media. Sci. Rep. 2021, 11, 21678. [Google Scholar] [CrossRef]
  40. Erraguntla, M.; Zapletal, J.; Lawley, M. Framework for Infectious Disease Analysis: A Comprehensive and Integrative Multi-Modeling Approach to Disease Prediction and Management. Health Inform. J. 2019, 25, 1170–1187. [Google Scholar] [CrossRef]
  41. Vasanthakumar, U.; Bryna Goh, J.R.; Hui, S.C.; Lam, K.Y.; Er, B.; Fua’di, M.T.; Aung, K.T. Fine-Tuning Pre-Trained Language Model for Urgency Classification on Food Safety Feedback. In Proceedings of the 2023 10th International Conference on ICT for Smart Society (ICISS), Bandung, Indonesia, 6–7 September 2023; pp. 1–10. [Google Scholar]
  42. Lee, C.K.H. Predicting Food Safety Violations via Social Media to Improve Public Health Surveillance. ECSM 2023, 10, 109–116. [Google Scholar] [CrossRef]
  43. Tao, D.; Hu, R.; Zhang, D.; Laber, J.; Lapsley, A.; Kwan, T.; Rathke, L.; Rundensteiner, E.; Feng, H. A Novel Foodborne Illness Detection and Web Application Tool Based on Social Media. Foods 2023, 12, 2769. [Google Scholar] [CrossRef] [PubMed]
  44. Hu, R.; Zhang, D.; Tao, D.; Zhang, H.; Feng, H.; Rundensteiner, E. UCE-FID: Using Large Unlabeled, Medium Crowdsourced-Labeled, and Small Expert-Labeled Tweets for Foodborne Illness Detection. In Proceedings of the 2023 IEEE International Conference on Big Data (BigData), Sorrento, Italy, 15–18 December 2023. [Google Scholar]
  45. Molenaar, A.; Lukose, D.; Brennan, L.; Jenkins, E.L.; McCaffrey, T.A. Using Natural Language Processing to Explore Social Media Opinions on Food Security: Sentiment Analysis and Topic Modeling Study. J. Med. Internet Res. 2024, 26, e47826. [Google Scholar] [CrossRef] [PubMed]
  46. Harris, J.K.; Mansour, R.; Choucair, B.; Olson, J.; Nissen, C.; Bhatt, J. Health Department Use of Social Media to Identify Foodborne Illness—Chicago, Illinois, 2013–2014. MMWR Morb. Mortal Wkly. Rep. 2014, 63, 681–685. [Google Scholar] [PubMed]
  47. Harris, J.K.; Hawkins, J.B.; Nguyen, L.; Nsoesie, E.O.; Tuli, G.; Mansour, R.; Brownstein, J.S. Using Twitter to Identify and Respond to Food Poisoning: The Food Safety STL Project. J. Public Health Manag. Pract. 2017, 23, 577–580. [Google Scholar] [CrossRef] [PubMed]
  48. Harrison, C.; Jorder, M.; Stern, H.; Stavinsky, F.; Reddy, V.; Hanson, H.; Waechter, H.; Lowe, L.; Gravano, L.; Balter, S. Using Online Reviews by Restaurant Patrons to Identify Unreported Cases of Foodborne Illness—New York City, 2012–2013. MMWR Morb. Mortal Wkly. Rep. 2014, 63, 441–445. [Google Scholar] [PubMed]
  49. Joaristi, M.; Serra, E.; Spezzano, F. Evaluating the Impact of Social Media in Detecting Health-Violating Restaurants. In Proceedings of the 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), Davis, CA, USA, 18–21 August 2016; pp. 626–633. [Google Scholar]
  50. Sadilek, A.; Kautz, H.; DiPrete, L.; Labus, B.; Portman, E.; Teitel, J.; Silenzio, V. Deploying nEmesis: Preventing Foodborne Illness by Data Mining Social Media. AI Mag. 2017, 38, 37–48. [Google Scholar] [CrossRef]
  51. Tegtmeyer, R.; Potts, L.; Hart-Davidson, W. Tracing and Responding to Foodborne Illness. In Proceedings of the 30th ACM International Conference on Design of Communication, Seattle, WA, USA, 3–5 October 2012; Association for Computing Machinery: New York, NY, USA, 2012; pp. 369–370. [Google Scholar]
  52. Zou, B.; Lampos, V.; Gorton, R.; Cox, I.J. On Infectious Intestinal Disease Surveillance Using Social Media Content. In Proceedings of the 6th International Conference on Digital Health Conference, Montreal, QC, Canada, 11–13 April 2016; Association for Computing Machinery: New York, NY, USA, 2016; pp. 157–161. [Google Scholar]
  53. Chen, J.; Wang, Y. Social Media Use for Health Purposes: Systematic Review. J. Med. Internet Res. 2021, 23, e17917. [Google Scholar] [CrossRef]
  54. Ahmed, S.F.; Alam, M.S.B.; Hassan, M.; Rozbu, M.R.; Ishtiak, T.; Rafa, N.; Mofijur, M.; Shawkat Ali, A.B.M.; Gandomi, A.H. Deep Learning Modelling Techniques: Current Progress, Applications, Advantages, and Challenges. Artif. Intell. Rev. 2023, 56, 13521–13617. [Google Scholar] [CrossRef]
  55. Meir, Y.; Tevet, O.; Tzach, Y.; Hodassman, S.; Gross, R.D.; Kanter, I. Efficient Shallow Learning as an Alternative to Deep Learning. Sci. Rep. 2023, 13, 5423. [Google Scholar] [CrossRef] [PubMed]
  56. Stevens, L.M.; Mortazavi, B.J.; Deo, R.C.; Curtis, L.; Kao, D.P. Recommendations for Reporting Machine Learning Analyses in Clinical Research. Circ. Cardiovasc. Qual. Outcomes 2020, 13, e006556. [Google Scholar] [CrossRef]
  57. Deiner, M.S.; Deiner, N.A.; Hristidis, V.; McLeod, S.D.; Doan, T.; Lietman, T.M.; Porco, T.C. Use of Large Language Models to Assess the Likelihood of Epidemics from the Content of Tweets: Infodemiology Study. J. Med. Internet Res. 2024, 26, e49139. [Google Scholar] [CrossRef]
  58. Hasan, A.; Levene, M.; Weston, D.; Fromson, R.; Koslover, N.; Levene, T. Monitoring COVID-19 on Social Media: Development of an End-to-End Natural Language Processing Pipeline Using a Novel Triage and Diagnosis Approach. J. Med. Internet Res. 2022, 24, e30397. [Google Scholar] [CrossRef] [PubMed]
  59. Espinosa, L.; Salathé, M. Use of Large Language Models as a Scalable Approach to Understanding Public Health Discourse. PLoS Digit. Health 2024, 3, e0000631. [Google Scholar] [CrossRef] [PubMed]
  60. Narvaez Rojas, C.; Alomia Peñafiel, G.A.; Loaiza Buitrago, D.F.; Tavera Romero, C.A. Society 5.0: A Japanese Concept for a Superintelligent Society. Sustainability 2021, 13, 6567. [Google Scholar] [CrossRef]
  61. Digital Health. Available online: https://www.aihw.gov.au/reports/australias-health/digital-health (accessed on 20 December 2024).
  62. Bisbee, J.; Munger, K. The Vibes Are Off: Did Elon Musk Push Academics Off Twitter? In PS: Political Science & Politics; Cambridge University Press: Cambridge, UK, 2024; pp. 1–8. [Google Scholar] [CrossRef]
  63. Hassoun, A.; Jagtap, S.; Trollman, H.; Garcia-Garcia, G.; Abdullah, N.A.; Goksen, G.; Bader, F.; Ozogul, F.; Barba, F.J.; Cropotova, J.; et al. Food Processing 4.0: Current and Future Developments Spurred by the Fourth Industrial Revolution. Food Control 2023, 145, 109507. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.