Next Article in Journal
InvMap and Witness Simplicial Variational Auto-Encoders
Next Article in Special Issue
3t2FTS: A Novel Feature Transform Strategy to Classify 3D MRI Voxels and Its Application on HGG/LGG Classification
Previous Article in Journal
Special Issue “Selected Papers from CD-MAKE 2020 and ARES 2020”
Previous Article in Special Issue
Using Resistin, Glucose, Age and BMI and Pruning Fuzzy Neural Network for the Construction of Expert Systems in the Prediction of Breast Cancer
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Systematic Review

Machine Learning and Prediction of Infectious Diseases: A Systematic Review

1
Regional Health Care and Social Agency of Lodi, ASST Lodi, 26900 Lodi, Italy
2
Synbrain Srl, 90128 Palermo, Italy
3
Hospital Management, AOUP “G. Rodolico-San Marco” of Catania, 95123 Catania, Italy
4
Hospital Hygiene Unit, AOUP “G. Rodolico-San Marco” of Catania, 95123 Catania, Italy
5
Hospital Management, Local Health Unit of Pescara, 65122 Pescara, Italy
*
Author to whom correspondence should be addressed.
Mach. Learn. Knowl. Extr. 2023, 5(1), 175-198; https://doi.org/10.3390/make5010013
Submission received: 7 December 2022 / Revised: 20 January 2023 / Accepted: 30 January 2023 / Published: 1 February 2023
(This article belongs to the Special Issue Machine Learning for Biomedical Data Processing)

Abstract

:
The aim of the study is to show whether it is possible to predict infectious disease outbreaks early, by using machine learning. This study was carried out following the guidelines of the Cochrane Collaboration and the meta-analysis of observational studies in epidemiology and the preferred reporting items for systematic reviews and meta-analyses. The suitable bibliography on PubMed/Medline and Scopus was searched by combining text, words, and titles on medical topics. At the end of the search, this systematic review contained 75 records. The studies analyzed in this systematic review demonstrate that it is possible to predict the incidence and trends of some infectious diseases; by combining several techniques and types of machine learning, it is possible to obtain accurate and plausible results.

1. Introduction

1.1. Burden of Infectious Diseases

Infectious diseases are now an increasing part of the global health burden: every year, millions of deaths are due to infectious diseases [1]. To be able to develop protective measures against infectious diseases, it is crucial to identify the factors of disease diffusion in order to apply control and prevention measures [2]. Identifying the factors of disease diffusion could allow for making predictions that can be useful for policymakers to make appropriate decisions on, for example, the purchase of vaccines, public awareness campaigns and training programs for health professionals [3,4].

1.2. Machine Learning Applied to Infectious Diseases—Overview

Machine-learning algorithms can contribute to the control of infectious diseases by helping to both spatially and temporally predict the evolution and spread of infectious diseases [5]. Machine-learning algorithms are capable of analyzing large, complex data sets and identifying patterns and trends that may be difficult for humans to detect. This makes them well suited for the prediction of infectious diseases, which often involve multiple factors such as population demographics, environmental conditions, and individual behaviors. In recent years, many studies have applied machine-learning techniques to the prediction of infectious diseases, and the results have been promising. One of the key challenges in using machine learning for disease prediction is the availability of high-quality, comprehensive data. Infectious disease surveillance systems often collect data on a variety of factors, including the number of reported cases, the locations of outbreaks, and the demographics of infected individuals. However, these data are often incomplete, biased, or noisy, which can affect the performance of machine-learning models. Additionally, many infectious diseases have long incubation periods, which means that data on past outbreaks may not accurately reflect current conditions. To overcome these challenges, researchers have employed a range of machine-learning algorithms, including decision trees, random forests, support vector machines, and deep-learning networks. These algorithms have been applied to a variety of data sets, including electronic health records, genomic data, and social media posts. In general, the results of these studies have shown that machine-learning algorithms can accurately predict the spread and onset of infectious diseases, with performance comparable with or better than traditional statistical methods. Machine-learning algorithms have shown great promise in predicting the spread and onset of infectious diseases. For example, some studies have used machine learning to forecast the number of cases of a particular disease in a given region on the basis of historical data and current conditions [6]. Others have used machine learning to identify the most likely sources of an outbreak on the basis of the genetic makeup of the pathogen and the patterns of infection [7,8]. Still others have used machine learning to predict the likelihood of an individual contracting an infectious disease on the basis of their personal characteristics and behaviors [9]. Overall, the use of machine learning in the prediction of infectious diseases is a promising area of research, with potential applications in public health, epidemiology, and clinical practice. However, there are also significant challenges and limitations to using machine learning in this context, including the need for high-quality data, the complexity of the underlying phenomena, and the potential for bias and overfitting [10]. Despite the promising results of these studies, there are also significant challenges and limitations to using machine learning in the prediction of infectious diseases. One of the main challenges is the need for high-quality data, which are often difficult to obtain owing to issues such as missing values, incomplete records, and varying data formats. Additionally, the complexity of the underlying phenomena, such as the transmission of infectious diseases, can make it difficult to develop accurate models. Finally, there is a potential for bias and overfitting in machine-learning models, which can lead to inaccurate predictions [10]. In this systematic review, we will explore the current state of the art in the use of machine learning for the prediction of infectious diseases. We will focus on recent, high-quality studies that have applied machine-learning techniques to real-world data and have evaluated the performance of these models in predicting the spread and onset of infectious diseases. We will also discuss the challenges and limitations of using machine learning in this context and provide insights into future directions for research in this area.

1.3. Aim

The aim of the study is to show whether it is possible to predict infectious disease outbreaks early, by using machine learning.

2. Materials and Methods

2.1. Search Strategy and Data Sources

The Cochrane Collaboration [11] and the meta-analysis of observational studies in epidemiology (MOOSE) guidelines [12] were followed in order to conduct the current systematic review. The preferred reporting items for systematic reviews and meta-analyses [13] guidelines [14] were used to report the process and the results. A bibliographic search was conducted on 9 November 2022, on the Scopus and PubMed/MEDLINE databases, combining keywords by using the Boolean operators “AND”, ”OR” and “NOT”. The search strategy is reported in Supplementary Table S1. No time filter was used. Given the innovative nature of the study and its recent field of application halfway between medicine, epidemiology, and information technology, it was not always possible to apply all the items of the PRISMA checklist (more details are given in the study limitations section).

2.2. Inclusion and Exclusion Criteria

Studies had to meet the following criteria to be considered eligible: (i) language: written in English; (ii) population: human; (iii) interventions: machine learning; (iv) comparators/control: infectious diseases; (v) outcomes: prediction/forecasting outbreaks infectious diseases; (vi) type of study: epidemiologic studies (case-control, cross-sectional, or cohort studies).
Exclusion criteria were as follows: (i) articles not published in English; (ii) not human; (iii) full text not available; (iv) interventions: not about machine learning; (v) comparators/control: not about infectious diseases; (vi) outcomes: not about prediction/forecasting outbreaks infectious diseases; (vii) type of study: review article, meta-analysis, trial, expert opinion, commentary, editorial, case report, letter to the editor, or book chapters. See Supplementary Table S2 where the detailed description of the inclusion/exclusion criteria is reported.

2.3. Selection Process and Data Extraction

Titles and abstracts of manuscripts found using the search strategy and those retrieved from additional sources were independently assessed by two reviewers (D.G. and C.F.). Subsequently, the same authors assessed the eligibility of the articles and independently reviewed the full downloaded text. When there was an unresolved disagreement between the two evaluators, the discussion was resolved by discussing the case with a senior reviewer (O.E.S.). Full texts were downloaded only for potentially eligible studies.
Data extraction was conducted only for those articles that met all the inclusion criteria, and it was performed using a predefined and prepiloted spreadsheet elaborated in Microsoft Excel for Windows. The extracted data included the author, publication year, study period, country where the study was conducted, disease, data source, model and/or techniques, aim, main results, accuracy/best model, space/time resolution, order of magnitude modeled populations, funds, and conflicts of interest.

2.4. Strategy for Data Synthesis

By following the PRISMA 2020 guidelines, a flowchart (Figure S1) was created showing the number of references at each stage of the review process [15]. Summary tables were created showing the qualitative results of the literature. A full report was produced; in this, there is a general overview of the main findings of the review.

2.5. Critical Appraisal

A critical evaluation of the articles using the Newcastle–Ottawa scale (NOS) was independently carried out by two authors (O.E.S. and D.G.) [16]; this was a bias-risk assessment tool for observational studies that can assign up to nine points for the lowest risk of bias in three domains: (i) study group selection; (ii) comparability; and (iii) assessment of exposure and outcomes for case-control and cohort studies, respectively.
An adapted version of the NOS was used to assess cross-sectional studies [17]. According to these criteria and on the standard cutoff used in the previous literature [18,19], studies were classified as being of high, moderate, or low quality when their NOS score was ≥7, 4–6, and ≤3, respectively.

3. Results

3.1. Literature Search

First, 375 and 333 records were found on Scopus and PubMed/MEDLINE, respectively, and the total was 708. Second, 89 records were eliminated because they were duplications. In the end, 619 records were evaluated for admissibility. By evaluating the title and abstract, 537 records were deleted because the topic was not related (n = 530), the articles were not original (n = 3), they were not written in English (n = 3), and one was not about humans (n = 1). The full text of the 82 records was downloaded, and 7 records were excluded with reasons following an in-depth assessment. At the end of the process, 75 records were included in our review [20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76,77,78,79,80,81,82,83,84,85,86,87,88,89,90,91,92,93,94]. Figure S1 shows the selection flowchart. There was a 0.7% disagreement among the authors during the first screening. Table 1 lists the characteristics of the included studies in alphabetical order by author. The articles appear from Absar N [20] to Zhong R [73], and the proceedings/conference papers appear from Ajith A [74] to the end.

3.2. Characteristics of Included Studies

Table 1 shows the characteristics of the included studies: the first part reports the studies published in international journals, the second part of Table 1 shows the studies extracted from the proceedings, the lines are highlighted in gray, and the authors are listed in alphabetical order. As for the studies excluding the proceedings, all studies were performed after 2016 [20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73]. Further, nine studies were carried out in China [38,40,49,60,66,69,71,73,94]; seven in the US [21,41,47,48,59,67,71]; three in Bangladesh [20,39,50]; two each in Spain [51,53], Iran [28,56], Hong Kong [44,70], India [55,65], Turkey [23,24], and South Korea [27,37]; and the other studies in Jordan, Brazil, Cameroon, Canada, Mexico, Singapore, and Vietnam [25,33,36,52,54,57,62]. Finally, the remaining studies were multicentric.
Most of the studies were related to COVID-19 (n = 30) [20,21,22,25,26,27,29,31,33,34,37,39,41,42,45,46,50,51,53,56,58,59,60,63,65,66,67,68,71,72] and then influenza and influenza-like illness (ILI) [44,47,54,64,70]; for other details of the studies, see Table 1. No funds were reported by five studies [20,22,37,46,48], funds were not reported for nine studies [27,31,39,42,45,50,59,60,63], and funds were reported in the other studies (n = 40). In two studies, the authors declared the conflicts of interest [37,47], and in six, the authors did not declare the conflicts of interest [25,34,42,45,50,59]. Furthermore, 21 proceedings/conference papers fulfilling the search criteria were included in this review. Of these, 17 reported on COVID-19 [75,77,78,79,80,82,83,84,85,86,87,88,89,90,91,92,93], one study West Nile virus [74], one malaria [76], one dengue [81], and one hand, foot, and mouth disease, hepatitis B [94]. Most of the studies were performed on worldwide data [75,85,86,90,91,92,93] and all were published after 2019 [74,75,76,77,78,79,80,81,82,83,84,85,86,87,88,89,90,91,92,93,94].

3.3. Quality Assessment

The quality of the 75 studies ranged from 6 to 9. The assessment revealed a medium–high quality level for cohort studies and cross-sectional ones. See Supplementary Table S3 for a complete overview that is based on the NOS checklist.

4. Discussion

Several infectious diseases are emerging and threatening the human health condition across the world. The burden of infectious diseases is certainly a planetary issue, annually causing millions of deaths [24]. Therefore, the study of infectious disease behavior has been a subject of scientific interest for many years; the early identification of emerging infectious disease outbreak patterns is critical and offers great advantages [36,43]. Indeed, as is evident from the studies under observation, accurate and reliable predictions of infectious diseases can be invaluable to public health organizations planning interventions to reduce or prevent disease transmission [38] and mitigate the negative impacts of diseases [35]. As seen by Ketu S. et al., the viral epidemic, in addition to exerting direct damage on people’s lives, can affect a country’s economy [42,43]. As reiterated by Roster K. et al., recent epidemic outbreaks, such as the COVID-19 pandemic and the zika epidemic in Brazil, have demonstrated the importance and difficulty of accurately predicting new infectious diseases [57]. A lack of knowledge about new infectious diseases and their consequences, along with complicated social and governmental factors, may influence the spread of every newly emerging disease [42]. It is, in fact, essential to try to estimate the future movement and pattern of a new disease [39], so that preventive measures such as closing schools, shopping centers, and theaters; closing borders; suspending public services; and stopping travel can be quickly implemented [42]. However, because the epidemic spread of an infectious disease usually occurs sporadically and rapidly, it is not easy to predict whether an infectious disease will emerge and how. In addition, collecting data on a specific infectious disease is not always easy. Knowledge about the transmission paths of emerging diseases, the level and duration of immunity to reinfection, and other parameters needed to build realistic epidemiological models are often scarce. For these reasons, it is necessary to find appropriate and useful information sources and data and build up reliable prediction models with these [43,57]. Indeed, to develop increasingly effective control and prevention strategies, reliable computational tools that may help to understand disease dynamics and predict future cases are needed. Policymakers can use these computational tools to make decisions that are more informed [24]. Several approaches have been proposed in the literature to produce accurate and timely predictions and potentially improve public health response [32]. Time series forecasting and machine learning, while less dependent on disease assumptions, require large amounts of data that may not be available in the early stages of an outbreak [57]. Modeling the spread of infectious diseases in space and time must account for complex dependencies and uncertainties. Machine-learning methods, especially neural networks, are useful for modeling these kinds of complex problems, even if they in some cases lack probabilistic interpretations [53]. Predicting the evolution of contagion dynamics is still an open problem, to which mechanical models offer only a partial answer. To remain mathematically or computationally tractable, these models must rely on simplifying assumptions, thus limiting the quantitative accuracy of their predictions and the complexity of the dynamics they can model [51]. Mathematical modeling is the most scientific technique for understanding the evolution of natural phenomena, including the spread of infectious diseases. Therefore, these modeling tools have been widely used in epidemiology to predict risks and inform decision-making [46]. While imperfect, these models offer an additional input for decision makers of infectious disease responses. These results could be useful insofar as informing decisions on planning, resource allocation, and social-distancing policies [37]. Deep learning offers a new and complementary perspective to build effective models of contagion dynamics on networks, as demonstrated by Murphy C. et al. [51]. The analysis found that models based on combining multiple machine-learning methods, incorporating information from different models that are based on multiple data sources, produced the most robust and most accurate results [47].

4.1. Acute Respiratory Infection (ARI)

Acute respiratory infections (ARIs) are one of the main causes of morbidity and mortality in the world, particularly in children under 5 years and adults over 65 years [36]. Gónzalez-Bandala et al. [36] propose a methodology that merges the predictions of a computational model with machine learning, a projection model, and a proposed smoothed endemic channel calculation. The predictions are made on weekly acute respiratory infection (ARI) data obtained from epidemiological reports in Mexico, along with the usage of key terms in the Google search engine. The results obtained with this methodology were compared with state-of-the-art techniques resulting in reduced root-mean-square percentage error (RMPSE) and maximum absolute percent error (MAPE) metrics, achieving a MAPE of 21.7%. The results show that the combination of different data analysis techniques (FFNN, SoS, and smoothed endemic channels) can provide an accurate prediction for ARI data 1 week in advance [36].

4.2. Brucellosis

Brucellosis (Malta fever) is one of the most common zoonotic diseases and has long been one of the most important health concerns for humans and animals [61]. Two studies emerged in our review: one by Bagheri H. et al. [28], performed in Iran, and one by Shen L. et al. [61], performed in Europe. In both studies, the authors demonstrated that their models can predict brucellosis cases in humans; to demonstrate this, they used different models. Bagheri [28] compared RBF and MLP, stating that RBF is a more common type of neural network learning that responds to a limited section of the input space; it has a faster and more accurate and yet simpler network structure compared to other neural networks, while the MLP is more generalizable. Shen L. used LSTM, LSTM, and ARIMA as convolutional models, and the prediction results have shown that the LSTM and ConvLSTM models have higher forecast precision.

4.3. Campylobacteriosis, Q-Fever, and Typhoid

Only one study addressed these diseases, and it was Dixon S. et al. [35], where multiple models were compared: RF, XGB, MLP, ARIMA, ARIMAX, GLARMA, and SARIMA. The end result was that the XGB models performed the best for all diseases, and in general, tree-based ML models performed the best when looking at data splits [35]. According to the authors, this study demonstrated the power of ML approaches to incorporate a wide range of factors to more accurately forecast various diseases, regardless of location, than traditional statistical approaches [35].

4.4. Chickenpox

Only one study addressed chickenpox; however, it was not dedicated only to this disease but rather took into consideration various infectious diseases with respiratory transmission [32]. The study by Chen et al. [32], using the LASSO model, showed that predictions made more than 4 weeks in advance were increasingly discrepant from the real scenario and that the prediction model was more accurate in capturing the epidemic but less sensitive to predicting the size of the epidemic, probably because the climatic variables have different levels of importance in the accuracy of the forecasts [32].

4.5. Clostridiodes Difficile

The study by Marra et al. [48] was the one that obtained the least accuracy. It was a study on Clostridiodes difficile, and despite the large variability of the models used (LR, FR, naïve Bayes, K-nearest neighbor, MLP, Lib SVM, decision tree (J48), AdaBoost (M1), bagging, and radial basis function classifier), these machine-learning models have produced only modest results in a real-world population [32]. The logistic regression, random forest, and naïve Bayes models yielded the highest performance: 0.6 [32].

4.6. Crimean-Congo Hemorrhagic Fever (CCHF)

Crimeane-Congo hemorrhagic fever (CCHF) is a tick-borne viral infection usually transmitted by tick bites or through contact with tissues, blood, or other bodily fluids from infected people and animals [23,24]. In our systematic review, there were two studies conducted by the same author: Ak Ç [23,24]. The author demonstrated in both studies that Gaussian process formulation obtained better results than two frequently used standard machine-learning algorithms (i.e., random forests and boosted regression trees) under temporal, spatial, and spatiotemporal prediction scenarios. The Gaussian process was the best model to spatiotemporally show CCHF outbreaks [23,24].

4.7. COVID-19

Most of the studies under observation concern the COVID-19 pandemic [20,21,22,25,26,27,29,31,33,34,37,39,41,42,45,46,50,51,53,56,58,59,60,63,65,66,67,68,71,72]. Ever since the coronavirus pandemic (COVID-19) emerged in Wuhan, China, and was recognized as a global threat, several national and global studies have been conducted to try to predict the epidemic, with various levels of reliability and accuracy [34], as has been carried out in Korea [37] and India [65], among others. Academic researchers all around the world have proposed various predictive models to allow policymakers to make better decisions, to apply appropriate control measures [26,29,32,63], and to reduce the burden in hospitals [58]. Recently, new machine-learning approaches have been used to understand the dynamic trend of the COVID-19 spread, as was shown in the study by Verma H. et al., who used the temporal deep-learning architecture to predict COVID-19 cases in India [65]. In the study by Pourghasemi H.R. et al., in order to assess the risk of COVID-19 outbreak in Fars province, Iran, a machine-learning algorithm (MLA) based on a geographic information system (GIS) and a support vector machine (SVM) was used, all while daily observations of infected cases were analyzed with polynomial and autoregressive integrated moving average (ARIMA) models to examine the virus infestation patterns in the province and Iran [56]. The deep-learning model was used as a sustainable prognostic method of the COVID-19 outbreak in Bangladesh, by Mohammad Masum A.K. et al. [50]. Kumar S.L. et al. showed how, through statistical modeling on COVID-19 data, they performed linear regressions, random forests, ARIMA, and LSTM to estimate the empirical indication of COVID-19 infection and intensity in four countries (the US, India, Brazil, and Russia) in order to arrive at better validation [45]. Wang Y. et al. proposed the ARIMA, SARIMA, and PROPHET models to predict daily new cases and cumulative confirmed cases in the US, Brazil, and India over the next 30 days on the basis of the data set on new confirmed cases and cumulative confirmed cases of COVID-19 published by the WHO [68]. Wang X. et al. created a method to predict the daily number of confirmed cases of infectious diseases by combining an ordinary differential equation mechanistic (ODE) model for infectious classes and a generalized machine-learning model (GBM) to predict how public health policies and mobility data influence the transmission rate in the ODE model [67]. In addition to monitoring general research and publication activities, the use of machine-learning approaches and a theoretical understanding of information-sharing behaviors is a productive approach to improve the effectiveness of infosurveillance [60], as demonstrated by the study conducted by Zhang Y. et al., whose experimental results suggest that it is feasible to use Twitter data to provide the surveillance and prediction of COVID-19 in the United States to support decision-making by health departments [72]. Shaghaghi N. et al. demonstrated that the use of eVision, an epidemic prediction system that combines machine learning (ML) in the form of a recursive neural network (RNN) long short-term memory (LSTM) and search engine statistics in order to make accurate predictions of the weekly number of cases of highly communicable diseases, was able to achieve 89% accuracy in predicting the progress of the COVID-19 pandemic in the United States [59]. Among the proceedings concerning COVID-19 from an accuracy point of view, it is worth mentioning the study by Rohini et al. [89], carried out in India using K-nearest neighbors (KNN); the authors declared to have reached a predictive accuracy of 98.34%. Moreover, the studies on world data by Andreas et al. [75] and Satu et al. [90] should also be noted, which through a model created ad hoc by Andreas and the use of PROPHET by Satu, reached an R2 greater than or equal to 0.99.

4.8. Dengue

Some of the included studies were conducted on dengue fever prediction models [55], as in the study conducted by Nguyen V.H. et al. It aimed to develop an accurate prediction model of dengue fever in Vietnam by using a wide range of weather factors as inputs to inform public health responses for outbreak prevention in the context of future climate change [52], by comparing convolutional neural network (CNN), transformer, long short-term memory (LSTM), and attention-enhanced LSTM (LSTM-ATT) models with traditional machine-learning models on weather-based DF forecasting. Interesting results were found by Shy et al., who demonstrated that statistical models built with machine-learning methods such as LASSO have the potential to greatly improve forecasting techniques for recurrent outbreaks of infectious diseases such as dengue [62], and by Xu J. et al., whose results, based on the use of LSTM, provide a more accurate dengue prediction model and could be used for other dengue-like infectious diseases [69]. As for the proceedings, Kolesnikov et al. [81] in their study indicate that the most effective predictions were given by a mathematical model based on a combination of spatial analysis techniques (MGWR) and neural networks based on the LSTM architecture.

4.9. Epatitis B

The quality of the study is modest because it is a conference paper. The study by Zhang P. et al. [94] implemented various models: oriented attention model (OAM), AR, LSTM, gated recurrent unit, encoder-decoder, CNN, CNN-RNN, LSTM-attn, GRU-attn, ED-attn, CNN-attn, and CNNRNN-attn. The self-attention significantly improves the predictive accuracy of all comparable methods. The MAE and RMSE values were decreased by 51.67% and 39.43% at most, respectively. The R2 increased by 52.99% at most [94].

4.10. Epatitis E

Guo Y. et al. investigated what might be the most appropriate model to predict the incidence of hepatitis E. By comparing ARIMA, SVM, and LSTM, they found that nonlinear models (SVM, LSTM) outperform linear models (ARIMA). LSTM obtained the best performance according to all three metrics: RSME, MAPE, and MAE. Hence, LSTM is the most suitable for predicting the hepatitis E monthly incidence and cases numbers [38].

4.11. Hand, Foot, and Mouth Disease

Studies on a variety of other infectious diseases were included in the review, such as hand-foot-mouth disease (HFMD), an increasingly prominent public health problem that has caused an epidemic in China every year since 2008. Predicting the incidence of HFMD and analyzing the key factors that may play a role are of great importance for its prevention [49,63], which has been researched in the study conducted by Meng D. et al. They proposed two machine-learning algorithms, random forest and eXtreme gradient boosting (XGBoost), for the analysis and prediction of hand, foot, and mouth disease [49]. Among the proceedings was a study by Zhang et al. [94], where the authors stated that the self-attention methodology applied to various models increased the accuracy of their predictions.

4.12. Influenza/Influenza-Like Illness (ILI)

Several studies included in the review concern influenza and influenza-like illness (ILI) [44,47,54,64,70]. Influenza epidemics are a major public health challenge worldwide and annually cause thousands of deaths, posing a serious threat to worldwide health [44,47]. In assessing the utility of influenza-like illness (ILI), surveillance systems and developing approaches for predicting future trends are important for pandemic preparedness [54]. Building forecasting models and accurate systems that track influenza activity at the city level are necessary to provide usable information for clinical, hospital, and community outbreak preparedness [44,47]. Such propositions find application in studies such as the one conducted by Venkatramanan S. et al., which can contribute to the development of the timely forecasting of infectious diseases on a global scale by using human mobility data, expanding its applications in the area of infectious disease epidemiology [64]; that of Xu Q. et al., who attempted to predict influenza in Hong Kong with Google search queries and statistical model fusion [70]; and that of Lu F.S. et al., who demonstrated how information from Internet-based data sources, when combined with using an informed, robust methodology, can be effectively used as early indicators of influenza activity at fine geographic resolutions [47].

4.13. Malaria

Another application of machine learning was studied by Kamana E. et al. They demonstrated how the LSTMSeq2Seq model can be effectively applied in the prediction of malaria re-emergence [40]. The LSTMSeq2Seq model achieved an average prediction accuracy of 87.3% [40]. A lower accuracy was calculated by Brock et al. [76], who, using the BRT model, estimated an accuracy of between 55% and 82%.

4.14. West Nile Virus

West Nile fever is a disease caused by the West Nile virus (WNV), which was discovered in Africa in 1979 and which quickly spread to Western Asia, Europe, Australia, and the US thanks to its natural reservoirs: birds and mosquitoes [74]. In this study, the author says that random forest was able to correctly predict the probability of WNV’s presence with the highest accuracy; it also is possible to know not only the possibility of WNV’s occurrence but also how it could spread. This could help policymakers to implement safety measures to prevent the deadly spread of WNV [74].

4.15. Zika

This study by Roster K. et al. [57] took into consideration various diseases, including COVID-19 and zika, which we will briefly discuss. Human infection with zika virus is a viral disease transmitted by the bite of infected mosquitoes. If the subject is stung by a carrier mosquito and stung again by an uninfected mosquito, this can trigger a chain capable of giving rise to an endemic outbreak, in which human-to-human contagion, even if possible, is modest and unlikely [57]. The study was conducted in Brazil, and the models used were RF, TrAdaBoost, and neurol network (NN) [57].

4.16. Strengths and Limitations

As with any study, this systematic literature review has its limitations. First, we limited our search to articles published in English, and this might have reduced the total number of potential eligible studies, although English is the most commonly used language in the scientific community. The number of papers included in the systematic review was low, despite the fact that the initial search comprised more than 700 articles. Nevertheless, we believe this did not significantly affect our results; indeed, the small number of retrieved articles could be rather due to the fact that this is a relatively new area of research, proved by the fact that articles have been published only in the past few years: the 75 papers included in the study were published between 2016 and 2022. Second, this study reviewed only published journal articles and proceedings; however, normally important studies are disseminated mainly by scientific journal articles and not by editorials or commentaries. Another important limitation of the study is due to the fact that among the articles included are proceedings and conference papers, which could have diminished the quality of this systematic review. The study also bears limitations on the applicability of all items of the PRISMA checklist; in fact, items 12, 13, and 15 have not always been developed or completed adequately. For item 12, it was not possible to specify the effect measures for each result, because in most cases, the effect measures such as odds ratios were completely absent. This systematic review explored a new and objective field of the study, which was as specified in paragraph 1.3, to show whether it is possible to predict infectious disease outbreaks early by using machine learning. For item 13, the study was not composed of clinical trials, and a meta-analysis was not performed; therefore, most of the subitems were not applicable to this systematic review. For the applicable items, 13a to 13c, the synthesis methods were applied. In the assessment of certainty (item 15) as related to item 12 (measures of effect), the authors tried to evaluate the quality of the studies through NOS even if for this item it may not have been the appropriate tool.
Lastly, our paper also has some important strengths. First, as the title suggests, this article is a systematic literature review. Second, our search strategy was developed with several keywords. Third, the review was conducted in accordance with international guidelines.

5. Conclusions

The use of machine-learning models for the early or real-time verification of epidemics is a new and innovative field that today presents many study methodologies, and this makes it difficult to compare studies from a methodological point of view even if almost all of them come to the same conclusion, which is that the outbreak of major infectious diseases can be monitored.
Concerning public health and preventive strategies, this study suggests that machine learning is a new tool that can be widely used for public health practice. On the basis of the evidence collected so far, we can hypothesize that the research objectives explored so far and the results generated can be considered preliminary and that new research questions and new applications of this method can be developed in the future. In light of the recent outbreaks of infectious diseases that have occurred around the world over the past few decades, machine-learning models offer an opportunity to monitor the introduction and spread. Furthermore, their use in scientific research is expected to grow and become more and more used in the daily life of the healthcare world.
The results showed that with the association of multiple machine-learning models, it is possible to spatially and temporally predict the trends or the incidence of infectious diseases, and future research efforts will allow the construction of more-precise and more-plausible models. In conclusion, the combination of different data sets improves the results. The most probable ones are obtained when the data source is not unique, but if different data are used in this way, the results are more likely to be accurate. For future research, the integration of machine-learning models is suggested to improve existing standard epidemiological models in terms of accuracy.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/make5010013/s1, Figure S1: PRISMA 2020 flow diagram; Table S1: Full search strategy; Table S2: Detailed description of inclusion/exclusion criteria, based on PICOS (population, intervention, comparison, outcomes and study design); Table S3: Quality assessment of the included studies, using the Newcastle–Ottawa scale (NOS).

Author Contributions

O.E.S. conceptualized the study, O.E.S. designed the study, and O.E.S. performed the literature search. O.E.S. performed resource analysis and data extraction. O.E.S., V.G., S.P., D.G. and F.C. wrote the first draft. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. World Health Organization. Global Report on Infection Prevention and Control; World Health Organization: Geneva, Switzerland, 2022; Licence: CC BY-NC-SA 3.0 IGO. [Google Scholar]
  2. Provenzano, S.; Santangelo, O.E.; Giordano, D.; Alagna, E.; Piazza, D.; Genovese, D.; Calamusa, G.; Firenze, A. Predicting disease outbreaks: Evaluating measles infection with Wikipedia Trends. Recenti. Prog. Med. 2019, 110, 292–296. [Google Scholar] [CrossRef] [PubMed]
  3. Gianfredi, V.; Santangelo, O.E.; Provenzano, S. Correlation between flu and Wikipedia’s pages visualization. Acta Biomed. 2021, 92, e2021056. [Google Scholar] [CrossRef] [PubMed]
  4. Santangelo, O.; Provenzano, S.; Piazza, D.; Giordano, D.; Calamusa, G.; Firenze, A. Digital epidemiology: Assessment of measles infection through Google Trends mechanism in Italy. Ann Ig. 2019, 31, 385–391. [Google Scholar]
  5. World Health Organization. Ethics and Governance of Artificial Intelligence for Health: WHO Guidance; World Health Organization: Geneva, Switzerland, 2021; Licence: CC BY-NC-SA 3.0 IGO. [Google Scholar]
  6. Palaniappan, S.; V, R.; David, B.; S, P.N. Prediction of Epidemic Disease Dynamics on the Infection Risk Using Machine Learning Algorithms. SN. Comput. Sci. 2022, 3, 47. [Google Scholar] [CrossRef]
  7. Roy, S.; Biswas, P.; Ghosh, P. Spatiotemporal tracing of pandemic spread from infection data. Sci. Rep. 2021, 11, 17689. [Google Scholar] [CrossRef]
  8. Ghannam, R.B.; Techtmann, S.M. Machine learning applications in microbial ecology, human microbiome studies, and environmental monitoring. Comput. Struct. Biotechnol. J. 2021, 19, 1092–1107. [Google Scholar] [CrossRef]
  9. Atkinson, A.; Ellenberger, B.; Piezzi, V.; Kaspar, T.; Salazar-Vizcaya, L.; Endrich, O.; Leichtle, A.B.; Marschall, J. Extending outbreak investigation with machine learning and graph theory: Benefits of new tools with application to a nosocomial outbreak of a multidrug-resistant organism. Infect. Control. Hosp. Epidemiol. 2022, 16, 1–7, Advance online publication. [Google Scholar] [CrossRef]
  10. Roth, J.A.; Battegay, M.; Juchler, F.; Vogt, J.E.; Widmer, A.F. Introduction to Machine Learning in Digital Healthcare Epidemiology. Infect. Control. Hosp. Epidemiol. 2018, 39, 1457–1462. [Google Scholar] [CrossRef]
  11. Higgins, J.P.T.; Altman, D.G.; Gøtzsche, P.C.; Jüni, P.; Moher, D.; Oxman, A.D.; Savović, J.; Schulz, K.F.; Weeks, L.; Sterne, J.A.C.; et al. The Cochrane Collaboration’s tool for assessing risk of bias in randomised trials. BMJ 2011, 343, d5928. [Google Scholar] [CrossRef]
  12. Stroup, D.F.; Berlin, J.A.; Morton, S.C.; Olkin, I.; Williamson, G.D.; Rennie, D.; Moher, D.; Becker, B.J.; Sipe, T.A.; Thacker, S.B. Meta-analysis of observational studies in epidemiology: A proposal for reporting. Meta-analysis Of Observational Studies in Epidemiology (MOOSE) group. JAMA 2000, 283, 2008–2012. [Google Scholar] [CrossRef]
  13. Moher, D.; Liberati, A.; Tetzlaff, J.; Altman, D.G.; PRISMA Group. Preferred reporting items for systematic reviews and meta-analyses: The PRISMA statement. PLoS Med. 2009, 6, e1000097. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Liberati, A.; Altman, D.G.; Tetzlaff, J.; Mulrow, C.; Gøtzsche, P.C.; Ioannidis, J.P.; Clarke, M.; Devereaux, P.J.; Kleijnen, J.; Moher, D. The PRISMA Statement for Reporting Systematic Reviews and Meta-Analyses of Studies That Evaluate Health Care Interventions: Explanation and Elaboration. Ann. Intern. Med. 2009, 151, W65–W94. [Google Scholar] [CrossRef] [PubMed]
  15. Page, M.J.; McKenzie, J.E.; Bossuyt, P.M.; Boutron, I.; Hoffmann, T.C.; Mulrow, C.D.; Shamseer, L.; Tetzlaff, J.M.; Akl, E.A.; Brennan, S.E.; et al. The PRISMA 2020 Statement: An Updated Guideline for Reporting Systematic Reviews. BMJ 2021, 372, n71. [Google Scholar] [CrossRef] [PubMed]
  16. Wells, G.A.; Shea, B.; O’Connell, D.; Pereson, J.; Welch, V.; Losos, M.; Tugwell, P. The Newcastle–Ottawa Scale (NOS) for Assessing the Quality of Nonrandomised Studies in Meta-Analyses. Available online: http://www.ohri.ca/programs/clinical_epidemiology/oxford.asp (accessed on 1 November 2022).
  17. Herzog, R.; Álvarez-Pasquin, M.J.; Díaz, C.; Del Barrio, J.L.; Estrada, J.M.; Gil, Á. Are healthcare workers’ intentions to vaccinate related to their knowledge, beliefs and attitudes? A systematic review. BMC Public Health 2013, 13, 154. [Google Scholar] [CrossRef]
  18. Nucci, D.; Santangelo, O.E.; Provenzano, S.; Fatigoni, C.; Nardi, M.; Ferrara, P.; Gianfredi, V. Dietary Fiber Intake and Risk of Pancreatic Cancer: Systematic Review and Meta-Analysis of Observational Studies. Int. J. Environ. Res. Public Health 2021, 18, 11556. [Google Scholar] [CrossRef]
  19. Nucci, D.; Santangelo, O.E.; Provenzano, S.; Nardi, M.; Firenze, A.; Gianfredi, V. Altered Food Behavior and Cancer: A Systematic Review of the Literature. Int. J. Environ. Res. Public Health 2022, 19, 10299. [Google Scholar] [CrossRef]
  20. Absar, N.; Uddin, N.; Khandaker, M.U.; Ullah, H. The efficacy of deep learning based LSTM model in forecasting the outbreak of contagious diseases. Infect. Dis. Model. 2022, 7, 170–183. [Google Scholar] [CrossRef]
  21. Adiga, A.; Wang, L.; Hurt, B.; Peddireddy, A.; Porebski, P.; Venkatramanan, S.; Lewis, B.L.; Marathe, M. All Models Are Useful: Bayesian Ensembling for Robust High Resolution COVID-19 Forecasting. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining (KDD ’21), Singapore, 14–18 August 2021; Association for Computing Machinery: New York, NY, USA, 2021; pp. 2505–2513. [Google Scholar] [CrossRef]
  22. Ahmad, H.F.; Khaloofi, H.; Azhar, Z.; Algosaibi, A.; Hussain, J. An Improved COVID-19 Forecasting by Infectious Disease Modelling Using Machine Learning. Appl. Sci. 2021, 11, 11426. [Google Scholar] [CrossRef]
  23. Ak, Ç.; Ergonul, O.; Gönen, M. A prospective prediction tool for understanding Crimean–Congo haemorrhagic fever dynamics in Turkey. Clin. Microbiol. Infect. 2020, 26, 123.e1–123.e7. [Google Scholar] [CrossRef]
  24. Ak, C.; Ergonul, O.; Sencan, I.; Torunoglu, M.A.; Gonen, M. Spatiotemporal prediction of infectious diseases using structured Gaussian processes with application to Crimean-Congo hemorrhagic fever. PLoS Negl. Trop. Dis. 2018, 12, e0006737. [Google Scholar] [CrossRef] [Green Version]
  25. Alsmadi, M.K. Modified SEIRand machine learning prediction of the trend of the epidemic of COVID-19 in Jordan under lockdowns impact. IJECE 2022, 12, 5455–5466. [Google Scholar] [CrossRef]
  26. Ardabili, S.; Mosavi, A.; Ghamisi, P.; Ferdinand, F.; Varkonyi-Koczy, A.; Reuter, U.; Rabczuk, T.; Atkinson, P. COVID-19 Outbreak Prediction with Machine Learning. Algorithms 2020, 13, 249. [Google Scholar] [CrossRef]
  27. Asfahan, S.; Gopalakrishnan, M.; Dutt, N.; Niwas, R.; Chawla, G.; Agarwal, M.; Garg, M.K. Using a Simple Open-Source Automated Machine Learning Algorithm to Forecast COVID-19 Spread: A Modelling Study. Adv. Respir. Med. 2020, 88, 400–405. [Google Scholar] [CrossRef] [PubMed]
  28. Bagheri, H.; Tapak, L.; Karami, M.; Hosseinkhani, Z.; Najari, H.; Karimi, S.; Cheraghi, Z. Forecasting the monthly incidence rate of brucellosis in west of Iran using time series and data mining from 2010 to 2019. PLoS ONE 2020, 15, e0232910. [Google Scholar] [CrossRef] [PubMed]
  29. Balogh, A.; Harman, A.; Kreuter, F. Real-Time Analysis of Predictors of COVID-19 Infection Spread in Countries in the European Union Through a New Tool. Int. J. Public Health 2022, 67, 1604974. [Google Scholar] [CrossRef] [PubMed]
  30. Benedum, C.M.; Shea, K.M.; Jenkins, H.E.; Kim, L.Y.; Markuzon, N. Weekly dengue forecasts in Iquitos, Peru; San Juan, Puerto Rico; and Singapore. PLoS Negl. Trop. Dis. 2020, 14, e0008710. [Google Scholar] [CrossRef]
  31. Chaurasia, V.; Pal, S. Application of machine learning time series analysis for prediction COVID-19 pandemic. Res. Biomed. Eng. 2020, 38, 35–47. [Google Scholar] [CrossRef]
  32. Chen, Y.; Chu, C.W.; Chen, M.; Cook, A.R. The utility of LASSO-based models for real time forecasts of endemic infectious diseases: A cross country comparison. J. Biomed. Inform. 2018, 81, 16–30. [Google Scholar] [CrossRef]
  33. Chimmula, V.K.R.; Zhang, L. Time series forecasting of COVID-19 transmission in Canada using LSTM networks. Chaos, Solitons Fractals 2020, 135, 109864. [Google Scholar] [CrossRef]
  34. Dash, S.; Chakraborty, C.; Giri, S.K.; Pani, S.K.; Frnda, J. BIFM: Big-Data Driven Intelligent Forecasting Model for COVID-19. IEEE Access 2021, 9, 97505–97517. [Google Scholar] [CrossRef]
  35. Dixon, S.; Keshavamurthy, R.; Farber, D.H.; Stevens, A.; Pazdernik, K.T.; Charles, L.E. A Comparison of Infectious Disease Forecasting Methods across Locations, Diseases, and Time. Pathogens 2022, 11, 185. [Google Scholar] [CrossRef]
  36. Gónzalez-Bandala, D.; Cuevas-Tello, J.; Noyola, D.; Comas-García, A.; García-Sepúlveda, C. Computational Forecasting Methodology for Acute Respiratory Infectious Disease Dynamics. Int. J. Environ. Res. Public Health 2020, 17, 4540. [Google Scholar] [CrossRef] [PubMed]
  37. Goo, T.; Apio, C.; Heo, G.; Lee, D.; Lee, J.H.; Lim, J.; Han, K.; Park, T. Forecasting of the COVID-19 pandemic situation of Korea. Genom. Informatics 2021, 19, e11. [Google Scholar] [CrossRef] [PubMed]
  38. Guo, Y.; Feng, Y.; Qu, F.; Zhang, L.; Yan, B.; Lv, J. Prediction of hepatitis E using machine learning models. PLoS ONE 2020, 15, e0237750. [Google Scholar] [CrossRef] [PubMed]
  39. Haq, I.; Hossain, I.; Saleheen, A.A.S.; Nayan, I.H.; Mila, M.S. Prediction of COVID-19 Pandemic in Bangladesh: Dual Application of Susceptible-Infective-Recovered (SIR) and Machine Learning Approach. Interdiscip. Perspect. Infect. Dis. 2022, 2022, 8570089. [Google Scholar] [CrossRef] [PubMed]
  40. Kamana, E.; Zhao, J.; Bai, D. Predicting the impact of climate change on the re-emergence of malaria cases in China using LSTMSeq2Seq deep learning model: A modelling and prediction analysis study. BMJ Open 2022, 12, e053922. [Google Scholar] [CrossRef]
  41. Katragadda, S.; Bhupatiraju, R.T.; Raghavan, V.; Ashkar, Z.; Gottumukkala, R. Examining the COVID-19 case growth rate due to visitor vs. local mobility in the United States using machine learning. Sci. Rep. 2022, 12, 12337. [Google Scholar] [CrossRef]
  42. Ketu, S.; Mishra, P.K. Enhanced Gaussian process regression-based forecasting model for COVID-19 outbreak and significance of IoT for its detection. Appl. Intell. 2021, 51, 1492–1512. [Google Scholar] [CrossRef]
  43. Kim, J.; Ahn, I. Infectious disease outbreak prediction using media articles with machine learning models. Sci. Rep. 2021, 11, 4413. [Google Scholar] [CrossRef]
  44. Kim, J.; Ahn, I. Weekly ILI patient ratio change prediction using news articles with support vector machine. BMC Bioinform. 2019, 20, 259. [Google Scholar] [CrossRef]
  45. Kumar, S.L.; Sarobin, M.V.R.; Anbarasi, L.J. Predictive Analytics of COVID-19 Pandemic: Statistical Modelling Perspective. Walailak J. Sci. Technol. WJST 2021, 18, 15583. [Google Scholar] [CrossRef]
  46. Lmater, M.A.; Eddabbah, M.; Elmoussaoui, T.; Boussaa, S. Modelization of COVID-19 pandemic spreading: A machine learning forecasting with relaxation scenarios of countermeasures. J. Infect. Public Health 2021, 14, 468–473. [Google Scholar] [CrossRef] [PubMed]
  47. Lu, F.S.; Hou, S.; Baltrusaitis, K.; Shah, M.; Leskovec, J.; Sosic, R.; Hawkins, J.; Brownstein, J.; Conidi, G.; Gunn, J.; et al. Accurate Influenza Monitoring and Forecasting Using Novel Internet Data Streams: A Case Study in the Boston Metropolis. JMIR Public Health Surveill. 2018, 4, e4. [Google Scholar] [CrossRef]
  48. Marra, A.R.; Alzunitan, M.; Abosi, O.; Edmond, M.B.; Street, W.N.; Cromwell, J.W.; Salinas, J.L. Modest Clostridiodes difficile infection prediction using machine learning models in a tertiary care hospital. Diagn. Microbiol. Infect. Dis. 2020, 98, 115104. [Google Scholar] [CrossRef]
  49. Meng, D.; Xu, J.; Zhao, J. Analysis and prediction of hand, foot and mouth disease incidence in China using Random Forest and XGBoost. PLoS ONE 2021, 16, e0261629. [Google Scholar] [CrossRef]
  50. Masum, A.K.M.; Khushbu, S.A.; Keya, M.; Abujar, S.; Hossain, S.A. COVID-19 in Bangladesh: A Deeper Outlook into The Forecast with Prediction of Upcoming Per Day Cases Using Time Series. Procedia Comput. Sci. 2020, 178, 291–300. [Google Scholar] [CrossRef]
  51. Murphy, C.; Laurence, E.; Allard, A. Deep learning of contagion dynamics on complex networks. Nat. Commun. 2021, 12, 4720. [Google Scholar] [CrossRef]
  52. Nguyen, V.-H.; Tuyet-Hanh, T.T.; Mulhall, J.; Van Minh, H.; Duong, T.Q.; Van Chien, N.; Nhung, N.T.T.; Lan, V.H.; Cuong, D.; Bich, N.N.; et al. Deep learning models for forecasting dengue fever based on climate data in Vietnam. PLoS Negl. Trop. Dis. 2022, 16, e0010509. [Google Scholar] [CrossRef] [PubMed]
  53. Niraula, P.; Mateu, J.; Chaudhuri, S. A Bayesian machine learning approach for spatio-temporal prediction of COVID-19 cases. Stoch. Environ. Res. Risk Assess. 2022, 36, 2265–2283. [Google Scholar] [CrossRef] [PubMed]
  54. Nsoesie, E.O.; Oladeji, O.; Abah, A.S.A.; Ndeffo-Mbah, M.L. Forecasting influenza-like illness trends in Cameroon using Google Search Data. Sci. Rep. 2021, 11, 6713. [Google Scholar] [CrossRef]
  55. Patil, S.; Pandya, S. Forecasting Dengue Hotspots Associated with Variation in Meteorological Parameters Using Regression and Time Series Models. Front. Public Health 2021, 9, 798034. [Google Scholar] [CrossRef]
  56. Pourghasemi, H.R.; Pouyan, S.; Farajzadeh, Z.; Sadhasivam, N.; Heidari, B.; Babaei, S.; Tiefenbacher, J.P. Assessment of the outbreak risk, mapping and infection behavior of COVID-19: Application of the autoregressive integrated-moving average (ARIMA) and polynomial models. PLoS ONE 2020, 15, e0236238. [Google Scholar] [CrossRef]
  57. Roster, K.; Connaughton, C.; Rodrigues, F.A. Forecasting new diseases in low-data settings using transfer learning. Chaos Solitons Fractals 2022, 161, 112306. [Google Scholar] [CrossRef]
  58. Saba, T.; Abunadi, I.; Shahzad, M.N.; Khan, A.R. Machine learning techniques to detect and forecast the daily total COVID-19 infected and deaths cases under different lockdown types. Microsc. Res. Tech. 2021, 84, 1462–1474. [Google Scholar] [CrossRef] [PubMed]
  59. Shaghaghi, N.; Calle, A.; Kouretas, G.; Mirchandani, J.; Castillo, M. eVision: Epidemic Forecasting on COVID-19. Curr. Dir. Biomed. Eng. 2021, 7, 839–842. [Google Scholar] [CrossRef]
  60. Shen, C.; Chen, A.; Luo, C.; Zhang, J.; Feng, B.; Liao, W. Using Reports of Symptoms and Diagnoses on Social Media to Predict COVID-19 Case Counts in Mainland China: Observational Infoveillance Study. J. Med. Internet Res. 2020, 22, e19421. [Google Scholar] [CrossRef] [PubMed]
  61. Shen, L.; Jiang, C.; Sun, M.; Qiu, X.; Qian, J.; Song, S.; Hu, Q.; Yelixiati, H.; Liu, K. Predicting the Spatial-Temporal Distribution of Human Brucellosis in Europe Based on Convolutional Long Short-Term Memory Network. Can. J. Infect. Dis. Med. Microbiol. 2022, 2022, 7658880. [Google Scholar] [CrossRef]
  62. Shi, Y.; Liu, X.; Kok, S.-Y.; Rajarethinam, J.; Liang, S.; Yap, G.; Chong, C.-S.; Lee, K.-S.; Tan, S.S.; Chin, C.K.Y.; et al. Three-Month Real-Time Dengue Forecast Models: An Early Warning System for Outbreak Alerts and Policy Decision Support in Singapore. Environ. Health Perspect. 2016, 124, 1369–1375. [Google Scholar] [CrossRef]
  63. Tiwari, D.; Bhati, B.S.; Al-Turjman, F.; Nagpal, B. Pandemic coronavirus disease (COVID-19): World effects analysis and prediction using machine-learning techniques. Expert Syst. 2022, 39, e12714. [Google Scholar] [CrossRef] [PubMed]
  64. Venkatramanan, S.; Sadilek, A.; Fadikar, A.; Barrett, C.L.; Biggerstaff, M.; Chen, J.; Dotiwalla, X.; Eastham, P.; Gipson, B.; Higdon, D.; et al. Forecasting influenza activity using machine-learned mobility map. Nat. Commun. 2021, 12, 726. [Google Scholar] [CrossRef]
  65. Verma, H.; Mandal, S.; Gupta, A. Temporal deep learning architecture for prediction of COVID-19 cases in India. Expert Syst. Appl. 2022, 195, 116611. [Google Scholar] [CrossRef] [PubMed]
  66. Wang, H.; Qiu, J.; Li, C.; Wan, H.; Yang, C.; Zhang, T. Applying the Spatial Transmission Network to the Forecast of Infectious Diseases Across Multiple Regions. Front. Public Health 2022, 10, 774984. [Google Scholar] [CrossRef]
  67. Wang, X.; Wang, H.; Ramazi, P.; Nah, K.; Lewis, M. From Policy to Prediction: Forecasting COVID-19 Dynamics Under Imperfect Vaccination. Bull. Math. Biol. 2022, 84, 90. [Google Scholar] [CrossRef]
  68. Wang, Y.; Yan, Z.; Wang, D.; Yang, M.; Li, Z.; Gong, X.; Di Wu, D.; Zhai, L.; Zhang, W.; Wang, Y. Prediction and analysis of COVID-19 daily new cases and cumulative cases: Times series forecasting and machine learning models. BMC Infect. Dis. 2022, 22, 495. [Google Scholar] [CrossRef] [PubMed]
  69. Xu, J.; Xu, K.; Li, Z.; Meng, F.; Tu, T.; Xu, L.; Liu, Q. Forecast of Dengue Cases in 20 Chinese Cities Based on the Deep Learning Method. Int. J. Environ. Res. Public Health 2020, 17, 453. [Google Scholar] [CrossRef] [PubMed]
  70. Xu, Q.; Gel, Y.R.; Ramirez, L.L.R.; Nezafati, K.; Zhang, Q.; Tsui, K.-L. Forecasting influenza in Hong Kong with Google search queries and statistical model fusion. PLoS ONE 2017, 12, e0176690. [Google Scholar] [CrossRef]
  71. Yang, Z.; Zeng, Z.; Wang, K.; Wong, S.-S.; Liang, W.; Zanin, M.; Liu, P.; Cao, X.; Gao, Z.; Mai, Z.; et al. Modified SEIR and AI prediction of the epidemics trend of COVID-19 in China under public health interventions. J. Thorac. Dis. 2020, 12, 165–174. [Google Scholar] [CrossRef]
  72. Zhang, Y.; Chen, K.; Weng, Y.; Chen, Z.; Zhang, J.; Hubbard, R. An intelligent early warning system of analyzing Twitter data using machine learning on COVID-19 surveillance in the US. Expert Syst. Appl. 2022, 198, 116882. [Google Scholar] [CrossRef]
  73. Zhong, R.; Wu, Y.; Cai, Y.; Wang, R.; Zheng, J.; Lin, D.; Wu, H.; Li, Y. Forecasting hand, foot, and mouth disease in Shenzhen based on daily level clinical data and multiple environmental factors. Biosci. Trends 2018, 12, 450–455. [Google Scholar] [CrossRef]
  74. Ajith, A.; Manoj, K.; Kiran, H.; Pillai, P.J.; Nair, J.J. A Study on Prediction and Spreading of Epidemic Disease. In Proceedings of the 2020 International Conference on Communication and Signal Processing (ICCSP), Chennai, India, 28–30 July 2020; pp. 1265–1268. [Google Scholar] [CrossRef]
  75. Andreas, A.; Mavromoustakis, C.X.; Mastorakis, G.; Mumtaz, S.; Batalla, J.M.; Pallis, E. Modified Machine Learning Techique for Curve Fitting on Regression Models for COVID-19 projections. In Proceedings of the 2020 IEEE 25th International Workshop on Computer Aided Modeling and Design of Communication Links and Networks (CAMAD), Pisa, Italy, 14–16 September 2020; pp. 1–6. [Google Scholar] [CrossRef]
  76. Brock, P.M.; Fornace, K.M.; Grigg, M.J.; Anstey, N.M.; William, T.; Cox, J.; Drakeley, C.J.; Ferguson, H.; Kao, R.R. Predictive analysis across spatial scales links zoonotic malaria to deforestation. Proc. Biol. Sci. 2019, 286, 20182351. [Google Scholar] [CrossRef]
  77. Chumachenko, D.; Chumachenko, T.; Meniailov, I.; Muradyan, O.; Zholtkevych, G. Forecasting of COVID-19 Epidemic Process by Lasso Regression. In Proceedings of the 2021 IEEE International Conference on Information and Telecommunication Technologies and Radio Electronics (UkrMiCo), Odesa, Ukraine, 29 November–3 December 2021; pp. 80–83. [Google Scholar] [CrossRef]
  78. Chumachenko, D.; Meniailov, I.; Bazilevych, K.; Krivtsov, S. Forecasting of COVID-19 Epidemic Process by Random Forest Method. In Proceedings of the 2021 IEEE 8th International Conference on Problems of Infocommunications, Science and Technology (PIC S&T), Kharkiv, Ukraine, 5–7 October 2021; pp. 491–494. [Google Scholar] [CrossRef]
  79. Fan, X.-R.; Zuo, J.; He, W.-T.; Liu, W. Stacking based prediction of COVID-19 Pandemic by integrating infectious disease dynamics model and traditional machine learning. In Proceedings of the 2022 5th International Conference on Big Data and Internet of Things (BDIOT ’22), Chongqing, China, 12–14 August 2022; Association for Computing Machinery: New York, NY, USA, 2022; pp. 20–26. [Google Scholar] [CrossRef]
  80. Hasri, H.; Aris, S.A.M.; Ahmad, R. Linear Regression and Holt’s Winter Algorithm in Forecasting Daily Coronavirus Disease 2019 Cases in Malaysia: Preliminary Study. In Proceedings of the 2021 IEEE National Biomedical Engineering Conference (NBEC), Kuala Lumpur, Malaysia, 9–10 November 2021; pp. 157–160. [Google Scholar] [CrossRef]
  81. Kolesnikov, A.A.; Kikin, P.M.; Portnov, A.M. Diseases spread prediction in tropical areas by machine learning methods ensembling and spatial analysis techniques. Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci. 2019, 8XLII–3/W, 221–226. [Google Scholar] [CrossRef]
  82. Kumari, P.; Toshniwal, D. Real-time estimation of COVID-19 cases using machine learning and mathematical models—The case of India. In Proceedings of the 2020 IEEE 15th International Conference on Industrial and Information Systems (ICIIS), Rupnagar, India, 26–28 November 2020; pp. 369–374. [Google Scholar] [CrossRef]
  83. Liu, Z.; Zuo, J.; Lv, R.; Liu, S.; Wang, W. Coronavirus Epidemic (COVID-19) Prediction and Trend Analysis Based on Time Series. In Proceedings of the 2021 IEEE International Conference on Artificial Intelligence and Industrial Design (AIID), Guangzhou, China, 28–30 May 2021; pp. 35–38. [Google Scholar] [CrossRef]
  84. Maaliw, R.R.; Ballera, M.A.; Mabunga, Z.P.; Mahusay, A.T.; Dejelo, D.A.; Seno, M.P. An Ensemble Machine Learning Approach For Time Series Forecasting of COVID-19 Cases. In Proceedings of the 2021 IEEE 12th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON), Vancouver, BC, Canada, 27–30 October 2021; pp. 0633–0640. [Google Scholar] [CrossRef]
  85. Mahima, Y.; Ginige, T. COVID-19 Spread prediction Based on Food Categories using Data Science. In Proceedings of the 2020 IEEE International Conference for Innovation in Technology (INOCON), Bangluru, India, 6–8 November 2020; pp. 1–7. [Google Scholar] [CrossRef]
  86. Mei, W.; Liu, Z.; Long, B.; Su, Y. Infectious Diseases Dynamic Transmissibility with Age Structure and Medical Resources. In Proceedings of the 2021 China Automation Congress (CAC), Beijing, China, 22–24 October 2021; pp. 1543–1548. [Google Scholar] [CrossRef]
  87. Patayon, U.B. Time Series Analysis of Infected COVID-19 Cases in the Zamboanga Peninsula, Philippines using Long Short-Term Memory Neural Networks. In Proceedings of the 2021 4th International Conference of Computer and Informatics Engineering (IC2IE), Depok, Indonesia, 14–15 September 2021; pp. 106–111. [Google Scholar] [CrossRef]
  88. Pickering, L.; Viana, J.; Li, X.; Chhabra, A.; Patel, D.; Cohen, K. Identifying Factors in COVID—19 AI Case Predictions. In Proceedings of the 2020 7th International Conference on Soft Computing & Machine Intelligence (ISCMI), Stockholm, Sweden, 14–15 November 2020; pp. 192–196. [Google Scholar] [CrossRef]
  89. Rohini, M.; Naveena, K.; Jothipriya, G.; Kameshwaran, S.; Jagadeeswari, M. A Comparative Approach to Predict Corona Virus Using Machine Learning. In Proceedings of the 2021 International Conference on Artificial Intelligence and Smart Systems (ICAIS), Coimbatore, India, 25–27 March 2021; pp. 331–337. [Google Scholar] [CrossRef]
  90. Satu, S.; Rahman, K.; Alam Rony, M.; Shovon, A.R.; Alam Adnan, J.; Howlader, K.C.; Kaiser, M.S. COVID-19: Update, Forecast and Assistant—An Interactive Web Portal to Provide Real-Time Information and Forecast COVID-19 Cases in Bangladesh. In Proceedings of the 2021 International Conference on Information and Communication Technology for Sustainable Development (ICICT4SD), Dhaka, Bangladesh, 27–28 February 2021; pp. 456–460. [Google Scholar] [CrossRef]
  91. Sri, S.; Nagarathinam, S.; Ishvarya, K.; Srinidhi, S. COVID-19 Prediction Using FbProphet. In Proceedings of the 2022 8th International Conference on Advanced Computing and Communication Systems (ICACCS), Coimbatore, India, 25–26 March 2022; pp. 1–5. [Google Scholar] [CrossRef]
  92. Wang, H.; Tao, G.; Ma, J.; Jia, S.; Chi, L.; Yang, H.; Zhao, Z.; Tao, J. Predicting the Epidemics Trend of COVID-19 Using Epidemiological-Based Generative Adversarial Networks. IEEE J. Sel. Top. Signal Process. 2022, 16, 276–288. [Google Scholar] [CrossRef]
  93. Zhou, Q.; Tao, W.; Jiang, Y.; Cui, B. A Comparative Study on the Prediction Model of COVID-19. In Proceedings of the 2020 IEEE 9th Joint International Information Technology and Artificial Intelligence Conference (ITAIC), Chongqing, China, 11–13 December 2020; pp. 1348–1352. [Google Scholar] [CrossRef]
  94. Zhang, P.; Wang, Z.; Chao, G.; Huang, Y.; Yan, J. An Oriented Attention Model for Infectious Disease Cases Prediction. In Advances and Trends in Artificial Intelligence. Theory and Practices in Artificial Intelligence. IEA/AIE 2022. Lecture Notes in Computer Science; Fujita, H., Fournier-Viger, P., Ali, M., Wang, Y., Eds.; Springer: Cham, Switzerland, 2022; Volume 13343. [Google Scholar] [CrossRef]
Table 1. Qualitative characteristics of included studies, reported in alphabetical order. From the study of Ajith A [74] the proceedings/conference papers.
Table 1. Qualitative characteristics of included studies, reported in alphabetical order. From the study of Ajith A [74] the proceedings/conference papers.
Author [Ref.]Publication YearCountry of StudyStudy PeriodDiseaseData SourceModel and/or TechniquesAimMain ResultsAccuracy/Best ModelSpace/Time ResolutionOrder of Magnitude Modeled PopulationsFundsConflicts of Interest
Absar N. [20]2022BangladeshMarch 2020–August 2021COVID-19Health division of the government of the Republic of BangladeshLSTMPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseLSTM shows a high accuracy for predicting cases, shows a medium–low accuracy especially in the long term for predicting death cases17-month forecast divided dailyThousands of casesNoNone
Adiga A. [21]2021USAugust 2022–January 2021COVID-19Centers for Disease ControlLSTM, BMA, AR, ARIMA, EnKF, SEIRPredicting the trend of COVID-19 pandemicIt is possible to make a more accurate prediction by integrating machine learning with other statistical and mathematical modelsLSTM which is trained over short observation windows, is able to learn accurately the sharp rise or drop in cases relatively quickly, and start obtaining high weights in the subsequent weeks5-month forecast divided weeklyThousands of casesYesNone
Ahmad H.F. [22]2021KSA, Kuwait, Bahrain, UAEMarch 2020–January 2021COVID-19Kaggle. Novel Corona Virus 2019 Data Set *ARIMA, SIR, bi-LSTM, LRM, SVRPredicting the trend of COVID-19 pandemicLong-term outbreaks of infectious diseases cannot be predictedBi-LSTM is the best model with the highest accuracy9-month forecast divided dailyThousands of casesNoNone
Ak Ç. [23]2020TurkeyJanuary 2004–December 2017Crimean-Congo hemorrhagic fever (CCHF)Ministry of Health of TurkeyGaussian process regression (GPR)Predicting Crimean-Congo hemorrhagic fever in 2016 and 2017The model predicts annual cases in 2016 e 2017Annual predictions for 2016 and 2017 are accurate, but the predictions for individual provinces are not as much accurate24-month forecast divided monthly; identify the province where there will be more casesHundreds of casesYesNone
Ak Ç. [24]2018TurkeyJanuary 2004–December 2015Crimean-Congo hemorrhagic fever (CCHF)Ministry of Health of TurkeyGaussian process regression (GPR), BRT, RFRPredicting Crimean-Congo hemorrhagic feverGaussian process formulation obtained better results than two frequently used standard machine-learning algorithms (i.e., random forests and boosted regression trees) under temporal, spatial, and spatiotemporal prediction scenariosGPR algorithm did a better job than RFR and BRT algorithms by predicting CCHF case counts more accurately24-month forecast divided monthly; identify the province where there will be more casesHundreds of casesYesNone
Alsmadi M.K. [25]2022Jordan2020–2021COVID-19John Hopkins University **; Worldometer ***Modified SEIR, DNNPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseThe DNN is more efficient and accurate compared with the traditional prediction methods, decision trees, and linear regression3-month forecast divided dailyThousands of casesYesNot reported
Ardabili S.F. [26]2020Italy, German, China, US, IranJanuary–March 2020COVID-19Worldmeter ***MLP, GA, PSO, GWO, ANFIS, SIR, SEIRPredicting the trend of COVID-19 pandemicOutbreak prediction can be realized by integrating machine-learning and SEIR modelsIntegration of machine learning and SIR/SEIR models is suggested to enhance the existing standard epidemiological models in terms of accuracy and longer lead time3-month forecast divided dailyThousands of casesYesNone
Asfahan S. [27]2020South KoreaJanuary–March 2020COVID-19Korea’s center for disease controlPROPHET (open-source automated machine learning)Predicting the trend of COVID-19 pandemicThe difference between predicted and observed values ranged from 4% to 12% in naïve population and in short termMAPE index of authors model for 1 week was 7.42%, which is indicative of a highly accurate forecasting model1-month forecast divided dailyThousands of casesNot reportedNone
Bagheri H. [28]2020Iran2010–2018BrucellosisHealth Surveillance System of IranRBF, MLPPredicting human brucellosis casesThe model could be effectively used in predicting infectious diseaseRBF is a more common type of neural network learning that responds to a limited section of the input space; it has a faster and more accurate and yet simpler network structure compared with other neural networks, while the MLP is more generalizable18-month forecast divided monthly; identify the province where there will be more casesDozens of casesYesNone
Balogh A. [29]2022European Union2020–2022COVID-19Global COVID-19 Trends and Impact Survey, Johns Hopkins University **, Worldmeter ***, European Centre for Disease Prevention and Control, National Centers for Environmental Information, EurostatRFPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseNot reported16-month forecast divided dailyThousands of casesYesNone
Benedum C.M. [30]2020Perù, Puerto Rico, Singapore1990–2016DenguePassive surveillance systems for Perù and Puerto Rico, Ministry of Health for SingaporeRF, RF-UFA, ARIMA, LASSOPredicting weekly dengue casesFor near-term predictions of weekly case counts and when using surveillance data, ML models had 21% and 33% less error than regression and time series models, respectivelyWhen using dengue surveillance, population, temporal, and weather data as model inputs, RF was more accurate than both Poisson regression and ARIMA models, for near-term predictions, while the ARIMA model performed best for long-term predictions; when predicting dengue outbreaks, RF-UFA outperformed both RF and logistic regression models when using only population, temporal, and weather data as model inputs48-month forecast divided weeklyThousands of casesYesNone
Chaurasia V. [31]2022WorldJanuary 2020–May 2020COVID-19WHOARIMA, Holt linear trend method, Holt’s Winter method, naïve method, simple average, moving average, single exponential smoothingPredicting the trend of COVID-19 pandemicThe models were to be adjusted, often not adequate.The naïve method was best suited as described, better than all other methods3-month forecast divided dailyMilions of casesNot reportedNone
Chen J. [32]2018Japan, Taiwan, Thailandia, Singapore2001–2014Dengue, malaria, chickenpox, hand foot and mouth diseaseNational Institute of Infectious Diseases for Japan, Taiwan National Infectious Disease Statistics System, Ministry of Health of Singapore,LASSO modelsPredicting the trend of dengue, malaria, chickenpox, hand foot and mouth diseaseThe model could be effectively used in predicting infectious diseaseFor LASSO, the models used for prediction, including different sets of predictors, have varying effects in different situations; short-term predictions generally perform better than longer term predictionsCases in the next 14 days; identify the province/states where there will be more casesThousands of casesYesNone
Chimmula V.K.R. [33]2020CanadaJanuary–March 2020COVID-19Johns Hopkins University **, Canadian Health AuthorityLSTM, deep learningPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseThe LSTM model has an accuracy of about 93%3-month forecast divided dailyThousands of casesYesNone
Dash S. [34]2021Brazil, US, India, France, UK, RussiaApril 2020–April 2021COVID-19Johns Hopkins University **, WHO, CDC, COVID19 India °ARIMA modelsPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseThe model achieved 85% accuracy for all the countries and all states of India2-month forecast divided dailyThousands of casesYesNot reported
Dixon S. [35]2022Australia, Israel, Norway, Sweden, US, Japan2008–2019Campylobacteriosis, Q-fever, typhoidWHO, epiarchive.bsvgateway.org, gadm.org, NASA, diva-gis.org, United Nations, naturalearthdata.comRF, XGB, MLP, ARIMA, ARIMAX, GLARMA, SARIMAPredicting of campylobacteriosis, Q-fever, typhoid outbreaksThe model could be effectively used in predicting infectious diseaseThe XGB models performed the best for all diseases, and in general, tree-based ML models performed the best when looking at data splitsDivided daily, time interval varies depending on the diseaseHundreds of casesYesNone
Gónzalez-Bandala D.A. [36]2020Mexico2002–2019Acute respiratory infection (ARI)Mexican Health MinistrySoS, FFNNPredicting of ARIThe model could be effectively used in predicting infectious diseaseThe results show that the combination of different data analysis techniques (FFNN, SoS, and smoothed endemic channels) can provide an accurate prediction for ARI data 1 week in advance; the final model could be used, along with the endemic channels, to detect possible outbreaks12-month forecast divided weeklyThousands of casesYesNone
Goo T. [37]2021South KoreaJanuary 2020–February 2021COVID-19Kaggle. Novel Corona Virus 2019 Data Set *, Ministry of Health of South KoreaGBM, LSTM, SEIR, local linear regression (LLR), negative binomial (NB), segment PoissonPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseLLR, GBM, SEIR, NB, and LSTM, separately, performed well in the forecasting of the pandemic situation of the country2-month forecast divided dailyThousands of casesNoDeclared
Guo Y. [38]2020ChinaJuly 2015–December 2017Hepatitis EShandong Center for Disease Control and Prevention (SCDC)ARIMA, SVM, LSTMPredicting incidence of hepatitis EThe model could be effectively used in predicting infectious diseaseLSTM is the most suitable for predicting hepatitis E monthly incidence and cases number30-month forecast divided monthly; identify the province where there will be more casesDozens of casesYesNone
Haq I. [39]2022BangladeshApril 2020–August 2021COVID-19Institute of Epidemiology Disease Control and Research (IEDCR) of Bangladesh, Worldometer **, Directorate General of Health Services of BangladeshSIR, PROPHET (an open-source automated machine learning)Predicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseNo data on accuracy, but the authors say it appears that the PROPHET algorithm is appropriate for pandemic data with a growing trend12-month forecast divided dailyThousands of casesNot reportedNone
Kamana E. [40]2022ChinaJanuary 2004–December 2016MalariaChinese Center for Disease Control and PreventionXGB, LSTM, LSTMSeq2SeqPredicting incidence of MalariaThe LSTMSeq2Seq model significantly improved the prediction of malaria re-emergence according to the influence of climatic factorsThe LSTMSeq2Seq model achieved an average prediction accuracy of 87.3%24-month forecast divided monthly; identify the province where there will be more casesThousands of casesYesNone
Katragadda S. [41]2022USMarch–December 2020COVID-19Corona Data Scraper open-source projectLR, SVR, K-nearest neighbor regression, MLP, RF, XGBPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseThe prediction accuracy improved by 33.78% for the whole duration of the pandemic in 2020 (March–December) when visitor mobility was used in the forecasting model; the prediction accuracy improved by 33.78% for the whole duration of the pandemic in 2020 (March–December) when visitor mobility was used in the forecasting modelCases in the next 14 days; identify the province/states where there will be more casesThousands of casesYesNone
Ketu S. [42]2021World, China, India, ItalyDecember 2019–June 2020COVID-19WHOGaussian process regression, MTGP, LR, SVR, RFPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseEnhanced multitask Gaussian process regression (MTGP) model on the COVID-19 outbreak forecasting is better than the traditional forecasting model6-month forecast divided bi-weeklyMilions of casesNot reportedNot reported
Kim J. [43]2021237 different CountriesJanuary–December 2019Multiple infectious diseasesMedia articles, MedisysSVM, SSL, DNNPredicting incidence of infectious diseasesThe model could be effectively used in predicting infectious diseasesSSL shows outstanding performance compared to other two models, SVM and DNN also show reasonable performance, showing average accuracy over 0.7Cases in the next 3 months; identify the province/states where there will be more casesMilions of casesYesNone
Kim J. [44]2019Hong KongJanuary 2004–January 2018Influenza-like illness (ILI)Media articles, Centre for Health Protection (CHP) of Hong KongSVMPredicting weekly incidence of ILIThe model could be effectively used in predicting infectious diseaseThe prediction result using news text data with SVM exhibited a mean accuracy of 86.7%48-month forecast divided weeklyThousands of casesYesNone
Kumar S.L. [45]2021US, India, Brazil, RussiaUntil 25 November of 2020COVID-19World in Data by University of OxfordLR, ARIMA, RF, LSTMPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseARIMA outperformed linear regression and random forest in terms of accuracy prediction of test data; ARIMA and LSTMs were compared again with the death-forecasting task, in which LSTMs were able to provide very high accuracy in comparison6-month forecast divided dailyThousands of casesNot reportedNot reported
Lmater M.A. [46]2021Belgium, Morocco, Netherlands, RussiaFebruary–November 2020COVID-19Worldmeter ***SIDR modelPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseNot reported6-month forecast divided dailyThousands of casesNoNone
Lu F.S. [47]2018USSeptember 2012–May 2017Influenza-like illness (ILI)Boston Public Health Commission, Google searches, Twitter posts, FNY mobile app reports, EHRsARGOPredicting weekly incidence of ILIIt is possible to forecast influenza 1 week ahead of the current dateEnsemble-based methods incorporating information from diverse models that are based on multiple data sources, including ARGO, produced the most robust and accurate results; the observed Pearson correlations between our out-of-sample flu activity estimates and those historically reported by the BPHC were 0.98 in nowcasting influenza and 0.94 in forecasting influenza 1 week ahead of the current date12-month forecast divided dailyThousands of casesYesDeclared
Marra A.R. [48]2020US2015–2017Clostridiodes difficileEHRsLR, FR, naïve Bayes, K-nearest neighbor, MLP, Lib SVM, decision tree (J48), AdaBoost (M1), bagging, radial basis function classifierPredicting incidence of Clostridiodes difficileMultiple machine-learning models yielded only modest results in a real-world populationLogistic regression, random forest and naïve Bayes models yielded the highest performance: 0.6Not applicableDozens of casesNoNone
Meng D. [49]2021ChinaJanuary 2009–December 2017Hand, foot, and mouth diseaseThe Dara-Center of China Public Health Science, China Meteorological Data Service Centre, Tencent Location Big Data, The 2018 China Statistical YearbookRF, XGBPredicting the trend of hand, foot, and mouth diseaseThe model could be effectively used in predicting infectious diseaseThe prediction capability of XGBoost model was better than that of random forest model from the overall perspective2-month forecast divided daily; identify the province where there will be more casesThousands of casesYesNone
Mohammad Masum A.K. [50]2020BangladeshMay–June 2020COVID-19Institute of Epidemiology Disease Control & Research of BangladeshLSTM, ML, RFR, SVRPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseLSTM is perfect fitted of real-time analysis; LSTM is best mod1-month forecast divided dailyThousands of casesNot reportedNot reported
Murphy C. [51]2021SpainJanuary 2020–December 2021COVID-19COVID-19 en España https://cnecovid.isciii.es (accessed on 29 January 2023), Observatorio del Transporte y la Logística en España https://observatoriotransporte.mitma.gob.es/estudio-experimenta (accessed on 29 January 2023)GNN, MLEPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseGNN provides more-accurate predictions than the MLE in general and across all degrees14-month forecast divided dailyThousands of casesYesNone
Nguyen V.H. [52]2022Vietnam1997–2016DengueNational Institute of Hygiene and Epidemiology of Vietnam, Vietnam Institute of Meteorology, Hydrology and EnvironmentLSTM-ATT, LSTM, weather-based dengue fever forecasting, XGB, SARIMA, Poisson regression, SVR, SVR-L, CNN, TFPredicting incidence of dengueThe model could be effectively used in predicting infectious diseaseLSTM-ATT displayed the highest performanceIdentify the province where there will be more casesHundreds of casesYesNone
Niraula P. [53]2022SpainMarch 2020–February 2021COVID-19Open data portal of Castilla-Leon, Barcelona Supercomputing CenterLSTM, Poisson regressionPredicting the trend of COVID-19 pandemicThe results show that a Bayesian model informed by a neural network method is generally able to predict the number of cases of COVID-19 in both space and timeNot reported3-month forecast divided daily; identify the province where there will be more casesThousands of casesYesNone
Nsoesie E.O. [54]2021Cameroon2012–2018Influenza-like illness (ILI)Google searches for influenza symptomsARIMA, SVM, RF, multivariable linear regressionPredicting weekly incidence of ILIThe model could be effectively used in predicting infectious diseaseRF and SVM had the highest average R2 (0.78 and 0.88, respectively) for predicting ILI per 100,000 persons at the country level72-month forecast divided weekly; identify the province where there will be more casesThousands of casesYesNone
Patil S. [55]2021India2009–2023DengueIndian Meteorological Department, National Vector-Borne Disease Control ProgramSVR, ARIMA, SARIMA, PROPHET, RF, ElasticNet regression, multiple linear regression, polynomial regression, decision tree regressionPredicting incidence of dengueThe model could be effectively used in predicting infectious diseaseNo model prevails over the other; the accuracy is quite high, but depending on the city, one model is better than the other12-month forecast divided daily; identify the province where there will be more casesThousands of casesYesNone
Pourghasemi H.R. [56]2020IranMarch–June 2020COVID-19Iranian’s Ministry of Health and Medical EducationSVM, ARIMAPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseThe SVM model showing good forecast accuracy was used for mapping the outbreak risk of COVID-19; among the four kernel functions of SVM, RBF has been proved to generate high accuracy models; the generated model using SVM had a good predictive accuracy—0.786 and 0.7994-month forecast divided daily; identify the province where there will be more casesThousands of casesYesNone
Roster K. [57]2022Brazil2014–2021Dengue, zika, influenza, COVID-19Notifiable Diseases Information System of BrazilRF, TrAdaBoost, Neurol Network (NN)Predicting incidence of dengue, zika, and COVID-19The model could be effectively used in predicting infectious diseasesRF models performed better for zika forecasts, while NN models were more successful in predicting COVID-19; NN transfer models predicted the most similar target disease with greater accuracy, while TrAdaBoost and the direct RF model fared better for less similar target diseasesNot applicableThousands of casesYesNone
Saba T. [58]2021India, Iran, Greece, Bulgaria, China, Sweden, Netherlands, Iceland, RussiaJanuary 2020–January 2021COVID-19https://github.com/CSSEGISandData (accessed on 29 January 2023)RF, polynomial regression, SVR, GBR, KNN, decision tree, SARIMA, A’IMA, Holt’s WinterPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseIt is impossible to recommend a single approach to model and forecasting for all data sets in respect of obtained results; as the different data sets exhibited different trends, depending upon the size, nature, and type of the lockdown14-month forecast divided weeklyThousands of casesYesNone
Shaghaghi N. [59]2021USNot reportedCOVID-19New York Times COVID-19 Data, Google TrendsLSTM, RNNPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseeVision has been able to achieve an accuracy of 89% for predicting the trend of the COVID-19 outbreak in the United StatesIdentify the province where there will be more casesThousands of casesNot reportedNot reported
Shen C. [60]2021ChinaNovember 2019–March 2020COVID-19Weibo, Chinese Center for Disease Control and PreventionDecision tree, extra trees, KNN, MLP, SVM, RFPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseSVM, RF, MLP, KNN, DT, and extra tree have accuracy better than 0.8Identify the province where there will be more casesThousands of casesNot reportedNone
Shen L. [61]2022Europe2008–2018BrucellosisEuropean Center for Disease Control and prevention, World Animal Health Information SystemConvolutional LSTM, LSTM, ARIMAPredicting human brucellosis casesThe model could be effectively used in predicting infectious diseaseThe prediction results have shown that LSTM and ConvLSTM models have higher forecast precision12-month forecast divided monthly; identify the province where there will be more casesDozens of casesYesNone
Shi Y. [62]2016Singapore2001–2013DengueSingapore’s Ministry of HealthLASSO modelsPredicting incidence of dengueThe model could be effectively used in predicting infectious diseaseThe model specifically optimizes predictive accuracy over a 3-month time horizon12-month forecast divided weeklyThousands of casesYesNone
Tiwari D. [63]2022WorldJanuary–May 2020COVID-19Kaggle *Naïve Bayes, SVM, LRPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseNaïve Bayes is more accurate than the other models in the study for predicting future COVID-19 trends3-month forecast divided daily; identify the province where there will be more casesMilions of casesNot reportedNone
Venkatramanan S. [64]2021US, Australia2009–2018InfluenzaNational Notifiable Disease Surveillance Syste of Australia Government, CDC FluView, The New Jersey Department of Health, Australian Bureau of Statistics, American Community Survey, Google,Machine-learned anonymized mobility mapPredicting incidence of influenzaThe model could be effectively used in predicting infectious diseaseThe model performs better during early weeks, especially before onset, but as the season progresses, its performance deteriorates in comparison to the other network models3-month forecast divided weekly; identify the province where there will be more casesThousands of casesYesNone
Verma H. [65]2022India2020–2021COVID-19COVID-19India.orgVanilla LSTM, stacked LSTM, ED_LSTM, Bi-LSTM, CNN, and hybrid CNN + LSTMPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseThe stacked-LSTM and hybrid CNN + LSTM models perform best relative to other models3-week forecast divided dailyThousands of casesYesNone
Wang H. [66]2022China2010–2017COVID-19Influenza-Like Illness (ILI)SEIRS, LSTM, spatial transmission network (STN)Predicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseThe results illustrated that the STN not only had good accuracy in forecasting performance but also indicated the spreading directions of infectious diseases among multiple regions, to a certain extent5-month forecast divided weekly; identify the province where there will be more casesThousands of casesYesNone
Wang X. [67]2022USApril 2020–April 2021COVID-19Worldmeters ***, https://www.google.com/covid19/mobility/ (accessed on 29 January 2023)GBM, ordinary differential equation (ODE) modelPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseThe ODE model predicted the number of daily infected cases up to 35 days in the future, with an average mean absolute percentage error of 20.15%, which was further improved to 14.88% when combined with human mobility data12-month forecast divided dailyThousands of casesYesNone
Wang Y. [68]2022US, Brazil, India2020–2021COVID-19WHOPROPHET (an open-source automated machine learning), ARIMA, SARIMAPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseThe PROPHET model showed more accuracy in the daily COVID-19 new cases in the US; the ARIMA model is more suitable for predicting cases in Brazil and India12-month forecast divided dailyThousands of casesYesNone
Xu J. [69]2020ChinaJanuary 2005–December 2018DengueChina National Notifiable Disease Surveillance SystemLSTMPredicting incidence of dengueThe model could be effectively used in predicting infectious diseaseThe LSTM model achieved superior performance in predicting dengue cases as compared with other previously published forecasting models24-month forecast divided monthlyThousands of casesYesNone
Xu Q. [70]2017Hong KongJanuary 2014–December 2015Influenza-like illness (ILI)Hong Kong Centers for Health Protection, Google searches for influenza symptoms, Meteorological data are available from Hong Kong ObservatoryGLM, LASSO, ARIMA, DL, FNN, Bayesian model averagingPredicting incidence of ILIThe model could be effectively used in predicting infectious diseaseDL with FNN remains the preferred method for predicting locations of influenza peaks12-month forecast divided weeklyThousands of casesYesNone
Yang Z. [71]2020ChinaTraining on 2003 data of SARSCOVID-19National Health Commission of ChinaModified SEIR, LSTMPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseThe modified SEIR model is better than LSTM2-month forecast divided daily; identify the province where there will be more casesThousands of casesYesNone
Zhang Y. [72]2022USJanuary–April 2020COVID-19TwitterLR, KNN, SVM, Deep pyramid convolutional neural network, fine-tuning BERTPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseFine-tuning BERT as a tweet classification method and achieve a 0.98 and could to be an early warning on outbreak disease3-month forecast divided daily; identify the province/States where there will be more casesThousands of casesYesNone
Zhong R. [73]2018ChinaJanuary 2017–September 2017Hand, foot, and mouth diseaseShenzhen Health Information Center, Weather Underground app (https://www.wunderground.com/) (accessed on 29 January 2023)XGBPredicting the trend of hand, foot, and mouth diseaseThe model could be effectively used in predicting infectious diseaseCompared with the model only using the previous HFMD rate and temperature factors, the addition of the air-quality factors could make the model better by nearly 16.7%6-month forecast divided dailyThousands of casesYesNone
Ajith A. [74]2020Not reported2014–2020West Nile virus (WNV)Kaggle *RF, naïve Bayes classifier, adaptive boostPredicting the trend of West Nile virusThe model could be effectively used in predicting infectious diseaseRandom forest is the best model for accurately predicting WNV cases6-year forecast divided weeklyThousands of casesNot reportedNot reported
Andreas A. [75]2020World2020COVID-19Worldmeters ***Mathematical forecasting framework based on machine learning and the cloud computing systemPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseThe proposed model has a R2 equal to 0.994-month forecast divided dailyMilions of casesYesNot reported
Brock P.M. [76]2019Malaysia2000–2014MalariaCase-control studies, global positioning system (GPS)Boosted regression trees (BRT)Predicting the outbreak of MalariaThe model could be effectively used in predicting infectious diseaseThe BRT models to predict P. knowlesi occurrence varied from an AUC of 0.55 (little better than a random model) to a maximum of 0.82Identify the province where there will be more casesThousands of casesYesNone
Chumachenko D. [77]2021Ukraine, UK, Germany, Japan2020–2021COVID-19https://github.com/CSSEGISandData (accessed on 29 January 2023)LASSOPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseThe model shows more-accurate results when forecasting for 10 days or fewerNot reportedThousands of casesYesNot reported
Chumachenko D. [78]2021Ukraine, UK, Germany, Japan2020–2021COVID-19https://github.com/CSSEGISandData (accessed on 29 January 2023)RFPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseThe RF method showed the most accurate result among K-nearest neighbor regression, linear regression, LASSO regression, ridge regression, and gradient boostingNot reportedThousands of casesYesNot reported
Fan X.R. [79]2022ChinaJanuary–February 2020COVID-19National Health Commission of the People’s Republic of ChinaISIR, MLR, XGBoost, LightGBM, StackCCPred (stacking based prediction of COVID-19 pandemic)Predicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseThe results positively proved that the proposed StackCPPred model outperformed the existing models for COVID-19 (R2 = 0.902)1-week forecast divided dailyThousands of casesYesNot reported
Hasri H. [80]2021Malaysia1–14 August 2021COVID-19Ministry of Health of MalaysiaLR, Holt’s WinterPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseHolt’s Winter is more accurate than the LR model: LR model accuracy is 82%, Holt’s Winter accuracy is 89%2-week forecast divided dailyThousands of casesYesNot reported
Kolesnikov A.A. [81]2019Not reportedNot reportedDengueNational Oceanic and Atmospheric Administration in the US Department of Commerce, Philippine Department of Health, CDC, International Society for Infectious Diseases, Landsat, SentinelRF, LSTM, XGB, SARIMA, LightGBM, LR, KNN, CatBoost, KerasPredicting incidence of dengueThe model could be effectively used in predicting infectious diseaseThe most effective predictions were given by a mathematical model based on a combination of spatial analysis techniques (MGWR) and neural networks based on the LSTM architectureNot reportedThousands of casesNot reportedNot reported
Kumari P. [82]2020IndiaJanuary–July 2020COVID-19Government of IndiaArtificial neural networkPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseThe proposed model is found to be highly accurate in estimating the growth of COVID-19 related parametersNot reportedThousands of casesNot reportedNot reported
Liu Z. [83]2021US, India, BrazilJanuary 2020–January 2021COVID-19https://github.com/CSSEGISandData (accessed on 29 January 2023)Nonlinear autoregressive neural network (NAR), LR, ARIMA, SEIRPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseNAR dynamic neural network is better than the comparison model in the time prediction of the new crown epidemic, with the maximum error of 3.6% and the minimum error of –0.3%2-month forecast divided dailyThousands of casesYesNot reported
Maaliw R.R. [84]2021Philippines, US, India, BrazilMarch 2020–June 2021COVID-19https://github.com/CSSEGISandData (accessed on 29 January 2023)ARIMA, S-LSTMPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseThe quantitative results show that the ensemble model outperforms stand-alone models of ARIMA and S-LSTM for a 15-day forecast accuracy of 93.50% (infected cases) and 87.97% (death cases)1-week forecast divided dailyThousands of casesYesNot reported
Mahima Y. [85]2020WorldNot reportedCOVID-19Kaggle *LR, bagging regression, RF, KNPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseThe bagging regressor model gives the highest accuracy3-month forecast divided daily; identify the province/states where there will be more casesThousands of casesNot reportedNot reported
Mei W. [86]2021WorldNot reportedCOVID-19Not reportedTime-variant relevance-based infected recovered extreme learning machinePredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseAuthors say that the proposed method can achieve higher accuracy than existed methodsNot reportedMilions of casesYesNot reported
Patayon U.B. [87]2021PhilippinesMarch 2020–May 2021COVID-19Department of Health Zamboanga Peninsula–Center for Health DevelopmentVanilla LSTM, stacked LSTM, bidirectional LSTM, ConvLSTMPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseConvLSTM trained using Adam and RMSProp delivers the best results as it closely adapted to the trend of actual data1-month forecast divided dailyThousands of casesYesNot reported
Pickering L. [88]2020US2018–2020COVID-19NASA Space AppsSVR, multidimensional regression with interactions, stepwise regression methodPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseAll three methods predict a rise in cases similar to the actual rise in cases and are all able to predict to a certain degree the unexpected dip in cases on the 10th and 11th day of prediction4-month forecast divided dailyThousands of casesNot reportedNot reported
Rohini M. [89]2021IndiaNot reportedCOVID-19Kaggle *KNN, DT, SVM, RTPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseThe model developed using K-nearest neighbors (KNN) is effective, with a prediction accuracy of 98.34%Not reportedThousands of casesNot reportedNot reported
Satu M.S. [90]2021World2020COVID-19https://github.com/CSSEGISandData (accessed on 29 January 2023)LR, PR, SVR, MLP, poly-MLP, PROPHETPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseasePROPHET is the best model among those proposed: its R2 is equal to 1Not reportedThousands of casesNot reportedNot reported
Sri S.B. [91]2022WorldJanuary 2020–November 2021COVID-19WHOPROPHETPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseThrough training, the model is able to predict the cases of the next 7 daysNot reportedThousands of casesNot reportedNot reported
Wang H. [92]2022World2020–2021COVID-19https://github.com/CSSEGISandData (accessed on 29 January 2023)T-SIRGAN (susceptible–infected–recovered, generative adversarial networks), T-GAN, T-COVID, LSTM-SIRGAN, LSTM-GAN, LSTM-COVID, ARIMA, DT, SVM, KNNPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseThe authors’ T-SIRGAN model has better accuracy than the others used as a comparison4-month forecast divided daily; identify the states where there will be more casesThousands of casesYesNot reported
Zhou Q. [93]2020WorldJanuary–July 2020COVID-19WHO, Baidu BaikeLogistic model, ARIMA, SIR, SEIRPredicting the trend of COVID-19 pandemicThe model could be effectively used in predicting infectious diseaseThe improved SEIR model is the best model and can more precisely predict the future development trend of the global epidemic6-month forecast divided dailyThousands of casesYesNot reported
Zhang P. [94]2022ChinaJanuary 2015–December 2019Hand, foot, and mouth disease, hepatitis BXiamen Center for Disease Control and PreventionOriented attention model (OAM), AR, LSTM, gated recurrent unit, encoder-decoder, CNN, CNN-RNN, LSTM-attn, GRU-attn, ED-attn, CNN-attn and CNNRNN-attnPredicting the trend of hand, foot, and mouth disease and hepatitis BThe model could be effectively used in predicting infectious diseaseThe self-attention significantly improves the predictive accuracy of all comparable methods; the MAE and RMSE values were decreased by 51.67% and 39.43% at most, respectively; the R2 is increasing by 52.99% at mostNot reportedThousands of casesYesNot reported
UK: United Kingdom, US: United States, KSA: Kingdom of Saudi Arabia, UAE: United Arab Emirates, LSTM: long short-term memory, Bi-LSTM: bidirectional LSTM, ED: encoder-decoder, LSTM-ATT: attention-enhanced LSTM, ARIMA: autoregressive integrated moving average, AR: autoregressive, EnKF: nonlinear ensemble Kalman Filter, SEIR: susceptible–exposed–infectious–removed, SIR: susceptible–infected–recovered, SIDR: susceptible–infected–diagnosed–recovered, BMA: Bayesian model averaging, PR: polynomial regression, LRM: linear regression model, SVR: support vector regression, SVR-L: support vector regressor with linear kernel, SVM: support vector machine, DNN: deep neural network, MLP: multilayered perceptron, RF: random forest, ANFIS: adaptive neuro fuzzy inference system, GA: genetic algorithm, PSO: particle swarm optimization, GWO: gray wolf optimizer, RNN: recursive neural network, UFA: univariate flagging algorithm, SARIMA: seasonal ARIMA, SEIAR: susceptible-exposed-symptomatic infected-asymptomatic infected-removed, LASSO: least absolute shrinkage and selection operator algorithm, XGB: extreme gradient boosted, GLARMA generalized linear autoregressive moving averages, ARIMAX: ARIMA with exogenous variables, CNN: convolutional neural network, FFNN: feed forward neural network, SoS: sum of sines, DT: decision tree, GBM: tree-based gradient-boosting machine, MTGP: multitask Gaussian process regression, SSL: semi-supervised learning, TF: transformer, ARGO: autoregression with general online information, WHO: World Health Organization, N.A.: not available. * Available online: https://www.kaggle.com/sudalairajkumar/novel-corona-virus-2019-dataset (accessed on 29 January 2023); ** Available online: https://coronavirus.jhu.edu/map.html (accessed on 29 January 2023); *** Available online: https://www.worldometers.info/coronavirus (accessed on 29 January 2023); ° Available online: https://www.covid19india.org/ (accessed on 29 January 2023).
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Santangelo, O.E.; Gentile, V.; Pizzo, S.; Giordano, D.; Cedrone, F. Machine Learning and Prediction of Infectious Diseases: A Systematic Review. Mach. Learn. Knowl. Extr. 2023, 5, 175-198. https://doi.org/10.3390/make5010013

AMA Style

Santangelo OE, Gentile V, Pizzo S, Giordano D, Cedrone F. Machine Learning and Prediction of Infectious Diseases: A Systematic Review. Machine Learning and Knowledge Extraction. 2023; 5(1):175-198. https://doi.org/10.3390/make5010013

Chicago/Turabian Style

Santangelo, Omar Enzo, Vito Gentile, Stefano Pizzo, Domiziana Giordano, and Fabrizio Cedrone. 2023. "Machine Learning and Prediction of Infectious Diseases: A Systematic Review" Machine Learning and Knowledge Extraction 5, no. 1: 175-198. https://doi.org/10.3390/make5010013

Article Metrics

Back to TopTop