You are currently on the new version of our website. Access the old version .
  • Article
  • Open Access

25 June 2020

Predicting the Epidemiological Outbreak of the Coronavirus Disease 2019 (COVID-19) in Saudi Arabia

,
,
,
and
1
Computer Science Department, College of Science and Humanities in Jubail, Imam Abdulrahman Bin Faisal University, Jubail P.O. Box 31961, Saudi Arabia
2
Department of Engineering and Mathematics, Sheffield Hallam University, Sheffield S1 1WB, UK
3
Computer Science and Information Department, College of Computer Science and Engineering, University of Ha’il, Hail 8145, Saudi Arabia
*
Author to whom correspondence should be addressed.

Abstract

The coronavirus diseases 2019 (COVID-19) outbreak continues to spread rapidly across the world and has been declared as pandemic by World Health Organization (WHO). Saudi Arabia was among the countries that was affected by the deadly and contagious virus. Using a real-time data from 2 March 2020 to 15 May 2020 collected from Saudi Ministry of Health, we aimed to give a local prediction of the epidemic in Saudi Arabia. We used two models: the Logistic Growth and the Susceptible-Infected-Recovered for real-time forecasting the confirmed cases of COVID-19 across Saudi Arabia. Our models predicted that the epidemics of COVID-19 will have total cases of 69,000 to 79,000 cases. The simulations also predicted that the outbreak will entering the final-phase by end of June 2020.

1. Introduction

Coronaviruses are a large family of viruses that are distributed among humans and animals such as livestock, birds, bats and other wild animals. These viruses cause serious illnesses for human when infecting respiratory, hepatic, gastrointestinal and neurological [1,2,3]. In December 2019, China notified the World Health Organization (WHO) about a novel coronavirus—severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2)—that caused many cases of respiratory illnesses that were mostly related to people who had visited a live animal seafood market in Wuhan city [4]. The disease, now formally called COVID-19 (coronavirus disease 2019), caused an outbreak of a typical pneumonia through human-to-human transmission starting from Wuhan, a highly populated city with more than eleven million residents, then rapidly spread in China [5]. The Chinese authority put Wuhan city and several cities in Hubei province on lockdown, and all the public transportation was stopped to prevent any further spread of the virus [6,7]. However, the confirmed cases have increased daily in China and in many countries around the globe. On 11 March 2020, WHO officially declared COVID-19 outbreak as a global pandemic as the virus has spread to well over 200 countries around the world [8]. To date, COVID-19 has infected more than 4.5 million people and killed over 300,000 people across the world [8].
Saudi Arabia was among the countries that was affected by the virus. On 2 March 2020, the Saudi Ministry of Health reported the first confirmed COVID-19 case in the country. From the beginning of March, the number of confirmed COVID-19 cases gradually increased and the highest number of cases reported in one day was 2307 on 15 May 2020 with a total of 49,176 confirmed cases of COVID-19 infections and 292 deaths have been confirmed in Saudi Arabia (see Figure 1).
Figure 1. Number of (a) confirmed, (b) recovered and (c) deaths cases related to COVID-19 in Saudi Arabia from 2 March 2020 to 15 May 2020.
Mathematical models and simulations are considered an important tools to predict the possibility and severity of disease outbreak and provide main information for determining the type and intensity of disease intervention. Resulting in decreasing the transmission of the diseases and a more accurate approaches to manage the epidemic. Recently, mathematical modeling has been used to predict epidemics such as Foot-and-Mouth Disease (FMD), SARS, Zika and Ebola [9,10,11,12].
This article aims to give a local prediction of the epidemic peak for COVID-19 in Saudi Arabia by using the real-time data from 2 March 2020 to 15 May 2020.
The remainder of this article is arranged as follows. Section 2 presents the related work on COVID-19. Section 3 describes data and the models used in prediction of the epidemic peak for COVID-19 in Saudi Arabia. Section 4 presents our simulation results. Finally, Section 5 draws the conclusion.

3. Methodology

3.1. Data

We collected the daily number of confirmed, recovered, and deaths COVID-19 cases released by Saudi Ministry of Health’s Twitter account from 2 March 2020 to 15 May 2020 to construct a real-time database. The data were organized in a matrix with the rows representing the date and columns representing the number of the new confirmed cases, the number of accumulated cases, the number of accumulated recovered cases and the number of accumulated deaths cases as shown in Table 1 and Table 2.
Table 1. Number of cases in Saudi Arabia from 2 March 2020 to 15 April 2020.
Table 2. Number of cases in Saudi Arabia from 16 April 2020 to 15 May 2020.

3.2. Models

We generated short-term forecasts in real-time using two models namely, the Logistic Growth and SIR models.

3.2.1. Logistic Growth Model

The phenomenological Logistic Growth model has been widely used to model population growth with limited resources and space. The model was originally developed by Haberman in [24] and have been used to predict 2015 Ebola epidemic [11,12]. The dynamics of the epidemic, typically expressed as a cumulative number of cases, can use the similar model when the primary method of control is quarantine—as in the case of a novel viral infection such as COVID-19.
In the Logistic Growth model, the epidemic can be defined by the differential equation
d C d t = r C 1 C K ,
where C is the cumulative cases at time t, r is the early growth rate, and K is the final epidemic size.

3.2.2. Susceptible-Infected-Recovered Model

For comparison, we also simulate the well-known SIR model to represent the spreading process of COVID-19 epidemic in Saudi Arabia. SIR model framework is an epidemic spreading model inspired by the seminal work of Kermack and McKendric [25]. In epidemiology, SIR model is belong to compartmental epidemic models. The basic in compartmental epidemic model is proposed by Hamer in 1906 where he suggested that the infection spread should depend on the number of susceptible population and the number of infected population [26].
We estimate the parameters of model to get a best fit on reported data of COVID-19 outbreak in Saudi Arabia. In SIR model, we assume that the modeling timescale is short, no vital dynamics (births and deaths), and the host population size (N) is constant. In SIR-type models, individuals are classified into three separate groups (or compartments) based on their infectious status:
  • Susceptible (S): group of individuals that not currently infected but may catch the disease.
  • Infected (I): group of individuals that are currently infectious.
  • Recovered or Removed (R): group of individuals that are no longer infectious. They are either recovered, become immune, or have died.
The SIR model is represented by the following system of nonlinear Ordinary Differential Equations (ODEs) [27]:
d S d t = β S I N
d I d t = β S I N γ I
d R d t = γ I
where t is time (in day), β is the contact rate, g a m m a is the remove rate or the inverse of infectious period, S ( t ) is the number of susceptible population at time t, I ( t ) is the number of infected population at time t, and R ( t ) is the number of recovered population at time t (see Figure 2).
Figure 2. SIR model.
In the SIR model, one key parameter to understand the basic epidemiological characteristics of the epidemic is the basic reproduction number ( R 0 ). R 0 is the average number of secondary persons in a complete susceptible population infected by a single infected person during its spreading life. It indicates how contagious is the infectious diseases. The true value of R 0 is uncertain until the outbreak is over. R 0 depends on three factors:
  • Duration of infectiousness;
  • Probability of infection being transmitted during contact between an infected person and a susceptible person;
  • The average rate of contact between infected and susceptible individuals.
R 0 is represented as
R 0 = β γ
when R 0 > 1 virus is currently spreading in the population and when R 0 < 1 virus is stop spreading due to run out of susceptible and the decrease of new cases.

4. Results

4.1. Results of Logistic Growth Model

We used MATLAB to simulate the Logistic Growth and SIR model based on [17,28] models. In the epidemic simulation graphs, regions color separate epidemic into three phases:
  • Red: fast growth phase;
  • Yellow: transition to steady-state phase;
  • Green: ending phase (plateau stage).
The Logistic Growth model results of COVID-19 epidemic in Saudi Arabia are shown in Figure 3 and Figure 4.
Figure 3. COVID-19 epidemic in Saudi Arabia prediction based on Logistic Growth model.
Figure 4. COVID-19 epidemic in Saudi Arabia prediction based on Logistic Growth model (infection rate).
The simulation is data-driven which is rely on the historical time-series data. From the simulations, it is shown that the Logistic Growth prediction is more optimistic compare to the other models. From the Logistic Growth model prediction, the peak of the infection rate is 7 May 2020 (see Figure 3). The transition to steady-state phase starts on 28 May 2020 and the ending phase starts on 14 June 2020. The Logistic Growth model predicts that final number of case is around 69,000 cases.

4.2. Results of Susceptible-Infected-Recovered Model

Several assumptions were taken in the simulation of SIR model: constant total population, uniform mixing of the people, and equally likely recovery of infected. The SIR model of COVID-19 epidemic simulation in Saudi Arabia are shown in Figure 5 and Figure 6.
Figure 5. COVID-19 epidemic in Saudi Arabia prediction based on SIR model.
Figure 6. COVID-19 epidemic in Saudi Arabia prediction based on SIR model (infection rate).
In SIR-model the parameters β and γ are estimated from the actual number of confirmed cases. The variable R 0 changes with respect to time. It will change every day based on the number of cases confirmed. From the SIR-model prediction, the peak of the infection rate is 1 May 2020. The transition to steady-state phase starts on 2 June 2020 and the ending phase starts on 24 June 2020. The SIR-model predict that final number of case is around 79,000 cases.

5. Conclusions

Predicting the epidemic evolution based on limited data and with no past epidemiological data is not trivial task. We present predictions for reported cases of COVID-19 in Saudi Arabia from 2 March to 15 May 2020 using mathematical modeling and simulation. We used two models: Logistic Growth and SIR models. Across both predictions, Logistic Growth and SIR-model provide different results (Figure 3 and Figure 5). Both models are similar in predicting the epidemic trends but with a slightly different timing, the SIR model predicts about a few days late than the Logistic Growth. Both models also have a gap in predicting the final number of cases with SIR model has a higher number of cases compared to the Logistic Growth.
In conclusion, while our models predict the COVID-19 outbreak in Saudi Arabia still in fast growth phase, our predictions need to be interpreted with caution given the dynamic case definition and reporting patterns. Without mass testing, the confirmed case number might be only a subset of the true total infected cases. Also, the asymptomatic infected individuals who are not tested and then recovered do not get counted. Under-reporting and asymptomatic people are observed in many countries worldwide and may lead to under-estimation of the accumulated cases. Therefore, mass testing is needed to identify patients and to contain the spread of the disease. Our prediction based on current data suggests that the epidemic continue to spreading in Saudi Arabia. It is suggested that warmer weather may contribute to slowing down the spread of coronavirus but this would need further investigation when more data is available. It should be noted that the MERS coronavirus has spread in Saudi Arabia in the summer (August) [29].

Author Contributions

Conceptualization, D.A. (Dabiah Alboaneen); Data Collection: D.A. (Dabiah Alboaneen), D.A. (Dhahi Alshammari), R.A., N.A.; Analysis: D.A. (Dabiah Alboaneen), B.P.; Visualisation: B.P., D.A.(Dhahi Alshammari), D.A. (Dabiah Alboaneen); Writing—original draft preparation, D.A. (Dabiah Alboaneen), B.P., D.A.(Dhahi Alshammari), N.A, R.A.; Writing—review and editing, D.A. (Dabiah Alboaneen), B.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Chen, Y.; Liu, Q.; Guo, D. Emerging coronaviruses: Genome structure, replication, and pathogenesis. J. Med. Virol. 2020, 92, 418–423. [Google Scholar] [CrossRef] [PubMed]
  2. Ge, X.Y.; Li, J.L.; Yang, X.L.; Chmura, A.A.; Zhu, G.; Epstein, J.H.; Mazet, J.K.; Hu, B.; Zhang, W.; Peng, C.; et al. Isolation and characterization of a bat SARS-like coronavirus that uses the ACE2 receptor. Nature 2013, 503, 535–538. [Google Scholar] [CrossRef] [PubMed]
  3. Wang, L.F.; Shi, Z.; Zhang, S.; Field, H.; Daszak, P.; Eaton, B.T. Review of bats and SARS. Emerg. Infect. Dis. 2006, 12, 1834. [Google Scholar] [CrossRef] [PubMed]
  4. Novel Coronavirus (2019-nCoV); World Health Organization: Geneva, Switzerland, 2020.
  5. Huang, C.; Wang, Y.; Li, X.; Ren, L.; Zhao, J.; Hu, Y.; Zhang, L.; Fan, G.; Xu, J.; Gu, X.; et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet 2020, 395, 497–506. [Google Scholar] [CrossRef]
  6. Du, Z.; Wang, L.; Cauchemez, S.; Xu, X.; Wang, X.; Cowling, B.J.; Meyers, L.A. Risk for Transportation of 2019 Novel Coronavirus (COVID-19) from Wuhan to Cities in China. medRxiv 2020. [Google Scholar] [CrossRef]
  7. Wu, J.T.; Leung, K.; Leung, G.M. Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: A modelling study. Lancet 2020, 395, 689–697. [Google Scholar] [CrossRef]
  8. COVID-19 Coronavirus Pandemic. Available online: https://www.worldometers.info/coronavirus/ (accessed on 17 April 2020).
  9. Shanafelt, D.W.; Jones, G.; Lima, M.; Perrings, C.; Chowell, G. Forecasting the 2001 foot-and-mouth disease epidemic in the UK. EcoHealth 2018, 15, 338–347. [Google Scholar] [CrossRef]
  10. Wang, W.; Ruan, S. Simulating the SARS outbreak in Beijing with limited data. J. Theor. Biol. 2004, 227, 369–379. [Google Scholar] [CrossRef]
  11. Chowell, G.; Simonsen, L.; Viboud, C.; Kuang, Y. Is West Africa approaching a catastrophic phase or is the 2014 Ebola epidemic slowing down? Different models yield different answers for Liberia. PLoS Curr. 2014, 6. [Google Scholar] [CrossRef]
  12. Pell, B.; Kuang, Y.; Viboud, C.; Chowell, G. Using phenomenological models for forecasting the 2015 Ebola challenge. Epidemics 2018, 22, 62–70. [Google Scholar] [CrossRef]
  13. Al-qaness, M.A.; Ewees, A.A.; Fan, H.; Abd El Aziz, M. Optimization Method for Forecasting Confirmed Cases of COVID-19 in China. J. Clin. Med. 2020, 9, 674. [Google Scholar] [CrossRef] [PubMed]
  14. Li, Q.; Feng, W. Trend and forecasting of the COVID-19 outbreak in China. arXiv 2020, arXiv:2002.05866. [Google Scholar]
  15. Roosa, K.; Lee, Y.; Luo, R.; Kirpich, A.; Rothenberg, R.; Hyman, J.; Yan, P.; Chowell, G. Real-time forecasts of the COVID-19 epidemic in China from February 5th to February 24th, 2020. Infect. Dis. Model. 2020, 5, 256–263. [Google Scholar] [CrossRef] [PubMed]
  16. Yang, S.; Cao, P.; Du, P.; Wu, Z.; Zhuang, Z.; Yang, L.; Yu, X.; Zhou, Q.; Feng, X.; Wang, X.; et al. Early estimation of the case fatality rate of COVID-19 in mainland China: A data-driven analysis. Ann. Transl. Med. 2020, 8, 128. [Google Scholar] [CrossRef]
  17. Batista, M. Estimation of the final size of the COVID-19 epidemic. medRxiv 2020. [Google Scholar] [CrossRef]
  18. Liu, Z.; Magal, P.; Seydi, O.; Webb, G. Predicting the cumulative number of cases for the COVID-19 epidemic in China from early data. arXiv 2020, arXiv:2002.12298. [Google Scholar] [CrossRef]
  19. Zhao, S.; Musa, S.S.; Lin, Q.; Ran, J.; Yang, G.; Wang, W.; Lou, Y.; Yang, L.; Gao, D.; He, D.; et al. Estimating the unreported number of novel coronavirus (2019-nCoV) cases in China in the first half of January 2020: A data-driven Modelling analysis of the early outbreak. J. Clin. Med. 2020, 9, 388. [Google Scholar] [CrossRef]
  20. Nishiura, H.; Kobayashi, T.; Yang, Y.; Hayashi, K.; Miyama, T.; Kinoshita, R.; Linton, N.M.; Jung, S.m.; Yuan, B.; Suzuki, A.; et al. The rate of underascertainment of novel coronavirus (2019-nCoV) infection: Estimation using Japanese passengers data on evacuation flights. J. Clin. Med. 2020, 9, 419. [Google Scholar] [CrossRef]
  21. Tang, B.; Wang, X.; Li, Q.; Bragazzi, N.L.; Tang, S.; Xiao, Y.; Wu, J. Estimation of the transmission risk of the 2019-nCoV and its implication for public health interventions. J. Clin. Med. 2020, 9, 462. [Google Scholar] [CrossRef]
  22. Thompson, R.N. Novel coronavirus outbreak in Wuhan, China, 2020: Intense surveillance Is vital for preventing sustained transmission in new locations. J. Clin. Med. 2020, 9, 498. [Google Scholar] [CrossRef]
  23. Jung, S.m.; Akhmetzhanov, A.R.; Hayashi, K.; Linton, N.M.; Yang, Y.; Yuan, B.; Kobayashi, T.; Kinoshita, R.; Nishiura, H. Real-time estimation of the risk of death from novel coronavirus (COVID-19) infection: Inference using exported cases. J. Clin. Med. 2020, 9, 523. [Google Scholar] [CrossRef] [PubMed]
  24. Haberman, R. Mathematical Models: Mechanical Vibrations, Population Dynamics, and Traffic Flow; Society for Industrial and Applied Mathematics: Philadelphia, PA, USA, 1998; Volume 21. [Google Scholar]
  25. Kermack, W.O.; McKendrick, A.G. A contribution to the mathematical theory of epidemics. Proc. R. Soc. Lond. Ser. A Contain. Pap. Math. Phys. Charact. 1927, 115, 700–721. [Google Scholar]
  26. Hamer, W.H. Epidemic Disease in England: The Evidence of Variability and of Persistency of Type; Bedford Press: Bedford, UK, 1906. [Google Scholar]
  27. Hethcote, H.W. The mathematics of infectious diseases. SIAM Rev. 2000, 42, 599–653. [Google Scholar] [CrossRef]
  28. Batista, M. Estimation of the final size of the second phase of the coronavirus epidemic by the logistic model. medRxiv 2020. [Google Scholar] [CrossRef]
  29. Le Page, M. Will heat kill the coronavirus? New Sci. 2020, 245, 6–7. [Google Scholar] [CrossRef]

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.