Real-Time Estimation of the Risk of Death from Novel Coronavirus (COVID-19) Infection: Inference Using Exported Cases

Jung, Sung-mok; Akhmetzhanov, Andrei R.; Hayashi, Katsuma; Linton, Natalie M.; Yang, Yichi; Yuan, Baoyin; Kobayashi, Tetsuro; Kinoshita, Ryo; Nishiura, Hiroshi

doi:10.3390/jcm9020523

Open AccessFeature PaperArticle

Real-Time Estimation of the Risk of Death from Novel Coronavirus (COVID-19) Infection: Inference Using Exported Cases

by

Sung-mok Jung

¹

,

Andrei R. Akhmetzhanov

¹

,

Katsuma Hayashi

¹,

Natalie M. Linton

¹

,

Yichi Yang

¹,

Baoyin Yuan

¹,

Tetsuro Kobayashi

¹,

Ryo Kinoshita

¹

and

Hiroshi Nishiura

^1,2,*

¹

Graduate School of Medicine, Hokkaido University, Kita 15 Jo Nishi 7 Chome, Kita-ku, Sapporo-shi, Hokkaido 060-8638, Japan

²

Core Research for Evolutional Science and Technology (CREST), Japan Science and Technology Agency, Honcho 4-1-8, Kawaguchi, Saitama 332-0012, Japan

^*

Author to whom correspondence should be addressed.

J. Clin. Med. 2020, 9(2), 523; https://doi.org/10.3390/jcm9020523

Submission received: 29 January 2020 / Revised: 11 February 2020 / Accepted: 12 February 2020 / Published: 14 February 2020

(This article belongs to the Special Issue Real Time Clinical and Epidemiological Investigations on Novel Coronavirus - Part I)

Download

Browse Figures

Versions Notes

Abstract

:

The exported cases of 2019 novel coronavirus (COVID-19) infection that were confirmed outside China provide an opportunity to estimate the cumulative incidence and confirmed case fatality risk (cCFR) in mainland China. Knowledge of the cCFR is critical to characterize the severity and understand the pandemic potential of COVID-19 in the early stage of the epidemic. Using the exponential growth rate of the incidence, the present study statistically estimated the cCFR and the basic reproduction number—the average number of secondary cases generated by a single primary case in a naïve population. We modeled epidemic growth either from a single index case with illness onset on 8 December 2019 (Scenario 1), or using the growth rate fitted along with the other parameters (Scenario 2) based on data from 20 exported cases reported by 24 January 2020. The cumulative incidence in China by 24 January was estimated at 6924 cases (95% confidence interval [CI]: 4885, 9211) and 19,289 cases (95% CI: 10,901, 30,158), respectively. The latest estimated values of the cCFR were 5.3% (95% CI: 3.5%, 7.5%) for Scenario 1 and 8.4% (95% CI: 5.3%, 12.3%) for Scenario 2. The basic reproduction number was estimated to be 2.1 (95% CI: 2.0, 2.2) and 3.2 (95% CI: 2.7, 3.7) for Scenarios 1 and 2, respectively. Based on these results, we argued that the current COVID-19 epidemic has a substantial potential for causing a pandemic. The proposed approach provides insights in early risk assessment using publicly available data.

Keywords:

mortality; censoring; travel; migration; importation; emerging infectious diseases

1. Introduction

Since 8 December 2019, clusters of pneumonia cases of unknown etiology have emerged in Wuhan City, Hubei Province, China [1,2]. Virological investigation suggests that the causative agent of this pneumonia is a novel coronavirus (COVID-19) [3]. As of 27 January 2020, a total of 4515 cases including 106 deaths were confirmed [4]. Forty-one cases of COVID-19 infections were also reported outside China, in other Asian countries, the United States, France, Australia, and Canada.

A local market selling seafood and wildlife in Wuhan was visited by many cases in the initial cluster, indicating that a common-source zoonotic exposure may have been the main mode of transmission [5]. However, even after shutting down the market, the number of cases continued to grow across China and several instances of household transmission were reported [6]. It is now speculated that sustained human-to-human transmission aided in the establishment of the epidemic [7] and that reported case counts greatly underestimated the actual number of infections in China [8].

Early assessment of the severity of infection and transmissibility can help quantify the pandemic potential of COVID-19 and anticipate the likely number of deaths by the end of the epidemic. One important epidemiological measure of severity is case fatality risk (CFR), which can be measured using three different approaches, by estimating (i) the proportion of the cumulative number of deaths out of the cumulative number of cases at a point in time, (ii) the ratio of the cumulative number of deaths to the cumulative number of infected individuals whose clinical outcome is known (i.e., the deceased or recovered), and (iii) the risk of death among confirmed cases, explicitly accounting for the time from illness onset to death [9]. Estimating the CFR using the ratio of deaths to confirmed cases (cCFR), with an adjustment of the time delay from illness onset to death (i.e., method (iii)), can provide insight into severity of the disease, because the naïve CFR based on method (i) tends to be an underestimate due to the real-time nature of the growth of fatal cases. For example, during the early stage of an epidemic, failing to right-censor cases with respect to the time delay from illness onset to death may lead to underestimation of the CFR. This is because death due to infection may yet occur following case identification [9,10,11].

When the CFR denominator only includes confirmed cases it is referred to as the cCFR [10] and may overestimate the actual CFR among all infected individuals due to under-ascertainment of infections in the population. Nonetheless, the cCFR is still a valuable measure of the upper bound of the CFR among all symptomatic cases (sCFR), particularly in circumstances of high uncertainty, such as the emergence of a new human pathogen (i.e., COVID-19).

The basic reproduction number (

R_{0})

, the average number of secondary cases generated by a single primary case in a fully susceptible population, represents an epidemiological measurement of the transmissibility, helping us to quantify the pandemic potential of COVID-19. Here, we define a pandemic as the worldwide spread of a newly emerged disease, in which the number of simultaneously infected individuals exceeds the capacity for treatment [12]. Using the growth rate of the estimated cumulative incidence from exportation cases and accounting for the time delay from illness onset to death, the present study aims to estimate the cCFR of COVID-19 in real-time.

2. Methods

2.1. Epidemiological Data

Information on exported COVID-19 cases who were confirmed in other countries and deaths due to COVID-19 infection in China were retrieved from the first announcement date of the current outbreak (i.e., 31 December 2019) through 24 January 2020. The cutoff time of 24 January was specifically selected to reflect the governmental ban on public transportation from Wuhan (including flights) which was invoked on 23 January 2020 [13]. All data were collected either from government websites or from media quoting government announcements. Our data include 51 cases diagnosed outside China who had illness onset through 24 January and were reported by 9 February 2020 and the dates of illness onset and death among 41 deceased cases in China.

2.2. Estimation of the Delay Distributions

The observed incidence i(t) by date of illness onset t is modeled using an exponential growth model with the rate r:

i (t) = i_{0} e^{r t}

, where i₀ is the expected number of infected cases at time t = 0. The cumulative incidence I(t) is an integral of i(t) over the time interval from zero to t that can be written as

I (t) = \int_{0}^{t} i (s) d s = i_{0} (e^{r t} - 1) / r

. The cumulative incidence is adjusted to the date of report by the factor u dependent on the parameters of the delay distribution. For the estimation of time delay distribution from illness onset to death, we accounted for right truncation and modeled and used a lognormal distribution with parameters adopted from an earlier study [14]. Let

f (t; θ)

be the lognormal distribution with parameters

θ_{d} = {a_{d}, b_{d}}

. Then, the cumulative incidence I(t) by date of report t can be adjusted to the time from illness onset to death and report by simply multiplying it by the factor

u (r, θ_{d})

, which is a consequence of the exponentially growing epidemic [10]. The factor u is defined by the following integral:

u (r, θ_{d}) = \int_{0}^{\infty} \exp (- r s) f (s; θ_{d}) d s

(1)

The adjusted cumulative incidence then obeys

u (r, θ_{d}) \cdot I (t)

. The cumulative number of deaths D(t) reported by date t is the result of Binomial sampling:

D (t) ~ binom (size = u (r, θ_{d}) \cdot I (t), prob = C F R (t))

(2)

Let

a_{e}

and

b_{e}

be the shape and inverse scale of the gamma distribution modeling the distribution of time interval from illness onset to the report for observed exported cases. The cumulative incidence in China can be then adjusted by the multiplicative factor

\tilde{u} (r, θ_{e} = {a_{e}, b_{e}})

. The number of exported cases E(t) can then be sampled from another Binomial distribution:

E (t) ~ binom (size = \tilde{u} (r, θ_{e}) \cdot I (t), prob = p)

(3)

where p describes the probability of finding a traveler from Wuhan among all travelers from China subject to the detection time window of the virus T = 12.5 days [8]. Given that the total volume of inbound passengers from China is M = 5.56 million passengers per year, the fraction of Wuhan travelers is

ϕ

= 0.021%, and the population of Wuhan is n = 11 million, the probability p is given by

p = \frac{ϕ M T}{365 n} \approx 0.0036

(4)

2.3. Statistical Inference

First, we fitted the delay distribution of the time from illness onset to report

Δ T_{e, k}

(

k \leq K_{e}

) to the gamma distribution. We define the log-likelihood as follows:

l o g L_{e} (θ_{e} | Δ T_{e, k}) = \sum_{k} \log (gamma (Δ T_{e, k} | shape = a_{e}, scale = b_{e})

(5)

that is maximized to find the mean values of the parameters

{\tilde{θ}}_{e} = {{\tilde{a}}_{e}, {\tilde{b}}_{e}}

, used in the following step.

Second, we fitted the observed counts of exported cases and deaths by considering two other likelihoods respective to each process:

l o g L_{E} ({r, i_{0}} | {E (t), t_{e} \leq t \leq T}) = \sum_{t = t_{e}}^{T} \log (binom (E (t) | size = \tilde{u} (r, {\tilde{θ}}_{e}) \cdot I (t), prob = p))

(6)

l o g L_{D} ({r, i_{0}, C F R (t)} | {D (t), t_{d} \leq t \leq T}) = \sum_{t = t_{d}}^{T} \log (binom (D (t) | size = u (r, θ_{d}) \cdot I (t), prob = C F R (t)))

(7)

where

t_{e}

and

t_{d}

are the times at which the first exportation event and the first death are observed. The total log-likelihood is given by the sum of two log-likelihoods:

l o g L_{Σ} (θ_{Σ} = {r, i_{0}, CFR (t)} | D (t), E (t)) = l o g L_{E} ({r, i_{0}} | E (t)) + l o g L_{D} ({r, i_{0}, C F R (t)} | D (t))

(8)

which is then maximized to determine the best-fit parameters

{r, i_{0}, C F R (t)}

.

We considered two possible scenarios to fit to the data. In the first scenario (referred to as “Scenario 1”), the parameter

i_{0}

is fixed at one on the date of illness onset for the first COVID-19 confirmed case (i.e., 8 December 2019), providing a fixed starting point for the exponential growth of the cumulative incidence. Although 8 December 2019 is assumed as the starting point for exponential growth of the cumulative incidence in Scenario 1, there is still uncertainty using this date as a proxy for the beginning of exponential growth due to inconsistencies in reported illness onset for the earliest reported cases [2,5]. Therefore, we conducted a sensitivity analysis where the start date for exponential growth was varied between 1 December and 10 December 2019. In the second scenario (referred to as “Scenario 2”), all parameters

{r, i_{0}, C F R (t)}

are variable, and calculation begins on the date the first exported case was observed (i.e., 13 January 2020). For both scenarios, we conducted sensitivity analyses that estimated the growth rate, cumulative incidence, and cCFR value using different cut-off times, values of the detection window T, and sizes of the catchment population of the Wuhan International airport n.

The value of the basic reproduction number

R_{0}

for the COVID-19 epidemic was calculated as

R_{0} = 1 + r S_{}

(9)

where r is the estimated growth rate for each scenario and S is the mean serial interval for successive COVID-19 infections. The value of the serial interval was adopted from a published study [2], and a gamma distribution was fitted to six known infector-infectee pairs (mean = 7.5 days, standard deviation [SD] = 3.4 days).

In our analysis, we employed Markov chain Monte Carlo (MCMC) simulations with eight chains of 4250 samples each, with 2500 samples used for the tuning stage, leveraging the No-U-Turn sampler (NUTS) part of the Python PyMC3 package. However, one of the input variables for the Binomial distribution

\tilde{u} (r, θ_{e}) \cdot I (t)

or

u (r, θ_{d}) \cdot I (t)

was modeled with a continuous variable in our approach. We used the gamma distribution, matching the first two moments to obtain the transformation of the discrete Binomial distribution to its continuous approximation [15,16]. We avoided joint estimation of all parameters

{θ_{Σ}, θ_{e}, θ_{d}}

due to heterogeneity in the aggregated data—a similar issue is discussed in [10]. Instead, we implemented sequential fitting, wherein first we considered only the likelihood

L_{e}

, then we fit the likelihood

L_{Σ}

using the mean values of the estimated parameters obtained in the previous round. Finally, we verified the obtained fit by calculating pointwise estimates using maximum likelihood estimation with confidence intervals derived as profile likelihood-based intervals. We found that both approaches were at complete agreement in their results.

3. Results

Figure 1A,B show the mean and SD of the time from illness onset to reporting and death, respectively. Employing the gamma distribution, the mean time from illness onset to reporting was estimated at 7.1 days (95% confidence interval [CI]: 5.9, 8.4). The mean time from illness onset to death—adopted from a previous study [14]—was estimated at 20.2 days (95% CI: 15.1, 29.5), which led to a lognormal distribution with a location parameter of 2.84 and a scale parameter of 0.52.

Subsequently, the cumulative incidence was estimated from exported case data by fitting an exponentially growing incidence curve for both Scenarios 1 and 2 (Figure 2). As of 24 January 2020, 20 exported cases were reported, and the cumulative incidence in China was estimated at 6924 cases (95% CI: 4885, 9211) in Scenario 1 and 19,289 cases (95% CI: 10,901, 30,158) in Scenario 2. Table 1 shows the real-time update of the estimated cumulative incidence. The exponential growth rates (r), derived from the growth rate of cumulative incidence, were estimated at 0.15 per day (95% CI: 0.14, 0.15) and 0.29 per day (95% CI: 0.22, 0.36) in Scenarios 1 and 2, respectively.

Figure 2C,D show the estimated cCFR value accounting for the time delay from illness onset to death under Scenarios 1 and 2, respectively. A total of 41 confirmed deaths were reported as of 24 January 2020, the cCFR value was estimated at 5.3% (95% CI: 3.5%, 7.5%) for Scenario 1 and 8.4% (95% CI: 5.3%, 12.3%) for Scenario 2, respectively.

We estimated the basic reproduction number for the COVID-19 infection, using the estimated exponential growth (r) and accounting for possible variations of the mean serial interval (Figure 3). Assuming that the mean serial interval was 7.5 days [2], the basic reproduction number was estimated at 2.10 (95% CI: 2.04, 2.16) and 3.19 (95% CI: 2.66, 3.69) for Scenarios 1 and 2, respectively. However, as the mean serial interval varies, the estimates can range from 1.6 to 2.6 and 2.2 to 4.2 for Scenarios 1 and 2, respectively.

To address the uncertainty in the unobserved date of illness onset of the index case in Scenario 1, cCFR was estimated by varying the starting date of the exponential growth in the incidence by placing the single index case between 1 and 10 December 2019 (Figure S1 and Table S1). When we assumed the date of illness onset of the index case was 1 December 2019, the estimated incidence in China and the cCFR on 24 January 2020 were estimated at 4718 (95% CI: 3328, 6278) and 5.3% (95% CI: 3.5, 7.6). The sensitivity analyses for varied cutoff dates between 15 and 24 January 2020 were conducted. Depending on the number of time points, the estimates of the cumulative incidence and cCFR have similar values in Scenario 1 (Figure S2) but slightly decreased in Scenario 2, when the cutoff date was earlier (Figure S3). In addition, considering the uncertainty of two fixed parameters (i.e., detection time window and catchment population in Wuhan airport), sensitivity was assessed by varying those parameters. As the catchment population increases and the detection window time decreases, estimated cCFR on 24 January 2020 gets smaller due to increased number of incidence in China (Tables S2 and S3).

4. Discussion

The present study estimated the risk of death among confirmed cases while addressing ascertainment bias by using data from cases diagnosed outside mainland China and a right-censored likelihood for modeling the count of deceased cases. The estimated cCFR value was 5.3% (95% CI: 3.5, 7.5) when the date of illness onset for the index case was fixed a priori at 8 December 2019 (Scenario 1), and 8.4% (95% CI: 5.3%, 12.3%) when the timing of the exponential growth of the epidemic was fitted to data alongside with other model parameters (Scenario 2). The estimated value of u in Scenario 2, which adjusts the time delay from illness onset to death, was smaller than that in Scenario 1 due to the larger value of the growth rate of incidence, while the estimated cCFR value as of 24 January 2020 was larger in Scenario 2 compared with Scenario 1. Depending on the available data, the estimate of the cCFR (i.e., the delay-adjusted risk of death among confirmed cases) may vary in time and show some fluctuations. In addition to that, we estimated the value of the basic reproduction number

R_{0}

in the range of 1.6–2.6 for Scenario 1 and 2.2–4.2 for Scenario 2. From either estimate, we conclude that COVID-19 has substantial potential to spread via human-to-human transmission. However, R₀ > 1 does not guarantee that a single exported (and untraced) case would immediately lead to a major epidemic in the destination country as government responses such as border control, isolation of suspected cases, and intensive surveillance should serve to reduce opportunities for transmission to occur [17,18,19].

Our cCFR estimates of 5.3% and 8.4% indicate that the severity of COVID-19 is not as high as that of other diseases caused by coronaviruses, including severe acute respiratory syndrome (SARS), which had an estimated CFR of 17% in Hong Kong [9,10,20], and Middle East respiratory syndrome, which had an estimated CFR of 20% in South Korea [21]. Nonetheless, considering the overall magnitude of the ongoing epidemic, a 5%–8% risk of death is by no means insignificant. In addition to quantifying the overall risk of death, future research must identify groups at risk of death (e.g., the elderly and people with underlying comorbidities) [22,23]. Moreover, considering that about 9% of all infected individuals are ascertained and reported [24], the infection fatality risk (IFR), i.e., the risk of death among all infected individuals, would be on the order of 0.5% to 0.8%.

Our estimated

R_{0}

range of 1.6–4.2 for COVID-19 is consistent with other preliminary estimates posted on public domains [25,26,27,28], and is comparable to the

R_{0}

of SARS, which was in the range of 2–5 during the 2003 outbreak in Singapore [15]. Between our two estimates, the latter scenario yielded a greater value than the former, and there was an increasingly improved ascertainment in early January 2020. The virus was identified and sequenced on 7 January 2020 and subsequently the primer was widely distributed, allowing for rapid laboratory identification of cases and contributing to a time-dependent increase in the number of confirmed cases out of China. Consequently, Scenario 2, which was fully dependent on the growth rate of exported cases, could have overestimated the intrinsic growth rate of cases. Considering the estimated value of

R_{0}

, the possibility of presymptomatic transmission in the ongoing epidemic is a critical question, as it would have a substantial impact on public health response to the epidemic (e.g., whether the contact tracing should be prioritized or not) as well as overall predictability of the epidemic during the containment stage [29,30].

From the technical side, it should be emphasized that our proposed approach can be especially useful during the early stage of an epidemic when local surveillance is affected by substantial ascertainment bias and export and death data are available and better ascertained. Nonetheless, caution must be used when implementing similar estimations for the COVID-19 epidemic, as all flights from Wuhan airport were grounded as of 23 January 2020 [13] and this intervention abruptly changed the human migration network. Despite the decrease in the outbound flow of travelers from Wuhan, there is a substantial risk that the next epidemic wave will originate from other cities.

There are five main limitations in the present study. First, our results present an estimate for the cCFR which only addresses fatality among confirmed cases. More precise IFR estimates that include infected individuals other than confirmed cases can only be estimated using additional data (e.g., seroepidemiological data or outpatient clinic visits). It should be noted that not only the denominator but also the numerator values are subject to better estimation (e.g. excess mortality estimate). Second, our study relied on limited empirical data that were extracted from publicly available data sources. Thus, future studies with greater sample size and precision are needed. Nonetheless, we believe that this study will improve the situational assessment of the ongoing epidemic. Third, our assumed date of illness onset for the index case in Scenario 1 is based on initial reports of the earliest onset date for a case, and the continued exponential growth with the rate r is the authors’ extrapolation. However, we conducted a sensitivity analysis and ensured that the resulting statistical estimates would not greatly vary from our main results. Fourth, there is an uncertainty in the detection window time T. Since the epidemiological investigations are being actively implemented outside China, we believe that the sum of the incubation period and the infectious period can be a plausible estimation of the detection time window. Fifth, heterogeneous aspects of death (e.g. age and risk groups) need to be addressed in the future studies.

In conclusion, the present study has estimated cCFR to be on the order of 5%–8% and R₀ to be 1.6–4.2, endorsing the notion that COVID-19 infection in the ongoing epidemic possesses the potential to become a pandemic. The proposed approach can also help direct risk assessment in other settings with the use of publicly available datasets.

Supplementary Materials

The following are available online at https://www.mdpi.com/2077-0383/9/2/523/s1, Figure S1: Sensitivity analysis with varying the start point of exponential growth in cumulative incidence from 1 to 10 December 2019, Figure S2: Sensitivity analysis with varying the cutoff date from 15 to 24 January 2020 in estimation Scenario 1, Figure S3: Sensitivity analysis with varying the cutoff date from 15 to 24 January 2020 in estimation Scenario 2, Table S1: Sensitivity analysis with varying the start point of exponential growth in cumulative incidence from 1 to 10 December 2019 in estimation Scenario 1, Table S2: Sensitivity analysis with varying the catchment population size in Wuhan airport, Table S3: Sensitivity analysis with varying the detection time window.

Author Contributions

S.-m.J., A.R.A., and H.N. conceived the study and participated in the study design. All authors assisted in collecting the data. S.-m.J. and A.R.A. analyzed the data and S.-m.J., A.R.A., K.H. and H.N. drafted the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

H.N. received funding from the Japan Agency for Medical Research and Development (AMED) [grant number: JP18fk0108050]; the Japan Society for the Promotion of Science (JSPS) KAKENHI [grant numbers, H.N.: 17H04701, 17H05808, 18H04895 and 19H01074; R.K.: 18J21587], the Inamori Foundation, and the Japan Science and Technology Agency (JST) CREST program [grant number: JPMJCR1413]. S.M.J. and N.M.L. receive graduate study scholarships from the Ministry of Education, Culture, Sports, Science and Technology, Japan. B.Y. wishes to thank China Scholarship Council.

Conflicts of Interest

The authors declare no conflict of interest.

References

World Health Organization. Novel Coronavirus—China Disease Outbreak News. Available online: https://www.who.int/csr/don/12-january-2020-novel-coronavirus-china/en/ (accessed on 9 February 2020).
Li, Q.; Guan, X.; Wu, P.; Wang, X.; Zhou, L.; Tong, Y.; Ren, R.; Leung, K.S.M.; Lau, E.H.Y.; Wong, J.Y.; et al. Early transmission dynamics in Wuhan, China, of novel coronavirus-infected pneumonia. N. Engl. J. Med. 2020. [Google Scholar] [CrossRef] [PubMed]
Cui, J.; Li, F.; Shi, Z.L. Origin and evolution of pathogenic coronaviruses. Nat. Rev. Microbiol. 2019, 17, 181–192. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Center for Disease Control and Prevention. 2019 Novel Coronavirus, Wuhan, China. Available online: https://www.cdc.gov/coronavirus/COVID-19/about/transmission.html (accessed on 24 January 2020).
Huang, C.; Wang, Y.; Li, X.; Ren, L.; Zhao, J.; Hu, Y.; Zhang, L.; Fan, G.; Xu, J.; Gu, X.; et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet 2020. [Google Scholar] [CrossRef] [Green Version]
Chan, J.F.W.; Yuan, S.; Kok, K.H.; To, K.K.W.; Chu, H.; Yang, J.; Xing, J.; Xing, F.; Liu, J.; Yip, C.C.; et al. A familial cluster of pneumonia associated with the 2019 novel coronavirus indicating person-to-person transmission: A study of a family cluster. Lancet 2020. [Google Scholar] [CrossRef] [Green Version]
World Health Organization. Novel Coronavirus (COVID-19) Situation Report-3. Available online: https://www.who.int/docs/default-source/coronaviruse/situation-reports/20200123-sitrep-3-COVID-19.pdf?sfvrsn=d6d23643_4 (accessed on 23 January 2020).
Nishiura, H.; Jung, S.; Linton, N.M.; Kinoshita, R.; Yang, Y.; Hayashi, K.; Kobayashi, T.; Yuan, B.; Akhmetzhanov, A.R. The extent of transmission of novel coronavirus in Wuhan, China, 2020. J. Clin. Med. 2020, 9, 330. [Google Scholar] [CrossRef] [Green Version]
Ghani, A.C.; Donnelly, C.A.; Cox, D.R.; Griffin, J.T.; Fraser, C.; Lam, T.H.; Ho, L.M.; Chan, W.S.; Anderson, R.M.; Hedley, A.J.; et al. Methods for estimating the case fatality ratio for a novel, emerging infectious disease. Am. J. Epidemiol. 2005, 16, 479–486. [Google Scholar] [CrossRef]
Nishiura, H.; Klinkenberg, D.; Roberts, M.; Heesterbeek, J.A. Early epidemiological assessment of the virulence of emerging infectious diseases: A case study of an influenza pandemic. PLoS ONE 2009, 4, e6852. [Google Scholar] [CrossRef] [Green Version]
Garske, T.; Legrand, J.; Donnelly, C.A.; Ward, H.; Cauchemez, S.; Fraser, C.; Ferguson, N.M.; Ghani, A.C. Assessing the severity of the novel influenza A/H1N1 pandemic. BMJ 2009, 339, b2840. [Google Scholar] [CrossRef]
Thompson, R.N.; Gilligan, C.A.; Cunniffe, N.J. When Does A Minor Outbreak Become A Major Epidemic? Linking the Risk from Invading Pathogens to Practical Definitions of A Major Epidemic. Available online: https://www.biorxiv.org/content/10.1101/768853v2 (accessed on 9 February 2020).
BBC NEWS. Coronavirus: Wuhan Shuts Public Transport over Outbreak. Available online: https://www.bbc.com/news/world-asia-china-51215348 (accessed on 28 January 2020).
Linton, N.M.; Kobayashi, T.; Yang, Y.; Hayashi, K.; Akhmetzhanov, A.R.; Jung, S.; Yuan, B.; Kinoshita, R.; Nishiura, H. Epidemiological characteristics of novel coronavirus infection: A statistical analysis of publicly available case data. J. Clin. Med. 2020, in press. [Google Scholar]
Lipsitch, M.; Cohen, T.; Cooper, B.; Robins, J.M.; Ma, S.; James, L.; Gopalakrishna, G.; Chew, S.K.; Tan, C.C.; Samore, M.H.; et al. Transmission dynamics and control of severe acute respiratory syndrome. Science 2003, 300, 1966–1970. [Google Scholar] [CrossRef] [Green Version]
Li, M.; Dushoff, J.; Bolker, B.M. Fitting mechanistic epidemic models to data: A comparison of simple Markov chain Monte Carlo approaches. Stat. Methods Med. Res. 2018, 27, 1956–1967. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Kucharski, A.J.; Russell, T.W.; Diamond, C.; CMMID nCoV Working Group; Funk, S.; Eggo, R.M. Early dynamics of transmission and control of COVID-19: A mathematical modelling study. medRxiv 2020. [Google Scholar] [CrossRef]
Thompson, R.N. 2019–2020 Wuhan coronavirus outbreak: Intense surveillance is vital for preventing sustained transmission in new locations. biorxiv 2020. [Google Scholar] [CrossRef] [Green Version]
Boldog, P.; Tekeli, T.; Vizi, Z.; Denes, A.; Bartha, F.; Rost, G. Risk assessment of novel coronavirus COVID-19 outbreaks outside China. J. Clin. Med. 2020, 9, 571. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Donnelly, C.A.; Ghani, A.C.; Leung, G.M.; Hedley, A.J.; Fraser, C.; Riley, S.; Abu-Raddad, L.J.; Ho, L.-M.; Thach, T.-Q.; Chau, P.; et al. Epidemiological determinants of spread of causal agent of severe acute respiratory syndrome in Hong Kong. Lancet 2003, 361, 1761–1766. [Google Scholar] [CrossRef] [Green Version]
Mizumoto, K.; Saitoh, M.; Chowell, G.; Miyamatsu, Y.; Nishiura, H. Estimating the risk of Middle East respiratory syndrome (MERS) death during the course of the outbreak in the Republic of Korea, 2015. Int. J. Infect. Dis. 2015, 39, 7–9. [Google Scholar] [CrossRef] [Green Version]
Majumder, M.S.; Kluberg, S.A.; Mekaru, S.R.; Brownstein, J.S. Mortality risk factors for Middle East Respiratory Syndrome outbreak, South Korea, 2015. Emerg. Infect. Dis. 2015, 21, 2088–2090. [Google Scholar] [CrossRef]
Mizumoto, K.; Endo, A.; Chowell, G.; Miyamatsu, Y.; Saitoh, M.; Nishiura, H. Real-time characterization of risks of death associated with the Middle East respiratory syndrome (MERS) in the Republic of Korea, 2015. BMC Med. 2015, 13, 228. [Google Scholar] [CrossRef] [Green Version]
Nishiura, H.; Kobayashi, T.; Yang, Y.; Hayashi, K.; Miyama, T.; Kinoshita, R.; Linton, N.M.; Jung, S.-M.; Yuan, B.; Suzuki, A.; et al. The rate of under ascertainment of novel coronavirus (COVID-19) infection: Estimation using Japanese passengers data on evacuation flights. J. Clin. Med. 2020, 9, 419. [Google Scholar] [CrossRef] [Green Version]
Zhao, S.; Ran, J.; Musa, S.S.; Yang, G.; Lou, Y.; Gao, D.; Yang, L.; He, D. Preliminary Estimation of the Basic Reproduction Number of Novel Coronavirus (COVID-19) in China, from 2019 to 2020: A Data-Driven Analysis in the Early Phase of the Outbreak. Available online: https://www.biorxiv.org/content/10.1101/2020.01.23.916395v1 (accessed on 29 January 2020).
Imperial College London-MRC Centre for Global Infectious Disease Analysis. News/Wuhan Coronavirus. Available online: https://www.imperial.ac.uk/mrc-global-infectious-disease-analysis/news--wuhan-coronavirus/ (accessed on 28 January 2020).
The University of Hong Kong-HKUMed WHO Collaborating Centre for Infectious Disease Epidemiology and Control. Nowcasting and Forecasting the Wuhan COVID-19 Outbreak. Available online: https://www.med.hku.hk/news/press/HKUMed-WHO-Collaborating-Centre-for-Infectious-Disease-Epidemiology-and-Control-releases%20real-time-nowcast-on-the-likely-extent-of-the-Wuhan-coronavirus (accessed on 28 January 2020).
Read, J.M.; Bridgen, J.R.; Cummings, D.A.; Ho, A.; Jewell, C.P. Novel Coronavirus COVID-19: Early Estimation of Epidemiological Parameters and Epidemic Predictions. medRxiv 2020. [Google Scholar] [CrossRef]
Fraser, C.; Riley, S.; Anderson, R.M.; Ferguson, N.M. Factors that make an infectious disease outbreak controllable. Proc. Natl. Acad. Sci. USA 2004, 101, 6146–6151. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Thompson, R.N.; Gilligan, C.A.; Cunniffe, N.J. Detecting presymptomatic infection is necessary to forecast major epidemics in the earliest stages of infectious disease outbreaks. PLoS Comput. Biol. 2016, 12, e1004836. [Google Scholar] [CrossRef] [PubMed] [Green Version]

Figure 1. Estimates of the mean and standard deviation (SD) of the time from illness onset to reporting and death cases, accounting for right truncation, with novel coronavirus (COVID-19) infection in China, 2020. Inference of A and B was conducted among (A) exported cases observed in other countries and (B) deceased cases in China. (A) Frequency distribution of the time from illness onset to reporting among exported cases employing a gamma distribution with a mean of 7.1 days (95% confidence interval [CI]: 5.9, 8.4) and SD of 4.4 days (95% CI: 3.5, 5.7). (B) Frequency distribution of the time from illness onset to death with a mean of 19.9 days (95% CI: 14.9, 29.0) (shown in black) and SD of 11.4 days (95% CI: 6.5, 21.6) employing a lognormal distribution and accounting for right truncation. For reference, the estimate of the mean and its 95% credible intervals without accounting for right truncation are shown in grey. The values for distribution of time from illness onset to death are adopted from an earlier study [14]. The blue bars show empirically observed data collected from governmental reports (as of 24 January 2020).

Figure 2. Cumulative incidence and the confirmed case fatality risk of the novel coronavirus (COVID-19) outbreak in China, 2020. (A,B) Observed and estimated cumulative number of cases in China by the date of report. An exponential growth curve was extrapolated using the exported case data. Scenario 1 extrapolated the exponential growth from December to first case on 8 December 2019, while Scenario 2 started the estimation of the exponential growth only from 13 January 2020. The black line and shaded area represent median and 95% credible interval of the cumulative incidence in China, respectively. The blue bars show the cumulative number of reported cases from the government of mainland China. The cumulative number of reported cases was not used for fitting, but it was shown for comparison between the cumulative number of reported and estimated cases in China. There is a decrease in the cumulative number of reported cases in early January, because only 41 cases tested positive for the novel coronavirus among the reported 59 cases on 10 January 2020. Left-top panels on both A and B show the cumulative numbers of exported cases observed in other countries and the cumulative number of deaths in China, represented by dark and light green bars, respectively. (C,D) Confirmed case fatality risk (cCFR) by the date of reporting. Each value of cCFR was estimated as the ratio of cumulative number of estimated incidence to death at time t. The points and error bars represent the median and its 95% credible interval of the cCFR. All 95% credible intervals were derived from Markov chain Monte Carlo simulations.

Figure 3. Basic reproduction number of novel coronavirus (COVID-19) infections in China, 2020. Black lines and grey shades represent the median and 95% credible intervals of the basic reproduction number. Panel A shows the result of Scenario 1, in which an exponential growth started from the assumed illness onset date of index case, while Panel B shows the result from exponential growth from the first exported case (Scenario 2). The 95% credible intervals were derived from Markov chain Monte Carlo method.

Table 1. Exportation events and estimated incidence in China, 2020.

Importing Locations	Date of Report (2020)	Cumulative Count	Estimated Incidence in China (95% CI)
Importing Locations	Date of Report (2020)	Cumulative Count	Scenario 1	Scenario 2
Thailand	13 January	1	1828 (1397 2288)	1369 (1003 1782)
Japan	16 January	2	2120 (1605 2672)	1829 (1392 2309)
Thailand	17 January	3	2458 (1845 3119)	2444 (1894 3033)
South Korea	20 January	4	3832 (2802 4962)	5882 (4252 7629)
Taiwan, United States	21 January	6	4443 (3220 5792)	7901 (5425, 10,662)
Thailand	22 January	8	5151 (3700 6761)	10,626 (6897, 15,003)
Singapore, Vietnam	23 January	11	5972 (4252 7892)	14,308 (8661, 21,250)
Japan, Nepal, South Korea, Singapore, Thailand, United States	24 January	20	6924 (4885 9211)	19,289 (10,901, 30,158)

CI, confidence interval (the 95% CI was derived from the Markov chain Monte Carlo method). Scenario 1 indicates the estimated exponential growth rate with the assumed illness onset date of the first COVID-19 case (i.e., 8 December 2019), while Scenario 2 presents the estimated exponential growth rate from the date of first exportation event (i.e., 13 January 2020).

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Jung, S.-m.; Akhmetzhanov, A.R.; Hayashi, K.; Linton, N.M.; Yang, Y.; Yuan, B.; Kobayashi, T.; Kinoshita, R.; Nishiura, H. Real-Time Estimation of the Risk of Death from Novel Coronavirus (COVID-19) Infection: Inference Using Exported Cases. J. Clin. Med. 2020, 9, 523. https://doi.org/10.3390/jcm9020523

AMA Style

Jung S-m, Akhmetzhanov AR, Hayashi K, Linton NM, Yang Y, Yuan B, Kobayashi T, Kinoshita R, Nishiura H. Real-Time Estimation of the Risk of Death from Novel Coronavirus (COVID-19) Infection: Inference Using Exported Cases. Journal of Clinical Medicine. 2020; 9(2):523. https://doi.org/10.3390/jcm9020523

Chicago/Turabian Style

Jung, Sung-mok, Andrei R. Akhmetzhanov, Katsuma Hayashi, Natalie M. Linton, Yichi Yang, Baoyin Yuan, Tetsuro Kobayashi, Ryo Kinoshita, and Hiroshi Nishiura. 2020. "Real-Time Estimation of the Risk of Death from Novel Coronavirus (COVID-19) Infection: Inference Using Exported Cases" Journal of Clinical Medicine 9, no. 2: 523. https://doi.org/10.3390/jcm9020523

APA Style

Jung, S.-m., Akhmetzhanov, A. R., Hayashi, K., Linton, N. M., Yang, Y., Yuan, B., Kobayashi, T., Kinoshita, R., & Nishiura, H. (2020). Real-Time Estimation of the Risk of Death from Novel Coronavirus (COVID-19) Infection: Inference Using Exported Cases. Journal of Clinical Medicine, 9(2), 523. https://doi.org/10.3390/jcm9020523

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Real-Time Estimation of the Risk of Death from Novel Coronavirus (COVID-19) Infection: Inference Using Exported Cases

Abstract

1. Introduction

2. Methods

2.1. Epidemiological Data

2.2. Estimation of the Delay Distributions

2.3. Statistical Inference

3. Results

4. Discussion

Supplementary Materials

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI