Abstract
Air travel has a decisive role in the spread of infectious diseases at the global level. We present a methodology applied during the early stages of the COVID-19 pandemic that uses detailed aviation data at the final destination level in order to measure the risk of the disease spreading outside China. The approach proved to be successful in terms of identifying countries with a high risk of infected travellers and as a tool to monitor the evolution of the pandemic in different countries. The high number of undetected or asymptomatic cases of COVID-19, however, limits the capacity of the approach to model the full dynamics. As a result, the risk for countries with a low number of passengers from Hubei province appeared as low. Globalization and international aviation connectivity allow travel times that are much shorter than the incubation period of infectious diseases, a fact that raises the question of how to react in a potential new pandemic.
Keywords:
air transport; aviation; COVID-19; coronavirus; epidemic model; risk assessment; spatial analysis 1. Introduction
The recent pandemic caused by the novel coronavirus COVID-19 has once again brought epidemiological modeling into the limelight. Especially during the early stages of the virus propagation—when data were limited and highly uncertain—model projections were a major input for understanding the magnitude and potential risks of the outbreak. Apart from predicting the dynamics of transmission within a specific population, particular emphasis was put on quantifying the risk of the disease spreading beyond the area of its original focus in Wuhan, China.
The contribution of transport—and aviation in particular—has been shown to be significant in the spread of airborne diseases such as SARS, MERS [1,2] and the influenza A H1N1 [3], as well as other infectious such as Ebola [4] or even vector-borne diseases such as Dengue [5] or Malaria [6].
Spatio-temporal dynamics also have a major role. Social interactions and personal contact patterns at either the origin of the disease or areas connected to it affect the speed and range of prevalence across different geographic zones [7]. Budd et al. [8] made a historical review of journey times and the incubation period of selected infectious diseases. Travel times have decreased drastically and even intercontinental trips have a duration that is a fraction of the incubation period of several diseases. Coming to the case of the COVID-19 outbreak, no intercontinental flights from Wuhan were possible before 2012, but in 2020 direct flights were available to London, Paris, Rome, Moscow, New York City, San Francisco, Istanbul, Dubai, Sydney, and numerous other international destinations. As an example, a direct flight from Wuhan to Rome, Italy takes 12 h 15 m and is available three times weekly.
Several studies have utilized air travel data for the estimation of disease importation risks. The Vector-Borne Disease Airline Importation Risk (VBD-AIR) tool allows the combination of disease information and air travel networks to produce a user-defined global map of risk distribution [9]. A generic tool that allows the estimation of the median early disease arrival time from around the world using air transport schedules was proposed by [10]. Air travel patterns were also employed by Bajardi et al. [11] in order to theoretically model the potential impacts of travel restrictions on the spread of an epidemic. Nevertheless, while travel and trade have been shown as relevant for the international spread of the 2009 A/H1N1 influenza, the slow deployment of control measures in countries with lower healthcare capacities led to spatial imbalances [12]. Air transport data have been used indirectly to measure the effective distance and the relative arrival time of a disease outbreak, with promising results when tested on historical data [13]. Based on the same premises, producing near real-time or now-casting predictions on the international spread of COVID-19 appeared to be based on scientifically sound principles [14].
However, most expectations about the ability to predict the global spread of COVID-19 proved misleading or of limited use.
This includes our own work, which we present here in order to discuss the reasons why most projections missed important aspects of the disease propagation dynamics. In our opinion, and as a part of the scientific process, it is important to publish and discuss even the negative results of research activities. The lessons learned from unsuccessful applications can be valuable for future work, either by improving current practices or by re-examining risk assessment from a new perspective. Especially in the case of predictive modeling, the wide body of literature that presents successful ex-post examples of possible value may give the false impression that the spatial aspects of disease propagation are more easily predictable than what is actually possible in reality.
The question this paper is trying to answer is whether air passenger traffic alone can provide early information on the potential distribution of contagious diseases at the global level. Based on the experience from the SARS, MERS and Ebola epidemics—which suggests that travel patterns are a good predictor of spatial evolution of an epidemic—we mapped the expected distribution of COVID-19 at the global level using now-casted origin-destination estimates of aviation passengers and near real-time data on the spatial distribution of diagnosed infections. The number of reported cases for each possible destination outside the initial outburst area was compared to the estimated traffic intensity between Wuhan and the specific destination. The resulting ratio was used as an indicator of the expected rate of infection of aviation passengers and as a means of comparison of the evolution of the pandemic worldwide. The approach gave promising results during the initial phases of the pandemic, but underestimated the risks associated with local infections at several distant destinations and the development of numerous secondary foci of the disease.
3. Discussion of the Results
We estimated q, the ratio of infected passengers to total passengers, at the province and territory level within China, and at the country level for the rest of the world. As described in Section 2, we used the estimated number of aviation passengers for January 2020 and transformed them to an equivalent daily volume. The q indicator was calculated for four different points in time at weekly intervals: on the 3rd, 10th, 17th and 24th of February 2020, corresponding to t values of 34, 41, 48 and 54 days, respectively (assuming t = 0 as 31/12/2019, the date of the first reported case in the Johns Hopkins database). The number of aviation passengers to the provinces within less than 700 km from Wuhan is virtually null (Hubei province itself, as well as Anhui, Hebei, Henan, Hunan and Jiangxi), therefore the approach is not applicable in those cases.
Figure 3 shows the results for Chinese provinces between 700 and 1000 km from Wuhan. The q indicator ranges from 4 to 10 infected cases per one thousand passengers in most cases, except Zhejiang and especially Jiangsu (of which Nanjing, a major city, is only 3 h from Wuhan by train). Apart from the obvious possible differences in diagnosing and reporting, the main reason for discrepancies in the case of those provinces close to Wuhan and Hubei is their high share of train, bus and car travel to and from Wuhan.
Figure 3.
Estimated ratio of infected passengers (cases per 1000 passengers), China (700–1000 km from Wuhan).
On the other hand, the picture is more homogeneous for Chinese provinces and territories over 1000 km from Wuhan (Figure 4). For such distances, air travel is practically the only option for passengers and the q indicator has a range of between 2 and 6 cases per thousand passengers. The two outliers are Tibet (with a single case in the period, but also a very low number of estimated passengers) and Heilongjiang, where the number of cases was proportionally higher already on February 3, possibly as a result of the province having entered the local transmission phase early.
Figure 4.
Estimated ratio of infected passengers, (cases per 1000 passengers), China (over 1000 km from Wuhan).
As regards other countries apart from China (Figure 5), the indicator had a range of between 0 and 6 cases per thousand passengers until 17 February, with Germany being an outlier (but with only 16 reported cases). Up to that point, the comparison of the indicators for distant Chinese destinations (more than 1000 km from Wuhan) and countries outside China confirmed the hypothesis of diagnosed cases being proportional to the number of passengers arriving from Wuhan. Even though the absolute number of cases was very low and the ratios were unstable, the global distribution patterns seemed to follow the expected dynamics and to correspond to the air transport patterns.
Figure 5.
Estimated ratio of infected passengers outside China (cases per 1000 passengers).
Given that the majority of final destinations of passengers from Wuhan were in Mainland China, the results implied that the appearance of infected cases would follow the same pattern as that of aviation trips (Figure 1), with the risk for countries outside Asia being low. Additionally, since China had implemented an extended quarantine since the last week of January, these patterns suggested that the risk for additional cases in the rest of the world would be limited, as infections in other Chinese provinces and territories were kept under control. An additional observation that could be made at that moment was that the reporting quality in China was comparable to that of most neighboring countries, with indicator levels being at similar levels. However, surprisingly a number of countries—including Indonesia and the Philippines—report a very low number of cases compared to the expected number of passengers from Hubei. It should be noted, though, that the number of tests performed in several countries was very low and the actual number of symptomatic or asymptomatic cases may actually be much higher.
A dramatic change can be seen in the data for the 24th of February in a number of countries outside China (Figure 5). Suddenly, the number of cases in South Korea, Japan and Iran presented a pronounced spike. While South Korea and Japan were among the countries where the risk had already been identified, Iran was a case that the travel pattern analysis had completely missed. The model estimates on the number of travellers from Hubei to Iran were between 33 and 236 per month, a value too low to raise any alarm. These three countries were only the beginning of the pandemic at the worldwide level that in the course of the following 8 weeks infected more than 3 million people and caused more than 200 thousand deaths (as of 27 April 2020).
It is now known that the main reason for the rapid expansion in countries outside Asia was the appearance of local foci that were not detected initially, and the subsequent spread of the virus from them both locally and internationally. This evolution of the pandemic revealed a curious pattern: the countries that had high levels of interaction with China realized the level of risk early and applied strict control measures, resulting in a comparatively limited spread of the disease. Whereas countries that appeared to have a lower risk—since they were not in such direct contact with the area of the primary focus of the pandemic—eventually reached disease levels that exceeded those in the original focus in China (especially the USA, Spain and Italy). In that sense, our approach appears as sufficient to capture the true positives, but—as all approaches that we are aware of—may underestimate risk and lead to false negatives with a high impact. This is a general weakness of any risk assessment model that we try to explain and discuss in the following sections.
4. Limitations to Predictive Capacity
The capacity of aviation activity to predict the appearance of infected cases can be affected by various factors that may influence the ratio of infected passengers reaching a specific destination. In the approach presented here, we assumed—for lack of data—that the socio-economic and demographic profile of travellers to all destinations is the same. The assumed profile includes the ratio of local to visiting travellers and their age distribution, as well as their income or professional activity distribution.
Most probably, the infection rates of the local population in Hubei province are different than the one for visitors, mainly because of the difference in terms of the total time spent in the origin of the pandemic and—as a result—the period of exposure. The analysis of exportations of symptomatic cases with MERS-CoV in the period 2013–2015 [26] differentiated between locals and visitors to the Middle East, the center of the MERS outbreak. The exposure time of visitors was estimated using tourism statistics, while the exposure of locals was assumed to be equal to the upper bound of the disease incubation period (10.2 days). A model-based study of frequent flyers suggests that the probability that an asymptomatic infected person will make an international flight is higher for high-frequency travellers than for low-frequency ones [27]. One would therefore expect differences in the infection rates between destinations that serve mainly visitors to Wuhan (e.g., flights to Europe) and destinations that serve a higher proportion of locals (e.g., flights to destinations in China with employees returning to work).
The socio-economic profile of the travellers can also affect the probability of super-spreaders, individuals who due to the type or intensity of their social interactions can transmit the disease to a disproportionate number of others. A higher frequency of infections in health care settings has been observed for SARS and MERS [28]. To what concerns travel, frequent flyers have been shown to have a higher probability of being super-spreaders [1]. Visitors to Wuhan, especially ones for business purposes, probably had a higher number of social interactions at the origin than the average local inhabitant (as well as at the destination, increasing even more the probabilities of spreading the disease). A similar pattern may have been in place in the propagation from the secondary focus of Milan, where a large number of foreign visitors to commercial exhibitions or sport events may have caused the spread to other countries.
Infection during flights has been found to have a significant probability. Even though aircraft are equipped with High-Efficiency Particulate Air (HEPA) filters, which reduce the risk of transmission through the air circulation system, several studies have demonstrated that infection rates can be as high as 4.3% in a 9-h flight, and that the risk does not depend on the relative seating of contagious passengers and passengers infected in-flight [29]. Mangili and Gendreau already discussed the need for additional work on in-flight transmissions [30], but insufficient data have been made available in the meanwhile to allow a more robust conclusion.
The diagnostic capacity and protocols applied in airport controls have been discussed using the experience from SARS [31,32]. Grout et al. [33] identified several inconsistencies in infection control measures and major differences in how national and international legislation is enforced by airlines and airports. The differences in the diagnosing quality may affect the number of cases reported. However, while focusing on controls at the point of entry may improve the identification of travellers with symptoms, in the case of COVID-19 this strategy may have given a false sense of security.
Climate and local environmental conditions may also affect the speed of local transmission and can lead to deviations from the expected number of infections. Such an association has been suggested for SARS and MERS [34], but has not yet been demonstrated in the case of COVID-19. It is still not clear how the virus reacts to temperature or air pollution, especially particulates. Both aspects will probably be researched extensively in the future.
The low level of risks suggested by both aviation data and airport controls can distract attention from what—in hindsight—proved to be the main reason the pandemic reached such levels: asymptomatic cases. The number and transmission capacity of asymptomatic cases was a “known unknown” from the beginning of the outbreak. Since COVID-19 has a long incubation period (up to or even exceeding 14 days) and tests were not yet available, the number of asymptomatic carriers could not be measured or estimated even in later stages of the pandemic.
5. Conclusions
The frequently cited work by Colizza et al. (2007) explored the predictability of stochastic dynamic models on global disease distribution and concluded that:
“… stochastic fluctuations are less important than the constraints imposed by the transport network structure that imposes an overall pattern to the epidemic evolution. In this respect, global properties provide insights on the general features of the epidemic spreading in relation with the underlying network but do not provide adequate information on the predictability of the resulting pattern” [35].
Our empirical results from modeling the spread of COVID-19 suggest the opposite, at least as regards the first part of the statement. The stochastic fluctuations of the factors that affect the air transport-related spread of the disease introduce high levels of uncertainty and consequently limit the predictability of the resulting spatial pattern.
Aviation activity analyzed with the methodology we presented here can provide an early signal for risk. Such information is important for destinations with a high number of passengers from the origin of the pandemic, since it can correctly identify potential risks. The indicator used is also a convenient measure to compare the evolution across countries and identify changes in the direction of the trends.
Soon, however, if a large enough number of disease carriers has travelled undetected, the easiness of air travel can convert a local outburst into a global pandemic. In that case, the value of aviation-based distribution models is limited, due to their systematic underestimation of risk for destinations with low passenger volume from the origin or interactions with a secondary focus of infection.
Strict measures (quarantine, flight cancelations, travel restrictions) can reduce risks significantly but come with the cost of a large social, economic and political risk. Airport controls, especially in the case of novel diseases with unknown symptoms or high asymptomatic shares, can prove inadequate and may even give a false sense of security. International collaboration and exchange of information are important elements in managing risk. While the random effects of disease propagation cannot be controlled, monitoring the situation at the global level using reliable data can significantly improve the reaction of public authorities.
As with any risk assessment approach, the evaluation of the actual risk and the costs of the possible risk mitigation strategies can pose delicate trade-offs. However, keeping in mind that modern aviation permits connecting any two points on the planet in less than 48 h, the chances for new virulent diseases spreading around the world are high and the need for better predictive tools becomes even more urgent.
Disclaimers
The views expressed are purely those of the authors and may not in any circumstances be regarded as stating an official position of the European Commission. The territorial boundaries and names indicated in this work do not imply official endorsement or acceptance by the European Union.
Author Contributions
Conceptualization, P.C. and A.C.; Data curation, P.C.; Formal analysis, P.C. and A.C.; Methodology, P.C. and A.C. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Conflicts of Interest
The authors declare no conflict of interest.
Appendix A

Figure A1.
Predicted and real numbers of monthly aviation passenger per region.
References
- Wong, G.; Liu, W.; Liu, Y.; Bi, Y.; Gao, G.F. MERS, SARS, and Ebola: The Role of Super-Spreaders in Infectious Disease. Cell Host Microbe 2015, 18, 398–401. [Google Scholar] [CrossRef]
- Poletto, C.; Boëlle, P.-Y.; Colizza, V. Risk of MERS importation and onward transmission: A systematic review and analysis of cases reported to WHO. BMC Infect. Dis. 2016, 16, 448. [Google Scholar] [CrossRef]
- Khan, K.; Arino, J.; Hu, W.; Chan, A.; Gardam, M. Spread of a novel influenza A (H1N1) Virus via global airline transportation. N. Engl. J. Med. 2009, 361, 212–214. [Google Scholar] [CrossRef]
- Pigott, D.M.; Golding, N.; Mylne, A.; Khan, K.; Hay, S.I. Mapping the zoonotic niche of Ebola virus disease in Africa. eLife 2014, 3, e04395. [Google Scholar] [CrossRef]
- Semenza, J.C.; Sudre, B.; Miniota, J.; Van Bortel, W.; Khan, K. International Dispersal of Dengue through Air Travel: Importation Risk for Europe. PLoS Negl. Trop. Dis. 2014, 8, e3278. [Google Scholar]
- Tatem, A.J.; Hay, S.I.; Rogers, D.J. Global traffic and disease vector dispersal. Proc. Natl. Acad. Sci. USA 2006, 103, 6242–6247. [Google Scholar] [CrossRef]
- Wang, L.; Wang, Z.; Zhang, Y.; Li, X. How human location-specific contact patterns impact spatial transmission between populations? Sci. Rep. 2013, 3, 1468. [Google Scholar] [CrossRef] [PubMed]
- Budd, L.; Morag, B.; Brown, T. Of plagues, planes and politics: Controlling the global spread of infectious diseases by air. Political Geogr. 2009, 28, 426–435. [Google Scholar] [CrossRef] [PubMed]
- Huang, Z.; Das, A.; Qiu, Y.; Tatem, A.J. Web-based GIS: The vector-borne disease airline importation risk (VBD-AIR) tool. Int. J. Health Geogr. 2012, 11, 33. [Google Scholar] [CrossRef] [PubMed]
- Hwang, G.M.; Mahoney, P.J.; James, J.H.; Lin, G.C.; Berro, A.D.; Keybl, M.A.; Goedecke, D.M.; Mathieu, J.J.; Wilson, T. A model-based tool to predict the propagation of infectious disease via airports. Travel Med. Infect. Dis. 2012, 10, 32–42. [Google Scholar] [CrossRef] [PubMed]
- Bajardi, P.; Poletto, C.; Ramasco, J.J.; Tizzoni, M.; Colizza, V.; Vespignani, A. Human mobility networks, travel restrictions, and the global spread of 2009 H1N1 pandemic. PLoS ONE 2011, 6, e16591. [Google Scholar] [CrossRef] [PubMed]
- Hosseini, P.; Sokolow, S.H.; Vandegrift, K.J.; Kilpatrick, A.M.; Daszak, P. Predictive power of air travel and socio-economic data for early pandemic spread. PLoS ONE 2010, 5, e12763. [Google Scholar] [CrossRef] [PubMed]
- Brockmann, D.; Helbing, D. The hidden geometry of complex, network-driven contagion phenomena. Science 2013, 342, 1337–1342. [Google Scholar] [CrossRef]
- Wu, J.T.; Leung, K.; Leung, G.M. Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: A modelling study. Lancet 2020, 395, 689–697. [Google Scholar] [CrossRef]
- Zhu, Z.; Zhang, A.; Zhang, Y. Connectivity of intercity passenger transportation in China: A multi-modal and network approach. J. Transp. Geogr. 2018, 71, 263–276. [Google Scholar] [CrossRef]
- Wang, J.; Du, D.; Huang, J. Inter-city connections in China: High-speed train vs. inter-city coach. J. Transp. Geogr. 2020, 82, 102619. [Google Scholar] [CrossRef]
- SABRE. Sabre Data & Analytics Market Intelligence 6.3, Origin-Destination Tables. 2020. Available online: https://emergo5.sabre.com/community/home (accessed on 14 April 2020).
- Christidis, P. Four shades of Open Skies: European Union and four main external partners. J. Transp. Geogr. 2016, 50, 105–114. [Google Scholar] [CrossRef]
- Abate, M.; Christidis, P. The impact of air transport market liberalization: Evidence from EU’s external aviation policy. Econ. Transp. 2020, 22, 100164. [Google Scholar] [CrossRef]
- Hyndman, R.; Khandakar, Y. Automatic Time Series Forecasting: The forecast Package for R. J. Stat. Softw. 2008, 27, 1–22. [Google Scholar] [CrossRef]
- Njoya, E.T.; Christidis, P.; Nikitas, A. Understanding the impact of liberalisation in the EU-Africa aviation market. J. Transp. Geogr. 2018, 71, 161–171. [Google Scholar] [CrossRef]
- Yang, H.; Dobruszkes, F.; Wang, J.; Dijst, M.; Witte, P. Comparing China’s urban systems in high-speed railway and airline networks. J. Transp. Geogr. 2018, 68, 233–244. [Google Scholar] [CrossRef]
- Dong, E.; Du, H.; Gardner, L. An Interactive Web-Based Dashboard to Track COVID-19 in Real Time, the Lancet Infectious Diseases. 2020. Available online: https://covid19.elsevierpure.com/en/publications/an-interactive-web-based-dashboard-to-track-covid-19-in-real-time (accessed on 28 April 2020).
- Pell, B.; Kuang, Y.; Viboud, C.; Chowell, G. Using phenomenological models for forecasting the 2015 Ebola challenge. Epidemics 2018, 22, 62–70. [Google Scholar] [CrossRef] [PubMed]
- Chowell, G.; Viboud, C.; Simonsen, L.; Moghadas, S.M. Characterizing the reproduction number of epidemics with early sub-exponential growth dynamics. J. R. Soc. Interface 2016, 13, pii20160659. [Google Scholar] [CrossRef] [PubMed]
- Carias, C.; O’Hagan, J.J.; Jewett, A.; Gambhir, M.; Cohen, N.J.; Haber, Y.; Pesik, N.; Swerdlow, D.L. Exportations of symptomatic cases of MERS-CoV infection to countries outside the Middle East. Emerg. Infect. Dis. 2016, 22, 723–725. [Google Scholar] [CrossRef] [PubMed]
- Hollingsworth, T.D.; Ferguson, N.M.; Anderson, R.M. Frequent travelers and rate of spread of epidemics. Emerg. Infect. Dis. 2007, 13, 1288–1294. [Google Scholar] [CrossRef]
- De Wit, E.; Van Doremalen, N.; Falzarano, D.; Munster, V.J. SARS and MERS: Recent insights into emerging coronaviruses. Nat. Rev. Microbiol. 2016, 14, 523–534. [Google Scholar] [CrossRef]
- Young, N.; Pebody, R.; Smith, G.; Olowokure, B.; Shankar, G.; Hoschler, K.; Galiano, M.; Green, H.; Wallensten, A.; Hogan, A.; et al. International flight-related transmission of pandemic influenza A(H1N1)pdm09: An historical cohort study of the first identified cases in the United Kingdom. Influenza Other Respir. Viruses 2014, 8, 66–73. [Google Scholar] [CrossRef]
- Mangili, A.; Gendreau, M.A. Transmission of infectious diseases during commercial air travel. Lancet 2005, 365, 989–996. [Google Scholar] [CrossRef]
- Gold, L.; Balal, E.; Horak, T.; Mehmetoglu, T.; Gurbuz, O. Health screening strategies for international air travelers during an epidemic or pandemic. J. Air Transp. Manag. 2019, 75, 27–38. [Google Scholar] [CrossRef]
- Mouchtouri, V.A.; Christoforidou, E.P.; der Heiden, M.A.; Swaan, C.; Hadjichristodoulou, C. Exit and entry screening practices for infectious diseases among travelers at points of entry: Looking for evidence on public health impact. Int. J. Environ. Res. Public Health 2019, 16, 4638. [Google Scholar] [CrossRef]
- Grout, A.; Howard, N.; Coker, R.; Speakman, E.M. Guidelines, law, and governance: Disconnects in the global control of airline-associated infectious diseases. Lancet Infect. Dis. 2017, 17, e118–e122. [Google Scholar] [CrossRef]
- Sun, Z.; Thilakavathy, K.; Kumar, S.S.; He, G.; Liu, S.V. Potential factors influencing repeated SARS outbreaks in China. Int. J. Environ. Res. Public Health 2020, 17, 1633. [Google Scholar] [CrossRef] [PubMed]
- Colizza, V.; Barrat, A.; Barthelemy, M.; Valleron, A.-J.; Vespignani, A. Modeling the worldwide spread of pandemic influenza: Baseline case and containment interventions. PLoS Med. 2007, 4, e13. [Google Scholar] [CrossRef] [PubMed]
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).