Next Article in Journal
Identification of a Novel Betacoronavirus (Merbecovirus) in Amur Hedgehogs from China
Previous Article in Journal
The Effect of a Commercially Available Bacteriophage and Bacteriocin on Listeria monocytogenes in Coleslaw
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Viral Metagenomics Revealed Sendai Virus and Coronavirus Infection of Malayan Pangolins (Manis javanica)

1
Guangdong Key Laboratory of Animal Conservation and Resource Utilization, Guangdong Public Laboratory of Wild Animal Conservation and Utilization, Guangdong Institute of Applied Biological Resources, Guangzhou 510260, China
2
Guangzhou Zoo, Guangzhou 510230, China
*
Author to whom correspondence should be addressed.
Viruses 2019, 11(11), 979; https://doi.org/10.3390/v11110979
Submission received: 30 September 2019 / Revised: 21 October 2019 / Accepted: 21 October 2019 / Published: 24 October 2019
(This article belongs to the Section Animal Viruses)

Abstract

:
Pangolins are endangered animals in urgent need of protection. Identifying and cataloguing the viruses carried by pangolins is a logical approach to evaluate the range of potential pathogens and help with conservation. This study provides insight into viral communities of Malayan Pangolins (Manis javanica) as well as the molecular epidemiology of dominant pathogenic viruses between Malayan Pangolin and other hosts. A total of 62,508 de novo assembled contigs were constructed, and a BLAST search revealed 3600 ones (≥300 nt) were related to viral sequences, of which 68 contigs had a high level of sequence similarity to known viruses, while dominant viruses were the Sendai virus and Coronavirus. This is the first report on the viral diversity of pangolins, expanding our understanding of the virome in endangered species, and providing insight into the overall diversity of viruses that may be capable of directly or indirectly crossing over into other mammals.

1. Introduction

The Malayan pangolin (Manis javanica), a representative mammal species of the order Pholidota, is one of the only eight pangolin species worldwide. Four of them are from Asia (M. javanica, M. pentadactyla, M. crassicaudata and M. culionensis), whereas another four from Africa (M. tricuspis, M. tetradactyla, M. gigantea and M. temminckii) [1]. Unlike other placental mammals, the skin of pangolins is covered by large and overlapping keratinized scales [2]. Because of the huge demand for their meat as a delicacy and their scales for use in traditional medicines, pangolins are the most poached and trafficked mammal in the world. That is why all the eight pangolin species are included in the Convention on International Trade in Endangered Species of Wild Fauna and Flora (CITES). Concerted efforts have been made to conserve and rescue these species in captivity in China because of their threatened status and continuing decline of the population size in the wild. At the same time, poor health condition and low immunity are also important problems for the rescue of pangolins. A previous study reported a complete genome sequence of Parainfluenza Virus 5 (PIV5) from a Sunda Pangolin (the same as Malayan Pangolin) in China, which further broadens the PIV5 infection host spectrum [3], implicating that pangolins are not only confronted with the potential of great harm from humans, but are also facing the risk of infectious diseases. Recently, a large number of viral metagenomic studies have found pathogenic viruses carried by human, pig, cow, bat, cat, horse, chicken and other animals [4,5,6,7,8,9,10], some of which successfully isolated new virus strains. However, we still know little about the diseases and their etiologies of rare and threatened terrestrial vertebrate such as pangolins.
Viruses are infectious agents that replicate only inside living cells and have the ability to infect a variety of hosts [11]. There has been a lot of discussion within the virology community regarding the best method to determine viral infectivity, pathogenicity, and effects on the host microbiome. Virologists use a variety of methods to gain understanding of infection, replication, pathogenicity, and, more recently, the evolution of the viral genome. Unbiased sequencing of nucleic acids from environmental samples has great potential for the discovery and identification of diverse microorganisms [12,13,14,15]. We know this technique as metagenomics, or random, agnostic or shotgun high-throughput sequencing. In theory, metagenomics techniques enable the identification and genomic characterization of all microorganisms present in a sample with a generic laboratory procedure [16]. The approach has gained popularity with the introduction of next-generation sequencing (NGS) methods that provide more data in less time at a lower cost than previous sequencing techniques. While initially mainly applied to the analysis of the bacterial diversity, modifications in sample preparation protocols allowed characterization of viral genomes as well. Researchers have seized the opportunity to expand our knowledge in the fields of virus discovery and biodiversity characterization [12,13,15,17].
The Guangdong Wildlife Rescue Center received 21 live Malayan pangolins from the Anti-smuggling Customs Bureau on 24 March 2019; most individuals, including adults and subadults, were in poor health, and their bodies were covered with skin eruptions. All these Malayan pangolins were rescued by the Guangdong Wildlife Rescue Center, however, 16 died after extensive rescue efforts. Most of the dead pangolins had a swollen lung which contained a frothy liquid, as well as the symptom of pulmonary fibrosis, and in the minority of the dead ones, we observed hepatomegaly and splenomegaly. We collected 21 organ samples of lung, lymph, and spleen with obvious symptoms from 11 dead Malayan pangolins to uncover the virus diversity and molecular epidemiology of potential etiologies of viruses based on a viral metagenomic study. This study will be beneficial to pangolin disease research and subsequent rescue operation.

2. Materials and Methods

2.1. Ethics Statement

The study design was approved by the ethics committee for animal experiments at the Guangdong Institute of Applied Biological Resources (reference number: GIABR20170720, 20 July 2017) and followed basic principles outlined by this committee.

2.2. Library Preparation and Sequencing

In our study, organ samples of lung, lymph and spleen were collected from dead Malayan pangolins at the Guangdong Wildlife Rescue Center. Preparation of viral-like particles followed a previous published paper [18]. Total nucleic acid was extracted from viral-like particles using a MagPure Viral DNA/RNA Mini LQ Kit (R6662-02; Magen, Guangzhou, China). Double-stranded cDNA was synthesized by reverse transcription from single-stranded and double-stranded RNA viral nucleic acids using REPLI-g Cell WGA & WTA Kit (150052; Qiagen, Hilden, Germany), while single-stranded DNA viral nucleic acids were converted to double-stranded DNA and purified by a REPLI-g Cell WGA & WTA Kit (150052; Qiagen, Hilden, Germany). Amplified DNA was randomly sheared by ultrasound sonication (Covaris M220) to produce fragments of ≤ 800 bp, and sticky ends repaired and adapters added using T4 DNA polymerase (M4211, Promega, Madison, WI, USA), Klenow DNA Polymerase (KP810250, Epicentre), and T4 polynucleotide kinase (EK0031, Thermo scientific-fermentas, Glen Burnie, MD, USA). Fragments of approximately 350 bp were collected by beads after electrophoresis. After amplification, libraries were pooled and subjected to 150 bp paired-end sequencing using the Novaseq 6000 platform (Illumina, San Diego, CA, USA). High-throughput sequencing was conducted by the Magigene Company (Guangzhou, China). The data supporting this study are openly available on the NCBI sequence read archive (SRA) under Bio Project PRJNA573298.

2.3. Raw Read Filtering and Rapid Identification of Virus Species

2.3.1. Quality Control

As raw sequencing reads always include some low-quality data, it is necessary to perform processing to improve the accuracy of reads for follow-up analyses. To this end, we used SOAPnuke version 1.5.6 [19] to remove adapter sequences and reads (i) with more than 5% Ns; (ii) those with 20% base quality values less than 20; (iii) those arising from PCR duplications; as well as (iv) those with a polyA sequence.

2.3.2. Remove Host Contamination

To avoid the confusion cause by ribosomes and host sequences, all clean reads that passed quality control were mapped to the ribosomal database (silva) and the host reference genome of M. javanica (NCBI Project ID: PRJNA256023) utilizing BWA version 0.7.17 [20]; only the unmapped sequences were used in subsequent analysis.

2.3.3. Rapid Identification of Virus Species

Clean reads without ribosomes and host sequences were mapped to an in-house virus reference data separate from the GenBank non-redundant nucleotide (NT) database to primarily identify virus reads. According to the NCBI taxonomy database annotation information, reads were classified into different virus families. To improve the accuracy, we removed the alignment results with a coverage below 5 reads.

2.4. Read Assembly and Species Identification

Clean reads were de novo assembled using MEGAHIT version 1.0 [21]. BWA version 0.7.17 [20] was used to align clean reads to assembled contigs. A host sequence was determined based on BLAST version 2.7.1 and was removed by satisfying one of the following conditions: (1) Length of matched area ≥ 500 bp, alignment similarity ≥ 90%; (2) Length of matched area accounts for more than 80% of the total length of contigs, and alignment similarity ≥ 90%. Then, Cdhit version 4.6 [22] was used to cluster the assembled virus contigs from each Malayan pangolin sample. Contigs were then classified by BLASTx against the NT database using alignment similarity ≥ 80%, length of matched area ≥ 500 bp and e-value ≤ 10−5. Contigs with significant BLASTx hits were confirmed as virus sequences.

2.5. Phylogenetic Analysis

Whole genome sequences of virus strains, the same species as the dominant viruses in Malayan pangolins, from different hosts were downloaded from ViPR database (https://www.viprbrc.org/brc/home.spg?decorator=vipr). Virus sequences from Malayan pangolins and other hosts were aligned using MAFFT version 7.427 [23] with the auto alignment strategy. The best substitution models, as well as maximum likelihood (ML) trees were then evaluated with the iqtree version 1.6.9 [24] with 1000 bootstrap replicates. Then, all the ML trees were visualized and exported as vector diagrams with FigTree version 1.4.3 (http://tree.bio.ed.ac.uk/software/figtree/).

3. Results

3.1. Viral Metagenomics

A total of 21 organ samples of lung, lymph and spleen from 11 dead Malayan pangolins that could not be rescued by the Guangdong Wildlife Rescue Center were used to reveal viral diversity of pangolins. Viral nucleic acids were deep sequenced and then we obtained a total of 227.32 GB data (757,729,773 valid reads, 150 bp in length). In total, 233,587 reads were best matched with viral proteins available in the NCBI NR database (~0.03% of the total sequence reads). The number of viral-associated reads in each sample varied from 2856 to 78,052 (Table 1). In the aggregate, 28 families of viruses were parsed (Table S1). The most widely distributed virus families were Herpesviridae and Paramyxoviridae, and the diverse reads related to these families occupied ~85% of the total viral sequence reads (Figure 1).
Contig sequences were then generated by de novo assembly using MEGAHIT version 1.0 [21], generating 62,508 unique contigs with a max. length of 13,503 bp (Table 2, Figure S1). A taxonomic assignment of these contigs was performed on the basis of BLAST analysis. At this stage, 68 contigs were confirmed for virus species, accounting for about 0.1% of the total number of contigs (Table 2). An assignment of these contigs to different types of viral genomes identified 20.59% DNA viruses and 79.41% RNA viruses, among which 14.71% were assigned to Phages. Another 3532 contigs were suspected to be assigned to virus species (Table S2). DNA viruses accounted for 66.53% while RNA viruses accounted for 33.47%, and 37.06% of these contigs were assigned to Phages. For all the unique contigs, the top 30 ones with the most reads abundance were assigned to families Paramyxoviridae, Flaviviridae and Caudovirales (Figure 2).

3.2. Sendai Virus

Sendai virus was identified in 6 of the 11 Malayan pangolin individuals, which was the common identified virus. For several of these pangolin samples, larger Sendai virus contigs were produced (Table 2). In one case, a contig of 13,232 base pairs isolated from the lung tissue of individual 19 was identified, which is about 86% of the whole genome sequence length (15,384). This contig showed relatively high sequence identity (89.76%) to the whole genome sequence of a Sendai virus strain isolated from humans (GenBank accession: AB005795). The length of other contigs conformed as Sendai virus was in the range from 608 to 7027 bp (Table 2).
Whole genome sequences of Sendai virus from human beings, mouse and monkey were downloaded from the ViPR database (https://www.viprbrc.org/brc/home.spg?decorator=vipr). After sequence alignment conducted by MAFFT version 7.427 (Katoh & Standley, 2013), the best substitution model analyzed by iqtree was GTR+F+I. Then phylogenetic analysis revealed the closest relationship between the 13,232 bp length contig from Malayan pangolin and Sendai virus strains isolated from humans (AB005795.1), but distant from strains isolated from the mouse (Figure 3). Then, we generated the phylogenetic relationships of each gene sequence. Six trees had slight differences, but the genetic distance between the Sendai virus from Malayan pangolin and humans (AB005795.1) was the closest (Figure 4); the same as the relationship between them generated based on whole genome sequences.

3.3. Coronavirus

One or several members of the Coronaviridae families were identified in 2 out of the 11 M. javanica individuals (individual 07 and 08). For several of these pangolin samples, larger contigs were produced, and the length ranged from 503 to 2330 bp (Table 3). Though there was high species variety of Coronavirus detected, SARS-CoV was the most widely distributed (Table 3). Whole genome sequences of strains belonging to four genera (Alphacoronavirus, Betacoronavirus, Gammacoronavirus, and Deltacoronaviruses) isolated from different hosts were downloaded from the ViPR database (https://www.viprbrc.org/brc/home.spg?decorator=vipr). Together with 16 contigs confirmed as Coronavirus in this study, all the sequences were aligned utilizing MAFFT version 7.427 (Katoh & Standley, 2013). The best substitution model analyzed by iqtree was GTR+F+R7, and the phylogenetic analysis therefore showed multiple relationships between Coronavirus contigs and the four Coronavirus genera (Figure 5).

4. Discussion

Pangolins are important wildlife resources in imminent danger of extinction. Great efforts have been made to rescue trafficked pangolins; however, most of the pangolin individuals intercepted by customs were in a poor health condition, and then dead in a few days. Investigating the potential pathogens carried by pangolins may help to rescue them. Our viral metagenomics analysis revealed a high diversity of viruses carried by dead Malayan pangolins. The Sendai virus and Coronaviruses were dominant virus species conformed by assembled contigs, which might have some relationship with the death of Malayan pangolins. Recently, the prediction of viral zoonosis epidemics has become a major public health issue. A profound understanding of the viral population in key animal species acting as reservoirs represents an important step towards this goal. Bats are natural hosts for a large variety of zoonotic viruses. In a recently study, up to 47 different virus families were detected from bat fecal samples [25]. Over 130 virus species have been detected in bats as of 2017 [26], including several emergent human pathogens [27,28,29,30,31,32,33,34,35,36,37,38]. For domesticated animals, virome analysis between sick and health ones could help to find out the pathogens or virus diversity [8,9,39,40,41]. Our study showed that viral metagenomics analysis could also work in revealing viral diversity and potential pathogens of rare and threatened terrestrial vertebrates such as pangolins.
The Sendai virus was the most widely distributed pathogens in 11 dead Malayan pangolins, which was one of the potential causes of their death. The whole genome and individual gene phylogenies for Sendai virus sequences assembled in this study all showed that the Sendai virus from Malayan pangolin had the closest relationship with the strain isolated from humans (AB005795.1), which strongly suggests the possibility that the Sendai virus is transmitted between pangolins and humans. Sendai virus is a member of the paramyxovirus subfamily Paramyxovirinae, genus Respirovirus, members of which primarily infect mammals. The scientific community considers the Sendai virus as the archetype organism of the Paramyxoviridae family because most of the basic biochemical, molecular and biologic properties of the whole family were derived from its own characteristics [42]. Sendai virus-associated disease has a worldwide distribution and has been found in mouse colonies in Asia [43], North America [44] and Europe [45], and is responsible for a highly transmissible respiratory tract infection in mice, hamsters, guinea pigs, rats, and occasionally pigs and bats [46,47], with infection through both air and direct contact routes. Epizootic infections of mice are usually associated with a high mortality rate, while enzootic disease patterns suggest that the virus is latent and can be cleared over the course of a year. This is the first report of a wild pangolin dying possibly due to Sendai virus infection, which further broadens the Sendai virus infection host spectrum. Because of the lack of healthy individuals as a control, we could not figure out whether the Sendai virus carried by pangolins was caused by infection from other hosts or was inherited.
Besides the Sendai virus, Coronaviruses were also detected as potential pathogens of Malayan pangolins. The phylogeny of Coronavirus sequences assembled and strains from four Coronavirus genera demonstrated complex genetic relationships and high species diversity of the Coronavirus in Malayan pangolins. Coronaviruses can cause a variety of severe diseases including gastroenteritis and respiratory tract diseases, and have been identified in mice, rats, chickens, turkeys, swine, dogs, cats, rabbits, horses, cattle and humans [48,49]. Sometimes, but not often, a coronavirus can infect both animals and humans. Human coronaviruses were first described in the 1960s for patients with the common cold. Since then, more have been discovered, including those that cause severe acute respiratory syndrome (SARS) and Middle East respiratory syndrome (MERS), two pathogens that can cause fatal respiratory disease in humans [50,51]. It was recently discovered that dromedary camels in Saudi Arabia harbor three different human coronaviruses species, including a dominant MERS HCoV lineage that was responsible for the outbreaks in the Middle East and South Korea during 2015 [52]. The detection of different types of SARS-CoV in this study may also be related to the death of the Malayan pangolins. Considering the outbreak of SARS which was transmitted by masked palm civet from the natural reservoir of bats [29,53,54], Malayan pangolins could be another host with the potential of transmitting the SARS coronavirus to humans. As a consequence, the viral metagenomic study of Malayan pangolin is meaningful both for the conservation of rare wild animals and public health.

5. Conclusions

We found high viral diversity of dead Malayan pangolins, and the Sendai virus and Coronavirus may be the dominant pathogens responsible for their death. The Sendai virus showed a close relationship between the Malayan pangolin and the strain isolated from humans, whereas Coronavirus sequences showed a high species diversity. Further investigations are required to compare the incidence of these viruses in healthy and diseased pangolin individuals in order to better elucidate their pathogenic role. To date, this is the first metagenomic study of virus diversity in pangolins in China. This study expands our understanding of the viral diversity in endangered species and the capability of directly or indirectly crossing over into other mammals.

Supplementary Materials

The following are available online at https://www.mdpi.com/1999-4915/11/11/979/s1, Figure S1: Length distribution of all the unique contigs generated by de novo assembly. The horizontal axis indicates length of contigs, while the vertical axis indicates number of contigs at different stages of length, Table S1: Number of sequencing reads assigned to different virus families in each Malayan pangolin sample, Table S2: Information of contigs with moderate level of sequence similarity to known viruses. Query ID: query sequence ID; Subject ID: the target sequence ID; identity (%): the percentage of consistency of sequence alignment; Alignment Length: the length of the matched area; Mismatches: the number of mismatch bases in the matched area; Gap Openings: the number of gaps in the matched area; q Start: the starting point of the comparison region on the query sequence; q End: the termination point of the matched area on the query sequence; s Start: the starting position of the matching region on the subject sequence; s End: the termination site of the matched area on the subject sequence; e-Value: the expected value of the comparison result; Bit Score: the bit score value of the comparison result; Taxonomy: Annotation information.

Author Contributions

Funding acquisition, P.L. and J.-P.C.; Project administration, P.L.; Resources, W.C.; Supervision, J.-P.C.; Writing—original draft, P.L.; Writing—review and editing, P.L. and J.-P.C.

Funding

This project was supported by the GDAS’ Project of Science and Technology Development (grant number 2019GDASYL-0302007, 2018GDASCX-0107), the Guangzhou Science Technology and Innovation Commission (grant number 201804020080), as well as the Special Funds for Forestry Development and Protection of Guangdong Province (2019).

Conflicts of Interest

The authors declare that the research described herein was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. Gaudin, T.J.; Emry, R.J.; Wible, J.R. The phylogeny of living and extinct pangolins (mammalia, pholidota) and associated taxa: A morphology based analysis. J. Mamm. Evol. 2009, 16, 235–305. [Google Scholar] [CrossRef]
  2. Meyer, W.; Liumsiricharoen, M.; Suprasert, A.; Fleischer, L.G.; Hewicker-Trautwein, M. Immunohistochemical demonstration of keratins in the epidermal layers of the Malayan pangolin (Manis javanica), with remarks on the evolution of the integumental scale armour. Eur. J. Histochem. 2013, 57, 172–177. [Google Scholar] [CrossRef] [PubMed]
  3. Wang, X.; Chen, W.; Xiang, R.; Li, L.; Chen, J.; Zhong, R.; Xiang, H.; Chen, J. Complete Genome Sequence of Parainfluenza Virus 5 (PIV5) from a Sunda Pangolin (Manis javanica) in China. J. Wildl. Dis. 2019, 55, 5–8. [Google Scholar] [CrossRef] [PubMed]
  4. Wu, Z.; Ren, X.; Yang, L.; Hu, Y.; Yang, J.; He, G.; Zhang, J.; Dong, J.; Sun, L.; Du, J.; et al. Virome Analysis for Identification of Novel Mammalian Viruses in Bat Species from Chinese Provinces. J. Virol. 2012, 86, 10999–11012. [Google Scholar] [CrossRef] [Green Version]
  5. Zhang, W.; Li, L.; Deng, X.; Kapusinszky, B.; Pesavento, P.A.; Delwart, E. Faecal virome of cats in an animal shelter. J. Gen. Virol. 2014, 95, 2553–2564. [Google Scholar] [CrossRef]
  6. Li, L.; Giannitti, F.; Low, J.; Keyes, C.; Ullmann, L.S.; Deng, X.; Aleman, M.; Pesavento, P.A.; Pusterla, N.; Delwart, E. Exploring the virome of diseased horses. J. Gen. Virol. 2015, 96, 2721–2733. [Google Scholar] [CrossRef] [Green Version]
  7. Ng, T.F.F.; Kondov, N.O.; Deng, X.; Van Eenennaam, A.; Neibergs, H.L.; Delwart, E. A Metagenomics and Case-Control Study to Identify Viruses Associated with Bovine Respiratory Disease. J. Virol. 2015, 89, 5340–5349. [Google Scholar] [CrossRef]
  8. Lima, D.A.; Cibulski, S.P.; Finkler, F.; Teixeira, T.F.; Varela, A.P.M.; Cerva, C.; Loiko, M.R.; Scheffer, C.M.; Dos Santos, H.F.; Mayer, F.Q.; et al. Faecal virome of healthy chickens reveals a large diversity of the eukaryote viral community, including novel circular ssDNA viruses. J. Gen. Virol. 2017, 98, 690–703. [Google Scholar] [CrossRef]
  9. Blomström, A.L.; Ye, X.; Fossum, C.; Wallgren, P.; Berg, M. Characterisation of the virome of tonsils from conventional pigs and from specific pathogen-free pigs. Viruses 2018, 10, 382. [Google Scholar] [CrossRef]
  10. Moreno-Gallego, J.L.; Chou, S.P.; Di Rienzi, S.C.; Goodrich, J.K.; Spector, T.D.; Bell, J.T.; Youngblut, N.D.; Hewson, I.; Reyes, A.; Ley, R.E. Virome Diversity Correlates with Intestinal Microbiome Diversity in Adult Monozygotic Twins. Cell Host Microbe 2019, 25, 261–272. [Google Scholar] [CrossRef]
  11. Noell, K.; Kolls, J.K. Further Defining the Human Virome using NGS: Identification of Redondoviridae. Cell Host Microbe 2019, 25, 634–635. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  12. Tang, P.; Chiu, C. Metagenomics for the discovery of novel human viruses. Future Microbiol. 2010, 5, 177–189. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Chiu, C.Y. Viral pathogen discovery. Curr. Opin. Microbiol. 2013, 16, 468–478. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Culligan, E.P.; Sleator, R.D.; Marchesi, J.R.; Hill, C. Metagenomics and novel gene discovery: Promise and potential for novel therapeutics. Virulence 2014, 5, 37–41. [Google Scholar] [CrossRef]
  15. Pallen, M.J. Diagnostic metagenomics: Potential applications to bacterial, viral and parasitic infections. Parasitology 2014, 141, 1856–1862. [Google Scholar] [CrossRef]
  16. Wooley, J.C.; Ye, Y. Metagenomics: Facts and artifacts, and computational challenges. J. Comput. Sci. Technol. 2010, 25, 71–81. [Google Scholar] [CrossRef]
  17. Cardenas, E.; Tiedje, J.M. New tools for discovering and characterizing microbial diversity. Curr. Opin. Biotechnol. 2008, 19, 544–549. [Google Scholar] [CrossRef]
  18. Wei, H.Y.; Huang, S.; Yao, T.; Gao, F.; Jiang, J.Z.; Wang, J.Y. Detection of viruses in abalone tissue using metagenomics technology. Aquac. Res. 2018, 49, 2704–2713. [Google Scholar] [CrossRef]
  19. Chen, Y.; Chen, Y.; Shi, C.; Huang, Z.; Zhang, Y.; Li, S.; Li, Y.; Ye, J.; Yu, C.; Li, Z.; et al. SOAPnuke: A MapReduce acceleration-supported software for integrated quality control and preprocessing of high-throughput sequencing data. Gigascience 2018, 7, 1–6. [Google Scholar] [CrossRef]
  20. Li, H.; Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 2009, 25, 1754–1760. [Google Scholar] [CrossRef]
  21. Li, D.; Luo, R.; Liu, C.M.; Leung, C.M.; Ting, H.F.; Sadakane, K.; Yamashita, H.; Lam, T.W. MEGAHIT v1.0: A fast and scalable metagenome assembler driven by advanced methodologies and community practices. Methods 2016, 102, 3–11. [Google Scholar] [CrossRef] [PubMed]
  22. Li, W.; Godzik, A. Cd-hit: A fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 2006, 22, 1658–1659. [Google Scholar] [CrossRef] [PubMed]
  23. Katoh, K.; Standley, D.M. MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol. Biol. Evol. 2013, 30, 772–780. [Google Scholar] [CrossRef] [PubMed]
  24. Trifinopoulos, J.; Nguyen, L.T.; von Haeseler, A.; Minh, B.Q. W-IQ-TREE: A fast online phylogenetic tool for maximum likelihood analysis. Nucleic Acids Res. 2016, 44, W232–W235. [Google Scholar] [CrossRef] [PubMed]
  25. Mendenhall, I.H.; Wen, D.L.H.; Jayakumar, J.; Gunalan, V.; Wang, L.; Mauer-Stroh, S.; Su, Y.C.F.; Smith, G.J.D. Diversity and Evolution of Viral Pathogen Community in Cave Nectar Bats (Eonycteris spelaea). Viruses 2019, 11, 250. [Google Scholar] [CrossRef]
  26. Hu, D.; Zhu, C.; Wang, Y.; Ai, L.; Yang, L.; Ye, F.; Ding, C.; Chen, J.; He, B.; Zhu, J.; et al. Virome analysis for identification of novel mammalian viruses in bats from Southeast China. Sci. Rep. 2017, 7, 10917. [Google Scholar] [CrossRef]
  27. Leroy, E.M.; Kumulungui, B.; Pourrut, X.; Rouquet, P.; Hassanin, A.; Yaba, P.; Délicat, A.; Paweska, J.T.; Gonzalez, J.P.; Swanepoel, R. Fruit bats as reservoirs of Ebola virus. Nature 2005, 438, 575–576. [Google Scholar] [CrossRef]
  28. Calisher, C.H.; Childs, J.E.; Field, H.E.; Holmes, K.V.; Schountz, T. Bats: Important reservoir hosts of emerging viruses. Clin. Microbiol. Rev. 2006, 19, 531–545. [Google Scholar] [CrossRef]
  29. Wang, L.F.; Shi, Z.; Zhang, S.; Field, H.; Daszak, P.; Eaton, B.T. Review of bats and SARS. Emerg. Infect. Dis. 2006, 12, 1834–1840. [Google Scholar] [CrossRef]
  30. Swanepoel, R.; Smit, S.B.; Rollin, P.E.; Formenty, P.; Leman, P.A.; Kemp, A.; Burt, F.J.; Grobbelaar, A.A.; Croft, J.; Bausch, D.G.; et al. Studies of reservoir hosts for Marburg virus. Emerg. Infect. Dis. 2007, 13, 1847–1851. [Google Scholar] [CrossRef]
  31. Chu, D.K.W.; Poon, L.L.M.; Guan, Y.; Peiris, J.S.M. Novel Astroviruses in Insectivorous Bats. J. Virol. 2008, 82, 9107–9114. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  32. Li, Y.; Ge, X.; Zhang, H.; Zhou, P.; Zhu, Y.; Zhang, Y.; Yuan, J.; Wang, L.-F.; Shi, Z. Host Range, Prevalence, and Genetic Diversity of Adenoviruses in Bats. J. Virol. 2010, 84, 3889–3897. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  33. Wong, S.; Lau, S.; Woo, P.; Yuen, K.Y. Bats as a continuing source of emerging infections in humans. Rev. Med. Virol. 2007, 17, 67–91. [Google Scholar] [CrossRef] [PubMed]
  34. Freuling, C.M.; Beer, M.; Conraths, F.J.; Finke, S.; Hoffmann, B.; Keller, B.; Kliemt, J.; Mettenleiter, T.C.; Mühlbach, E.; Teifke, J.P.; et al. Novel lyssavirus in Natterer’s bat, Germany. Emerg. Infect. Dis. 2011, 17, 1519–1522. [Google Scholar] [CrossRef] [PubMed]
  35. Lau, S.K.P.; Woo, P.C.Y.; Lai, K.K.Y.; Huang, Y.; Yip, C.C.Y.; Shek, C.-T.; Lee, P.; Lam, C.S.F.; Chan, K.-H.; Yuen, K.-Y. Complete Genome Analysis of Three Novel Picornaviruses from Diverse Bat Species. J. Virol. 2011, 85, 8819–8828. [Google Scholar] [CrossRef] [Green Version]
  36. Lau, S.K.P.; Woo, P.C.Y.; Li, K.S.M.; Huang, Y.; Tsoi, H.W.; Wong, B.H.L.; Wong, S.S.Y.; Leung, S.Y.; Chan, K.H.; Yuen, K.Y. Severe acute respiratory syndrome coronavirus-like virus in Chinese horseshoe bats. Proc. Natl. Acad. Sci. USA 2005, 102, 14040–14045. [Google Scholar] [CrossRef] [Green Version]
  37. Smith, I.; Broos, A.; de Jong, C.; Zeddeman, A.; Smith, C.; Smith, G.; Moore, F.; Barr, J.; Crameri, G.; Marsh, G.; et al. Identifying hendra virus diversity in pteropid bats. PLoS ONE 2011, 6, e25275. [Google Scholar] [CrossRef]
  38. Luis, A.D.; Hayman, D.T.S.; O’Shea, T.J.; Cryan, P.M.; Gilbert, A.T.; Pulliam, J.R.C.; Mills, J.N.; Timonin, M.E.; Willis, C.K.R.; Cunningham, A.A.; et al. A comparison of bats and rodents as reservoirs of zoonotic viruses: Are bats special? Proc. R. Soc. B Biol. Sci. 2013, 280, 20122753. [Google Scholar] [CrossRef]
  39. Shan, T.; Li, L.; Simmonds, P.; Wang, C.; Moeser, A.; Delwart, E. The Fecal Virome of Pigs on a High-Density Farm. J. Virol. 2011, 85, 11697–11708. [Google Scholar] [CrossRef] [Green Version]
  40. Lager, K.M.; Ng, T.F.; Bayles, D.O.; Alt, D.P.; Delwart, E.L.; Cheung, A.K. Diversity of viruses detected by deep sequencing in pigs from a common background. J. Vet. Diagn. Investig. 2012, 24, 1177–1179. [Google Scholar] [CrossRef] [Green Version]
  41. Amimo, J.O.; El Zowalaty, M.E.; Githae, D.; Wamalwa, M.; Djikeng, A.; Nasrallah, G.K. Metagenomic analysis demonstrates the diversity of the fecal virome in asymptomatic pigs in East Africa. Arch. Virol. 2016, 161, 887–897. [Google Scholar] [CrossRef] [PubMed]
  42. Faísca, P.; Desmecht, D. Sendai virus, the mouse parainfluenza type 1: A longstanding pathogen that remains up-to-date. Res. Vet. Sci. 2007, 82, 115–125. [Google Scholar] [CrossRef] [PubMed]
  43. Fukumi, H.; Fumio, T.; Nishikawa, T. A pneumotropic virus from mice causing hemagglutination. J. Med. Sci. Biol. 1954, 7, 345–363. [Google Scholar] [CrossRef] [PubMed]
  44. Parker, J.C.; Tennant, R.W.; Ward, T.G.; Rowe, W.P. Enzootic sendai virus infections in mouse breeder colonies within the United States. Science 1964, 146, 936–938. [Google Scholar] [CrossRef] [PubMed]
  45. Zurcher, C.; Burek, J.D.; van, M.N.; Meihuizen, S.P. A naturally occurring epizootic caused by Sendai virus in breeding and aging rodent colonies. I. Infection in the mouse. Lab. Anim. Sci. 1977, 27, 955–962. [Google Scholar]
  46. Parker, J.C.; Reynolds, R.K. Natural history of sendai virus infection in mice. Am. J. Epidemiol. 1968, 88, 112–125. [Google Scholar] [CrossRef]
  47. Noh, J.Y.; Jeong, D.G.; Yoon, S.W.; Kim, J.H.; Choi, Y.G.; Kang, S.Y.; Kim, H.K. Isolation and characterization of novel bat paramyxovirus B16-40 potentially belonging to the proposed genus Shaanvirus. Sci. Rep. 2018, 8, 12533. [Google Scholar] [CrossRef]
  48. Weiss, S.R.; Navas-Martin, S. Coronavirus pathogenesis and the emerging pathogen severe acute respiratory syndrome coronavirus. Microbiol. Mol. Biol. Rev. 2005, 69, 635–664. [Google Scholar] [CrossRef]
  49. Holmes, K.V.; Lai, M.M.C. Coronaviridae: The viruses and their replication. Fields Virol. 1996, 1, 1075–1093. [Google Scholar]
  50. Guy, J.S.; Breslin, J.J.; Breuhaus, B.; Vivrette, S.; Smith, L.G. Characterization of a coronavirus isolated from a diarrheic foal. J. Clin. Microbiol. 2000, 38, 4523–4526. [Google Scholar]
  51. Assiri, A.; McGeer, A.; Perl, T.M.; Price, C.S.; Al Rabeeah, A.A.; Cummings, D.A.T.; Alabdullatif, Z.N.; Assad, M.; Almulhim, A.; Makhdoom, H.; et al. Hospital outbreak of middle east respiratory syndrome coronavirus. N. Engl. J. Med. 2013, 369, 407–416. [Google Scholar] [CrossRef] [PubMed]
  52. Sabir, J.S.M.; Lam, T.T.; Mohamed, M.M.; Li, L.; Shen, Y.; Abo-aba, S.E.M.; Qureshi, M.I.; Abu-zeid, M.; Zhang, Y.; Smith, D.K.; et al. Co-circulation of three camel coronavirus species and recombination of MERS-CoVs in Saudi Arabia. Science 2015, 351, 81–84. [Google Scholar] [CrossRef] [PubMed]
  53. Epstein, J.H.; McEachern, J.; Zhang, J.; Daszak, P.; Wang, H.; Field, H.; Li, W.; Eaton, B.T.; Wang, L.-F.; Yu, M.; et al. Bats Are Natural Reservoirs of SARS-Like Coronaviruses. Science 2005, 310, 676–679. [Google Scholar]
  54. Wang, L.F.; Eaton, B.T. Bats, civets and the emergence of SARS. Curr. Top. Microbiol. Immunol. 2007, 315, 325–344. [Google Scholar]
Figure 1. The percentage of sequences related to the most abundant viral families among all virus reads, indicated in the same colors for each main viral category. Taxonomic classification of viruses is consistent across samples. Samples are characterized according to the number of sequences from each sample classified by taxonomic family. Virus families are indicated by the color code on the bottom.
Figure 1. The percentage of sequences related to the most abundant viral families among all virus reads, indicated in the same colors for each main viral category. Taxonomic classification of viruses is consistent across samples. Samples are characterized according to the number of sequences from each sample classified by taxonomic family. Virus families are indicated by the color code on the bottom.
Viruses 11 00979 g001
Figure 2. Heatmap of contigs with the top 30 abundance of sequence reads in each sample. The pangolin samples are listed below the heatmap. Information of contigs and the virus families they belong to is provided in the right text column. The boxes colored from blue to red represent the abundance of virus reads aligned to each contig.
Figure 2. Heatmap of contigs with the top 30 abundance of sequence reads in each sample. The pangolin samples are listed below the heatmap. Information of contigs and the virus families they belong to is provided in the right text column. The boxes colored from blue to red represent the abundance of virus reads aligned to each contig.
Viruses 11 00979 g002
Figure 3. The phylogenetic tree of the Sendai virus from Malayan pangolin and other hosts. The analysis was inferred using the Maximum Likelihood method based on iqtree [24]. Branch bootstrap values are shown and were based on 1000 replicates. The black star indicates a contig of the Sendai virus from M. javanica in 2019.
Figure 3. The phylogenetic tree of the Sendai virus from Malayan pangolin and other hosts. The analysis was inferred using the Maximum Likelihood method based on iqtree [24]. Branch bootstrap values are shown and were based on 1000 replicates. The black star indicates a contig of the Sendai virus from M. javanica in 2019.
Viruses 11 00979 g003
Figure 4. The phylogenetic tree of each gene of the Sendai virus from Malayan pangolin and other hosts. The analysis was inferred using the Maximum Likelihood method based on iqtree [24]. Branch bootstrap values are shown and were based on 1000 replicates. The black star indicates a contig of the Sendai virus from M. javanica in 2019.
Figure 4. The phylogenetic tree of each gene of the Sendai virus from Malayan pangolin and other hosts. The analysis was inferred using the Maximum Likelihood method based on iqtree [24]. Branch bootstrap values are shown and were based on 1000 replicates. The black star indicates a contig of the Sendai virus from M. javanica in 2019.
Viruses 11 00979 g004
Figure 5. The phylogenetic tree of Conronavirus from Malayan pangolin and other hosts. The analysis was inferred using the Maximum Likelihood method based on iqtree [24]. Branch bootstrap values are shown and were based on 1000 replicates. The black star indicates a contig of the Sendai virus from M. javanica in 2019.
Figure 5. The phylogenetic tree of Conronavirus from Malayan pangolin and other hosts. The analysis was inferred using the Maximum Likelihood method based on iqtree [24]. Branch bootstrap values are shown and were based on 1000 replicates. The black star indicates a contig of the Sendai virus from M. javanica in 2019.
Viruses 11 00979 g005
Table 1. Overview of reads and contig sequences of lung, lymph, and spleen tissues from 11 dead Malayan pangolins.
Table 1. Overview of reads and contig sequences of lung, lymph, and spleen tissues from 11 dead Malayan pangolins.
Sample IDRaw Reads
(PE)
Number of Reads Remaining after Filtering (%)Assembly Data on Filtered Reads
Clean Reads (PE)Rm. rRNA Clean (PE)Rm. Host Clean (PE)Virus Reads (PE)Total No.Max Len.Min Len.N50GC (%)
lung0153,970,68522,900,426 (42.43)13,929,751 (60.83)13,784,503(60.19)8945 (0.04)2395705430047151.08
lung0239,738,67916,573,376 (41.71)10,760,690 (64.93)10,580,567 (63.84)10,242 (0.06)3828763830053451.02
lung0329,005,76112,967,281 (44.71)7,511,236 (57.92)7,427,749 (57.28)7456 (0.06)3380519230049047.60
lung0432,420,34313,527,964 (41.73)8,156,824 (60.30)7,838,436 (57.94)15,539 (0.11)6047739230055950.72
lung0744,500,92819,045,923 (42.80)12,466,935 (65.46)12,339,084 (64.79)6056 (0.03)2539254130042947.50
lung0839,624,36816,414,925 (41.43)10,655,020 (64.91)10,555,677 (64.31)9139 (0.06)2196696930051450.72
lung0942,219,25318,067,615 (42.79)11,552,994 (63.94)11,442,175 (63.33)13,146 (0.07)490313,50330062346.97
lung1152,714,79022,220,187 (42.15)15,402,765 (69.32)14,227,635 (64.03)11,877 (0.05)9668456030039449.82
lung1217,630,0929,275,501 (52.61)5,425,644 (58.49)5,368,963 (57.88)2856 (0.03)638286630042252.31
lung1325,571,23016,491,648 (64.49)14,588,679 (88.46)10,591,839 (64.23)7447 (0.05)7557816430049562.37
lung1939,314,71519,986,780 (50.84)9,028,100 (45.17)8,889,856 (44.48)78,052 (0.39)2,46913,23230050951.16
lymph0140,842,45218,903,834 (46.28)11,243,800 (59.48)11,117,284 (58.81)7158 (0.04)1575344230047746.47
lymphA0144,848,97320,045,443 (44.70)12,675,354 (63.23)12,580,282 (62.76)6081 (0.03)2373365130042148.74
spleen0120,058,02611,527,782 (57.47)7,566,895 (65.64)7,422,262 (64.39)3161 (0.03)945144530038249.79
spleen0235,359,89915,350,468 (43.41)9,888,746 (64.42)9,739,169 (63.45)7955 (0.05)1857611930043647.94
spleen0334,350,84819,055,973 (55.47)11,356,082 (59.59)11,244,710 (59.01)5405 (0.03)1194429030035351.60
spleen0442,861,27619,038,817 (44.42)12,498,406 (65.65)12,394,988 (65.10)7616 (0.04)1442416230048151.78
spleen0837,544,02915,975,904 (42.55)10,761,939 (67.36)10,516,975 (65.83)7191 (0.05)3516517630038647.06
spleen1135,405,98015,273,939 (43.14)9,877,753 (64.67)9,726,051 (63.68)6596 (0.04)1351526630048051.27
spleen1221,926,55412,590,769 (57.42)8,383,040 (66.58)8,298,012 (65.91)5381 (0.04)129898930041558.16
spleen1927,820,89216,068,654 (57.76)11,459,934 (71.32)10,570,867 (65.79)6288 (0.04)6367155330038142.60
Mean36,082,37016,728,72410,723,36110,317,00411,1233216548630046050.32
Standard Deviation9,831,0293,477,2182,487,2252,214,37715,630240833480674.11
Rm. rRNA clean: number and percentage of reads after removing ribosome sequence; Rm. Host clean: number and percentage of reads after removing host sequence; Virus reads: number of reads mapped to the virus database.
Table 2. Information of contigs with a high level of sequence similarity and then conformed as the Sendai virus. See table legend of Table S2 for detailed explanation of the table header.
Table 2. Information of contigs with a high level of sequence similarity and then conformed as the Sendai virus. See table legend of Table S2 for detailed explanation of the table header.
Query IDSubject IDIdentity
(%)
Alignment
Length
MismatchesGap
Openings
q Startq Ends Starts Ende-ValueBit Score
lung01|contig_245AB005795.190.152072204010603131437864490.02818
lung01|contig_302AB005795.186.03594830207800145120440.0697
lung01|contig_307DQ219803.190.0651350122534931588040.0691
lung01|contig_507DQ219803.192.271747135099184511,96713,7130.02542
lung01|contig_1156DQ219803.187.6050062099598714966501.5 × 10−176623
lung01|contig_2161AB005795.190.4613621300101146214715080.01871
lung02|contig_1124DQ219803.191.417028602217027830615,3320.09945
lung07|contig_34DQ219803.191.445965101596537659710.0845
lung07|contig_45DQ219803.191.465154401515457640620.0731
lung07|contig_444DQ219803.189.0111281240911218275038770.01476
lung07|contig_506DQ219803.190.6879474027959761830.01099
lung07|contig_798DQ219803.192.431202910104130514,13115,3320.01757
lung07|contig_1426AB005795.191.6060751013173710,16395570.0865
lung09|contig_2947DQ219803.191.74155012241154713,83515,3810.02206
lung11|contig_5506DQ219803.191.3165657072727518058350.0926
lung19|contig_164AB005795.189.7613,23113550113,2311913,2490.017,751
Table 3. Information of contigs with a high level of sequence similarity and then confirmed as Coronavirinae. See table legend of Table S2 for a detailed explanation of the table header.
Table 3. Information of contigs with a high level of sequence similarity and then confirmed as Coronavirinae. See table legend of Table S2 for a detailed explanation of the table header.
Query IDSubject IDIdentity (%)Alignment
Length
MismatchesGap
Openings
q Startq Ends Starts Ende-ValueBit
Score
Taxonomy
lung07|contig_47JX993987.180.2450610002507761171062.28 × 10−128462Bat coronavirus Rp/Shaanxi2011
lung07|contig_174KJ473815.187.161262162019128015,06916,3300.01546BtRs-BetaCoV/GX2013
lung07|contig_368KC881006.188.93105711701105728,20429,2600.01379Bat SARS-like coronavirus Rs3367
lung07|contig_715AY394981.188.685215909661617,93717,4170.0673SARS coronavirus HGZ8L1-A
lung07|contig_1748DQ412042.187.845847104662911,91912,5020.0733Bat SARS CoV Rf1/2004
lung08|contig_223KJ473814.185.522023293098212014,50912,4870.02327BtRs-BetaCoV/HuB2013
lung08|contig_286KY417145.182.29559990155918,77118,2133.95 × 10−158562Bat SARS-like coronavirus
lung08|contig_330FJ588686.180.4611672242153131713742100.01072SARS coronavirus Rs_672/2006
lung08|contig_424DQ412042.187.99608730361012,05412,6610.0767Bat SARS CoV Rf1/2004
lung08|contig_729KY417145.188.06113913601113917,32616,1880.01442Bat SARS-like coronavirus
lung08|contig_731KF569996.183.967671230276811,97811,2120.0829Rhinolophus affinis coronavirus
lung08|contig_785KF294457.183.4517222850608232921,46319,7420.01820Bat SARS-like coronavirus
lung08|contig_1292JX993988.182.616441120364624,13324,7760.0657Bat coronavirus Cp/Yunnan2011
lung08|contig_1420AY394981.188.39646750164617,33317,9780.0827SARS coronavirus HGZ8L1-A
lung08|contig_1528DQ412043.184.29681107013881819,33918,6590.0746Bat SARS CoV Rm1/2004
lung08|contig_1551GQ153548.182.60500870250124,20223,7031.73 × 10−142509Bat SARS coronavirus HKU3-13

Share and Cite

MDPI and ACS Style

Liu, P.; Chen, W.; Chen, J.-P. Viral Metagenomics Revealed Sendai Virus and Coronavirus Infection of Malayan Pangolins (Manis javanica). Viruses 2019, 11, 979. https://doi.org/10.3390/v11110979

AMA Style

Liu P, Chen W, Chen J-P. Viral Metagenomics Revealed Sendai Virus and Coronavirus Infection of Malayan Pangolins (Manis javanica). Viruses. 2019; 11(11):979. https://doi.org/10.3390/v11110979

Chicago/Turabian Style

Liu, Ping, Wu Chen, and Jin-Ping Chen. 2019. "Viral Metagenomics Revealed Sendai Virus and Coronavirus Infection of Malayan Pangolins (Manis javanica)" Viruses 11, no. 11: 979. https://doi.org/10.3390/v11110979

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop