Viruses 2014, 6(5), 2138-2154; doi:10.3390/v6052138

Evidence for Retrovirus and Paramyxovirus Infection of Multiple Bat Species in China
Lihong Yuan 1, Min Li 1, Linmiao Li 1, Corina Monagin 2, Aleksei A. Chmura 3, Bradley S. Schneider 2, Jonathan H. Epstein 3, Xiaolin Mei 1, Zhengli Shi 4, Peter Daszak 3 and Jinping Chen 1,*
Guangdong Entomological Institute, South China Institute of Endangered Animals, Guangzhou 510260, China; E-Mails: (L.Y.); (M.L.); (L.L.); (X.M.); (J.C.)
Metabiota, San Francisco, CA 94104, USA; E-Mails: (C.M.); (B.S.S.)
EcoHealth Alliance, New York, NY 10001, USA; E-Mails: (A.A.C.); (J.H.E.); (P.D.)
Wuhan Institute of Virology of Chinese Academy of Sciences, Wuhan 430071, China; E-Mail:
Author to whom correspondence should be addressed; E-Mail:; Tel.: +86-20-8910-0920; Fax: +86-20-8418-3704.
Received: 19 February 2014; in revised form: 27 April 2014 / Accepted: 6 May 2014 /
Published: 16 May 2014


: Bats are recognized reservoirs for many emerging zoonotic viruses of public health importance. Identifying and cataloguing the viruses of bats is a logical approach to evaluate the range of potential zoonoses of bat origin. We characterized the fecal pathogen microbiome of both insectivorous and frugivorous bats, incorporating 281 individual bats comprising 20 common species, which were sampled in three locations of Yunnan province, by combining reverse transcription polymerase chain reaction (RT-PCR) assays and next-generation sequencing. Seven individual bats were paramyxovirus-positive by RT-PCR using degenerate primers, and these paramyxoviruses were mainly classified into three genera (Rubulavirus, Henipavirus and Jeilongvirus). Various additional novel pathogens were detected in the paramyxovirus-positive bats using Illumina sequencing. A total of 7066 assembled contigs (≥200 bp) were constructed, and 105 contigs matched eukaryotic viruses (of them 103 belong to 2 vertebrate virus families, 1 insect virus, and 1 mycovirus), 17 were parasites, and 4913 were homologous to prokaryotic microorganisms. Among the 103 vertebrate viral contigs, 79 displayed low identity (<70%) to known viruses including human viruses at the amino acid level, suggesting that these belong to novel and genetically divergent viruses. Overall, the most frequently identified viruses, particularly in bats from the family Hipposideridae, were retroviruses. The present study expands our understanding of the bat virome in species commonly found in Yunnan, China, and provides insight into the overall diversity of viruses that may be capable of directly or indirectly crossing over into humans.
bat; metagenomics; hipposideridae; pathogens; viral emergence; virome; zoonoses

1. Introduction

Bats (order Chiroptera) comprise the second largest mammalian group in the world and are widely distributed across six continents [1]. Numerous bat species have been implicated as natural reservoirs for significant zoonotic viruses, including Ebola and, Marburg viruses; Nipah and Hendra viruses; SARS coronavirus; and most recently [2,3,4,5]. Collectively, bats harbor a broad diversity of viruses, and it has been proposed that entire viral families and genera, such as pararmyxoviruses, hepaciviruses, and pegiviruses, may have originated from bats [6,7]. More than 170 viruses have been detected in bats, and many of these viruses are highly pathogenic to humans [8], including, Nipah and Hendra viruses [9], rabies virus [10], Australian Bat Lyssavirus [11], Marburg virus [12], and the SARS-like coronavirus [13]. Notably, the outbreak of SARS coronavirus in 2002, which originated in horseshoe bats in China and infected more than 8000 people globally, has spurred increased efforts from the international research community to characterize the diversity and distribution of bat viruses around the world [4,14].

Over 120 bat species have been identified in China, and many are widely distributed throughout the southern provinces of Yunnan, Guangdong, Guangxi, and Fujian [15]. These more ubiquitous bat species, which are in the genera of Rousettus, Myotis, Miniopterus and Hipposideros, naturally reside in close proximity to humans, thus increasing the potential of transmission of zoonotic pathogens to humans. Highly pathogenic paramyxoviruses have been identified in a variety of vertebrates, including humans, and phylogenetic reconstruction of host associations suggests a predominance of host switches from bats to other mammals and birds [6], and often cause serious outbreaks of diseases [16,17]. Thus, assessing the variety of paramyxoviruses circulating in bats in China could provide important guidance for the control and prevention of future epidemics.

Recently, high-throughput sequencing (e.g., Illumina, Solexa, etc.)—a rapid and efficient technique using sequence-independent amplification of nucleic acids followed by shotgun sequencing—has been employed with great success to discover an enormous diversity of viruses in a range of samples, including marine and fresh water [18,19], animals tissues [20,21], human feces, and bat fecal, urine and oral samples [22,23,24,25,26,27,28,29]. In this study, we used reverse transcription polymerase chain reaction (RT-PCR) to survey the prevalence of paramyxoviruses in bats from Yunnan China, and then conducted Illumina sequencing to characterize paramyxoviruses from bat samples. To date, this is the first reported metagenomic analysis in fecal samples of both insectivorous and frugivorous bats in China.

2. Experimental

2.1. Ethics Statement

This study was approved by the Guangdong Entomological Institute Administrative Panel on Laboratory Animal Care (Guangzhou, China). All bats were released unharmed immediately after sampling.

2.2. Sample Collection and Viral Nucleic Acids Preparation

Between November 2011 and March 2012, a total of 562 oral and rectal swab samples from 281 individual bats of 20 species were collected at multiple sites in the Yunnan Province of China. Of the samples collected, 252 samples (44.84%) were from 126 insectivorous bats and 310 samples (55.16%) from 155 frugivorous bats. The sampling locations include Yuanjiang Gulong hole (N23°35.180', E101°57.540'; H: 405 m), Xishuangbanna Botanical Garden (N: 21°55.368', E: 101°15.235', H: 535 m), and the Natural Arch (N: 21°59.235', E: 101°21.418', H: 919 m). Samples were suspended in a phosphate buffer solution (PBS) with antibiotics and stored at −80 °C until nucleic acid extraction. For the sampling details, such as numbers, locations and specific bat species, see Table S1.

To extract viral nucleic acid, the swab suspensions were centrifuged at 5000× g for 10 min. Then, the supernatant was transferred to fresh tubes and centrifuged at 12,000× g for 20 min. The viral DNA and RNA were simultaneously extracted from a 140 μL sample with the QIAamp Viral RNA Mini Kit (QIAgen, Düsseldorf, Germany) according to the manufacturer’s protocol and eluted into 60 μL AVE buffer.

2.3. Screening of Paramyxoviruses and Phylogenetic Analysis

Sensitive and broadly reactive RT-PCR assays were performed at Wuhan Institute of Virology, Chinese Academy of Sciecnces. In this study, the primers, which were designed according to the conserved motifs of the RNA polymerase (L)-coding sequence, were from the reference Tong et al. [30], and could be used for identification of novel paramyxoviruses. The 560 bp fragments of the RNA polymerase (L)-coding sequence conserved in the Paramyxovirinae subfamily were amplified, according to the reference Tong et al. [30] with no modification. The PCR productions were sequenced using Big Dye Terminator kits (Life Technologies, Carlsbad, CA, USA) on an ABI 3730 automated sequencer with primer PAR-R.

To assess the phylogenetic relationship of paramyxoviruses identified in this study, 79 partial and full L-gene nucleotide sequences were downloaded from NCBI with the GenBank accession numbers listed in Table S2. Nucleotide sequences were then aligned and translated by using ClustalX 1.83 [31]. The best-fit model (GTR + I + G) of the phylogenetic relationship was determined by Modeltest 3.7 [32] and the phylogenetic trees of L-gene were constructed by MrBayes 3.1.1 [33], with the sequences of Pneumovirinae subfamily as an outgroup.

2.4. High-throughput Sequencing and Pathogen Analysis

To further characterize the co-infected viruses in bat paramyxovirus-positive samples, Illumina high-throughput sequencing was conducted. Total nucleic acids were extracted as described above. Viral nucleic acid samples were then pooled, and a 3 μg pooled sample was used for sequencing library preparation. The synthesis of first and second-strand cDNA from the viral RNA was performed using random oligonucleotides/SuperScript II and DNA Polymerase I/RNase H, respectively. After the 3'-end adenylation, DNA fragments were ligated with Illumina PE adapters on both sides to purify and selectively enrich the 200 bp fragments using the AMPure XP system (Beckman Coulter, Fullerton, CA, USA) and Illumina PCR Primer Cocktail in a 10 cycle PCR reaction. Finally, a sequencing library was generated by Illumina TruSeqTM RNA Sample Preparation Kit (Illumina, San Diego, CA, USA) following the manufacturer’s recommendations and four index codes were added to attribute sequences. The clustering of the index-coded samples was performed on a cBot Cluster Generation System using TruSeq PE Cluster Kit v3-cBot-HS (Illumina, Inc.) according to the manufacturer’s instructions. After cluster generation, the library preparations were sequenced on an Illumina Hiseq 2000 platform and 100 bp paired-end reads were generated. Reads that were contaminated with adapter or of low quality, as well as poly-N reads, were removed from raw data and clean reads were obtained.

2.5. Identification of Viral Homologous Sequences

Clean reads from Illumina were compiled with de novo assembly, using de Bruijn graphs assembly algorithms [34], and contigs <200 bp in length were not analyzed further. Remaining sequences underwent sequential BLASTx searching against GenBank database to eliminate the bacteria and eukaryotes contigs and identify suspect-viral sequences, which were longer than 200 bp, and taxonomic classification was queried from the NCBI taxonomy web service. The reference sequence of the source organism with the best contig alignment (i.e., the alignment with the lowest e-value ≤ 0.0001) was retrieved. Contigs of bacteria and eukaryotes were eliminated. If the reference sequence taxonomy was viral and aligned with the contig with an e-value of ≤0.0001, the sequence was flagged as suspect-viral and retained for further analysis.

2.6. Phylogenetic Analysis of Viral Sequences

Suspect viral sequences related to invertebrate virus families were excluded. Reference sequences were downloaded from NCBI. Global alignments with contigs were generated using ClustalX 1.83 [31], translated in MEGA 5 [35] and edited with BIOEDIT v7.0 [36]. Gap-stripped alignments were then used to construct the phylogenetic trees by MrBayes 3.1.1 [33].

2.7. Nucleotide Sequence Accession Numbers

The Illumina sequence data obtained in this study have been deposited in the Sequence Read Archive (SRA) database (Accession No. SRX368740). The trimmed and binned vertebrate viral contigs (≥200 bp) used for phylogenetic analysis in this study were deposited in GenBank (Accession No. KF547868-KF547871).

3. Results

3.1. Detection of Paramyxoviruses

Among the samples, seven fecal samples of seven individual bats were identified as paramyxovirus-positive, and in these bats, two individuals belong to Hipposideros cineraceus (sampled in the Yuanjiang Gulong hole), one of Rousettus leschenaultii, one of Eonycteris spelaea, one of Hipposideros armiger (sampled from the Xishuangbanna Botanical Garden, Mengla, China), and three Taphozous melanopogon (sampled in Natural Arch, Mengla, China) (Table S1). The ~560-bp L-gene sequences were submitted to GenBank (Accession No. KC599255, KC599257- KC599261, KC599263).

The phylogenetic tree of the L-gene, based on a 529 bp alignment is shown in Figure 1. The phylogenetic analysis indicates that the paramyxovirus sequences identified in this study are separated into three distinct genera. Of them, KC599259 identified in R. leschenaultii was clustered with Rubulavirus. KC599257 detected in E. spelaea showed a close relationship with paramyxoviruses from Eidolon helvum in urban Africa, which formed an unclassified sister clade to the genus Henipavirus. The third cluster is that of five novel paramyxoviruses sharing a common ancestor and forming a phylogenetically diverse subgroup, closely related to genus Jeilongvirus, comprising Jeilongvirus (J-virus) and Beilong virus [37]. Moreover, this cluster can be further divided into two clades, KC599255 and KC599258 from Hipposideridae; and KC599260, KC599261, and KC599263 from T. melanopogon (family Emballonuridae). The deduced amino acid identity analysis supported the phylogenetic relationship observed using the nucleotide sequences (Table S3). KC599259 had the lowest homology (37.58%–40.61%) with other paramyxovirus sequences identified in this study and had a higher identity (61.49%–78.74%) with known Rubulavirus and Rubulavirus-related virus. The novel paramyxoviruses (KC599255, KC599258, KC599260, KC599261 and KC599263) from insectivorous bats shared the highest amino acid identities (74.55%–100%), and had a relatively high homology with Jeilongvirus (74.25%–78.44%) and KC599257 (59.39%–64.85%). On the other hand, KC599257 was most closely related to Henipaviruses (92.94%).

3.2. In-Depth Analysis of Pathogens by Illumina High-throughput Sequencing

Individual sequence reads with base quality scores were produced by Illumina. After removing the contaminant reads (0.01% the adapter reads, 2.43% low quality reads and 0.38% containing N reads), a total of 7,963,701 clean reads (97.15%) were obtained in this study (Figure 2A).

3.2.1. Assembled Contigs and BLASTx Analysis

A total of 7066 assembled contigs (≥200 bp) were constructed and 6880 contigs with e-value ≤ 0.0001 were obtained. Data showed that the longest contig was 6481 bp and 310 contigs were longer than 1000 bp. A total of 73.1% (5029/6880) of contigs associated with microorganisms. 118 contigs that were homologous to eukaryotic viruses and phage. Moreover, we found 17 contigs that were homologs of parasites (Table 1, Figure 2, Table S4 and Table S5).

Viruses 06 02138 g001 200
Figure 1. Phylogenetic analysis of paramyxovirus L-gene identified by RT-PCR. Phylogenetic tree of paramyxoviruse L-gene was constructed based on a 529 bp sequence alignment with Pneumovirinae subfamily as an outgroup. The accession numbers of sequences identified in this study are red and posterior probability values are shown next to the tree nodes. Posterior probability values are shown at each node (>70%) and the bar represents the expected number of amino acid substitutions per site.

Click here to enlarge figure

Figure 1. Phylogenetic analysis of paramyxovirus L-gene identified by RT-PCR. Phylogenetic tree of paramyxoviruse L-gene was constructed based on a 529 bp sequence alignment with Pneumovirinae subfamily as an outgroup. The accession numbers of sequences identified in this study are red and posterior probability values are shown next to the tree nodes. Posterior probability values are shown at each node (>70%) and the bar represents the expected number of amino acid substitutions per site.
Viruses 06 02138 g001 1024
Table Table 1. Details of assembled contigs related to eukaryotic viruses and phages as determined with Blastx and the GenBank database. Detailed information is shown in Table S6.

Click here to display table

Table 1. Details of assembled contigs related to eukaryotic viruses and phages as determined with Blastx and the GenBank database. Detailed information is shown in Table S6.
CladeFamilyGenusVirus NameContigs
Vertebrate virusRetroviridaeGammaretrovirusHuman endogenous retrovirus24
Murine leukemia virus5
Spleen necrosis virus4
Moloney murine leukemia virus3
Porcine endogenous retrovirus2
Woolly monkey sarcoma virus2
Friend murine leukemia virus1
Feline leukemia virus1
Xenotropic Murine Leukemia Virus1
Chick syncytial virus1
Gammaretrovirus RfRV/China/20111
Unknown gammaretrovirusReticuloendotheliosis virus40
Bat gammaretrovirus1
Baboon endogenous virus strain M71
Rousettus leschenaultii retrovirus1
BetaretrovirusOvine enzootic nasal tumor virus (ENTV)2
Jaagsiekte sheep retrovirus2
Squirrel monkey retrovirus2
Simian endogenous retrovirus1
UnclassifiedSimian retrovirus4
Multiple sclerosis associated retrovirus1
PolyomaviridaePolyomavirusSTL polyomavirus1
Hamster polyomavirus1
Merkel cell polyomavirus1
Insect virus Solenopsis invicta virus 31
Mycovirus Grapevine partitivirus1
PhageMyoviridaeT4-like virusAcinetobacter phage Ac421
MyoviridaeT4-like virusAcinetobacter phage Acj611
MyoviridaeT4-like virusAcinetobacter phage Acj92
MyoviridaeT4-like virusAeromonas phage 44RR2.8t1
MyoviridaeT4-like virusAeromonas phage Aeh11
MyoviridaeT4-like virusAeromonas phage Aes5081
MyoviridaeT4-like virusEnterobacteria phage Bp71
MyoviridaeT4-like virusEnterobacteria phage JSE1
MyoviridaeT4-like virusEnterobacteria phage Phi11
MyoviridaeT4-like virusEnterobacteria phage RB691
Myoviridaeunclassified MyoviridaeSalmonella phage STML-1981
Myoviridaeunclassified MyoviridaeYersinia phage phiR1-RT1
Viruses 06 02138 g002 200
Figure 2. Schematic summary of the number of Illumina high-throughput sequencing reads. (A) Classification of raw reads, and contigs (≥200 bp) compared with Genbank using BLASTx searches (e-value < 0.0001); (B) Reads related to eukaryotic viruses and phages.

Click here to enlarge figure

Figure 2. Schematic summary of the number of Illumina high-throughput sequencing reads. (A) Classification of raw reads, and contigs (≥200 bp) compared with Genbank using BLASTx searches (e-value < 0.0001); (B) Reads related to eukaryotic viruses and phages.
Viruses 06 02138 g002 1024

3.2.2. Identification of Bacteriophage Sequences

There were 13 contigs related to the sequences of known phages with a similarity of 61%–89% and an e-value of 6e−71–2.12e−5. All of them were members of Myoviridae and 11 contigs were related to T4-like entero-bacterial phages (Table 1, Figure 2B and Table S5).

3.2.3. Analysis of Vertebrate Viral Sequences by Family

Thirteen contigs of phage, 1 insect-virus, and 1 mycovirus contig were removed, and 103 contigs belonging to two mammalian-infecting viral families were left. Of them, 97% (100/103) were related to retroviruses and three contigs were related to a polyomavirus of double-stranded DNA (dsDNA). For the 103 contigs of vertebrate viruses, the amino acid sequences were 35% to 90% homologous with viral protein sequences (e-value ≤ 0.0001). In these, 79 cotigs displayed low identity (<70%) to known viruses at the amino acid level, suggesting that these belong to novel and genetically divergent viruses. The nucleotide BLASTx searches against the GenBank database are listed in Table S6.

3.2.4. Retroviridae

We identified 100 contigs, primarily related to three genera of retroviruses (Gamma, beta and unclassified retroviruses), and with e-values of 5e−117–1e−4 (Figure 3A, also see Table S6). These retroviral sequences related to all three canonical genes of retrovirus, including 28 Gag proteins, 66 protease/polymerase (Pol) and 6 envelope glycoproteins (Env), and some of them are highly closed to known bat tretroviruses [38,39,40]. Three retrovirual contigs, which code Pol, Gag and Env of gammaretrovirus respectively and have the longest CDS, were used for phylogenetic analysis. Due to the amplification of the short retroviruses sequences, we have adjusted the tree according to the published bat retrovirus trees [39,40] by removing some of the reference sequences. The phylogenetic trees indicated that Pol (KF547868) and Gag (KF547870) showed high homologous with bat gammaretroviruses, whereas Env (KF547869) couldn’t clade with those of bat gammaretroviruses indicating the possibility of new viruses (Figure 3B–D). Moreover, sequence analysis indicated that the translations of 55 of the 100 retrovirus contigs contained stop codons within the region of BLASTx alignments, suggesting that they originated from non-functional, endogenous/exogeneous retroviruses. In addition, BLASTx searching indicated that many contigs (such as comp6291_c0_seq1, comp48905_c0_seq1, comp8098_c0_seq1, comp6984_c0_seq1, comp92776_c0_seq1, comp71626_c0_seq1 and comp36376_c0_seq1) shared equivalent or relatively higher amino acid identity (52%–85%, e-values of 7e−12–4e−35) with poxvirus rather than retroviruses, with Fowlpox virus (FPV) being the closest relative.

3.2.5. Polyomaviridae

We assembled two sequences related to the VP1 capsid protein and a sequence related to the large T antigen protein of polyomaviruses (Table S6). They are phylogenetically clustered with hamster polyomavirus, merker cell polyomavirus and STL polyomavirus respectively, and with high confidence of e-values of 2e−15–2e−53 (Table S6 and Figure 4).

Viruses 06 02138 g003 200
Figure 3. Classification and phylogenetic analysis of retroviral contigs. (A) The chart shows proportions of retroviral sequences related to different retroviral genera. The number of sequences related to each genus is shown in parentheses; (B)–(D) The midpoint-rooted phylogenetic tree based on alignments of 83 aa Pol (starred), 142 aa Gag (starred) and 150 aa Env (starred) of bat retroviruses, respectively. Posterior probability values are shown at each node and the bar represents the expected number of amino acid substitutions per site. Sequence information is shown in Table S7.

Click here to enlarge figure

Figure 3. Classification and phylogenetic analysis of retroviral contigs. (A) The chart shows proportions of retroviral sequences related to different retroviral genera. The number of sequences related to each genus is shown in parentheses; (B)–(D) The midpoint-rooted phylogenetic tree based on alignments of 83 aa Pol (starred), 142 aa Gag (starred) and 150 aa Env (starred) of bat retroviruses, respectively. Posterior probability values are shown at each node and the bar represents the expected number of amino acid substitutions per site. Sequence information is shown in Table S7.
Viruses 06 02138 g003 1024
Viruses 06 02138 g004 200
Figure 4. Phylogenetic analysis of polyomaviral contig. The midpoint-rooted phylogenetic tree based on 111 aa VP1 protein of polyomavirus (starred). Posterior probability values are shown at each node and the bar represents the expected number of amino acid substitutions per site.

Click here to enlarge figure

Figure 4. Phylogenetic analysis of polyomaviral contig. The midpoint-rooted phylogenetic tree based on 111 aa VP1 protein of polyomavirus (starred). Posterior probability values are shown at each node and the bar represents the expected number of amino acid substitutions per site.
Viruses 06 02138 g004 1024

4. Discussion

With the increase of human activities, habitat and survival of bats are severely reduced, thus causing habitat shifts in bat populations and increased opportunity for contact with humans. The close contact between humans and bats expands the opportunity for cross-species transmission of new zoonotic viruses to humans [41]. In the current study, we identified seven bats [including three insectivorous bats (Hipposideros cineraceus, H. armiger and Taphozous melanopogon) and two frugivorous bats (Eonycteris spelaea and Rousettus leschenaultii)] positive for paramyxovirus by RT-PCR. Phylogenetic analysis indicated that paramyxoviruses carried by insectivorous bats were distinct from those found in frugivorous bats (Figure 1). Virus sequences from insectivorous bats could not be definitely assigned to any existing paramyxovirus genus, however, they appeared to cluster within the relatively new genus of the Jeilongvirus [37]. Moreover, we found that KC599257 (E. spelaea) and KC599259 (R. leschenaulti) from fruit bats belonged to henipavirus-related paramyxovirus and Rubulavirus genus, respectively. The finding of a henipavirus-related agent in E. spelaea provides further evidence that there is a large spectrum of henipa-like viruses among bats, including insectivorous species. Li et al. [42] previously found serological evidence of henipa-like viruses in bats, including Rousettus spp., Myotis spp., Hipposideros spp., and Miniopterus spp. in China. Henipaviruses have thus far overwhelmingly been identified in Pteropus and Eidolon species, though this may, in part, be due to a lack of surveillance in non-pteropodid bats [2,17,43]. The phylogenetic distance of species may constrain both cross-species transmission and host shifts and thus result in the separation of viruses [44]. Previous studies demonstrate that humans can be infected by paramyxoviruses through a secondary host species, such as horses and pigs, or directly from bats [9,45,46,47]. Although it is currently unknown whether paramyxoviruses, such as Jeilongvirus, are capable of infecting people, the potential risks of bat borne paramyxoviruses to human health should be considered and surveys of human populations highly exposed to bats for evidence of spillover of bat paramyxoviruses would help establish the public health risk.

High-throughput sequencing-based viral metagenomics is a powerful tool to explore known and unknown viruses existing in host animals. Recently, it has been successfully used to detect viruses in bats in North America, Africa and Asia [22,23,26,27,28], where the viromes of one frugivorous bat species, E. helvum in Africa, and several insectivorous bats were described. From our metagenomic study of paramyxovirus-positive bat samples, full-length viral genomic sequences were unable to be assembled from the clean reads. This result is similar with that of previous studies and may be due to the low quantity of the virus within samples, the short reads, and the lack of known reference sequences [22,23,26]. Previous studies demonstrated that protein-based comparisons are more effective than those based on nucleotides, and thus BLASTx usually identifies more suspect-viral sequences than BLASTn [28,48]. Among the viral contigs, most of the bacterioviruses detected in this study are enterobacterial phages, suggesting the similarity between the intestinal bacterial population of bats and that of humans. Five contigs are highly similar to Moloney Murine Leukemia Virus (M-MLV), which may have resulted from contamination of RT reagents and also have been detected in previous virome analysis of insectivorous bats in China [26]. Many contigs are closed to bat retrovirues, which were identified by previous studies [38,39,40]. Furthermore, we found that many viral contigs showed equivalent or higher identity with Fowlpox virus (FPV) more than Reticuloendotheliosis virus (REV). As a prototypical member of Avipoxvirus, FPV only infects non-mammalian hosts [49], and there is no previous data available to indicate that bats are natural reservoirs or vectors of FPV. Previous studies show that REVs derive directly from mammalian retroviruses and integrate into FPV genomes, which become endogenous in the genome of larger and more complex DNA viruses [50,51,52]. Thus, these FPV-like contigs were deemed as REVs and classified into unknown gammaretroviruses temporarily, and the source of FPV-like contigs in this study should be explored further.

Our sequence analysis indicates that majority of the vertebrate viral sequences were distinct from those previously identified in bats and were often diverse within the viral family. The novel viruses identified were classified into two viral families, Retroviridae and Polyomaviridae, which were also detected in previous studies [26,27,28]. Retroviruses have both endogenous and exogenous forms in nature, comparing the results from this study with previous studies, metagenomic virome analysis indicated that retroviruses are the most common viral sequences found in bats, especially in Hipposideridae [26,29]. This finding may be due to the frequent integration of retroviral genomic material into host genomes, rather than an indication of viral infection. Interestingly, the Illumina screening did not detect paramyxovirus nucleic acid, which had been detected by RT-PCR, suggesting that these viruses may be present in samples in relatively small amounts and below the sensitivity of Illumina detection. These findings highlight an important limitation of high throughput sequencing as a tool for virome characterization. Furthermore, the limited size or number of contigs generated by Illumina sequencing may necessitate follow-up with conventional PCR assays in order to obtain enough sequence to perform meaningful phylogenetic analyses [6,28]. The sensitivity of high-throughput sequencing to detect viral nucleic acid is also limited if samples are not properly filtered and treated to remove host genome and other non-target nucleic acid and enrich for viral sequences. Host nucleic acid are typically present in significantly higher abundance relative to viral nucleic acid and can create background noise in the results [26]. As the cost of high-throughput sequencing decreases and methods of sample preparation improve to optimize results, Illumina sequencing can be an effective tool for characterizing virus diversity in host organisms. Bats represent a massively diverse group of mammals, and as a group they have been increasingly identified as natural reservoirs for virus families such as paramyxoviruses, coronaviruses, and retroviruses, that contain zoonotic members. Emerging zoonotic bat-borne viruses, such as SARS coronavirus; Nipah and Hendra virus; and Marburg and Ebola virus; illustrate that bat viruses are capable of infecting people through various routes of transmission and in various contexts such as through domestic animals, food-borne routes, or the wildlife trade. Understanding the underlying diversity of viruses within common bat species and the types of interactions they have with people, wildlife and livestock in China may provide further insight into the risk of viral spillover, and may ultimately allow for interventions that reduce the risk of outbreaks.

5. Conclusions

We, for the first time, carry out the metagenomic analysis in fecal samples of both insectivorous and frugivorous bats in China. The present study contributes to the understanding of the bat virome, and sheds light on the overall diversity of zoonotic viruses.

Supplementary Files

  • Supplementary File 1:

    Supplementary Information (PDF, 928 KB)

  • Supplementary File 1:

    Supplementary Table (XLS, 62 KB)

  • Acknowledgment

    This work was supported by CAS Cross-disciplinary Collaborative Teams Program for Science, Technology and Innovationand was made possible by the generous support of the American people through the United States Agency for International Development (USAID) Emerging Pandemic Threats PREDICT. The contents are the responsibility of the authors and do not necessarily reflect the views of USAID or the United States Government. LH Yuan is supported by the NSFC (31301012), the Guangzhou Pearl River Scientific and Technological New Star Project (2012J2200003), and the Outstanding Youth Science and Technology Talent Fund of Guangdong Academy of Sciences (rcjj201402).

    Author Contributions

    Lihong Yuan and Jinping Chen conceived and designed the experiments. Lihong Yuan, Min Li, Linmiao Li and Xiaolin Mei carried out the lab work. Lihong Yuan and Min Li participated in data analysis. Lihong Yuan, Jinping Chen, Corina Monagin, Aleksei Chmura, Bradley S. Schneider, Jonathan H. Epstein, Zhengli Shi and Peter Daszak wrote the paper.

    Conflicts of Interest

    The authors declare no conflict of interest.

    References and Notes

    1. Jones, K.E.; Patel, N.G.; Levy, M.A.; Storeygard, A.; Balk, D.; Gittleman, J.L.; Daszak, P. Global trends in emerging infectious diseases. Nature 2008, 451, 990–993, doi:10.1038/nature06536.
    2. Halpin, K.; Hyatt, A.D.; Fogarty, R.; Middleton, D.; Bingham, J.; Epstein, J.H.; Rahman, S.A.; Hughes, T.; Smith, C.; Field, H.E.; et al. Pteropid bats are confirmed as the reservoir hosts of henipaviruses: A comprehensive experimental study of virus transmission. Am. J. Trop. Med. Hyg. 2011, 85, 946–951, doi:10.4269/ajtmh.2011.10-0567.
    3. Leroy, E.M.; Kumulungui, B.; Pourrut, X.; Rouquet, P.; Hassanin, A.; Yaba, P.; Delicat, A.; Paweska, J.T.; Gonzalez, J.P.; Swanepoel, R. Fruit bats as reservoirs of Ebola virus. Nature 2005, 438, 575–576, doi:10.1038/438575a.
    4. Li, W.; Shi, Z.; Yu, M.; Ren, W.; Smith, C.; Epstein, J.H.; Wang, H.; Crameri, G.; Hu, Z.; Zhang, H.; et al. Bats are natural reservoirs of SARS-like coronaviruses. Science 2005, 310, 676–679, doi:10.1126/science.1118391.
    5. Towner, J.S.; Amman, B.R.; Sealy, T.K.; Carroll, S.A.R.; Comer, J.A.; Kemp, A.; Swanepoel, R.; Paddock, C.D.; Balinandi, S.; Khristova, M.L. Isolation of genetically diverse Marburg viruses from Egyptian fruit bats. PLoS Pathog. 2009, 5, e1000536, doi:10.1371/journal.ppat.1000536.
    6. Drexler, J.F.; Corman, V.M.; Muller, M.A.; Maganga, G.D.; Vallo, P.; Binger, T.; Gloza-Rausch, F.; Rasche, A.; Yordanov, S.; Seebens, A. Bats host major mammalian paramyxoviruses. Nat. Commun. 2012, 3, doi:10.1038/ncomms1796.
    7. Quan, P.-L.; Firth, C.; Conte, J.M.; Williams, S.H.; Zambrana-Torrelio, C.M.; Anthony, S.J.; Ellison, J.A.; Gilbert, A.T.; Kuzmin, I.V.; Niezgoda, M. Bats are a major natural reservoir for hepaciviruses and pegiviruses. Proc. Natl. Acad. Sci. USA 2013, 110, 8194–8199, doi:10.1073/pnas.1303037110.
    8. Calisher, C.H.; Childs, J.E.; Field, H.E.; Holmes, K.V.; Schountz, T. Bats: Important reservoir hosts of emerging viruses. Clin. Microbiol. Rev. 2006, 19, 531–545, doi:10.1128/CMR.00017-06.
    9. Epstein, J.H.; Prakash, V.; Smith, C.S.; Daszak, P.; McLaughlin, A.B.; Meehan, G.; Field, H.E.; Cunningham, A.A. Henipavirus infection in fruit bats (Pteropus giganteus), India. Emerg. Infect. Dis. 2008, 14, 1309–1311, doi:10.3201/eid1408.071492.
    10. King, A.; Davies, P.; Lawrie, A. The rabies viruses of bats. Vet. Microbiol. 1990, 23, 165–174, doi:10.1016/0378-1135(90)90146-M.
    11. McCall, B.J.; Epstein, J.H.; Neill, A.S.; Heel, K.; Field, H.; Barrett, J.; Smith, G.A.; Selvey, L.A.; Rodwell, B.; Lunt, R. Potential exposure to Australian bat lyssavirus, Queensland, 1996–1999. Emerg. Infect. Dis. 2000, 6, 259–264, doi:10.3201/eid0603.000305.
    12. Slenczka, W.G. The Marburg virus outbreak of 1967 and subsequent episodes. Curr. Top. Microbiol. Immunol. 1999, 235, 49–75.
    13. Guan, Y.; Zheng, B.J.; He, Y.Q.; Liu, X.L.; Zhuang, Z.X.; Cheung, C.L.; Luo, S.W.; Li, P.H.; Zhang, L.J.; Guan, Y.J.; et al. Isolation and characterization of viruses related to the SARS coronavirus from animals in southern China. Science 2003, 302, 276–278, doi:10.1126/science.1087139.
    14. Lau, S.K.; Woo, P.C.; Li, K.S.; Huang, Y.; Tsoi, H.W.; Wong, B.H.; Wong, S.S.; Leung, S.Y.; Chan, K.H.; Yuen, K.Y. Severe acute respiratory syndrome coronavirus-like virus in Chinese horseshoe bats. Proc. Natl. Acad. Sci. USA 2005, 102, 14040–14045, doi:10.1073/pnas.0506735102.
    15. Wang, Y.X. A Complete Checklist of Mammal Species and Subspecies in China: A Taxonomic and Geographic Reference; China Forestry Press: Beijing, China, 2003.
    16. Parrish, C.R.; Holmes, E.C.; Morens, D.M.; Park, E.C.; Burke, D.S.; Calisher, C.H.; Laughlin, C.A.; Saif, L.J.; Daszak, P. Cross-species virus transmission and the emergence of new epidemic diseases. Microbiol. Mol. Biol. Rev. 2008, 72, 457–470, doi:10.1128/MMBR.00004-08.
    17. Baker, K.S.; Todd, S.; Marsh, G.A.; Crameri, G.; Barr, J.; Kamins, A.O.; Peel, A.J.; Yu, M.; Hayman, D.T.; Nadjm, B.; et al. Novel, potentially zoonotic paramyxoviruses from the African straw-colored fruit bat Eidolon helvum. J. Virol. 2013, 87, 1348–1358, doi:10.1128/JVI.01202-12.
    18. Breitbart, M.; Salamon, P.; Andresen, B.; Mahaffy, J.M.; Segall, A.M.; Mead, D.; Azam, F.; Rohwer, F. Genomic analysis of uncultured marine viral communities. Proc. Natl. Acad. Sci. USA 2002, 99, 14250–14255, doi:10.1073/pnas.202488399.
    19. Djikeng, A.; Kuzmickas, R.; Anderson, N.G.; Spiro, D.J. Metagenomic analysis of RNA viruses in a fresh water lake. PLoS One 2009, 4, e7264, doi:10.1371/journal.pone.0007264.
    20. Ng, T.F.; Manire, C.; Borrowman, K.; Langer, T.; Ehrhart, L.; Breitbart, M. Discovery of a novel single-stranded DNA virus from a sea turtle fibropapilloma by using viral metagenomics. J. Virol. 2009, 83, 2500–2509, doi:10.1128/JVI.01946-08.
    21. Victoria, J.G.; Kapoor, A.; Dupuis, K.; Schnurr, D.P.; Delwart, E.L. Rapid identification of known and new RNA viruses from animal tissues. PLoS Pathog. 2008, 4, e1000163, doi:10.1371/journal.ppat.1000163.
    22. Donaldson, E.F.; Haskew, A.N.; Gates, J.E.; Huynh, J.; Moore, C.J.; Frieman, M.B. Metagenomic analysis of the viromes of three North American bat species: Viral diversity among different bat species that share a common habitat. J. Virol. 2010, 84, 13004–13018, doi:10.1128/JVI.01255-10.
    23. Li, L.; Victoria, J.G.; Wang, C.; Jones, M.; Fellers, G.M.; Kunz, T.H.; Delwart, E. Bat guano virome: Predominance of dietary viruses from insects and plants plus novel mammalian viruses. J. Virol. 2010, 84, 6955–6965, doi:10.1128/JVI.00501-10.
    24. Blinkova, O.; Kapoor, A.; Victoria, J.; Jones, M.; Wolfe, N.; Naeem, A.; Shaukat, S.; Sharif, S.; Alam, M.M.; Angez, M.; et al. Cardioviruses are genetically diverse and cause common enteric infections in South Asian children. J. Virol. 2009, 83, 4631–4641, doi:10.1128/JVI.02085-08.
    25. Breitbart, M.; Hewson, I.; Felts, B.; Mahaffy, J.M.; Nulton, J.; Salamon, P.; Rohwer, F. Metagenomic analyses of an uncultured viral community from human feces. J. Bacteriol. 2003, 185, 6220–6223, doi:10.1128/JB.185.20.6220-6223.2003.
    26. Ge, X.; Li, Y.; Yang, X.; Zhang, H.; Zhou, P.; Zhang, Y.; Shi, Z. Metagenomic analysis of viruses from bat fecal samples reveals many novel viruses in insectivorous bats in China. J. Virol. 2012, 86, 4620–4630, doi:10.1128/JVI.06671-11.
    27. Wu, Z.; Ren, X.; Yang, L.; Hu, Y.; Yang, J.; He, G.; Zhang, J.; Dong, J.; Sun, L.; Du, J.; et al. Virome analysis for identification of novel mammalian viruses in bat species from Chinese provinces. J. Virol. 2012, 86, 10999–11012.
    28. Baker, K.S.; Leggett, R.M.; Bexfield, N.H.; Alston, M.; Daly, G.; Todd, S.; Tachedjian, M.; Holmes, C.E.G.; Crameri, S.; Wang, L.-F. Metagenomic study of the viruses of African straw-colored fruit bats: Detection of a chiropteran poxvirus and isolation of a novel adenovirus. Virology 2013, 441, 95–106, doi:10.1016/j.virol.2013.03.014.
    29. He, B.; Li, Z.; Yang, F.; Zheng, J.; Feng, Y.; Guo, H.; Li, Y.; Wang, Y.; Su, N.; Zhang, F.; et al. Virome profiling of bats from myanmar by metagenomic analysis of tissue samples reveals more novel Mammalian viruses. PLoS One 2013, 8, e61950.
    30. Tong, S.; Chern, S.W.; Li, Y.; Pallansch, M.A.; Anderson, L.J. Sensitive and broadly reactive reverse transcription-PCR assays to detect novel paramyxoviruses. J. Clin. Microbiol. 2008, 46, 2652–2658, doi:10.1128/JCM.00192-08.
    31. Thompson, J.D.; Gibson, T.J.; Plewniak, F.; Jeanmougin, F.; Higgins, D.G. The CLUSTAL_X windows interface: Flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 1997, 25, 4876–4882, doi:10.1093/nar/25.24.4876.
    32. Posada, D.; Crandall, K.A. MODELTEST: Testing the model of DNA substitution. Bioinformatics 1998, 14, 817–818, doi:10.1093/bioinformatics/14.9.817.
    33. Huelsenbeck, J.P.; Ronquist, F. MRBAYES: Bayesian inference of phylogenetic trees. Bioinformatics 2001, 17, 754–755, doi:10.1093/bioinformatics/17.8.754.
    34. Grabherr, M.G.; Haas, B.J.; Yassour, M.; Levin, J.Z.; Thompson, D.A.; Amit, I.; Adiconis, X.; Fan, L.; Raychowdhury, R.; Zeng, Q. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 2011, 29, 644–652, doi:10.1038/nbt.1883.
    35. Tamura, K.; Peterson, D.; Peterson, N.; Stecher, G.; Nei, M.; Kumar, S. MEGA5, molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol. Biol. Evol. 2011, 28, 2731–2739, doi:10.1093/molbev/msr121.
    36. Hall, T.A. BioEdit: A user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symp. Ser. 1999, 41, 95–98.
    37. Li, Z.; Yu, M.; Zhang, H.; Magoffin, D.E.; Jack, P.J.; Hyatt, A.; Wang, H.Y.; Wang, L.F. Beilong virus, a novel paramyxovirus with the largest genome of non-segmented negative-stranded RNA viruses. Virology 2006, 346, 219–228, doi:10.1016/j.virol.2005.10.039.
    38. Hayward, J.A.; Tachedjian, M.; Cui, J.; Field, H.; Holmes, E.C.; Wang, L.F.; Tachedjian, G. Identification of diverse full-length endogenous betaretroviruses in megabats and microbats. Retrovirology 2013, 10, doi:10.1186/1742-4690-10-35.
    39. Cui, J.; Tachedjian, M.; Wang, L.; Tachedjian, G.; Wang, L.-F.; Zhang, S. Discovery of retroviral homologs in bats: Implications for the origin of mammalian gammaretroviruses. J. Virol. 2012, 86, 4288–4293, doi:10.1128/JVI.06624-11.
    40. Cui, J.; Tachedjian, G.; Tachedjian, M.; Holmes, E.C.; Zhang, S.; Wang, L.-F. Identification of diverse groups of endogenous gammaretroviruses in mega-and microbats. J. Gen. Virol. 2012, 93, 2037–2045, doi:10.1099/vir.0.043760-0.
    41. Halpin, K.; Hyatt, A.D.; Plowright, R.K.; Epstein, J.H.; Daszak, P.; Field, H.E.; Wang, L.; Daniels, P.W. Emerging viruses: Coming in on a wrinkled wing and a prayer. Clin. Infect. Dis. 2007, 44, 711–717, doi:10.1086/511078.
    42. Li, Y.; Wang, J.; Hickey, A.C.; Zhang, Y.; Li, Y.; Wu, Y.; Zhang, H.; Yuan, J.; Han, Z.; McEachern, J.; et al. Antibodies to Nipah or Nipah-like viruses in bats, China. Emerg. Infect. Dis. 2008, 14, 1974–1976, doi:10.3201/eid1412.080359.
    43. Baker, K.S.; Todd, S.; Marsh, G.; Fernandez-Loras, A.; Suu-Ire, R.; Wood, J.L.N.; Wang, L.-F.; Murcia, P.R.; Cunningham, A.A. Co-circulation of diverse paramyxoviruses in an urban African fruit bat population. J. Gen. Virol. 2012, 93, 850–856, doi:10.1099/vir.0.039339-0.
    44. Streicker, D.G.; Turmelle, A.S.; Vonhof, M.J.; Kuzmin, I.V.; McCracken, G.F.; Rupprecht, C.E. Host phylogeny constrains cross-species emergence and establishment of rabies virus in bats. Science 2010, 329, 676–679, doi:10.1126/science.1188836.
    45. Chua, K.B.; Goh, K.J.; Wong, K.T.; Kamarulzaman, A.; Tan, P.S.; Ksiazek, T.G.; Zaki, S.R.; Paul, G.; Lam, S.K.; Tan, C.T. Fatal encephalitis due to Nipah virus among pig-farmers in Malaysia. Lancet 1999, 354, 1257–1259, doi:10.1016/S0140-6736(99)04299-3.
    46. Luby, S.P.; Hossain, M.J.; Gurley, E.S.; Ahmed, B.-N.; Banu, S.; Khan, S.U.; Homaira, N.; Rota, P.A.; Rollin, P.E.; Comer, J.A. Recurrent zoonotic transmission of Nipah virus into humans, Bangladesh, 2001–2007. Emerg. Infect. Dis. 2009, 15, doi:10.3201/eid1508.081237.
    47. Luby, S.P.; Rahman, M.; Hossain, M.J.; Blum, L.S.; Husain, M.M.; Gurley, E.; Khan, R.; Ahmed, B.-N.; Rahman, S.; Nahar, N. Foodborne transmission of Nipah virus, Bangladesh. Emerg. Infect. Dis. 2006, 12, doi:10.3201/eid1212.060732.
    48. Kunin, V.; Copeland, A.; Lapidus, A.; Mavromatis, K.; Hugenholtz, P. A bioinformatician’s guide to metagenomics. Microbiol. Mol. Biol. Rev. 2008, 72, 557–578, doi:10.1128/MMBR.00009-08.
    49. Afonso, C.L.; Tulman, E.R.; Lu, Z.; Zsak, L.; Kutish, G.F.; Rock, D.L. The genome of fowlpox virus. J. Virol. 2000, 74, 3815–3831, doi:10.1128/JVI.74.8.3815-3831.2000.
    50. Singh, P.; Schnitzlein, W.M.; Tripathy, D.N. Reticuloendotheliosis virus sequences within the genomes of field strains of fowlpox virus display variability. J. Virol. 2003, 77, 5855–5862, doi:10.1128/JVI.77.10.5855-5862.2003.
    51. Weiss, R.A. The discovery of endogenous retroviruses. Retrovirology 2006, 3, e67, doi:10.1186/1742-4690-3-67.
    52. Niewiadomska, A.M.; Gifford, R.J. The extraordinary evolutionary history of the reticuloendotheliosis viruses. PLoS Biol. 2013, 11, e1001642, doi:10.1371/journal.pbio.1001642.
    Viruses EISSN 1999-4915 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert