The Influence of Habitat on Viral Diversity in Neotropical Rodent Hosts

Rodents are important reservoirs of numerous viruses, some of which have significant impacts on public health. Ecosystem disturbances and decreased host species richness have been associated with the emergence of zoonotic diseases. In this study, we aimed at (a) characterizing the viral diversity in seven neotropical rodent species living in four types of habitats and (b) exploring how the extent of environmental disturbance influences this diversity. Through a metagenomic approach, we identified 77,767 viral sequences from spleen, kidney, and serum samples. These viral sequences were attributed to 27 viral families known to infect vertebrates, invertebrates, plants, and amoeba. Viral diversities were greater in pristine habitats compared with disturbed ones, and lowest in peri-urban areas. High viral richness was observed in savannah areas. Differences in these diversities were explained by rare viruses that were generally more frequent in pristine forest and savannah habitats. Moreover, changes in the ecology and behavior of rodent hosts, in a given habitat, such as modifications to the diet in disturbed vs. pristine forests, are major determinants of viral composition. Lastly, the phylogenetic relationships of four vertebrate-related viral families (Polyomaviridae, Flaviviridae, Togaviridae, and Phenuiviridae) highlighted the wide diversity of these viral families, and in some cases, a potential risk of transmission to humans. All these findings provide significant insights into the diversity of rodent viruses in Amazonia, and emphasize that habitats and the host’s dietary ecology may drive viral diversity. Linking viral richness and abundance to the ecology of their hosts and their responses to habitat disturbance could be the starting point for a better understanding of viral emergence and for future management of ecosystems.


Introduction
Viruses have conquered all living systems, infecting other microbes (bacteria, fungi, and parasites) and more complex organisms, such as plants, invertebrates, and vertebrates. Most viruses have small genomes with high mutation rates [1], giving them the ability to evolve and adapt quickly to new environments and potentially the ability to infect new hosts.
The development of metagenomic approaches applied to viruses (viromics) [2,3] has improved our knowledge of the extent of viral diversity and of the host spectra of several viral families. This is, for instance, the case for hepaciviruses, with both the descriptions of novel viral species in mammals and the evidence of infection in non-mammal species [4,5]. Viromic studies have also led to the discovery of new viral genotypes, helping us understand their evolutionary history [6] and providing new insights into the roles of viruses in Prior to processing, samples from the same species, the same organs, and the same environment were pooled (e.g., the sample k_Pguy_PF is a pool of kidneys from P. guyannensis collected in pristine forest), resulting in 36 different pools, i.e., 36 different viromes. Pools, at the organ level, included 2-31 individuals according to sample availability. Overall, 442 organs and sera from 187 individuals were included in this study (Table 1).
For serum samples, 50 µL from each collecting tube was used to constitute the pools. For kidney and spleen samples, 100 mg of organs was crushed in 400 µL of DMEM, and 200 µL of suspension from each collecting tube was used to constitute the pools. All pools were processed as previously described [41]: Pools were cleared of debris by low-speed centrifugation (5 min, 10,000× g, 4 • C). Eukaryotic and prokaryotic cell-sized particles were removed from supernatants through three successive filtrations (0.8, 0.45, and 0.22 µm), using cellulose acetate membrane filters (Nalgene). The filtrates were cleared of persistent high-density particles with low-speed centrifugation (15 min, 10,000× g, 4 • C); then, viral particles were pelleted with a 1-h ultracentrifugation step (100,000× g, 4 • C). All viral pellets were resuspended in 40 µL of nuclease-free water. Table 1. Sampling data of individuals by organ, species, and habitat. The composition of each pool, including organ, species, habitat, and number of individuals, is given. The last column corresponds to the total number of different individuals used to constitute each pool of each species in a given habitat. The last row corresponds to the totals of organs and individual samples for this study. All resuspended viral pellets were treated with a mixture of DNases (Turbo DNase from Ambion and Benzonase from Novagen) and RNase One (Promega) to digest nonenveloped nucleic acids (i.e., those not in viral capsids) [53]. All viral nucleic acids were then extracted using the NucliSENS easyMAG ®® bio-robot (bioMérieux).

Reverse Transcription and Amplification
For each pool, the RNA virus-only and DNA virus-only libraries were constructed using a whole transcriptome (WTA) or a whole genome (WGA) amplification method, respectively, as previously described [41].

Next-Generation Sequencing
For each pool, 1 µg from each library was pooled together, whenever possible, to construct RNA plus DNA viral libraries. High-throughput sequencing was carried out at the genomics center of the Institut Pasteur, Paris. Shotgun libraries were prepared by standard Illumina protocols using 1 µg of total genomic DNA. Each sample (sera, kidney, and spleen) was tagged according to its provenance (species, organs, and habitats) using Illumina adaptor-specific primers. High-throughput shotgun metagenomic sequencing was carried out in two different sessions. The first round was completed using an Illumina MiSeq platform with 300-bp paired-end reads (eight samples, see Table 2). The second round was performed with an Illumina HiSeq 2500 platform with 250-bp paired-end reads (28 samples, see Table 2).

Bioinformatic Analyses
Globally, after a first cleaning step, reads were assembled de novo (Step 1, Figure 1). Then, clean reads were mapped back to contigs in order to obtain the number of reads aligned to each contig (Step 2, Figure 1). The taxonomic assignment of contigs was achieved through BLASTn and BLASTx (Step 3, Figure 1). Finally, a matrix corresponding to the number of viral reads at the genus/subfamily level for each species-habitat was built for statistical analysis (Step 4, Figure 1).
All sequences were submitted to FaQCs [54] with an automated search for PhiX sequences, and quality filtering, after removal of adapters and poly-A tails ("-phiX yes -adapter yes -polyA yes"). The resulting clean sequence files were submitted to de novo assembly with MEGAHIT [55,56] using default parameters, which provide a set of k-mers from 21 to 141 with a step of 12 used in the assembly process, and the minimum contig length was set at 200 nucleotides (Step 1, Figure 1). Then cleaned reads were mapped back to the contigs using the BWA-MEM mapper [57] and Samtools [58] in order to obtain the number of reads for each contig (Step 2, Figure 1). Table 2. Sequencing and bioinformatic processing data by organ, species, and habitat. It shows the number of raw reads, the corresponding Illumina Platform, the percentage of reads kept after trimming and cleansing, the number of assembled contigs, the percentage of cleaned reads mapped back to contigs, and the number of viral contigs. We used BLAST [59,60] for the taxonomic assignment of contigs (Step 3, Figure 1). All contigs were submitted to DISCONTIGOUS MEGABLAST BLASTn (e-value ≤ 10 −1 ). Those without results in BLASTn were submitted to BLASTx. The BLASTx process comprised two steps: (1) using an in-house viral protein database, which was created by clustering (CD-HIT, 100% homology) the NCBI-nr database viral protein sequences (August 2018);

Sample ID
(2) the set of positive contigs was subsequently submitted to BLASTx (e-value ≤ 10 −1 ) against the whole NCBI-nr database. Both BLASTn and BLASTx results were filtered using in-house python scripts, which selected the best scoring match for each contig (max e-value of 10 −1 and coverage of, respectively, >50 nucleotides and 17 amino acids in length for BLASTn and BLASTx) (Filter 1). Taxonomies were deduced from the BLAST results and fragments were assigned to selected viruses matching Taxids against the full name lineages file from NCBI (f) [61]. These two data sets were filtered again (Filter 2) according to the e-values and coverage according to both BLASTn and BLASTx (e-value = 10 −5 ; coverage ≥ 250 nt or 83 amino acids in length) so as to consolidate the results. The remaining contigs were used for counting in the results. To provide a more accurate definition of the virus's taxonomic status, each viral taxonomic identification was associated with host types, such as bacteria, vertebrates, invertebrates, amoeba, and plants, by consulting the ICTV [62], ViralZone, and virus host database websites [63,64]. As the presence of bacteriophages is unexpected in compartments such as the kidney, spleen, and blood, and it is difficult to attribute such bacteriophages to bacterial infections, contigs assigned to bacteriophages were discarded from the data set. In addition, a manual  As the presence of bacteriophages is unexpected in compartments such as the kidney, spleen, and blood, and it is difficult to attribute such bacteriophages to bacterial infections, contigs assigned to bacteriophages were discarded from the data set. In addition, a manual inspection was carried out and viruses known to be amplified in the laboratory (Herpesviridae and Papillomaviridae) and endogenous virus (filovirus) were discarded from the data set. Finally, based on mapping data, viral taxa (at the genus/subfamily level) identified with ≤10 reads were discarded from the data set in order to avoid the spurious presence of contigs due to potential contamination.
From these data, a data set associating the number of contigs with viral families was created. The number of viral families (categorized according to host type) associated with each species and its habitats was plotted and a heatmap was developed (Rstudio, "pheatmap" library). The heatmap represents the number of contigs associated with each viral family by species-habitats. The viral genomes' completeness of assigned contigs was tested using CHECKV (version v0.7.0) and its associated database [65].
Lastly, a quantitative data set associating each contig with its number of reads was constructed with the number of reads associated with each taxonomic category at the subfamily or genus level.
The main steps are shown in solid-line boxes in green font characters, with the number in brackets representing the corresponding step in order of execution (de novo assembly, read mapping to contigs, and taxonomic assignment).
BLASTn and BLASTx, as sub-steps of taxonomic assignment, are shown in solid-line boxes with purple font characters. For each output, (+) and (−) stand for positive/kept and negative/discarded results, respectively, where they appear; taxonomic information acquisition and the filter-1 step are represented by red stars.
Subsequent filtering of contigs based on host (phages/human), endogenous status, and number of reads was carried out to obtain the final data set of both contigs and the corresponding reads matrix.
The taxonomic categories (Eukaryotes, Mammalia, Bacteria, Archaea, Viruses) to which contigs were assigned are shown as pie charts separately for BLASTn and BLASTx, and they were merged (BLASTn and x).

Statistical Analysis
To assess whether or not viral diversity is related to habitat type, diversity analyses were run for the four species present in at least two different habitats (pristine and disturbed forest for P. guyannensis, P. cuvieri, and H. megacephalus; disturbed forest, savannah, and peri-urban areas for Z. brevicauda) using the number of mapped reads on contigs assigned to each viral genus/subfamily (Supplemental data, Table S1).
As a prerequisite, before assessing local alpha diversities for the nine species-habitat combinations, we tested that (i) richness was not related to sequencing type using the Welch two-sample t-test); (ii) the number of viruses detected (genus/subfamily level) was not impacted by the number of individuals in each sequencing pool using a Pearson correlation test (R standard library) (Supplemental data, Table S2); (iii) each pool had been sufficiently sequenced to represent its diversity using rarefaction curves with the rarefy and rarecurve functions (Vegan R package) [66] with sampling at 231,725 reads (minimum number of viral reads across samples) and a step of 2000 reads (Supplemental data, Figure S1).
We then computed diversity indices for each of the nine species and habitat combinations and compared viral diversities, species by species, in their respective habitats. A synthesis between the most commonly accepted indices has been proposed [67] as a family of indices inspired in statistical physics [68], extending the link between diversity and entropy [69]. Based on α, related to Rényi's entropy, H α by N α = exp(H α ), three measures of diversity can be recovered: The total number of species (richness, α = 0), Shannon's entropy (α = 1), and the inverse of Simpson's dominance index (α = 2) [67,70]. The lower the α value is, the higher the weight given to rare species. The Hill α-diversity index values were generated with α in [0, 0.25, 0.5, 0.75, 1, 2], using the Vegan R package. A community A can be considered as more diverse than a community B if all Rényi's entropy values for A are higher than for B when α runs over a given range (here 0 < α < 2) [71]. Therefore, to enhance comparisons between samples, we plotted Rényi's entropy instead of Hill's diversity for a given value of α followed by comparisons according to the parameters above [71].
Furthermore, to highlight the importance of the abundance of viral taxa (i.e., the number of reads per viral genus/subfamily) and to explore how rare species have contributed to diversity according to the habitat, we recomputed richness by progressive deletion of taxa with the lowest number of reads (i.e., setting their counts to 0 if they were under a defined threshold T). Thereafter, we quantified the impacts of these deletions on richness by calculating, for each case, the difference between the value of richness without deletion (R 0 ,) and the richness value after suppression (R T, richness value after deletion under T threshold): The higher the difference (R 0 -R T ), the more numerous the rare species. The thresholds for deletion of a taxon were selected as fractions of the sample of the smallest size, i.e., fractions of 200,000 reads. We deleted all taxa with numbers of reads smaller than 0.01, 0.1, 1, 2, and 5% of this value, successively (i.e., 20,200,2000, 4000, 10,000) in all nine species-habitat combinations and calculated richness at each step.

Phylogenetic Analyses
Contigs assigned to four viral families (Polyomaviridae, Flaviviridae, Togaviridae, and Phenuiviridae) were selected from assembly files. Contigs were checked manually using the NCBI BLASTx web tool for the presence of stop codons. Then sequences presenting stop codons were deleted from further analysis. In order to infer phylogenetic relationships, we selected the closest sequences provided by BLAST during the taxonomic assignment process and some representatives of the family to be analyzed. In addition, for the Polyomaviridae and Flaviviridae analyses, sequences detected in rodents worldwide were also included. For the Alphavirus and Phlebovirus analysis, the data set including the closest sequences provided by BLAST was completed by sequences representative of the known antigenic complexes. All reference sequences were downloaded from the NCBI-nt database.
Accession numbers of viral sequences used to infer the phylogenetic trees are given in the respective phylogenetic reconstructions. After selecting the best-suited region for phylogenetic analyses, the Muscle algorithm [72] was used for multiple sequence alignments with default parameters. Pairwise sequence identity (at the nucleotide and amino acid levels) for each selected region were calculated using uncorrected p-distances. The best-fitted model of nucleotide or amino acid substitution for each analysis was selected using jModelTest 2 [73] and ProtTest 3 [74], respectively, under corrected Akaike information criteria (AICc). Bayesian phylogenetic analyses were performed using MrBayes 3.2 [75,76]. The Markov chain Monte Carlo (MCMC) algorithm was run with four chains with 2 million generations each, with trees sampled every 500 generations and a 25%burnin. Validation of the inference was assessed based on the standard deviation of split frequencies, less than the expected threshold value of 0.01 in MrBayes and by inspecting the effective sampling size (ESS > 500) criterion in Tracer version 1.6 [77].

Nucleotide Sequence Accession Numbers
All virus sequences reported in this study were deposited in the GenBank nucleotide database under accession numbers MT732099 to MT732117. The data from Illumina sequencing were deposited in the GenBank Sequence Reads Archive under accession numbers SAMN15496919 to SAMN15496954.

Illumina Sequencing and Bioinformatics Analyses
Overall, 453,865,988 paired-end raw reads (907,731,976 individual reads) were obtained that were 250-300 bp in length. After trimming by FaQcs, 96.62-99.56% of the reads were kept, totaling 894,996,464 reads.
These cleaned reads were assembled de novo for a total of 5,268,112 contigs using MEGAHIT ( Figure 1 and Supplemental data, Table S3 for assembly statistics). The number of contigs ranged from 25,215 to 347,957. The mean number was 146,336 contigs/sample ( Table 2). All these contigs were submitted to taxonomic assignment.
After megaBLASTn, 4,411,189 (83.73%) contigs were assigned to an organism, among which 50,431 were attributed to viruses. The remaining 856,923 (16.27%) unassigned contigs were first submitted to a BLASTx search against the in-house viral protein database (BLASTx1) (Figure 1). Only 274,991 contigs matched viral proteins, and they were all put through a second BLASTx step against the entire nr protein database (BLASTx2) (Figure 1). Overall, after this second BLASTx step (BLASTx2), 257,152 contigs were re-assigned to viruses, 3776 were assigned to other types of organisms, and 17,839 were not assigned at all (Figure 1).
To further avoid artifacts and false-positive results, the virus-assigned contigs were filtered at a coverage of ≥250 bp for BLASTn or ≥83 amino acids for BLASTx results and an e-value of ≤e−5 (Filter 2), resulting in 101,867 viral contigs, accounting for 1.93% of the total initial number of contigs and 15% of the assigned contigs ( Figure 1).

Viral Diversity Detected through Species and Environments
For the description of viral diversity, the results of viromes of different organs from a given species and in a given habitat were pooled together. The 77,767 viral contigs obtained were assigned to 27 families known to infect vertebrates, invertebrates, plants, and amoeba ( Figure 2 and Table S4). Viral family presence varied largely across species-habitat categories. Indeed, we observed a pattern of ubiquity for some vertebrate viruses, along with those of the Genomoviridae family (which are commonly found in vertebrates, invertebrates, and fungi). On the other hand, some viruses, especially plant and invertebrate viruses, seemed more specific to the few species-habitat categories that they were found in. The patterns of ubiquity/specificity across species-habitat categories seemed to follow the patterns of the hosts (vertebrates, invertebrates, plants).

Plant Viruses
Six plant-infecting viral families were detected from eight species-habitats, accounting for 49 contigs. Disturbed forest-originating samples from P. guyannensis, H. megacephalus, and Z. brevicauda, along with those from Z. brevicauda from savannah, contained no plantinfecting viruses. The most common plant-infecting viral family was the Tombusviridae family found in five species-habitats, followed by the Luteoviridae, Partitiviridae, and Phycodnaviridae families present in two species-habitats each. Lastly, Alphaflexiviridae and Caulimoviridae were all found in a single pool ( Figure 2 and Table S2).

Invertebrate Viruses
Five viral families of insect and invertebrate tropism were detected in six different species-habitats, totaling 106 contigs. The most common families were Polycipiviridae and Chuviridae present in four and two species-habitats, respectively. The remaining families (Iflaviridae, Nudiviridae, Picornaviridae) were detected in a single species-habitat ( Figure 2 and Table S4). P. cuvieri from pristine forest; P. guyannensis, O. auyantepui, H. yunganus, and H. megacephalus, all from disturbed forest; and Z. brevicauda from peri-urban habitat contained no invertebrate viruses. Viruses 2021, 13, x FOR PEER REVIEW 11 of 31

Vertebrate Viruses
A total of 11 viral families strictly associated with vertebrates were detected, accounting for 77,438 contigs. ssDNA virus families such as Anelloviridae, Circoviridae, Parvoviridae, and Polyomaviridae were detected in 12, nine, eight, and five species-habitats, respectively. Adenoviridae (dsDNA virus)-assigned contigs were found only in P. cuvieri from disturbed forest. RNA viruses (Riboviria) accounted for six families. Several positive-sense RNA viral families were also detected: Astroviridae, Arteriviridae, Flaviviridae (Hepacivirus), and Matonaviridae. Matonaviridae-attributed sequences were found in only one species-habitat (P. cuvieri in pristine forest), just as Arteriviridae sequences were detected only in P. guyannensis from disturbed forest. By contrast, Flaviviridae (Hepacivirus) and Astroviridae had a greater presence across species-habitats, respectively, 12 of 12 and nine of 12 specieshabitats. An Arenaviridae, an assigned sequence close to "Patawa virus" [46], was detected in O. auyantepui from disturbed forest ( Figure 2 and Table S4).

Potential Vector-Borne Viruses
Viral genera such as Alphavirus (Togaviridae) and Phlebovirus (Phenuiviridae), and the family Rhabdoviridae, are recognized as potential vector-borne viruses, since they infect both vertebrate and invertebrate hosts, were detected in one species-habitat. All contigs (19) attributed to vector-borne viruses were detected in P. cuvieri from disturbed forest ( Figure 2 and Table S4).

Sampling Effort on Viral Diversity
The Pearson correlation between the number of individuals per sample in a pool and the number of viral genera detected showed no significant association (R 2 = −0.07, p = 0.694). The sequencing type had no impact on the viral diversity detected (two-sided unpaired Student's t-test, t = −1.76, p = 0.105).
The rarefaction curves on the number of reads per sample, established for the nine species-habitat combinations, reached their asymptotes or started to plateau, suggesting that saturation was almost achieved if not in viral sequencing ( Figure S1, Supplementary data). Hence, the sampling effort for our data set, both regarding the number of rodent individuals and the number of reads per sample, was adequate for diversity comparisons.

Diversity across Species-Habitats
Viral richness oscillated between 10 for Zygodontomys in peri-urban habitats and 21 for H. megacephalus in disturbed forests (Table 3). On the other hand, Rényi's entropy tended to converge between species and habitats for α = 2 ( Figure 3). Most of the differentiation was between 0.5 < α < 1.0. Regarding the habitats of a given species, we observed that for P. guyannensis the diversity in pristine forest was higher than in disturbed forest ( Figure 3). For Z. brevicauda, the diversity was higher in savannahs, followed by disturbed forests, and was lowest in peri-urban habitats. For H. megacephalus and P. cuvieri, the diversity was higher in disturbed forests for α < 0.25 and α < 0.75, respectively. For these species, diversity was higher in pristine forests for α > 0.25 and α > 0.75 ( Figure 3). Thus, for two of the four rodent species trapped in two or more habitats, the diversity trend associated with different habitats was preserved over all α values between 0 (genera richness) and 2 (Simpson index).
The richness loss, calculated for the nine species-habitat combinations with different removal thresholds of rare genera, is shown in Figure 4. P. guyannensis and P. cuvieri showed, at all threshold values, greater richness loss in pristine forest compared with disturbed forest, revealing that viromes from pristine habitats possessed higher numbers of rare viral entities (at the genus and subfamily levels according to their taxonomic classification) and fewer dominant ones than disturbed environments (Figure 4). Z. brevicauda showed higher richness loss in savannah compared with disturbed forest and peri-urban areas, showing that rare viral entities are more frequent in savannah than in the two other types of habitats. On the other hand, greater richness loss was observed for H. megacephalus in disturbed forest compared with pristine forest, highlighting the importance of rare entities in disturbed environments for this species (Figure 4). We observed that the hierarchy of richness loss between habitats was similar over all threshold values for each rodent species. Some of the differences in viral diversity between habitats for a given species are predominantly due to rare viral entities, rather than fractions of abundant ones.

Phylogenetic Relationships of Selected Viruses
For phylogenetic analyses, we chose viral families for their frequent presence in the samples (Polyomaviridae and Flaviviridae-Hepacivirus) and their interest as potential EID agents because they are arthropod-borne (Phlebovirus and Alphavirus). associated with different habitats was preserved over all α values between 0 (genera richness) and 2 (Simpson index).    entities in disturbed environments for this species (Figure 4). We observed that the hierarchy of richness loss between habitats was similar over all threshold values for each rodent species. Some of the differences in viral diversity between habitats for a given species are predominantly due to rare viral entities, rather than fractions of abundant ones.

Phylogenetic Relationships of Selected Viruses
For phylogenetic analyses, we chose viral families for their frequent presence in the samples (Polyomaviridae and Flaviviridae-Hepacivirus) and their interest as potential EID agents because they are arthropod-borne (Phlebovirus and Alphavirus).

Rodent Polyomaviruses
The Polyomaviridae family is composed of four genera: Alphapolyomavirus, Betapolyomavirus, Gammapolyomavirus, and Deltapolyomavirus [78]. Each genome is composed of a circular dsDNA of approximately 5 kb. Polyomaviruses (PyVs) may be transmitted through either direct contact or by aerial or fecal-oral routes.
Overall, 54 contigs were assigned to the Polyomaviridae family and detected in four species: O. bicolor, Z. brevicauda, H. megacephalus, and P. guyannensis (Table S4). In kidney samples, one contig was detected in O. bicolor (disturbed forest); six were detected in Z. brevicauda (three from savannah and three from disturbed forest); 139 were detected in H. megacephalus (disturbed forest); and five were detected in P. guyannensis (disturbed forest). In spleen samples, two contigs were identified in O. bicolor (disturbed forest), two in Z. brevicauda (savannah and disturbed forest), and one in H. megacephalus (disturbed forest).
After alignment, the longest common sequences identified from the four species cov-

Rodent Polyomaviruses
The Polyomaviridae family is composed of four genera: Alphapolyomavirus, Betapolyomavirus, Gammapolyomavirus, and Deltapolyomavirus [78]. Each genome is composed of a circular dsDNA of approximately 5 kb. Polyomaviruses (PyVs) may be transmitted through either direct contact or by aerial or fecal-oral routes.
Overall, 54 contigs were assigned to the Polyomaviridae family and detected in four species: O. bicolor, Z. brevicauda, H. megacephalus, and P. guyannensis (Table S4). In kidney samples, one contig was detected in O. bicolor (disturbed forest); six were detected in Z. brevicauda (three from savannah and three from disturbed forest); 139 were detected in H. megacephalus (disturbed forest); and five were detected in P. guyannensis (disturbed forest). In spleen samples, two contigs were identified in O. bicolor (disturbed forest), two in Z. brevicauda (savannah and disturbed forest), and one in H. megacephalus (disturbed forest).
The phylogenetic analysis carried out on 166 amino acid-long sequences of the LTAg identified the two monophyletic clades corresponding to the Alphapolyomavirus (posterior probability = 1) and Gammapolyomavirus (posterior probability = 1) genera, whereas Betapolyomavirus genus was not supported. ObicPyV-1, ZbrePyV-1, and HmegPyV-1 polyomavirus sequences were clustered with polyomavirus sequences derived from Sciuridae (Sciurus corolinensis) and Gliridae (Glis glis and Callosciurus prevostii), a group of sequences that belongs to the Betapolyomavirus genus. These sequences are also associated with Miniopterus schreibersii polyomavirus 3 (posterior probability = 0.99). Among alphapolyomaviruses, PguyPyV-1 was not related to any other PyVs from rodents. It possessed a basal position of a clade consisting of viruses originating from primates and bats (posterior probability = 0.99) and to a lesser extent to PyVs from Artiodactyla and Scandentia ( Figure 5).

Rodent Hepaciviruses
Hepaciviruses (HVs) constitute a genus that belongs to the ssRNA+ family of Flaviviridae. Their genome is approximately 10 kb long and encodes a single ORF translated

Rodent Hepaciviruses
Hepaciviruses (HVs) constitute a genus that belongs to the ssRNA+ family of Flaviviridae. Their genome is approximately 10 kb long and encodes a single ORF translated into a polyprotein, which is processed by viral and cellular proteases, giving mature proteins. According to Smith and colleagues [79], there are 14 HV species in mammals (HV-A to HV-N), among which six are hosted by rodents (HV-E to HV-J). The main mode of transmission of HVs is vertical, but the fecal route for cross-species transmissions has been suggested.
A total of 16,686 contigs were assigned to HVs (Table S4). These sequences were found in 18 of 36 pools (seven sera, six kidneys, and five spleens) and in all species included in the study. After re-examination of contigs and progressive multiple alignments, an NS5B (i.e., RDRP) fragment of 384 nucleotides was chosen for subsequent phylogenetic analyses. This region included the main known mammal HVs and rodent HVs (RHVs) identified in five of the seven species analyzed (i.e., P. cuvieri with four contigs, P. guyannensis with one contig, Z. brevicauda with two contigs, H. megacephalus with one contig, H. yunganus with three contigs). We attributed arbitrary names by contracting rodent species names with "HV" for Hepacivirus with an incremental number to differentiate sequences from the same host species.
Three groups (A, B, C) of sequences regrouping rodent HVs were identified, highly supported with posterior probabilities of 1 ( Figure 6). Group A was subdivided into three subgroups. The first one comprised seven sequences identified here (PguyHV1, PcuvHV1-3, ZbreHV1-2, HyunHV1) and the previously identified RHV characterized in P. semispinosus. This subgroup was supported with a posterior probability of 1 and represented a group comprising Cricetidae and Echimyidae. It was related, however, with low support to a second subgroup composed of HV sequences identified in Dipus sagitta, Peromyscus maniculatus, and Rattus norvegicus, with these two last species hosting RHVs species E, G, and H. The third subgroup was composed two RHV sequences (Neodon clarkei HV and Myodes glareolus RHV-F). Group B contained the four remaining sequences from our samples (PcuvHV4, HmegHV1, HyunHV2-3), along with Oligoryzomys, Meriones, and Rhabdomys RHVs. PcuvHV4, HmegHV1, and HyunHV3 were grouped together (posterior probability = 1). The HyunHV2 sequence was clustered with the RHV sequence of Oligoryzomys nigripes (posterior probability = 1). This last group is related to RHV-I species hosted by Rhabdomys pumilio and to a lesser extent to RHV hosted by Meriones meridianus. Group C was composed of three RHVs sequences among which was the RHV-J species identified in Myodes glareolus ( Figure 6). All these groups did not seem to follow a co-evolution model with their rodent hosts since HVs identified in different rodent families were found independently in all groups. In addition, for a given rodent species, diverse and distantly related HV sequences were identified.
A total of 14 contigs were identified, all from P. cuvieri sera from disturbed forest. They covered 7682 nucleotides and both the structural and non-structural parts of the genome. We used the structural part of the genome (1989 nucleotides) for phylogenetic analyses including representatives of the main antigenic complexes (VEE, WEE, Semliki Forest, etc.). P. cuvieri VEE showed the highest levels of nucleotide and amino acid identity (94.83% and 98.69%, respectively) with the VEE strain Cabassou, a member of the antigenic complex of the same name (supplemental data, Table S7).
The phylogenetic tree showed that P. cuvieri VEE clustered together with VEE Cabassou (posterior probability = 1), forming a clade with a basal position of the VEE complex. The clade was highly supported (posterior probability = 1) (Figure 7). The other major alphavirus complexes, EEE and WEE, were also supported with posterior probability values of 1 (Figure 7).

Alphaviruses (Togaviridae)
The Togaviridae family is composed of ssRNA+ viruses. Their genome, 10-12 kb in size, is composed of nonstructural and structural parts. The Togaviridae family has recently become a monogenus family exclusively composed of alphaviruses since its former sibling genus Rubivirus was removed to its own family (Matonaviridae). Alphaviruses are classified as antigenic complexes, such as VEE (Venezuelan equine encephalitis), EEE (Eastern equine encephalitis), WEE (Western equine encephalitis), etc. Alphaviruses are mainly transmitted by mosquitoes.
A total of 14 contigs were identified, all from P. cuvieri sera from disturbed forest. They covered 7682 nucleotides and both the structural and non-structural parts of the genome. We used the structural part of the genome (1989 nucleotides) for phylogenetic analyses including representatives of the main antigenic complexes (VEE, WEE, Semliki Forest, etc.). P. cuvieri VEE showed the highest levels of nucleotide and amino acid identity (94.83% and 98.69%, respectively) with the VEE strain Cabassou, a member of the antigenic complex of the same name (Supplemental data, Table S7).
The phylogenetic tree showed that P. cuvieri VEE clustered together with VEE Cabassou (posterior probability = 1), forming a clade with a basal position of the VEE complex. The clade was highly supported (posterior probability = 1) (Figure 7). The other major alphavirus complexes, EEE and WEE, were also supported with posterior probability values of 1 (Figure 7).

Rodent Phleboviruses
Phleboviruses are members of the Bunyavirales order and belong to the Phenuiviridae family. They possess a segmented (three segments: small, medium, large) negative-sense ssRNA genome. Phleboviruses are arthropod-borne viruses frequently hosted by phlebotomin species (sandflies), but also by mosquitoes, ticks, and culicoides from which they are transmitted to humans and other vertebrates. Currently, 66 species have been recognized by the ICTV [80] and other species remain to be described. Phleboviruses have been classified in two antigenic groups including ten different species complexes [81,82].
Phlebovirus-attributed sequences were found in P. cuvieri samples from disturbed forest only, one from sera and three from spleen samples. The phylogenetic analysis was based on a 179-aa segment of the nucleocapsid. The P cuvieri phlebovirus sequence showed 79.80% and 95.78% nucleotide and amino acid identity, respectively, with Bujaru phlebovirus, which was identified from a P. guyannensis rodent in Brazil (supplemental data, Table S8).

Rodent Phleboviruses
Phleboviruses are members of the Bunyavirales order and belong to the Phenuiviridae family. They possess a segmented (three segments: small, medium, large) negative-sense ssRNA genome. Phleboviruses are arthropod-borne viruses frequently hosted by phlebotomin species (sandflies), but also by mosquitoes, ticks, and culicoides from which they are transmitted to humans and other vertebrates. Currently, 66 species have been recognized by the ICTV [80] and other species remain to be described. Phleboviruses have been classified in two antigenic groups including ten different species complexes [81,82].
Phlebovirus-attributed sequences were found in P. cuvieri samples from disturbed forest only, one from sera and three from spleen samples. The phylogenetic analysis was based on a 179-aa segment of the nucleocapsid. The P cuvieri phlebovirus sequence showed 79.80% and 95.78% nucleotide and amino acid identity, respectively, with Bujaru phlebovirus, which was identified from a P. guyannensis rodent in Brazil (Supplemental data, Table S8).
P. cuvieri phlebovirus were clustered phylogenetically with Bujaru virus with high support (posterior probability = 1). These Proechimys-originating sequences were grouped under the Bujaru serogroup with Munguba and Peña Blanca, both isolated from sandflies. The ancestral node of these four viruses was supported with a high posterior probability (pp = 1) (Figure 8). Nevertheless, phylogenetic relationships between all species-complexes remained unresolved with low support observed for basal nodes.
Viruses 2021, 13, x FOR PEER REVIEW 20 of 31 P. cuvieri phlebovirus were clustered phylogenetically with Bujaru virus with high support (posterior probability = 1). These Proechimys-originating sequences were grouped under the Bujaru serogroup with Munguba and Peña Blanca, both isolated from sandflies. The ancestral node of these four viruses was supported with a high posterior probability (pp = 1) (Figure 8). Nevertheless, phylogenetic relationships between all species-complexes remained unresolved with low support observed for basal nodes.  WAG model. Sequence identifiers include the NCBI accession number and the isolate name. Posterior probabilities of the Bayesian analysis (>70%) are shown next to the nodes. The scale bar indicates amino acid substitutions per site. The sequence of P. cuvieri phlebovirus is indicated in red. The species-complexes (established by ICTV or suggested in [81,82]) are indicated with colored bars (if many members are present) and dots (if one member is represented). Names of established speciescomplexes are indicated in bold and italics. Animal pictures were downloaded from phylopic web site (http://phylopic.org/) [83].

Discussion
Over the past decade, virome studies exploring the roles of wild species as reservoirs of infectious diseases have become more common thanks to the technological breakthrough of high-throughput sequencing. Considering that some species are reservoirs of numerous viruses, some of which have large impacts on human health, studies on viral diversity in rodents have recently increased [3,9,[84][85][86]. Hence, 173 viral species belonging to more than 65 genera have been described in rodents to date, among which 53 are zoonotic, such as mammarenaviruses and hantaviruses [30,87]. However, few studies have explored the links among viral diversity, host ecology, and habitats [8,88,89]. Here, we presented the viral diversity identified in three different organs of seven rodent species from French Guiana, according to their natural hosts and habitats, and further explored the phylogenetic relationships of several viruses of interest for human health.
In order to ascertain the viral infection status in natural reservoirs and to identify a large number of vertebrate-related viruses, we chose to study three types of organs representing different tropisms of viruses. The kidney is the target organ of viruses that use the urinary tract to disseminate, such as hantavirus, arenaviruses, and paramyxoviruses, whereas viruses such as dengue or West-Nile have been detected in the spleen, a blood reservoir. Finally, serum is one of the most important media for the transmission of arboviruses. Furthermore, the analysis of such organ samples also limits the potential errors in the taxonomic assignment of new viruses compared to those that may be detected in respiratory or fecal samples. The latter viruses could indeed be from environment plants, insects, or fungi, and only incidentally found in rodents. Together, the use of organs should give a good representation of vertebrate viruses hosted by rodents [84].
Overall, this study identified 77,767 viral-associated contigs distributed within 27 viral families known to infect vertebrates, invertebrates, plants, and amoeba. The viromes were quantitatively dominated by vertebrate viral sequences (>99% of both contigs and reads were assigned to 11 viral families known to strictly infect vertebrates) and to a lesser extent to viral sequences from invertebrates, plants, and amoeba. Nevertheless, the smaller number of invertebrate and plant virus sequences indicates non-negligible diversity, accounting for 12 families.
The different viral families, whether originating from invertebrates, plants, or vertebrates, were not evenly distributed within the different species and habitats. Viruses from Parvoviridae, Circoviridae, Astroviridae, and Anelloviridae from vertebrates were found in most species and habitats and can be considered as generalists. These ubiquitous viruses were already reported in wild rodents in the United States where the Circoviridae family was the most abundant among the 24 families described [86], and in wild brown rats in Germany, with viruses of the Parvoviridae family [90]. On the other hand, viruses belonging to the Caulimoviridae (from plants), Iflaviridae (from invertebrates), or Arteriviridae (from vertebrates) families were rare and only present in some species and/or habitats. These differences in the distribution of viral families can be put in perspective by hypothesizing a rare biosphere for microbial communities in oceanic waters [91], with a portion of a few dominant microbial species and a second large, unexplored fraction with rare species. Accordingly, viromes in rodents could be dominated by a few dominant families, and a long distribution tail shaping a rare virosphere. Such differences in virus abundance could be related to the ecology of the viruses (i.e., their ability to infect host cells and to persist and replicate) and to the ecology and behavior of their rodent hosts in a given habitat, such as a modified diet in a disturbed environment. The role of vectors in viral transmission and their diversity according to the environment can also have an impact on viral diversity. Indeed, for P. cuvieri and H. megacephalus, fourfold more viral families of invertebrate and vertebrate viruses have been detected in disturbed forest compared with pristine forest. In these two opportunistic species, diet can be supplemented by invertebrates when fruits and seeds are lacking [92], with subsequent impacts on their virome structures. On the other hand, a more specialized diet should restrict the range of viral diversity. Similar virome compositions were previously observed in house mice [3] and brown rats [85] in New York City, suggestive of an adaptive diet. Viral diversity indices and the relative dominance levels of viral species were also impacted by the level of disturbance and the type of habitat. In this study, the highest viral diversity index values were mainly observed in pristine habitats where the highest diversity of hosts was also recorded. The viromes of P. guyannensis and H. megacephalus in pristine forest showed the highest diversities (mainly driven by viruses originating from plants) compared with their counterparts from disturbed forest. This trend was nevertheless not found in P. cuvieri, for which viral diversities were comparable between habitats (pristine vs. disturbed forest), but a higher number of rare viral entities were in pristine forest. In contrast, H. megacephalus presented a high number of rare viruses in disturbed forests. Z. brevicauda, the only species also sampled in the savannah, showed the highest viral diversities in this habitat, also reflecting the richness of the savannah ecosystem [93,94]. Peri-urban areas had the lowest viral diversity, which may be related to overall low biodiversity.
Among vertebrate hosts, rodents have been described as major reservoirs of arboviruses such as Togaviridae, Flaviviridae, and Bunyaviridae [95], and can serve as amplifiers of viruses that can be transmitted to humans. For instance, Cabassou virus (genus Alphavirus) was detected in P. cuvieri in disturbed forest. The circulation of arboviruses in disturbed habitats could be the result of increased contacts with vectors and may also reflect the lowest diversity of hosts available for arthropods to feed on.
The likelihood of disease emergence is indeed commonly accepted to increase in disturbed habitats [96]. The transmission of viruses from forest species to humans may result from two mechanisms. First, anthropic activities can increase contact between wildlife and humans and thereby the risk of infection [36] when humans enter slightly modified habitats and come into contact with a pristine viral cycle. Secondly, in more degraded forests, environmental changes may disrupt some ecological barriers and impact the structure and dynamics of rodent and arthropod communities, species richness, and ecological functions [97]. This may favor generalist over specialist species and ultimately the dominance of more synanthropic ones. Feeding networks between hosts and hematophagous vectors consequently change, influencing the transmission of viruses and potentially increasing cross-species transmission events.
From a theoretical point of view, the dilution effect hypothesis explores how the decrease of biodiversity may increase the amplification of zoonotic diseases. Briefly, the dilution effect proposes that a high diversity of putative hosts and vector species dilutes the more efficient carriers and amplifiers of viruses in a community of less efficient species, consequently reducing the circulation of the harmful ones and lowering the likelihood of infection [98]. The dilution effect may affect cycles involving a single animal host (i.e., reservoir) and those with two host compartments, i.e., reservoirs and vectors. In the latter case, a decrease in vertebrate diversity may concentrate blood meals taken by arthropods on a lower number of species, resulting in a higher viral circulation as soon as those resilient vertebrate species are also efficient carriers. The dilution effect can be suggested to illustrate the links between the diversity of rodent hosts and the spread of some zoonotic viruses. A higher probability of hepacivirus infection in P. semispinosus has been related to a loss of diversity in hosts due to land-use change [99]. Additionally, hantavirus outbreaks in the Americas are related to environmental disturbances that result in a decrease in specific richness of non-murine rodents and in the dominance of a few Muridae species known to be more efficient reservoirs [28,100]. In French Guiana, all known human hantavirus cases occurred in agricultural and peri-urban areas, where rodent diversity is much lower than in forest habitats [33], likely favoring hantavirus circulation in most efficient reservoirs.
In this study, 14 viral families from Rodentia were detected of the 31 currently described. We established the phylogenetic relationships of viral sequences related to four viral families known to infect vertebrates including arthropod-borne viruses (Polyomaviridae, Flaviviridae, Togaviridae, and Phenuiviridae). Even if the sequences obtained were incomplete, their analysis added knowledge on viral evolution among rodent species in South America, a group of species with very few data available to date.
Most viral sequences were related to sequences previously detected in rodents from multiple geographic areas (Africa, Asia, and North and South America), suggesting common evolutionary processes. Nevertheless, cross-species transmission and spill-over events were also detected, emphasizing the importance of these mechanisms in their evolution. These events took place early during the evolution of mammals or could be linked to recent interactions between sympatric species, as suggested for polyomaviruses. Indeed, PyVs have been described in a wide range of hosts, including mammals, birds, amphibians, reptiles, fish, and invertebrates. Some PyVs are pathogenic for humans and animals [78]. In rodents, 45 PyVs have been described from 11 species originating from Europe, Asia, and Africa [30]. In South America, betapolyomaviruses were recently described in two Sigmodontinae (Cricetidae) species (Akodon montensis and Calomys tener), and an alphapolyomavirus in a Myocastoridae species (Myocastor coypus) [101,102]. We identified four PyV sequences in four species (Figure 4). The three Sigmodontinae PyV sequences (ObicPyV1, ZbrePyV1, and HmegPyV1) belong to the Betapolyomavirus genus, and the Echimyidae PyV sequence (PguyPyV1) to Alphapolyomavirus. The sequence identified in P. guyannensis did not cluster with the other alphaPyVs sequences from rodents and showed a basal position to a clade constituted of PyVs detected in primates and Chiroptera, suggesting duplication events [103]. Polyomaviruses were initially considered to be host-specific, with codivergence and lineage duplication being the main drivers of their diversification [6,[104][105][106]. PyV cross-species transmissions were also identified, but they do not seem to play a major role in the diversification processes of PyVs [105][106][107]. In the present study, a duplication event can be suggested to explain the position of PguyPyV1, and host-switching events could explain the closeness of PyVs detected in the Sigmodontinae subfamily. Indeed, the high PyV sequence identity values observed between O. bicolor, Z. brevicauda, and H. megacephalus may reflect a geographical signature related to the sympatry of these three taxonomically related species, favoring PyV host-switching events [101,106]. Further work is needed to confirm this hypothesis so as to ascertain whether the novel PyVs detected in the present study show evidence of host-switching in the Sigmodontinae subfamily, and whether PyV host-switching is more common in rodents than in other mammalian orders.
We also identified a large number of hepacivirus sequences (HVs) in the seven species, suggesting a high prevalence of HVs in neotropical rodents. The hepacivirus genus prototype is the human-infecting hepatitis C virus (HCV). After its identification at the end of the 1980s, HCV remained, along with GB-virus B (GBV-B), the only known HV for years. Homologues were then described within a wide range of hosts, such as horses, bats, rodents, cows, dogs, and even sharks, thanks to high-throughput sequencing and extensive investigations [4,9,[108][109][110][111][112][113]. Like polyomaviruses, HVs are considered to co-evolve with their hosts [4]. Nevertheless, their descriptions in a wide range of species, and more particularly in primates and rodents (RHVs), have demonstrated that they do not fully follow a co-speciation pattern [114]. The phylogenetic analysis of HVs, including those detected in this study, revealed the presence of genetically distinct RHVs in P. cuvieri, H. yunganus, and Z. brevicauda, and of two distantly related RHVs in P. guyannensis and H. megacephalus ( Figure 5). All RHV clades did not seem to co-evolve with their hosts, since RHVs identified in Muridae, Dipodidae, Echimyidae, and Cricetidae rodents were independently found in different clades. In addition, the identification of different RHVs in a single rodent species demonstrated the high level of genetic diversity among RHV species, reinforcing the idea that strict co-evolution is unlikely [114]. This high diversity reflects the idea that host shifts seem to be the main driver of RHV evolution [4,114,115]. The same evolutionary pattern was already suggested for RHVs from rodents in China [9]. Thus, the evolutionary history of the Hepacivirus genus remains to be deciphered along with the role of their rodent hosts as zoonotic transmitters, given their basal phylogenetic position in relation to other mammalian hepaciviruses [4,85,116].
Alphaviruses are arboviruses infecting a large number of vertebrates [5]. They are maintained in enzootic cycles involving arthropods as vectors and small mammals and/or birds as amplifier hosts. Occasionally, spill-overs into humans and domesticated animals lead to disease [117]. Among alphaviruses, VEEV have caused epizootics and epidemics in South America and the southern United States in the past few decades [118][119][120]. The VEEV complex is divided into six subtypes, some of which are known to cause diseases in humans and horses while others are enzootic, but can potentially be transmitted to humans [121][122][123]. In French Guiana, two viruses belonging to the VEEV complex were previously reported: Cabassou virus (CABV) isolated from Culex portesi, belonging to the subtype V, and Tonate virus (TONV), isolated from a bird and associated with the subtype IIIB [117,124]. Only a few cases of TONV infection were detected in French Guiana, Suriname, and North America, and most patients showed febrile illness. However, severe cases of encephalitis have been described [124,125]. To date, no human case of CABV infection has been reported.
Here, we identified sequences related to CABV (98.69% amino acid identity) in P. cuvieri samples in disturbed environments. Given that rodents belonging to the genera Proechimys, Sigmodon, Oligoryzomys, and Oryzomys have already been described as the main reservoirs for enzootic VEEV strains, CABV may circulate between mosquitoes and rodents, such as Culex (melanoconion) portesi (where it was previously isolated) and Proechimys [31,117,122,126,127]. VEEV strains are important candidates for future emergence in South America given that their reservoir hosts and vectors are facing an increasing number of anthropogenic disturbances. The potential circulation of CABV in the human population should be further investigated because its symptoms resemble those of dengue fever and it could therefore, be mis-or undiagnosed [128,129].
The Phenuiviridae family comprises 19 genera, including the Phlebovirus genus, which currently has ten recognized species [80] distributed worldwide. Among them, 14 phleboviruses were identified in rodents and 21 remain to be classified [30]. In South America, four species have been detected in rodents: Icoaraci, Itaporanga, Jacunda (Candiru complex), and Bujaru. The phylogenetic analysis of the sequence identified in P. cuvieri suggests that it belongs to the Bujaru complex composed of Munguba, Peña blanca, and Bujaru viruses [82]. Previous studies have already identified phleboviruses belonging to the Bujaru complex in P. guyannensis in Brazil and in sandflies (Phlebotominae spp.) [82,130], suggesting that viruses of the complex circulate in wild fauna and potentially in humans. Nevertheless, the pathogenicity of viruses of the Bujaru complex in humans is not known since most of the human cases described to date were due to viruses of the Candiru species complex. As for CABV, the fact that no human case has been reported for Bujaru viruses can be related to under-diagnosis given the similarity of the associated symptoms with other arboviral diseases [131,132]. Further studies are needed to investigate the ability of phlebotomine to spread the virus from their natural hosts and to clarify the real impact of phlebovirus infections on human health.

Conclusions
Only a few studies on the viral diversity in rodents have been conducted, even though they comprise the first order of mammals in terms of the number of species and are considered an important source of viral zoonotic pathogens. In addition, while Amazonia is considered a hotspot of diversity for hosts and pathogens [133], most virome studies have been conducted in Asia and North America. In French Guiana, north of the Amazonian region, the description of the virome of seven neotropical rodent species allowed us to identify a large number of new viruses, most of which correspond to vertebrate viruses. These findings extend our knowledge on the host range and evolution of these viruses. We identified previously known viruses belonging to Togaviridae and Phenuiviridae in the spiny rat P. cuvieri, highlighting its role in the maintenance and circulation of these arthropodborne viruses in disturbed areas. Further research is needed to better understand the transmission cycles and the ecology of the hosts and vectors involved. In addition, we showed that the diversity of rodent viromes varies according to the types of habitat, with higher viral diversity in pristine forests compared with disturbed forests for most rodent species. As well as the environment, the significance of species characteristics (including distribution, ecology, demography, and phylogenetic relationships), the importance of host switch throughout virus evolution, and the potential for local cross-species transmission should be studied to gain a better understanding of how viral diversity is shaped.
Environmental pressures on wild animal populations continue to grow, leading to increasing risks of contact between human and rodent populations. This could favor the emergence or re-emergence of viral diseases, including from viruses yet unknown or with undocumented roles on human health.

Supplementary Materials:
The following are available online at https://www.mdpi.com/article/ 10.3390/v13091690/s1, Figure S1: Rarefaction curves for the four species present in different types of habitats (nine species-habitat combinations), Table S1: Number of viral read of each viral genus/subfamily according rodent species and habitats, Table S2: Sampling variables and viral richness for the nine species-habitat at organ level (Organ level pool name, Number of rodents individuals, Number of viral genera/subfamilies detected, Sequencing platform, Organ type), Table S3: Assembly statistics of each sample by organ, species-and habitat, Table S4: Number of viral contigs by rodent species/habitats and viral families, Tables S5-S8: Pairwise sequence identities precents in nucleotides and amino-acids for respectively partial LTAg (Polyomaviridae), partial ns5b (Flaviviridae, Hepacivirus), partial Structural protein (Togavirida) and nucleocapsid protein (Phenuiviridae; Phlebovirus). Funding: S. Tirera was funded by the RESERVOIRS program, supported by European funds (PO FSE 2014-2020), an "Investissement d'Avenir" grant managed by the Agence Nationale de la Recherche (CEBA, ref. ANR-10-LABX-25-01), a European Commission "REGPOT-CT-2011-285837-STRonGer" grant within the FP7, and the Institut Pasteur de la Guyane. This study was conducted within the "BioViRo" program, supported by an "Investissement d'Avenir" grant managed by the Agence Nationale de la Recherche (CEBA, reference ANR-10-LABX-25-01). Field work conducted by BdT was funded by the ViRUSES program, supported by European funds (FEDER) and assistance from Région Guyane and Direction Régionale pour la Recherche et la Technologie, the ZNIEFF Guyane (DEAL Guyane) program, the GUYAMAZON II program, and Réseau des Observatoires Hommes-Milieux (OHM-Oyapok APR 2013, François Catzeflis). High-throughput sequencing was performed on the Biomics Platform, C2RT, Institut Pasteur, Paris, France, supported by France Génomique (ANR-10-INBS-09-09) and IBISA. The funders had no role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Institutional Review Board Statement: Not applicable.
Data Availability Statement: All in-house scripts used in this study are available at the github repository under the link: github.com/stirera/rodentsvirome_filter1 (accessed on 9 August 2021).