Profile of Small RNAs, vDNA Forms and Viral Integrations in Late Chikungunya Virus Infection of Aedes albopictus Mosquitoes

The Asian tiger mosquito Aedes albopictus is contributing to the (re)-emergence of Chikungunya virus (CHIKV). To gain insights into the molecular underpinning of viral persistence, which renders a mosquito a life-long vector, we coupled small RNA and whole genome sequencing approaches on carcasses and ovaries of mosquitoes sampled 14 days post CHIKV infection and investigated the profile of small RNAs and the presence of vDNA fragments. Since Aedes genomes harbor nonretroviral Endogenous Viral Elements (nrEVEs) which confers tolerance to cognate viral infections in ovaries, we also tested whether nrEVEs are formed after CHIKV infection. We show that while small interfering (si)RNAs are evenly distributed along the full viral genome, PIWI-interacting (pi)RNAs mostly arise from a ~1000 bp window, from which a unique vDNA fragment is identified. CHIKV infection does not result in the formation of new nrEVEs, but piRNAs derived from existing nrEVEs correlate with differential expression of an endogenous transcript. These results demonstrate that all three RNAi pathways contribute to the homeostasis during the late stage of CHIKV infection, but in different ways, ranging from directly targeting the viral sequence to regulating the expression of mosquito transcripts and expand the role of nrEVEs beyond immunity against cognate viruses.


Introduction
Chikungunya virus (CHIKV) was first isolated in Tanzania in 1952 [1] causing sporadic outbreaks. In 2005, it re-emerged in costal Kenya and spread to the islands of the Indian Ocean and India [2]. After 2005, CHIKV moved globally, causing outbreaks not only in tropical regions of the world such as South-East Asia, Central Africa, and Central America, but also in temperate Europe and North America [3][4][5]. The increasing public heath significance of CHIKV is tightly linked with the global invasion of Asian tiger mosquito Aedes albopictus which established in temperate areas of the world [6]. In response to this trend, chikungunya was included in the WHO list of the most relevant neglected tropical diseases in 2017 [7].
Adult female mosquitoes acquire viral pathogens through an infectious blood meal. Ingested viral particles must replicate in the midgut and further disseminate throughout the mosquito general cavity to reach salivary glands. Only if/when infection is established in the salivary glands, mosquitoes can transmit viruses to a new host during a subsequent blood meal, thus continuing the cycle [8,9]. In Ae. albopictus, CHIKV infection progresses rapidly and viral particles are detected in the saliva as early as 2 days post infection (dpi) [10]. During the early stages of infection, viral titer increases and mosquitoes activate genes, leading to regulation of virus replication [46][47][48]. The role of vpiRNAs following arboviral infection has not been elucidated yet [45].
Besides changes in the abundance of small RNA molecules following arboviral infections, transcriptional changes occur following infections with CHIKV and dengue virus (DENV) [49].
Here, we combined the analysis of high throughput sequencing data (whole genome sequencing, WGS, and small RNA-seq) and molecular biology techniques to test for vDNA presence and new nrEVE formation and analyze the production of small RNAs, including those from nrEVEs existing in the Ae. albopictus genome, 14 days post CHIKV infection, when infection has become persistent [50]. We show that CHIKV infection does not result in new nrEVE formation, but the differential accumulation of piRNAs derived from existing nrEVEs correlates with differential expression of an endogenous transcript. The profile of siRNAs was even along the CHIKV genome. On the contrary, piRNAs mostly arise from a~1000 bp window, which corresponds to the region encoding the C and E2 proteins. A unique vDNA fragment matching this region was also identified.
Statistically significant differentially accumulated siRNAs were also identified and correlated with differential abundance by qRT-PCR of selected, bioinformatically-deduced, target transcripts. Our results clearly show that RNAi pathways not only target the viral sequence, but host miRNAs and piRNAs are also modulated and contribute to cellular homeostasis during persistent CHIKV infection. The identified correlation between the differential accumulation of nrEVE-piRNAs and the decreased expression of the nrEVE-piRNA target expands the possible role of nrEVEs beyond immunity against cognate viruses.

Mosquito Infections
Aedes albopictus mosquitoes of the Foshan strain were used in this study. Mosquitoes were reared under constant conditions, at 28 • C and 70-80% relative humidity with a 12/12 h light/dark cycle. One-week old females were infected with CHIKV 06.21 as previously described [51]. CHIKV 06.21 was isolated in La Reunion island and was kindly provided by the French National Reference Center for Arboviruses. Thirty CHIKV-infected mosquitoes were collected 14 days post infection (dpi), along with un-infected controls. Ovaries were dissected from carcasses and the virus was titrated by fluorescence focus assay on C6/36 Ae. albopictus cells as previously described [51]. At 14 dpi, mean viral titer was 1515 ± 628 FFU/mL. Dissected ovaries and carcasses were pooled into groups of 15 giving a total of 8 samples, 2 pools for each condition ( Figure S1A). Each sample was used for DNA and RNA extraction using the Nucleospin tissue or the Nucleospin miRNA kit (Macherey-Nagel), respectively, following manufacturer's instructions. A second infection experiment was performed as before, resulting in mean viral titer of 1589 ± 50 FFU/mL. Samples from this infection experiment were collected as before and were used for RNA extraction using the standard Trizol protocol.

Whole Genome and Small RNA Sequencing
After quality control, DNA of the first infection experiment was sent to the Beijing Genomics Institute (BGI) for Illumina library preparation. Libraries were run on the Illumina Novaseq 6000 by to obtain 150 bp paired end reads. Small RNA libraries were prepared by BGI from the same samples as for WGS and sequenced on the BGI-Seq50 to obtain 40 million reads per sample.

Detection of vDNA Fragments
The presence of vDNA fragments was investigated in CHIKV infected ovaries and carcasses by PCR using primers designed to cover the whole viral genome and producing overlapping amplicons of~300-400 bp each (Table S1). PCR-amplified bands were cloned using the TA cloning kit (Invitrogen) following manufacturer's instruction and sequenced by Macrogen Europe (Amsterdam, The Netherlands).

Bioinformatic Analyses of WGS Data
For all genomic analyses regarding the Ae. albopictus genome, we used the latest genome assembly (Aalbo_primary.1, Refseq assembly GCF_006496715.1) [34]. WGS data provided by BGI were aligned to the genome of Ae. albopictus using BWA-MEM algorithm [52]. Because the landscape of nrEVEs in the genome of Ae. albopictus is not fixed, but mosquitoes may have different nrEVE patterns [40], we first assessed which nrEVEs were present in each sample using Genome Analysis ToolKit (GATK) [53] as previously described [40,53] and then assessed an unbiased small RNA profile of nrEVEs after CHIKV infection. Due to sequence similarity among nrEVEs, a nrEVE was considered absent in a sample when the average coverage for that nrEVE was less than 1 read with zero mapping quality. To identify possible viral integrations formed after CHIKV infection, we used the VyPer and ViR pipelines [54,55]. The similarity of nrEVEs sequences to CHIKV 06.21 was calculated by grouping nrEVEs depending on their viral family and then aligning them to the genome of CHIKV 06.21 (NC_004162.2) with MAFFT [56]. We retrieved the identity matrix as implemented by the EMBL-EBI search and sequence analysis tools [57] and plotted the results with custom scripts.

Bioinformatic Analyses of sRNA Data
For the following analyses, sRNA data were aligned using sRNAmapper [58] to either the genome of CHIKV 06.21 (NC_004162.2) or Ae. albopictus transcripts (Aalbo_primary.1, Refseq assembly GCF_006496715.1) using the "-best" option. Sequences between 20-22 nt were identified as siRNAs/miRNAs and sequences between 25-30 nt were identified as piRNAs. This classification of sRNAs was based on available literature [58][59][60][61][62], and through bioinformatics analysis as follows. First, we plotted the length distribution of all mature miRNAs reported for Ae. aegypti, downloaded from miRBase database v.22 [63], and corroborate that the major range for miRNAs occurs between 20 to 22 nt ( Figure S2). Second, we discarded ranges of nts where miRNA and piRNA definitions overlap, as the tail distribution of miRNAs (size range of 23-24 nt) shows ( Figure S2). Finally, taking into account that smaller peaks at 20 and 22 nt are often observed as Argonaute-2 products [64], and that thousands of 20 and 22 nt unique reads perfectly mapped to the viral genome ( Figure 1), we decided that the broader range of 20-22 nt would provide less biased results than limiting the analyses to 21 nt reads (i.e., exo-siRNA consensus size).

Detection of vDNA Fragments
The presence of vDNA fragments was investigated in CHIKV infected ovaries and carcasses by PCR using primers designed to cover the whole viral genome and producing overlapping amplicons of ~300-400 bp each (Table S1). PCR-amplified bands were cloned using the TA cloning kit (Invitrogen) following manufacturer's instruction and sequenced by Macrogen Europe (Amsterdam, The Netherlands).

Bioinformatic Analyses of WGS Data
For all genomic analyses regarding the Ae. albopictus genome, we used the latest genome assembly (Aalbo_primary.1, Refseq assembly GCF_006496715.1) [34]. WGS data provided by BGI were aligned to the genome of Ae. albopictus using BWA-MEM algorithm [52]. Because the landscape of nrEVEs in the genome of Ae. albopictus is not fixed, but mosquitoes may have different nrEVE patterns [40], we first assessed which nrEVEs were present in each sample using Genome Analysis ToolKit (GATK) [53] as previously described [40,53] and then assessed an unbiased small RNA profile of nrEVEs after CHIKV infection. Due to sequence similarity among nrEVEs, a nrEVE was considered absent in a sample when the average coverage for that nrEVE was less than 1 read with zero mapping quality. To identify possible viral integrations formed after CHIKV infection, we used the VyPer and ViR pipelines [54,55]. The similarity of nrEVEs sequences to CHIKV 06.21 was calculated by grouping nrEVEs depending on their viral family and then aligning them to the genome of CHIKV 06.21 (NC_004162.2) with MAFFT [56]. We retrieved the identity matrix as implemented by the EMBL-EBI search and sequence analysis tools [57] and plotted the results with custom scripts.

Bioinformatic Analyses of sRNA Data
For the following analyses, sRNA data were aligned using sRNAmapper [58] to either the genome of CHIKV 06.21 (NC_004162.2) or Ae. albopictus transcripts (Aalbo_primary.1, Refseq assembly GCF_006496715.1) using the "-best" option. Sequences between 20-22 nt were identified as siRNAs/miRNAs and sequences between 25-30 nt were identified as piRNAs. This classification of sRNAs was based on available literature [58][59][60][61][62], and through bioinformatics analysis as follows. First, we plotted the length distribution of all mature miRNAs reported for Ae. aegypti, downloaded from miRBase database v.22 [63], and corroborate that the major range for miRNAs occurs between 20 to 22 nt ( Figure  S2). Second, we discarded ranges of nts where miRNA and piRNA definitions overlap, as the tail distribution of miRNAs (size range of 23-24 nt) shows ( Figure S2). Finally, taking into account that smaller peaks at 20 and 22 nt are often observed as Argonaute-2 products [64], and that thousands of 20 and 22 nt unique reads perfectly mapped to the viral genome ( Figure 1), we decided that the broader range of 20-22 nt would provide less biased results than limiting the analyses to 21 nt reads (i.e., exo-siRNA consensus size).  To visualize the coverage along the virus genome, alignment (bam) files of biological replicates were merged using bamCompare as implemented in DeepTools2 [59], normalizing the data to reads per kilobase of transcripts (RPKM) values based on total library size, setting bin size to 1 nt to improve resolution, and computing the mean of the number of reads between the files. Alignment results were exported as bedgraph files and visualized in Integrative Genomics Viewer (IGV) [60]. sRNAs were filtered by length using BBMap reformat.sh [65]. PingPongPro and the small RNA signature tool [61] were implemented in Galaxy [66] to check for ping-pong signature of reads identified as piRNAs; 1U and 10A biases were plotted with weblogo 2.8.2 [64]. siRNAs that had mapped to CHIKV (hereafter called viral-sRNA) were re-mapped to Ae. albopictus transcripts and visualized in IGV [60], after merging alignment (bam) files with samtools merge [67]. Candidate sequences of viral miRNA-like precursors were identified using ViralMir [68] and miRNAFold [69] with default parameters by analyzing the secondary structure of the 150 bp viral RNA sequence including candidate viral miRNA-like sequences.
2.5.2. Small RNA Profile of Mosquito Transcripts, piRNA Clusters and nrEVEs sRNA data were aligned to Ae. albopictus transcripts. Only sRNAs mapping in reverse orientation with respect to the predicted transcripts were retained. sRNAs were filtered by length using BBMap reformat.sh [65] as indicated above. Counts for each transcript were performed using the subread featureCounts.sh [70]. Transcripts displaying differential abundance of small RNAs between CHIKV infected and control samples were identified and checked against the list of Ae. albopictus immunity genes [34] to identify possible regulation of immunity pathways by sRNAs. sRNA data were aligned to the Ae. albopictus genome following the same criteria as above and their abundance in each piRNA-cluster was estimated using the subread featureCounts.sh [70]. Comparison between infected vs. non-infected samples in the abundance of sRNA mapping to transcripts (hereafter called T-sRNA) was established based on read counts in edgeR using the weighted trimmed mean of M-values with zero pairing (TMMwzp) normalization method and the likelihood ratio test to assess statistical significance [71]. As implemented in edgeR, genomic regions, or transcripts that had a minimum total read count of 15 across all samples were discarded [72]. Statistical significance was based on a 0.05 FDR threshold. Three normalization strategies were implemented to minimize bias in analysis of differential abundance. Normalizations were based on: (1) the total number of sRNA reads that mapped on the reference genome, (2) the total number of reads analyzed in edgeR, following size-filtering and counts, (3) the fraction of miRNA sized molecules that mapped onto a list of previously identified miRNAs [34], as performed in [73][74][75]. Results from the three normalization methods were compared and visualized in Venn diagrams using Venny 2.1.0 [76] showing that the normalization based on the total number of sRNA reads has fewer outliers than the other methods and fewer outliers ( Figure S3). Transcripts displaying significant differential abundance of small RNAs between infected or uninfected were analyzed using Blast2GO [77]. Deduplicated target transcripts were blasted using default parameters against all currently available Culicidae genome sequences and the top 5 hits were retrieved. GO analyses were performed merging the results from the Interpro classification and the results from the blast database with the subsequent mapping and annotation, as implemented in Blast2GO. GO enrichment was performed based on the annotation of the full sets of transcripts of Ae. albopictus with FDR cut-off of 0.05. To find the most represented pathways, target transcripts were analyzed in KOBAS 3.0 [78], selecting Ae. aegypti as reference species and searching the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway (K option). Results were considered significant at p-value < 0.05.
Small RNA coverage of nrEVEs was tested after having verified nrEVE presence in the sample under study as described above. Despite the high stringency implemented in sRNAmapper (18 nt perfect seed match, 1 mismatch allowed after the seed and 2 mismatches allowed on the 3 end tail of the read), single sRNA molecules could be mapped to several nrEVEs with equal confidence because of the high sequence similarity among nrEVEs [34]. To identify sRNAs mapping to nrEVEs and accumulate differentially across samples, we used two strategies to reduce any bias due to nrEVE sequence similarity: (1) we focused on single small RNAs (level 1) or (2) we took in consideration the overall small RNAs mapping to a nrEVE (level 2) ( Figure S1B). For the first strategy, reads from Viruses 2021, 13, 553 6 of 19 each condition that mapped to nrEVEs were collapsed based on their sequence identity with fastx-toolkit [79] and each sequence was used to count the amount of each piRNA and siRNA sequence present in every fastq file, using custom scripts. Individual sRNA names were given following fastx-toolkit results in the format "rank-number" where rank starts at 1 for sRNA with the highest read count and number is the actual number of reads. For the second strategy, piRNAs and siRNAs abundance from each nrEVE was estimated using subread featureCounts.sh [70]. Differential abundance of small RNAs mapping to nrEVEs (hereafter defined as EVE-sRNA) between infected vs. uninfected samples was assessed in edgeR as described in the previous paragraph. EVE-sRNAs that were found statistically differentially abundant between infected and not infected samples were used to blast (BLASTn, default settings) [80] the collection of Ae. albopictus transcripts [34]. When found, the top 10 blast hits of differentially-abundant EVE-sRNAs were retrieved and thermodynamic analyses on the RNA-RNA pairing were performed using the software IntaRNA locally with heuristic mode under the model B [81,82], setting the seed between positions 2-7 [83] and the seed energy threshold <−3.2 kcal/mol [81]. Candidate target transcripts were analyzed using Blast2GO [77] to find their Gene Ontology terms as previously described. Functional annotation was also performed in Argot 2.5 [84] searching against Culicidae with default parameters and a cut-off ≥200.

Differential Expression of Ae. albopictus Annotated miRNAs
sRNA data were mapped to the consensus sequences of 229 loci corresponding to 121 validated Ae. albopictus pre-miRNAs [34]. Counting and differential expression analysis were performed in edgeR as described above. Results of identical miRNAs from different loci were averaged. For each differentially expressed miRNA, both the consensus star sequence and the consensus mature sequence were retrieved, and the possible targets investigated using blast and IntaRNA as described for EVE-sRNAs. Reads that mapped on differentially expressed miRNAs were also aligned to the CHIKV genome to investigate their potential as antiviral miRNAs.

Quantitative PCR
Selected target transcripts from all previous analyses (i.e., from the list of differentially abundant T-sRNAs and transcripts targeted by differentially abundant EVE-sRNAs or viral-sRNAs) were tested for differential expression by quantitative PCR (qPCR) on samples from the second infection experiment. RNA from these samples was extracted using standard Trizol protocol and reverse transcribed with the GoScript reverse transcriptase kit (Promega), following manufacturer's instructions. qRT-PCR reactions were performed using the QuantiNova SYBR Green PCR Kit (Qiagen) following the manufacturer's instructions on an Eppendorf Mastercycler RealPlex4 (Table S1). Estimates of relative quantification were performed with the delta-delta-Ct method implemented in the software qBase+ (Biogazelle), using RPL34 gene as housekeeping [85]. Statistical significance was assessed by ANOVA and unpaired t-tests as implemented in qBASE+. Selected piRNAs were also investigated using qPCR (Table S1). Small RNA molecules from the same samples used for transcript quantification were reverse transcribed using the miScript II RT Kit (Qiagen) following manufacturer's instruction with the HiSpec buffer to ensure only mature sRNAs would be selected. qPCR reactions were set up using the miScript SYBR Green PCR Kit (Qiagen), following manufacturer's instruction. piRNA expression was normalized to the expression of U6B small nuclear RNA (RNU6B).

Results and Discussion
Aedes albopictus mosquitoes of the reference Foshan strain were infected with CHIKV. At 14 dpi, when persistence has established, saliva and two pools of each 15 female carcasses and ovaries were collected. Pools of carcasses and ovaries were processed to extract both DNA and RNA for WGS and small RNA-seq, respectively. WGS data were used to derive the pattern of nrEVEs of the sampled mosquitoes and verify whether CHIKV infection results in new nrEVEs formation. Small RNA-seq was used to quantify the changes in small RNAs against CHIKV, the Ae. albopictus transcriptome and nrEVEs ( Figure S1A).
WGS libraires resulted in a total of 1,324,733,838 and 1,713,328,214 clean reads in CHIKV-infected carcasses and ovaries, respectively, and 1,467,049,196 and 774,603,540 clean reads in uninfected carcasses and ovaries, respectively, after grouping biological replicates. Sequencing of small-RNA libraries resulted in a total of 1,551,205,753 and 1,010,736,055 reads, in the range of 18-30 nt, which could be mapped to the reference Ae. albopictus genome from infected ovaries and carcasses, respectively, after grouping biological replicates. Similarly, 1,568,888,349 and 1,651,306,629 reads were mapped for un-infected control ovaries and carcasses, respectively. Reads size distribution of ovaries samples shows a peak at 21 nt in both infected and uninfected samples, as well as marked piRNAs production peaking at 28 nt ( Figure 1). In uninfected and CHIKV-infected carcasses samples, only a peak at 21 nt was seen, indicating a lower production of piRNAs in carcasses (Figure 1).

Small RNA Profile of CHIKV, vDNA Fragments, and New nrEVEs
The CHIKV genome consists of a single positive strand RNA of~11-12 Kb. Upon entry into a host cell and uncoating, the CHIKV genome is used both as mRNA for translation of nonstructural proteins and as the template for the synthesis of a complementary negative strand RNA molecule, which is the replicative form of the virus. A genomic length positive strand molecule is synthesized from the negative strand RNA, along with a shorter RNA molecule from an internal promoter. This subgenomic positive strand RNA molecule encodes for nonstructural proteins [86]. Small RNA sequencing data from CHIKV-infected and control samples were mapped against the CHIKV genome. Mapped sRNAs were filtered by size to assess siRNA-(20-22 nt) and piRNA-(25-30 nt) responses. A total of 19,008 and 316,527 siRNA size reads mapped to the CHIKV genome from infected ovaries and carcasses, respectively ( Figure 2A). Average strand-specific RPKM normalized counts show that small RNAs map predominantly on the positive strand of CHIKV, with the only exception of siRNAs in CHIKV-infected ovaries. This result indicates that the RNAi activity is mostly focused on the negative strand (Table 1). Consistently, piRNA size reads were found to map against CHIKV ( Figure 2B), with a predominance of piRNA size reads on the positive strand of the virus ( Figure 2C). We found a total of 5510 piRNA size reads in ovaries and 60,824 in carcasses of CHIKV-infected samples with a clear pick between positions~7600-8700 of the viral genome, corresponding to the region encoding the C and E2 proteins. A pick of piRNAs at the 5 terminus of the CHIKV genome, around nucleotide position 8000, was previously identified in both Ae. aegypti and Ae. albopictus mosquitoes 4 dpi with CHIKV [87]. Accordingly, we saw a single vDNA form between positions 7964-8742 of the viral genome in infected ovaries ( Figure 2B). A smaller fragment in the same region was found in carcasses. In a laboratory colony of Ae. albopictus derived from Vietnam, vDNA forms from nonstructural proteins (i.e., nsP2 and nsP4) were identified as early as 2 dpi and up to 9 dpi with CHIKV [30]. These vDNA forms were shown to influence the profile of siRNAs during the early phase of infection. Taken together, these results indicate that the subgenomic mRNA is a general hot spot for piRNAs in Aedes spp. mosquitoes both during acute and persistent CHIKV infection [42]. On the contrary, vDNA forms can be generated from different parts of the CHIKV genome, possibly depending on the combination of viral and mosquito strains. Additionally, while during the early phases of arboviral infection, vDNAs modulate siRNAs in mosquitoes [30,33], during persistent infection, we could identify a unique vDNA form corresponding to a pick of piRNAs. Overlap analysis of piRNA reads revealed that the ping-pong amplification loop is active at this stage of CHIKV infection both in ovaries and carcasses ( Figure 2D).
Using WGS data from the same samples, the presence of novel nrEVEs was tested, implementing VyPER [54] followed by ViR [55], resulting in no signals. This result indicates that no integration of vDNA fragments occurred following CHIKV infection in either soma (carcass) or germline (ovary) cells. The absence of nrEVEs following CHIKV infection may be due to the fact that integrations of viral sequences is a continuous but rare event in Aedes genomes and may depend on the viral load and the frequency of infection [36]. Additionally, no nrEVE with similarity to CHIKV has been characterized in arthropod genomes so far [34,35,88], suggesting that vDNA forms of CHIKV may remain episomal.
tural proteins (i.e., nsP2 and nsP4) were identified as early as 2 dpi and up to 9 dpi with CHIKV [30]. These vDNA forms were shown to influence the profile of siRNAs during the early phase of infection. Taken together, these results indicate that the subgenomic mRNA is a general hot spot for piRNAs in Aedes spp. mosquitoes both during acute and persistent CHIKV infection [42]. On the contrary, vDNA forms can be generated from different parts of the CHIKV genome, possibly depending on the combination of viral and mosquito strains. Additionally, while during the early phases of arboviral infection, vDNAs modulate siRNAs in mosquitoes [30,33], during persistent infection, we could identify a unique vDNA form corresponding to a pick of piRNAs. Overlap analysis of piRNA reads revealed that the ping-pong amplification loop is active at this stage of CHIKV infection both in ovaries and carcasses ( Figure 2D).
Using WGS data from the same samples, the presence of novel nrEVEs was tested, implementing VyPER [54] followed by ViR [55], resulting in no signals. This result indicates that no integration of vDNA fragments occurred following CHIKV infection in either soma (carcass) or germline (ovary) cells. The absence of nrEVEs following CHIKV infection may be due to the fact that integrations of viral sequences is a continuous but rare event in Aedes genomes and may depend on the viral load and the frequency of infection [36]. Additionally, no nrEVE with similarity to CHIKV has been characterized in arthropod genomes so far [34,35,88], suggesting that vDNA forms of CHIKV may remain episomal.

Viral SmallRNAs on Ae. albopictus Transcripts
After having identified small RNAs that mapped to CHIKV, we checked whether any of these viral small RNA molecules also mapped to Ae. albopictus transcripts. A total of 68 siRNA reads from the CHIKV-infected samples, deriving primarily from two hotspots on the nsP3 coding region ( Figure 3A), mapped to Ae. albopictus transcripts with an enrichment for afadin-like protein (LOC115259774), glucose transporter type1-like proteins (LOC109428702-LOC109424279), and an uncharacterized protein (LOC115263499) ( Table S2). These siRNA reads and the corresponding CHIKV genomic sequences were further analyzed to check for hairpins using viralMir [68] and miRNAFold [69]. These small RNAs may originate from hairpin structures within the CHIKV genomic sequence  Figure S4). The expression levels of the above-mentioned three transcripts were compared by qPCR between CHIKV-infected samples and un-infected controls ( Figure 3B). Results showed a significant decrease in expression for transcripts encoding the glucose transporter and the uncharacterized protein (p-values < 0.05), but not that of afadin-like protein, in infected carcasses. Overall, these results suggest that CHIKV encodes for small RNAs which may target mosquito transcripts. However, whether these small RNAs are canonical miRNAs or are part of the heterogenous population of siRNAs produced following infection has to be further investigated. Modulation of pathways linked to energy production, including carbohydrate metabolism, has already been observed in mosquitoes following arboviral infection [89][90][91]. It was proposed that induction of glucose metabolism following viral infection may compensate the lack of energy, which derive from the breakdown of mitochondrial membranes resulting in a lack of ATP [90,92]. of 68 siRNA reads from the CHIKV-infected samples, deriving primarily from two hotspots on the nsP3 coding region ( Figure 3A), mapped to Ae. albopictus transcripts with an enrichment for afadin-like protein (LOC115259774), glucose transporter type1-like proteins (LOC109428702-LOC109424279), and an uncharacterized protein (LOC115263499) ( Table S2). These siRNA reads and the corresponding CHIKV genomic sequences were further analyzed to check for hairpins using viralMir [68] and miRNAFold [69]. These small RNAs may originate from hairpin structures within the CHIKV genomic sequence ( Figure S4). The expression levels of the above-mentioned three transcripts were compared by qPCR between CHIKV-infected samples and un-infected controls ( Figure 3B). Results showed a significant decrease in expression for transcripts encoding the glucose transporter and the uncharacterized protein (p-values < 0.05), but not that of afadin-like protein, in infected carcasses. Overall, these results suggest that CHIKV encodes for small RNAs which may target mosquito transcripts. However, whether these small RNAs are canonical miRNAs or are part of the heterogenous population of siRNAs produced following infection has to be further investigated. Modulation of pathways linked to energy production, including carbohydrate metabolism, has already been observed in mosquitoes following arboviral infection [89][90][91]. It was proposed that induction of glucose metabolism following viral infection may compensate the lack of energy, which derive from the breakdown of mitochondrial membranes resulting in a lack of ATP [90,92].

The Profile of SmallRNAs on Ae. albopictus Transcripts
Modulation of mosquito small RNAs has been shown after arboviral infection not only as a direct response to the virus, but also as a way to maintain cellular homeostasis, by regulating gene expression [28,47,48]. On these bases, we mapped small RNA data to Ae. albopictus transcripts and kept only reads that mapped on antisense orientation to assess differential abundance between infected and uninfected samples. A total of 376,119 siRNA reads and 5,684,597 piRNA reads mapped to mosquito transcripts in CHIKV-infected ovaries; 1,084,531 siRNA reads and 690,008 piRNA reads in CHIKV-infected carcasses. Regarding the corresponding un-infected controls, 302,702 siRNA reads and 5,780,782 piRNA reads mapped to transcripts in ovaries, and 244,103 siRNA reads and 243,947 piRNA reads in carcasses. A total of 30 and 534 transcripts displayed significant differential abundance (SDA) of siRNAs in CHIKV-infected ovaries and carcasses, respectively, representing six over-represented GO terms, including "DNA helicase activity", "DNA recombination", and "DNA repair" (Table S4). Interestingly, transcriptional regulation of genes associated with DNA repair and chromatin function has been recently linked to ovaries and carcasses, respectively [93] (Figure 4A,B, Table S3). 5,780,782 piRNA reads mapped to transcripts in ovaries, and 244,103 siRNA reads and 243,947 piRNA reads in carcasses. A total of 30 and 534 transcripts displayed significant differential abundance (SDA) of siRNAs in CHIKV-infected ovaries and carcasses, respectively, representing six over-represented GO terms, including "DNA helicase activity", "DNA recombination", and "DNA repair" (Table S4). Interestingly, transcriptional regulation of genes associated with DNA repair and chromatin function has been recently linked to ovaries and carcasses, respectively [93] (Figure 4A,B, Table S3). GO enrichment analyses reveled an over representation representation of 6 terms, including "DNA helicase activity", "DNA recombination", and "DNA repair" (Table S4). Interestingly, transcriptional regulation of genes associated with DNA repair and chromatin function has been recently linked to transgenerational immunopriming (TGIP) in D. melanogaster after infection with Sindbis virus, an Alphavirus-like CHIKV, which was clearly demonstrated only in females [93]. Thus, it is tempting to speculate that during persistent infection of female mosquitoes, there is transcriptional regulation of functions that not only helps with maintenance of infection, but also prepares to TGIP. Because piR-NAs have been shown to contribute in regulating mosquito gene expression [94], we also GO enrichment analyses reveled an over representation representation of 6 terms, including "DNA helicase activity", "DNA recombination", and "DNA repair" (Table S4). Interestingly, transcriptional regulation of genes associated with DNA repair and chromatin function has been recently linked to transgenerational immunopriming (TGIP) in D. melanogaster after infection with Sindbis virus, an Alphavirus-like CHIKV, which was clearly demonstrated only in females [93]. Thus, it is tempting to speculate that during persistent infection of female mosquitoes, there is transcriptional regulation of functions that not only helps with maintenance of infection, but also prepares to TGIP. Because piRNAs have been shown to contribute in regulating mosquito gene expression [94], we also checked for transcripts with significantly different piRNA profile between infected and uninfected samples. A total of 299 and 81 transcripts displayed SDA of piRNAs in CHIKV-infected ovaries and carcasses, respectively ( Figure 4C,D, Table S5). In carcasses, the majority of these transcripts showed lower abundance of piRNAs during infection. KOBAS gene-list enrichment analysis of these transcripts revealed an enrichment for several pathways, including oxidative phosphorylation, purine metabolism, and DNA damage-related pathways ( Figure S5, Table S6). Two transcripts displayed differential abundance for piRNAs in CHIKV-infected ovaries and carcasses as well as for siRNAs in CHIKV-infected ovaries; four transcripts displayed differential abundance for piRNAs and siRNAs in CHIKV-infected ovaries; 23 transcripts displayed differential abundance for piRNAs and siRNAs in CHIKV-infected carcasses ( Figure S6). The expression levels of 3 transcripts (XM_029853235.1, uncharacterized; XM_029877572.1 encoding for a neurofilament heavy polypeptide-like protein and XM_029878700.1 for a hydrolase), which had consistent differential abundance of small RNAs across conditions and the relatively highest number of reads, were compared by qPCR between infected and un-infected samples. qPCR results show all three transcripts are significantly under-expressed in carcasses during infection compared to un-infected controls ( Figure 4E).
The hydrolase XM_029878700.1 was the only tested transcript with SDA of only piR-NAs. These piRNAs were extracted and re-mapped to the Ae. albopictus genome after masking the genomic region encoding for XM_029878700.1, leading to the identification of genomic region NW_021838576.1:117401889-117402177. This region, which maps outside annotated piRNA clusters [34], shows high sequence homology to XM_029878700.1, but without annotated ORFs ( Figure 4F). Annotation as "hydrolase" is a generic term as hydrolysis can be performed on different chemical bonds, including sugar glucosyl bonds and lipids. It is interesting to notice that sugar and lipid homeostasis is a common feature of Aedes spp. mosquitoes after infection not only with CHIKV, but also Flaviviruses [22,[90][91][92][95][96][97], suggesting that the candidate transcript identified here should be further functionally investigated.
The database of Ae. albopictus miRNAs was recently published [34], so we took advantage of this resource to check for specific changes in mosquito miRNA expression levels during persistent CHIKV infection. A total of 14 miRNAs were detected as differentially modulated (Table 2), with the majority being upregulated (8) in ovaries but depleted in carcasses. All the DE miRNAs (except for miR-new6) have a homolog in Ae. aegypti. Differential expression levels ranged between~0.20-fold for aal-miR-new6 and~20-fold for aal-miR-210. We compared these results to a previous report of differentially expressed miRNAs in Ae. albopictus cells [98] and found only one miRNA present in both analyses (aal-miR-1000). Interestingly, miR-1000 was reported to be under-expressed in cells 24 h post infection, whereas we found it over-expressed in adult mosquitoes 14 dpi. We searched the Ae. albopictus transcriptome for potential target sites of the DE miRNAs, as well as the corresponding star miRNAs, performing thermodynamic analyses on the RNA-RNA pairing of the top 10 blast hits for each miRNA. We found a total of 229 possible targets (140 from over-expressed miRNAs, 89 from under-expressed miRNAs) (Table S7). KOBAS analysis revealed 12 significantly enriched pathways (Table 3), including the Toll and Imd pathways, suggesting the under regulation of these immunity pathways during persistent infection. "Lysine degradation" was also identified among the enriched pathways when looking at differentially abundant piRNAs ( Figure S5). We also tested for potential targets site of the DE miRNAs on the CHIKV genome, but no significant alignment was found, indicating that these miRNAs cannot directly target CHIKV. Overall, these results show a different modulation of miRNAs in ovaries and carcasses. Depletion of miRNAs in carcasses is a trend consistent with what has been observed following Flavivirus infection (i.e., Zika and DENV) [99,100].

Differential Abundance of nrEVEs-sRNAs Following Infection and Their Targets on the Ae. albopictus Transcriptome
The genome of Ae. albopictus harbors hundreds of nrEVEs, which are variably distributed among the genomes of Foshan mosquitoes [40]. None of these nrEVEs derive from Alphaviruses and the maximum sequence similarity between annotated nrEVE and CHIKV is 70% ( Figure 5A). Nevertheless, nrEVEs produce piRNAs [35] and responder and trailer piRNAs are produced from viral RNAs guided by endogenous piRNAs, indicating that piRNA biogenesis can spread outside a putative nrEVE targeted region [31,101]. On this basis, we first verified which nrEVEs our mosquitoes had, then tested whether the piRNA profile of these nrEVEs was different in infected vs. un-infected-fed samples and finally checked for putative targets of the differentially accumulated nrEVEs-derived piRNAs.
Differences in the piRNA profile of nrEVEs were observed only in CHIKV-infected ovaries. A total of 24 nrEVEs mapping in different clusters showed a piRNA profile that was significantly different between infected and un-infected samples ( Figure S7, Table S8). piRNAs from these nrEVEs peaked between 27 and 29 nt ( Figure 5C). When looking at single piRNAs ( Figure S1B, level 1), a total of 34 and 4 piRNA molecules were significantly differentially expressed in infected vs un-infected ovaries and carcasses ( Figure 5B), respectively, and peaked at 27 nt ( Figure 5C). Of these 38 piRNAs, 30 originated from nrEVEs embedded in piRNA clusters. We focused on CHIKV-infected ovaries. Intersecting the genomic origins of the DE piRNAs with the SDA nrEVEs, we identified 24 DE piRNAs from 8 SDA nrEVEs (Table S9). These 24 piRNAs do not blast on the CHIKV genome indicating that they do not target the viral mRNA, but they blast against Ae. albopictus transcripts. The free energy of the seed pairing between these piRNAs and their candidate target transcripts was evaluated with IntaRNA, showing that these piRNAs could interfere with the detected transcripts. Two piRNAs (1049-820 and 3135-266), both originating from nrEVE Rhabdo22 ( Figure 5D, red bar), which is embedded in piRNA cluster 461, have a common target, transcript XM_029880513.1, an uncharacterized protein ( Figure 5E). In accordance with bioinformatic data, qPCR analysis confirmed a higher expression of 3135-266 in infected vs. un-infected ovaries and a lower expression of the candidate target transcript ( Figure 5F). Taken together, these results suggest that nrEVEs-derived piRNAs can be differentially accumulated during CHIKV infection and regulate the expression profile of endogenous transcripts. Alphaviruses and the maximum sequence similarity between annotated nrEVE and CHIKV is 70% ( Figure 5A). Nevertheless, nrEVEs produce piRNAs [35] and responder and trailer piRNAs are produced from viral RNAs guided by endogenous piRNAs, indicating that piRNA biogenesis can spread outside a putative nrEVE targeted region [31,101]. On this basis, we first verified which nrEVEs our mosquitoes had, then tested whether the piRNA profile of these nrEVEs was different in infected vs. un-infected-fed samples and finally checked for putative targets of the differentially accumulated nrEVEsderived piRNAs.

Conclusions
Here, we describe the profile of small RNAs and the presence of vDNA forms and nrEVEs during persistent infection of Ae. albopictus with CHIKV. Because RNAi is the main antiviral pathway of Aedes spp. mosquitoes, a number of studies have described the profile of small RNAs following infection with arboviruses, including CHIKV [87,[98][99][100][102][103][104][105]. In line with previous reports, we found a strong exogenous siRNA response throughout the whole CHIKV genome in both ovaries and carcasses [30,88]. On the contrary piRNAs aligned primarily to a region encompassing the C and E2 coding sequences from which a unique vDNA form was also identified, in both analyzed tissues. This result differs from previous studies in which various vDNAs from multiple regions of the genome of the flaviviruses Zika, dengue, and West Nile virus were identified 24 h post infection and up to 21 dpi in mosquito cells and adults, when infection has become persistent [30,32,106]. Whether this discrepancy is dependent on the specific viral species and mosquito species/strains studied or it indicates that only the vDNA molecule producing the most piRNAs is retained once infection has become persistent requires further investigations.
CHIKV infection did not result in the formation of novel viral integrations in the genome of Ae. albopictus. However, the piRNA profile of nrEVEs in the genomes of the tested mosquitoes changed and two nrEVE-derived piRNAs were further validated by qRT-PCR to be more abundant in CHIKV-infected than uninfected ovaries. Accordingly, the transcript to which these piRNAs map showed decreased in expression in CHIKVinfected ovaries. Thus, nrEVEs may have other functions than antagonistic action against cognate viruses, which was demonstrated in ovaries [38]. This result may explain previous findings describing that the presence/absence of certain nrEVEs correlates with differences in dissemination efficiency for CHIKV and dengue viruses, even if the genome of neither Ae. aegypti not Ae. albopictus has nrEVEs derived from these arboviruses [34,35,39]. To conclude, here we provided an in-depth analysis of small RNAs in CHIKV-infected Ae. albopictus mosquitoes at 14 dpi, when CHIKV has established persistent infection and identify target transcripts of differentially abundant small RNAs that merits further investigations. These results clearly show that the activity of the RNAi pathway modulates sRNAs and piRNAs, which contribute to cellular homeostasis, affecting primarily functions related to lipid and sugar metabolisms and DNA repair/binding. Finally, the identified correlation between the differential accumulation nrEVE-piRNAs and the decreased expression of the piRNA-target expands the possible role of nrEVEs beyond immunity against cognate viruses.
Supplementary Materials: The following are available online at https://www.mdpi.com/1999-491 5/13/4/553/s1, Figure S1: (A) Scheme of the infection experiments. (B) Strategy used to study the small RNA profile of nrEVEs, Figure S2: Length distribution of the full collection of Aedes aegypti miRNAs downloaded from miRbase, Figure S3: Venn diagram representing the intersection of the sRNA differential abundance across each transcript based on the three different library sizes for normalization, Figure S4: Hairpin predictions computed with miRNAFold, Figure S5: Bar plot of significantly enriched pathways based on differential sRNA abundance in CHIKV-infected ovaries and carcasses, Figure S6: Venn diagram of the transcripts displaying differential abundance across conditions, Figure S7: Venn diagram representing the intersection of the sRNA differential abundance across each nrEVE based on the three different library sizes for normalization, Table S1: List of primers used in this study, Table S2: List of candidate targets of potential viral-sRNAs, Table S3: List of transcripts displaying significant differential abundance for siRNAs, Table S4:. GO enrichment analysis of siRNA SDA transcripts, Table S5: List of transcripts displaying significant differential abundance for piRNAs,