Next Article in Journal
Comparative Transcriptomic Analyses of Peripheral Blood Mononuclear Cells of COVID-19 Patients without Pneumonia and with Severe Pneumonia in the First Year of Follow-Up
Next Article in Special Issue
Increased Susceptibility of Rousettus aegyptiacus Bats to Respiratory SARS-CoV-2 Challenge Despite Its Distinct Tropism for Gut Epithelia in Bats
Previous Article in Journal
Emergence of Recombinant SARS-CoV-2 Variants in California from 2020 to 2022
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Endogenous Bornavirus-like Elements in Bats: Evolutionary Insights from the Conserved Riboviral L-Gene in Microbats and Its Antisense Transcription in Myotis daubentonii

1
RNA Bioinformatics and High-Throughput Analysis, Friedrich Schiller University Jena, 07743 Jena, Germany
2
European Virus Bioinformatics Center, 07743 Jena, Germany
3
Cluster of Excellence Balance of the Microverse, Friedrich Schiller University Jena, 07743 Jena, Germany
4
Institute for Virology, FB10-Veterinary Medicine, Justus Liebig University, 35392 Gießen, Germany
5
Genome Competence Center (MF1), Robert Koch Institute, 13353 Berlin, Germany
6
German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, 04103 Leipzig, Germany
7
Fritz Lipmann Institute-Leibniz Institute on Aging, 07745 Jena, Germany
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Viruses 2024, 16(8), 1210; https://doi.org/10.3390/v16081210
Submission received: 29 April 2024 / Revised: 16 July 2024 / Accepted: 24 July 2024 / Published: 27 July 2024
(This article belongs to the Special Issue Antiviral Immune Responses of Bat)

Abstract

:
Bats are ecologically diverse vertebrates characterized by their ability to host a wide range of viruses without apparent illness and the presence of numerous endogenous viral elements (EVEs). EVEs are well preserved, expressed, and may affect host biology and immunity, but their role in bat immune system evolution remains unclear. Among EVEs, endogenous bornavirus-like elements (EBLs) are bornavirus sequences integrated into animal genomes. Here, we identified a novel EBL in the microbat Myotis daubentonii, EBLL-Cultervirus.10-MyoDau (short name is CV.10-MyoDau) that shows protein-level conservation with the L-protein of a Cultervirus (Wuhan sharpbelly bornavirus). Surprisingly, we discovered a transcript on the antisense strand comprising three exons, which we named AMCR-MyoDau. The active transcription in Myotis daubentonii tissues of AMCR-MyoDau, confirmed by RNA-Seq analysis and RT-PCR, highlights its potential role during viral infections. Using comparative genomics comprising 63 bat genomes, we demonstrate nucleotide-level conservation of CV.10-MyoDau and AMCR-MyoDau across various bat species and its detection in 22 Yangochiropera and 12 Yinpterochiroptera species. To the best of our knowledge, this marks the first occurrence of a conserved EVE shared among diverse bat species, which is accompanied by a conserved antisense transcript. This highlights the need for future research to explore the role of EVEs in shaping the evolution of bat immunity.

1. Introduction

Within all vertebrates, bats (Chiroptera) are the most abundant and ecologically diverse animals [1]. Except for the polar regions, bats are globally distributed [2] and their origin has been dated in the Cretaceous period [3]. In addition to having the ability to fly [3], to learn to produce vocalizations by hearing [4], and an exceptional longevity [5,6,7], bats are also known for their unique natural resistance to many pathogenic viruses [1,8,9,10]. For example, MERS and Ebola RNA viruses cause fatal infections in humans; however, bats appear to be asymptomatic and survive the infection [11,12,13,14]. A new type of coronavirus, SARS-CoV-2, led to a worldwide pandemic starting in 2019 and caused more than 6.9 million deaths, according to the WHO (www.who.int/emergencies/diseases/novel-coronavirus-2019 (accessed on 29 April 2024)) and most probably also originated in bats [15,16]. Despite the unique biological characteristics of these flying mammals, bats are one of the least studied taxa of all mammalian [2,17,18], and the reasons for their function as reservoirs for various viruses remain unclear [19,20,21]. However, bats’ uniquely developed immune systems may provide the solution to better understanding and fighting various pathogens and preventing future pandemics [13,22,23].
The immune systems of various bat species have been studied in the past (see [9] for an overview), with most focusing on differently expressed (protein-coding) genes [13,17,24,25,26]. Genes related to or associated with the immune system are typically differently expressed during viral infections on extreme levels [27]. In bat cells, these expression changes often appear not as drastically as in human cells, as previously shown in a comparison of a human and bat cell line infected with Ebola and Marburg virus [13]. Although investigations into the bat transcriptome during viral infections provide first insights into unique features, it is still elusive how the immune system of bats functions as a whole to control viral infections and how important key players act together. In addition, there is still little knowledge about the non-protein-coding transcriptome of bats, even though noncoding RNAs (ncRNAs) also play important roles during infections in bats [5,28,29].
One part of the ncRNA transcriptome consists of transcribed endogenous viral elements (EVEs), which have recently become more and more focused in the context of infections [30,31]. When a virus infects a cell, a part of the viral genome can be integrated into the host genome. In germline cells, integrated viral fragments can become EVEs; this process is called endogenization [32]. Once integrated into the host genome, EVEs can increase their allele frequency via vertical transmission until their potential fixation [32]. The first described EVEs were derived from endogenous retroviral elements (ERVs) [33,34]. For retroviruses, the integration of the retroviral RNA genome into the host DNA is a major step in the replication cycle [35]. Although less common, EVEs originating from nonretroviruses have been identified across many eukaryotic organisms, including unicellular eukaryotes, plants, vertebrates, and arthropods [33,36,37,38,39,40,41,42,43,44]. For example, EVEs derived from bornaviruses (nonsegmented, negative strand RNA viruses) were detected in eukaryotic genomes [45]. The family Bornaviridae comprises three genera: Carbovirus, Cultervirus, and Orthobornavirus, with EVEs present in all genera [46]. Different types of endogenous bornavirus-like (EBL) elements exist depending on which part of the viral mRNA has been integrated. To date, the elements of EBLN (nucleoprotein), EBLM (matrix protein), EBLG (envelope glycoprotein), EBLL (RNA-dependent RNA polymerase), EBLP (phosphoprotein), and EBLX (accessory protein) are known [47,48,49]. Despite being RNA viruses, bornaviruses likely have their mRNA reverse transcribed and integrated into the host genome via the action of the long interspersed nuclear element 1 (LINE-1) retrotransposon [33,45].
Among other mammals, EVEs with intact open reading frames (ORFs) have already been described in bat species. For instance, in Eptesicus fuscus, an EBLL element (efEBLL-1) was identified with an intact and complete ORF spanning 1718 codons [47], thus marking the initial observation of an EVE capable of encoding a functional RNA-dependent RNA polymerase. The function of this protein remains unclear, as is the case with most described EVEs. A notable exception is the VP35-like ORF derived from ancient filoviruses, where a hypothetical regulatory function has been proposed. Interestingly, this VP35-like protein exhibits an antagonist function to its exogenous homolog [30], thus potentially serving as a regulator of innate immune signaling. Various investigations have been conducted regarding EVEs in bats and their function, yet more than speculation or vague assertions are seldom made. It is assumed that some of these EVEs in bats contribute to bat-specific immune mechanisms that may confer a virus-tolerant phenotype [21,50]. In other organisms as well, these functions are typically inferred rather than definitively established [51]. An exception is the well-described syncytins, which are found, for example, in humans and mice, thus originating from the envelope gene of an expressed endogenous retrovirus [52,53,54,55,56,57].
In this work, we describe a novel EBLL in Myotis daubentonii (EBLL-Cultervirus.10-MyoDau) and its homologs in 18 bat species, thus following the EVE nomenclature proposed by Kawasaki et al. [48]. For better readability, we refer to EBLL-Cultervirus.10-MyoDau as CV.10-MyoDau in the following, and we omit the host component (-MyoDau) if we are talking about the locus in general and not about a copy within a species.
We demonstrate conservation at the nucleotide level and similarities at the protein level. Unexpectedly, an antisense transcript composed of three exons, which we named Antisense Myotis Complementary RNA (AMCR-MyoDau), has been confirmed through RNA-Seq analysis and RT-PCR in a Myotis daubentonii cell line. This AMCR-MyoDau is partly conserved in 22 Yangochiropera and 12 Yinpterochiroptera bat genomes.
Our results pave the way for further studies on transcribed EVEs to elucidate their potential function and role in virus immunity—even beyond bat species.

2. Materials and Methods

2.1. Bat Genome References and Gene Annotation Construction

We scanned genomes of 63 bat species for potential EVEs, thus comprising 42 of the suborder Yangochiroptera and 21 of Yinpterochiroptera, which were obtained from the NCBI database [58] and are of of varying assembly qualities (Table 1). Gene annotations were only available for reference sequences (RefSeqs, marked with an asterisk in Table 1). We performed liftover annotations for the other genomes lacking annotations using Liftoff (v1.6.3) [59]. We used the reference gene annotation from Myotis daubentonii (see Table 1 for species abbreviations) to annotate genes for all microbats lacking an annotation. Myotis daubentonii is one of the more recently published (20 August 2023) reference genomes within microbats, which is characterized by a small number of contigs and even annotated chromosomes. We used the reference annotation from Rousettus aegyptiacus for megabats, thus benefiting from its assembly composed of only 29 contigs, which is the most developed assembly available for megabats. The phylogenetic tree was adopted from the study by Agnarsson et al. [60]; see Figure S1.

2.2. Processing of RNA-Seq Data, Refinement of CV.10-MyoDau Annotation, and Differential Gene Expression

We used tblastn v2.15.0+ (E-value < 10 6 ) to search for new EVEs in the genome of Myotis daubentonii using the amino acid sequence of the L protein of a RefSeq Wuhan sharpbelly bornavirus of the genus Cultervirus (YP_010085030.1) as query (see https://www.doi.org/10.17605/OSF.IO/89EF2 for tblastn results). One of the hits was defined as EBLL-Cultervirus.10-MyoDau (NC_081844.1: 39,922,667–39,923,434; short name: CV.10.MyoDau), thus following the EVE nomenclature defined by Kawasaki et al. [48]. For transcriptome analyses, we used the genome assembly of Myotis daubentonii (mMyoDau2.1, Table 1) with the corresponding annotation (GCF_963259705.1) as a reference and utilized reads from our previously published RNA-Seq dataset derived from a virus- and an interferon alpha-induced M. daubentonii cell line (https://www.ncbi.nlm.nih.gov/bioproject/?term=GSE121301 (accessed on 1 July 2024)) [23]. From this study, we downloaded the quality-trimmed and rRNA-cleaned reads in FASTQ format for further downstream analyses (https://doi.org/10.17605/OSF.IO/X9KAD). In total, we obtained 18 short-read, single-end, and strand-specific Illumina samples comprising two postinfection time points (6 h, 24 h) and three conditions (mock, interferon alpha induction, Rift Valley fever virus Clone13 infection) in three biological replicates.
First, to verify our initial homology-based identification of CV.10-MyoDau and to more precisely define exon boundaries for this novel EVE, we mapped the RNA-Seq data to the Myotis daubentonii reference genome with HISAT2 v2.1.0 using default parameters [61,62] and investigated the mapping results using IGV [63]. For a programmatic approach, we used SAMtools to extract the depth at each nucleotide position from the mapped reads for the 18 samples. We determined the exon boundaries based on the sum of reads exceeding 10 across all 18 samples and refined them with split reads. The definition of the three resulting exons in Gene Transfer Format (GTF) format and the associated sequences of the exons belonging to the AMCR-MyoDau are available in the https://www.doi.org/10.17605/OSF.IO/89EF2.
Next, we ran RNAflow v1.4.6 [64] for mapping, read count normalization, and conducted differential gene expression analysis via DESeq2 [65] with parameters to skip the rRNA depletion step and employed counting reads in reverse strand specificity to match the RNA-Seq library design. We manually extended the Myotis daubentonii annotation GTF file by adding gene, transcript, and exon features for the three exons of AMCR-MyoDau. We obtained the adjusted p values, log2-fold change values, and normalized expression counts for the three exons and all pairwise comparisons.

2.3. In Vitro Validation of EVE Candidate

Cells were either mock treated, infected with Rift Valley fever virus (RVFV) Clone13 (MOI 5), or treated with 1000 U/mL pan species type I interferon (IFN). At the indicated time point, total cellular RNA was isolated using RNeasy Mini Kit (Qiagen, Cat No./ID: 74106) according to the manufacturer’s instructions. A total of 100 ng isolated RNA was used for random primed cDNA synthesis using PrimeScript High Fidelity RT-PCR Kit (Takara, R022B). PCR was conducted using a primer set designed using the online software Primer3Plus [66] to detect the AMCR-MyoDau 90 nucleotide sequence of exon 3 (fwd primer—CTCCCTTGAGGAGTGTGACC—and rev primer—GGCTGTCAGCAACAGTTTCA), KOD polymerase (Calbiochem, 71,086.3), and 2 µL of the respective cDNAs as templates. The annealing temperature was set to 57.1 °C for 35 cycles, and the resulting DNA products were separated on a 2% agarose (Serva, 11,404.07) in TAE (40 mM Tris, 20 mM Glacial acetic acid, 1 mM EDTA) buffer gel for 30 min at 100 V. The amplicons were then visualized using incubation in 0.25 µg/mL ethidium bromide in H2O for 20 min, and pictures were taken using the BioRad ChemiDoc MP imaging system.

2.4. Synteny Analysis to Confirm the Conservation of AMCR-MyoDau and CV.10-MyoDau

As a template, we extracted the three nucleotide sequences of the transcriptome-refined exons of AMCR-MyoDau and the authentic CV.10-MyoDau region from the Myotis daubentonii genome and searched them in all other selected bat genomes (Table 1) using blastn v2.15.0+ (E-value < 10 6 ) [67]. Since CV.10-MyoDau is part of exon 3 of AMCR-MyoDau, the synteny analysis of this EVE is similar to the synteny of AMCR-MyoDau. We conducted a synteny analysis for all blastn hits using either the annotations from the reference genomes or the built liftovers from the bat genomes without appropriate references. For this purpose, we recorded all genes within 1 million nucleotides in the downstream and upstream directions; for annotations of all surrounding genes, see https://www.doi.org/10.17605/OSF.IO/89EF2. The distances between the genes and the gene lengths were logarithmically transformed for visualization. A table containing all nucleotide blastn hits from the three exons of AMCR-MyoDau and CV.10-MyoDau, and their associated synteny is available in the https://www.doi.org/10.17605/OSF.IO/89EF2 as a feature file in GFF format and a human- and machine-readable plain text file.
Additionally, we provide truncated versions of these files containing only the blastn hits associated with a curated subset of syntenies.

2.5. Identification and Further Investigation of the Previously Described EBLL-IG

We refer to the homologous family of identified genes derived from AMCR-MyDau as AMCR. Due to numerous hits beyond AMCR’s synteny, we created a coverage plot of exon 3 for each synteny unit of all blastn hits to explore additional EVEs; see Figure S2. The coverage plot was created for exon 3 on the nucleotide level, as only exon 3 exclusively overlapped with CV.10-MyoDau. This plot unveiled another conserved EVE (sequence NC_081852.1 indices 16,169,651–16,170,474), as previously described in the literature as EBLL intergenus (EBLL-IG) [68]. With the resulting EBLL-IG blastn hit from exon 3 in Myotis daubentonii, additional searches were conducted using blastn, which are similar to exons 1–3 of AMCR-MyoDau, followed by a synteny analysis of the additional hits for the FASTA template and the blastn results; see https://www.doi.org/10.17605/OSF.IO/89EF2. Given the similarity between the templates of exon 3 of AMCR-MyoDau and EBLL-IG, it is possible that blastn hits for both templates could be found in the same source region. In this scenario, we displayed only the longer hit in the synteny plot.

2.6. Multiple Sequence Alignments of the Three Exons of AMCR in Various Bat Species

For the three AMCR exons identified in other bat species through synteny confirmation, we generated multiple nucleotide sequence alignments using MAFFT v7.520 employing pairwise alignments with Smith–Waterman algorithm (L-INS-i strategy, parameters: --auto, --localpair. --maxiterate 1000) [69]. For exon 3, a more advanced alignment procedure was required. We took the nucleotide sequences from all individual synteny-confirmed exon 3 blastn hits and aligned them against the exon 3 template from Myotis daubentonii using the Smith–Waterman algorithm implemented in MAFFT (L-INS-fragment strategy, parameters: --localpair, --maxiterate 1000, --addfragments). This study defined sequences in the alignment belonging to the same contig within a distance of fewer than 2000 nucleotides as related. To preserve the alignment, their corresponding sequences were concatenated using three ambiguous nucleotide bases (`NNN’). If neighboring hits in the respective bat overlapped in the query, specific nucleotides were eliminated, and the sequences were concatenated with the NNN block. The alignments are available as FASTA files with gaps indicated by a “ -” character in the https://www.doi.org/10.17605/OSF.IO/89EF2. The concatenated alignment of the three AMCR exons was visualized using CIAlign (parameter: --visualise) [70].

3. Results

3.1. A Novel EVE and Its Antisense Transcription in Myotis daubentonii: EBLL-Cultervirus.10-MyoDau and AMCR-MyoDau

After an extensive literature screening, to the best of our knowledge, we present a previously unreported EBLL locus in Myotis daubentonii, which we call EBLL-Cultervirus.10-MyoDau (short name is CV.10-MyoDau), originating from the L gene of a Cultervirus (YP_010085030.1) [45,47,48,68,71,72] (Figure 1). The CV.10-MyoDau is situated on the plus strand on chromosome 5 of Myotis daubentonii (NC_081844.1: 39,922,667–39,923,434). Three directly overlapping sequences with the L protein as query were identified using tblastn, with an E value of 1.57 × 10 34 . These sequences resulted in a protein alignment that is 268 amino acids long between the L protein and CV.10-MyoDau, thus requiring two frameshifts. The longest of the three sequences comprises 176 amino acids, but it contains several stop codons, which suggests compromised functionality; see Figure 1.
To investigate a possible transcription of this novel EBLL locus in Myotis daubentonii, we reanalyzed the RNA-Seq data from a previous study [23]. Here, we had previously examined the transcriptional landscape of a virus- and an interferon alpha-induced Myotis daubentonii cell line at two time points postinfection (6 h and 24 h). Thus, these data set were well suited for rescreening for potential EBLL-derived transcripts, especially since the sequencing was performed in a strand-specific manner, thus allowing us to detect whether transcripts originated from the plus or minus strand. Suprisingly, we identified a transcript on the minus strand consisting of three exons; see Figure 1. This has been identified as an antisense EVE, which is henceforward referred to as AMCR-MyoDau. Collectively, these three exons span a genomic length of 1903 nucleotides, which are distributed across a 6 kb genome region. Only a specific segment of exon 3 overlaps with CV.10-MyoDau; logically, only this part shares similarities with the L protein of the Wuhan sharpbelly bornavirus. This implies that there is only partial overlap between AMCR-MyoDau and CV.10-MyoDau; see Figure 1.

3.2. AMCR-MyoDau Shows Weak but Constant Expression in a Myotis daubentonii Cell Line under Mock, Interferon-Induced, and Virus-Infected Conditions

Next, we used the RNA-Seq data to refine the exon boundaries of CV.10-MyoDau and to investigate the expression levels. AMCR-MyoDau showed a shallow but constant expression, with TPM values between 4.3 and 9.5 in all the investigated RNA-Seq samples and conditions. This was confirmed using strand-specific RNA-Seq analysis (Figure 2). Random-primed RT-PCR confirmed the RNA expression through an independent method (Figure 3). We generated a coverage plot for all samples, where the sum of all mapped reads was scaled by 10, which also formed the basis of defining the exon boundaries for AMCR-MyoDau in Myotis daubentonii; see Figure 2. After further refinement employing spliced reads, we obtained the exon features at positions 39,922,089–39,923,568 (exon 3), 39,924,253–39,924,371 (exon 2), and 39,929,747–39,930,053 (exon 1). Please note the inverse order of the exon numbering due to the identification of AMCR-MyoDau on the minus strand (5′ to 3′); see Figure 1. Additionally, we found AMCR-MyoDau to be significantly (adjusted p value < 0.05) differentially expressed in four pairwise comparisons; see https://www.doi.org/10.17605/OSF.IO/89EF2. The comparison 24 h Mock vs. 24 h Clone13 showed the highest log2-fold change of −0.93, thus indicating the downregulation of AMCR-MyoDau. We hypothesize that the decline observed after 24 h for Clone13 is most likely due to cap snatching, which involves the cleaving off of host RNA 5′ caps by the viral polymerase, thus leading to host cell shutoff [73].
We investigated further whether AMCR-MyoDau is transcribed using RT-PCR with RNA samples from our Myotis daubentonii cell line [23]. As the RNA-Seq results indicate, the expected bands were detected under all the available conditions, including mock, RVFV Clone13 infection, and IFN-Alpha treatment; see Figure 3. Therefore, AMCR-MyoDau is expressed as RNA in Myotis daubentonii and is thus an antisense transcript of the EVE.

3.3. Synteny Patterns of AMCR and CV.10-MyoDau across Various Bat Species

For a synteny analysis, we first screened the nucleotide sequences of the three exons of AMCR-MyoDau across all listed bats using blastn to find the relevant positions in the bat genomes; see Table 1. As CV.10-MyoDau was found to be part of exon 3, a direct investigation of the synteny of this newly described EVE was also conducted. In total, 311 individual sequences were identified on nucleotide similarity across 42 distinct bat species. After concatenating the related hits nearby, 277 sequences remained. For all these locations, a synteny analysis was performed, and hits from various bats that exhibited the same synteny pattern were grouped together.
In total, we observed 29 different syntenic patterns in the nucleotid context of AMCR-MyoDau and CV.10-MyoDau. All these regions could be further evaluated as potential EVEs. But, there were just two syntenic patterns covering the CV.10-MyoDau region; see Figure 4 and Figure S2. Figure S3 presents all syntenic blocks unrelated to CV.10-MyoDau that are not shown in Figure 4.
The first syntenic pattern we identified has been named AMCR after the transcript. We could identify a consistent syntenic pattern (GALNT7, HMGB2, AMCR, SAP30, and SCRG1) for at least 34 bats. This indicates that the AMCR-MyoDau gene is at least partly conserved in at least 34 other bat species, although we cannot confirm its potential transcription activity. We propose naming these genes as AMCR followed by the specific bat species (e.g., AMCR-EptFus for Eptesicus fuscus). In total, 72 nucleotide sequences (after removing recurring hits in the same source region) were identified. A definitive set comprising 24 sequences for exon 1, 17 for exon 2, and 31 for exon 3, recognized as true positive sequences for AMCR, were selected for subsequent processing. For only 18 bat species within this syntenic pattern, sequence similarities to CV.10-MyoDau were identified. This means that these 18 EVEs are orthologs to CV.10-MyoDau and should be named according to the adjusted ERV nomenclature (e.g., CV.10-EptFus for Eptesicus fuscus) [74]. Out of these 18, 8 of them have already been detected using automated pipelines in Eptesicus fuscus, Antrozous pallidus, Pipistrellus kuhlii, Pipistrellus pipistrellus, Murina aurata feae, Myotis lucifugu, Myotis brandtii, and Myotis myotis [48,72]. These pipelines utilized outdated accessions, thus making the direct transfer a challenge. Another challenge lies in the high sequence similarities among paralogous EBLLs. To ensure that the EVEs found by the pipeline are indeed the same, we examined the genomic context.
We have identified nine additional novel orthologs, alongside our novel CV.10-MyoDau, specifically in the following bat species: Aeorestes cinereus, Corynorhinus townsendii, Plecotus auritus, Ia io, Eptesicus nilssonii, Pipistrellus pygmaeus, Myotis vivesi, Myotis yumanensis, and Myotis ricketti. The specific positions in the respective sequences of the genomes can be found in Table 2 and in the https://www.doi.org/10.17605/OSF.IO/89EF2.
The second syntenic pattern, sharing sequence similarities to CV.10, is the already described EBLL-IG [68]. EBLL-IG sequences were identified as blast hits using the L protein of Wuhan sharpbelly bornavirus, which is a paralog of CV.10-MyoDau. While EBLL-IG has previously exclusively been characterized in Myotis davidii and Eptesicus fuscus to date, our findings unveil its occurrence across 13 bat species in the Vespertilionidae family. Additionally, we have also evaluated conservation at the synteny level. An alignment is available in the https://www.doi.org/10.17605/OSF.IO/89EF2. Based on the sequence similarity, EBLL-IG seems to be a duplication of CV.10-MyoDau for all Vespertilionidae and was deleted in a later evolutionary event for a subset of Myotis species (Myotis lucifugus, Myotis brandtii, Myotis vivesi, and Myotis yumanensis). However, we can only hypothesize about the duplication event and cannot exclude the possibility that independent endogenization events of similar viruses generated the EBL elements.
In addition to the two EVEs presented here (CV.10-MyoDau and EBLL-IG), there are also numerous other EBLLs [21,45,47,48,68,72]. It needs to be determined which EBLL was integrated first or if there were multiple independent integration events.
In the depicted synteny plot of CV.10 (Figure 4), homologs of exon 3 have only been found for Vespertilionidae in the SCRG1 gene (in the purple box). However, in the coverage plot in Figure S2 (purple line), we observed that these sequences do not share any commonality with the L protein. Therefore, they do not represent the EVE itself but a potential further development of the functional unit derived from CV.10. This syntenic block is only interesting because of its proximity to CV.10.
On the other hand, nine homologous L protein samples could be identified. However, they could not be assigned to any synteny, as illustrated in the red-marked box in Figure 4 and as a red line in Figure S2. These nine 600 nt long potential homologous genes with an E value smaller than 6 × 10 26 belong exclusively to Nycticeius humeralis, Lasiurus borealis, Murina aurata feae, and Myotis vivesi, with all of them being members of the Vespertilionidae family. Controversially, these four bat species are phylogenetically not closely related, but their genomes are highly fragmented (see the number of contigs and the N50 value in Table 1). Therefore, the overall relation to the L protein remains questionable.

3.4. Comparative Genome Analysis Shows AMCR Sequence Similarities in 34 Bat Species

Due to synteny similarity, a total of 72 sequences from 34 species were identified as true positive sequences for AMCR (24 sequences for exon 1, 17 for exon 2, and 31 for exon 3). With these sequences, separate nucleotide alignments were built for each exon of AMCR; see the https://www.doi.org/10.17605/OSF.IO/89EF2. We used CIAlign to visualize the nucleotide alignments of all three exons. Figure 5 illustrates the distribution of exons among different bat species, thus providing detailed information about the specific regions and conservation within each exon. Generally, a high conservation can be observed in the aligned nucleotide sequences: the identity of nongap nucleotides with the consensus sequence is 91% in exon 1, 97% in exon 2, 96% in exon 3, and 95% for the combined exons. Furthermore, it is evident that the region of exon 3 of AMCR corresponding to the CV.10 EVE is highly conserved in Vespertilionidae.

3.5. Partial Occurrence of AMCR across Multiple Bat Families and Evolutionary Conservation of CV.10 in Vespertilionidae

The exon structure of AMCR-MyoDau and conservation to the L protein seem to be preserved in the family of Vespertilionidae, as revealed by synteny and sequence similarity analyses (Figure 4 and Figure 5). However, we could not detect the entire exon 3 of AMCR-MyoDau for a few other homologs within Yangochiroptera, and no matches were found for Yinpterochiroptera. This suggests that the L protein part integrated only after the divergence of the Vespertilionidae approximately 36–47 mio years ago, but it would have been before further branching inside the Vespertilionidae family 16–20 mio years ago [60,75]. CV.10 was not found in four Vespertilionidae species (Myotis davidii, Lasiurus borealis, Nycticeius humeralis, and Myotis vivesi). In Myotis davidii, exon 1, exon 2, and parts of exon 3 of AMCR-MyoDav were identified. However, Myotis davidii’s assembly quality is poor, with over 100 thousand contigs and an N50 value of 3.5 Mb. In the other three bat species (Lasiurus borealis, Nycticeius humeralis, and Myotis vivesi), sequences were found with CV.10, but these were not confirmed by synteny, thus likely due to relatively poor assembly quality with very high contig numbers and small N50 values; see Table 1.
In addition, the presence of exon 1 in nearly all bat species implies that this functional segment likely predated the divergence of Yangochiroptera and Yinpterochiroptera.

4. Discussion

Bats (Chiroptera) are renowned for their ecological diversity and unique traits—including resistance to pathogenic viruses—and they remain understudied despite their potential as viral disease reservoirs. The recent emergence of SARS-CoV-2 highlights the urgent need to comprehend bat biology and immunity.
Our discovery of the novel endogenous bornavirus-like element (CV.10-MyoDau) in Myotis daubentonii bats, with conserved nucleotide and protein sequences, provides insight into the intricate relationship between bats and viruses. Surprisingly, we found an antisense transcript, named AMCR-MyoDau, comprising three exons, as confirmed by RNA-Seq and RT-PCR analysis in Myotis daubentonii tissues. In the bat Myotis daubentonii, the causality between the transcription originating from the minus strand and the conservation of the L protein situated on the plus strand is uncertain. The functions of CV.10-MyoDau or AMCR-MyoDau remain speculative at this time. This represents the first instance of a conserved EVE and accompanying antisense transcript among diverse bat species, thus underlining the need for further investigation into the role of EVEs in bat immunity evolution. The three exon sequences of AMCR-MyoDau, as well as the region of CV.10, are preserved in Vespertilionidae. In total, there are 17 orthologs of CV.10-MyoDau, which we confirmed through synteny analysis, with 9 of these orthologs being novel.
Future studies should prioritize unraveling the complexities of bat immune responses, thus considering the potential role of transcribed EVEs in influencing bat immunity against viral infections. The observed virus-tolerant phenotype in bats may be associated with specific adaptations, thereby possibly manifested by the diversity and abundance of various EVEs described in these flying mammals.
However, one primary challenge in the still-young field of EVE research is the absence of a standardized database and widely used nomenclature for their cataloging. Currently, there is no centralized repository documenting previously identified EVEs, thus complicating the process of verifying whether a newly discovered EVE is truly novel. As researchers who frequently search for novel EVEs, we experience this challenge firsthand, thus underscoring the urgent need for a collaborative initiative to establish such a repository.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/v16081210/s1. Figure S1: The phylogeny of bats is depicted, delineating the two suborders Yangochiroptera andYinpterochiroptera; Figure S2: All blastn hits from Exon 3 in the various bat species were mapped back to the original Exon 3 sequence of Myotis daubentonii (the reference sequence); Figure S3: Here, the missing 26 synteny blocks are depicted, which were only hinted at in Figure 4, along with all the hits for which no synteny was found.

Author Contributions

Data curation, M.R. and T.E.; Formal analysis, M.R., T.E., A.S. and M.H.; Funding acquisition, M.M.; Investigation, M.R. and M.H.; Methodology, M.R., T.E., K.L. and A.S.; Supervision, F.W., M.H. and M.M.; Validation, M.R.; Visualization, M.R. and T.E.; Writing—original draft, M.R. and T.E.; Writing—review and editing, K.L., A.S., F.W., M.H. and M.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the TMWWDG grant “DigLeben” number 5575/10-9 TMWBDG; by the Deutsche Forschungsgemeinschaft (DFG—German Research Foundation) under Germany’s Excellence Strategy—EXC 2051—Project ID 390713860; M.M. is supported by the European Union’s Horizon 2020 research and innovation program under the Marie Skłodowska-Curie Actions Innovative Training Networks through grant no. 955974 (VIROINF) and the Werkstatt project of the Carl Zeiss Stiftung.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data are contained within the article or the Supplementary Materials. The data presented in this study are available in the Supplementary Materials through the Open Science Framework at https://www.doi.org/10.17605/OSF.IO/89EF2.

Acknowledgments

We utilized ChatGPT (May 24 version) to assist in generating initial drafts and refining the language of this manuscript.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Calisher, C.H.; Childs, J.E.; Field, H.E.; Holmes, K.V.; Schountz, T. Bats: Important reservoir hosts of emerging viruses. Clin. Microbiol. Rev. 2006, 19, 531–545. [Google Scholar] [CrossRef]
  2. Teeling, E.C.; Vernes, S.C.; Dávalos, L.M.; Ray, D.A.; Gilbert, M.T.P.; Myers, E.; Consortium, B. Bat biology, genomes, and the Bat1K Project: To generate Chromosome-Level genomes for all living bat species. Annu. Rev. Anim. Biosci. 2018, 6, 23–46. [Google Scholar] [CrossRef]
  3. Simmons, N.B. An Eocene big bang for bats. Science 2005, 307, 527–528. [Google Scholar] [CrossRef]
  4. Prat, Y.; Azoulay, L.; Dor, R.; Yovel, Y. Crowd vocal learning induces vocal dialects in bats: Playback of conspecifics shapes fundamental frequency usage by pups. PLoS Biol. 2017, 15, e2002556. [Google Scholar] [CrossRef]
  5. Huang, Z.; Jebb, D.; Teeling, E.C. Blood miRNomes and transcriptomes reveal novel longevity mechanisms in the long-lived bat, Myotis myotis. BMC Genom. 2016, 17, 906. [Google Scholar] [CrossRef]
  6. Foley, N.M.; Hughes, G.M.; Huang, Z.; Clarke, M.; Jebb, D.; Whelan, C.V.; Petit, E.J.; Touzalin, F.; Farcy, O.; Jones, G.; et al. Growing old, yet staying young: The role of telomeres in bats’ exceptional longevity. Sci. Adv. 2018, 4, eaao0926. [Google Scholar] [CrossRef] [PubMed]
  7. Huang, Z.; Whelan, C.V.; Foley, N.M.; Jebb, D.; Touzalin, F.; Petit, E.J.; Puechmaille, S.J.; Teeling, E.C. Longitudinal comparative transcriptomics reveals unique mechanisms underlying extended healthspan in bats. Nat. Ecol. Evol. 2019, 3, 1110–1120. [Google Scholar] [CrossRef] [PubMed]
  8. Smith, I.; Wang, L.F. Bats and their virome: An important source of emerging viruses capable of infecting humans. Curr. Opin. Virol. 2013, 3, 84–91. [Google Scholar] [CrossRef] [PubMed]
  9. Banerjee, A.; Baker, M.L.; Kulcsar, K.; Misra, V.; Plowright, R.; Mossman, K. Novel insights into immune systems of bats. Front. Immunol. 2020, 11, 26. [Google Scholar] [CrossRef]
  10. Olival, K.J.; Hosseini, P.R.; Zambrana-Torrelio, C.; Ross, N.; Bogich, T.L.; Daszak, P. Host and viral traits predict zoonotic spillover from mammals. Nature 2017, 546, 646–650. [Google Scholar] [CrossRef]
  11. Wang, L.F.; Walker, P.J.; Poon, L.L. Mass extinctions, biodiversity and mitochondrial function: Are bats “special” as reservoirs for emerging viruses? Curr. Opin. Virol. 2011, 1, 649–657. [Google Scholar] [CrossRef]
  12. Brook, C.E.; Dobson, A.P. Bats as “special” reservoirs for emerging zoonotic pathogens. Trends Microbiol. 2015, 23, 172–180. [Google Scholar] [CrossRef] [PubMed]
  13. Hölzer, M.; Krähling, V.; Amman, F.; Barth, E.; Bernhart, S.H.; Carmelo, V.A.O.; Collatz, M.; Doose, G.; Eggenhofer, F.; Ewald, J.; et al. Differential transcriptional responses to Ebola and Marburg virus infection in bat and human cells. Sci. Rep. 2016, 6, 34589. [Google Scholar] [CrossRef] [PubMed]
  14. Anthony, S.J.; Johnson, C.K.; Greig, D.J.; Kramer, S.; Che, X.; Wells, H.; Hicks, A.L.; Joly, D.O.; Wolfe, N.D.; Daszak, P.; et al. Global patterns in coronavirus diversity. Virus Evol. 2017, 3. [Google Scholar] [CrossRef] [PubMed]
  15. Andersen, K.G.; Rambaut, A.; Lipkin, W.I.; Holmes, E.C.; Garry, R.F. The proximal origin of SARS-CoV-2. Nat. Med. 2020, 26, 450–452. [Google Scholar] [CrossRef]
  16. Wu, Z.; Han, Y.; Wang, Y.; Liu, B.; Zhao, L.; Zhang, J.; Su, H.; Zhao, W.; Liu, L.; Bai, S.; et al. A comprehensive survey of bat sarbecoviruses across China in relation to the origins of SARS-CoV and SARS-CoV-2. Natl. Sci. Rev. 2023, 10, nwac213. [Google Scholar] [CrossRef] [PubMed]
  17. Papenfuss, A.T.; Baker, M.L.; Feng, Z.P.; Tachedjian, M.; Crameri, G.; Cowled, C.; Ng, J.; Janardhana, V.; Field, H.E.; Wang, L.F. The immune gene repertoire of an important viral reservoir, the Australian black flying fox. BMC Genom. 2012, 13, 261. [Google Scholar] [CrossRef] [PubMed]
  18. Power, M.L.; Foley, N.M.; Jones, G.; Teeling, E.C. Taking flight: An ecological, evolutionary and genomic perspective on bat telomeres. Mol. Ecol. 2022, 31, 6053–6068. [Google Scholar] [CrossRef] [PubMed]
  19. Luis, A.D.; Hayman, D.T.; O’Shea, T.J.; Cryan, P.M.; Gilbert, A.T.; Pulliam, J.R.; Mills, J.N.; Timonin, M.E.; Willis, C.K.; Cunningham, A.A.; et al. A comparison of bats and rodents as reservoirs of zoonotic viruses: Are bats special? Proc. R. Soc. B Biol. Sci. 2013, 280, 20122753. [Google Scholar] [CrossRef]
  20. Wang, L.F.; Anderson, D.E. Viruses in bats and potential spillover to animals and humans. Curr. Opin. Virol. 2019, 34, 79–89. [Google Scholar] [CrossRef]
  21. Skirmuntt, E.C.; Escalera-Zamudio, M.; Teeling, E.C.; Smith, A.; Katzourakis, A. The potential role of endogenous viral elements in the evolution of bats as reservoirs for zoonotic viruses. Annu. Rev. Virol. 2020, 7, 103–119. [Google Scholar] [CrossRef] [PubMed]
  22. O’shea, T.J.; Cryan, P.M.; Cunningham, A.A.; Fooks, A.R.; Hayman, D.T.; Luis, A.D.; Peel, A.J.; Plowright, R.K.; Wood, J.L. Bat flight and zoonotic viruses. Emerg. Infect. Dis. 2014, 20, 741. [Google Scholar] [CrossRef] [PubMed]
  23. Hölzer, M.; Schoen, A.; Wulle, J.; Müller, M.A.; Drosten, C.; Marz, M.; Weber, F. Virus-and interferon alpha-induced transcriptomes of cells from the microbat Myotis daubentonii. Iscience 2019, 19, 647–661. [Google Scholar] [CrossRef]
  24. Pavlovich, S.S.; Lovett, S.P.; Koroleva, G.; Guito, J.C.; Arnold, C.E.; Nagle, E.R.; Kulcsar, K.; Lee, A.; Thibaud-Nissen, F.; Hume, A.J.; et al. The Egyptian rousette genome reveals unexpected features of bat antiviral immunity. Cell 2018, 173, 1098–1110. [Google Scholar] [CrossRef] [PubMed]
  25. Hölzer, M.; Marz, M. De novo transcriptome assembly: A comprehensive cross-species comparison of short-read RNA-Seq assemblers. GigaScience 2019, 8, giz039. [Google Scholar] [CrossRef] [PubMed]
  26. Jayaprakash, A.D.; Ronk, A.J.; Prasad, A.N.; Covington, M.F.; Stein, K.R.; Schwarz, T.M.; Hekmaty, S.; Fenton, K.A.; Geisbert, T.W.; Basler, C.F.; et al. Filovirus infection induces an anti-inflammatory state in Rousettus bats. bioRxiv 2020. [Google Scholar] [CrossRef]
  27. Subudhi, S.; Rapin, N.; Misra, V. Immune system modulation and viral persistence in bats: Understanding viral spillover. Viruses 2019, 11, 192. [Google Scholar] [CrossRef]
  28. Cowled, C.; Stewart, C.R.; Likic, V.A.; Friedländer, M.R.; Tachedjian, M.; Jenkins, K.A.; Tizard, M.L.; Cottee, P.; Marsh, G.A.; Zhou, P.; et al. Characterisation of novel microRNAs in the Black flying fox (Pteropus alecto) by deep sequencing. BMC Genom. 2014, 15, 682. [Google Scholar] [CrossRef]
  29. Mostajo Berrospi, N.; Lataretu, M.; Krautwurst, S.; Mock, F.; Desirò, D.; Lamkiewicz, K.; Collatz, M.; Schoen, A.; Weber, F.; Marz, M.; et al. A comprehensive annotation and differential expression analysis of short and long non-coding RNAs in 16 bat genomes. NAR Genom. Bioinform. 2019, 2, lqz006. [Google Scholar]
  30. Edwards, M.R.; Liu, H.; Shabman, R.S.; Ginell, G.M.; Luthra, P.; Ramanan, P.; Keefe, L.J.; Köllner, B.; Amarasinghe, G.K.; Taylor, D.J.; et al. Conservation of structure and immune antagonist functions of filoviral VP35 homologs present in microbat genomes. Cell Rep. 2018, 24, 861–872.e6. [Google Scholar] [CrossRef]
  31. Ophinni, Y.; Palatini, U.; Hayashi, Y.; Parrish, N.F. piRNA-guided CRISPR-like immunity in eukaryotes. Trends Immunol. 2019, 40, 998–1010. [Google Scholar] [CrossRef]
  32. Feschotte, C.; Gilbert, C. Endogenous viruses: Insights into viral evolution and impact on host biology. Nat. Rev. Genet. 2012, 13, 283–296. [Google Scholar] [CrossRef] [PubMed]
  33. Belyi, V.A.; Levine, A.J.; Skalka, A.M. Unexpected inheritance: Multiple integrations of ancient bornavirus and ebolavirus/marburgvirus sequences in vertebrate genomes. PLoS Pathog. 2010, 6, e1001030. [Google Scholar] [CrossRef]
  34. Weiss, R. Spontaneous virus production from “non-virus producing” Rous sarcoma cells. Virology 1967, 32, 719–723. [Google Scholar] [CrossRef]
  35. Johnson, W.E. Origins and evolutionary consequences of ancient endogenous retroviruses. Nat. Rev. Microbiol. 2019, 17, 355–370. [Google Scholar] [CrossRef]
  36. Bellas, C.; Hackl, T.; Plakolb, M.S.; Koslová, A.; Fischer, M.G.; Sommaruga, R. Large-scale invasion of unicellular eukaryotic genomes by integrating DNA viruses. Proc. Natl. Acad. Sci. USA 2023, 120, e2300465120. [Google Scholar] [CrossRef]
  37. Klenerman, P.; Hengartner, H.; Zinkernagel, R.M. A non-retroviral RNA virus persists in DNA form. Nature 1997, 390, 298–301. [Google Scholar] [CrossRef]
  38. Taylor, D.J.; Bruenn, J. The evolution of novel fungal genes from non-retroviral RNA viruses. BMC Biol. 2009, 7, 88. [Google Scholar] [CrossRef]
  39. Horie, M.; Honda, T.; Suzuki, Y.; Kobayashi, Y.; Daito, T.; Oshida, T.; Ikuta, K.; Jern, P.; Gojobori, T.; Coffin, J.M.; et al. Endogenous non-retroviral RNA virus elements in mammalian genomes. Nature 2010, 463, 84–87. [Google Scholar] [CrossRef] [PubMed]
  40. Horie, M.; Tomonaga, K. Non-retroviral fossils in vertebrate genomes. Viruses 2011, 3, 1836–1848. [Google Scholar] [CrossRef]
  41. Fort, P.; Albertini, A.; Van-Hua, A.; Berthomieu, A.; Roche, S.; Delsuc, F.; Pasteur, N.; Capy, P.; Gaudin, Y.; Weill, M. Fossil rhabdoviral sequences integrated into arthropod genomes: Ontogeny, evolution, and potential functionality. Mol. Biol. Evol. 2012, 29, 381–390. [Google Scholar] [CrossRef]
  42. Takahashi, H.; Fukuhara, T.; Kitazawa, H.; Kormelink, R. Virus latency and the impact on plants. Front. Microbiol. 2019, 10, 2764. [Google Scholar] [CrossRef]
  43. Kapusta, A.; Suh, A. Evolution of bird genomes—A transposon’s-eye view. Ann. N. Y. Acad. Sci. 2017, 1389, 164–185. [Google Scholar] [CrossRef]
  44. Wallau, G.L. RNA virus EVEs in insect genomes. Curr. Opin. Insect Sci. 2022, 49, 42–47. [Google Scholar] [CrossRef]
  45. Horie, M.; Kobayashi, Y.; Suzuki, Y.; Tomonaga, K. Comprehensive analysis of endogenous bornavirus-like elements in eukaryote genomes. Philos. Trans. R. Soc. Lond. B Biol. Sci. 2013, 368, 20120499. [Google Scholar] [CrossRef]
  46. Afonso, C.L.; Amarasinghe, G.K.; Bányai, K.; Bào, Y.; Basler, C.F.; Bavari, S.; Bejerman, N.; Blasdell, K.R.; Briand, F.X.; Briese, T.; et al. Taxonomy of the order Mononegavirales: Update 2016. Arch. Virol. 2016, 161, 2351–2360. [Google Scholar] [CrossRef]
  47. Horie, M.; Kobayashi, Y.; Honda, T.; Fujino, K.; Akasaka, T.; Kohl, C.; Wibbelt, G.; Mühldorfer, K.; Kurth, A.; Müller, M.A.; et al. An RNA-dependent RNA polymerase gene in bat genomes derived from an ancient negative-strand RNA virus. Sci. Rep. 2016, 6, 25873. [Google Scholar] [CrossRef]
  48. Kawasaki, J.; Kojima, S.; Mukai, Y.; Tomonaga, K.; Horie, M. 100-My history of bornavirus infections hidden in vertebrate genomes. Proc. Natl. Acad. Sci. USA 2021, 118, e2026235118. [Google Scholar] [CrossRef]
  49. Garcia, B.C.B.; Mukai, Y.; Tomonaga, K.; Horie, M. The hidden diversity of ancient bornaviral sequences from X and P genes in vertebrate genomes. Virus Evol. 2023, 9, vead038. [Google Scholar] [CrossRef]
  50. Mukai, Y.; Horie, M.; Kojima, S.; Kawasaki, J.; Maeda, K.; Tomonaga, K. An endogenous bornavirus-like nucleoprotein in miniopterid bats retains the RNA-binding properties of the original viral protein. FEBS Lett. 2022, 596, 323–337. [Google Scholar] [CrossRef]
  51. Frank, J.A.; Feschotte, C. Co-option of endogenous viral sequences for host cell function. Curr. Opin. Virol. 2017, 25, 81–89. [Google Scholar] [CrossRef]
  52. Roberts, R.M.; Ezashi, T.; Schulz, L.C.; Sugimoto, J.; Schust, D.J.; Khan, T.; Zhou, J. Syncytins expressed in human placental trophoblast. Placenta 2021, 113, 8–14. [Google Scholar] [CrossRef]
  53. Lokossou, A.G.; Toudic, C.; Barbeau, B. Implication of human endogenous retrovirus envelope proteins in placental functions. Viruses 2014, 6, 4609–4627. [Google Scholar] [CrossRef]
  54. Boyd, M.T.; Bax, C.M.; Bax, B.E.; Bloxam, D.L.; Weiss, R.A. The human endogenous retrovirus ERV-3 is upregulated in differentiating placental trophoblast cells. Virology 1993, 196, 905–909. [Google Scholar] [CrossRef]
  55. Venables, P.J.; Brookes, S.M.; Griffiths, D.; Weiss, R.A.; Boyd, M.T. Abundance of an endogenous retroviral envelope protein in placental trophoblasts suggests a biological function. Virology 1995, 211, 589–592. [Google Scholar] [CrossRef]
  56. Robertson, W.; Warner, B. The ultrastructure of the human placental bed. J. Pathol. 1974, 112, 203–211. [Google Scholar] [CrossRef]
  57. Hertig, A.T. The primary human oocyte: Some observations on the fine structure of Balbiani’s vitelline body and the origin of the annulate lamellae. Am. J. Anat. 1968, 122, 107–137. [Google Scholar] [CrossRef]
  58. NCBI, R.C. Database Resources of the National Center for Biotechnology Information. Nucleic Acids Res. 2017, 45, D12. [Google Scholar]
  59. Shumate, A.; Salzberg, S.L. Liftoff: Accurate mapping of gene annotations. Bioinformatics 2021, 37, 1639–1643. [Google Scholar] [CrossRef]
  60. Agnarsson, I.; Zambrana-Torrelio, C.M.; Flores-Saldana, N.P.; May-Collado, L.J. A time-calibrated species-level phylogeny of bats (Chiroptera, Mammalia). PLoS Curr. 2011, 3. [Google Scholar] [CrossRef]
  61. Kim, D.; Langmead, B.; Salzberg, S.L. HISAT: A fast spliced aligner with low memory requirements. Nat. Methods 2015, 12, 357–360. [Google Scholar] [CrossRef]
  62. Kim, D.; Paggi, J.M.; Park, C.; Bennett, C.; Salzberg, S.L. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat. Biotechnol. 2019, 37, 907–915. [Google Scholar] [CrossRef]
  63. Thorvaldsdóttir, H.; Robinson, J.T.; Mesirov, J.P. Integrative Genomics Viewer (IGV): High-performance genomics data visualization and exploration. Brief. Bioinform. 2013, 14, 178–192. [Google Scholar] [CrossRef]
  64. Lataretu, M.; Hölzer, M. RNAflow: An effective and simple RNA-seq differential gene expression pipeline using nextflow. Genes 2020, 11, 1487. [Google Scholar] [CrossRef]
  65. Love, M.I.; Huber, W.; Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014, 15, 550. [Google Scholar] [CrossRef]
  66. Untergasser, A.; Cutcutache, I.; Koressaar, T.; Ye, J.; Faircloth, B.C.; Remm, M.; Rozen, S.G. Primer3—New capabilities and interfaces. Nucleic Acids Res. 2012, 40, e115. [Google Scholar] [CrossRef]
  67. Stephen, F.A. Basic local alignment search tool. J. Mol. Biol. 1990, 215, 403–410. [Google Scholar]
  68. Mukai, Y.; Horie, M.; Tomonaga, K. Systematic estimation of insertion dates of endogenous bornavirus-like elements in vesper bats. J. Vet. Med. Sci. 2018, 80, 1356–1363. [Google Scholar] [CrossRef]
  69. Yamada, K.D.; Tomii, K.; Katoh, K. Application of the MAFFT sequence alignment program to large data—Reexamination of the usefulness of chained guide trees. Bioinformatics 2016, 32, 3246–3251. [Google Scholar] [CrossRef]
  70. Tumescheit, C.; Firth, A.E.; Brown, K. CIAlign: A highly customisable command line tool to clean, interpret and visualise multiple sequence alignments. PeerJ 2022, 10, e12983. [Google Scholar] [CrossRef]
  71. Cui, J.; Wang, L.F. Genomic mining reveals deep evolutionary relationships between Bornaviruses and bats. Viruses 2015, 7, 5792–5800. [Google Scholar] [CrossRef] [PubMed]
  72. Blanco-Melo, D.; Campbell, M.A.; Zhu, H.; Dennis, T.P.; Modha, S.; Lytras, S.; Hughes, J.; Gatseva, A.; Gifford, R.J. A novel approach to exploring the dark genome and its application to mapping of the vertebrate virus fossil record. Genome Biol. 2024, 25, 120. [Google Scholar] [CrossRef] [PubMed]
  73. Olschewski, S.; Cusack, S.; Rosenthal, M. The cap-snatching mechanism of bunyaviruses. Trends Microbiol. 2020, 28, 293–303. [Google Scholar] [CrossRef]
  74. Gifford, R.J.; Blomberg, J.; Coffin, J.M.; Fan, H.; Heidmann, T.; Mayer, J.; Stoye, J.; Tristem, M.; Johnson, W.E. Nomenclature for endogenous retrovirus (ERV) loci. Retrovirology 2018, 15, 59. [Google Scholar] [CrossRef]
  75. Teeling, E.C.; Springer, M.S.; Madsen, O.; Bates, P.; O’brien, S.J.; Murphy, W.J. A molecular phylogeny for bats illuminates biogeography and the fossil record. Science 2005, 307, 580–584. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Top: A schematic illustration is provided for the newly described EBLL-Cultervirus.10-MyoDau (CV.10-MyoDau) alongside the L protein encoded by the RNA-dependent RNA polymerase (RdRP) of Wuhan sharpbelly bornavirus. CV.10-MyoDau is located on the plus strand. There is also an antisense transcript comprising three exons on the minus strand, which is named AMCR-MyoDau. Bottom: Amino acid sequence alignments of the L protein of Wuhan sharpbelly bornavirus and CV.10-MyoDau: The red boxes indicate frameshifts. In the amino acid sequences, “-” represents gaps in the alignment, and “*” denotes stop codons. Below the two amino acid sequences, “*” indicates identical amino acids, while “+” signifies similar amino acids according to the BLOSUM62 matrix.
Figure 1. Top: A schematic illustration is provided for the newly described EBLL-Cultervirus.10-MyoDau (CV.10-MyoDau) alongside the L protein encoded by the RNA-dependent RNA polymerase (RdRP) of Wuhan sharpbelly bornavirus. CV.10-MyoDau is located on the plus strand. There is also an antisense transcript comprising three exons on the minus strand, which is named AMCR-MyoDau. Bottom: Amino acid sequence alignments of the L protein of Wuhan sharpbelly bornavirus and CV.10-MyoDau: The red boxes indicate frameshifts. In the amino acid sequences, “-” represents gaps in the alignment, and “*” denotes stop codons. Below the two amino acid sequences, “*” indicates identical amino acids, while “+” signifies similar amino acids according to the BLOSUM62 matrix.
Viruses 16 01210 g001
Figure 2. Left: Expression box plot of AMCR-MyoDau. The plot shows the TPM values for each condition and biological replicate. Graphs show mean values and standard deviations from the three biological replicates per condition. Right: Coverage plot illustrating the accumulated read mapping to the Myotis daubentonii contig NC_081844.1. The gray lines represent the read counts for each of the 18 samples. The blue line depicts the sum of reads across all samples, which is scaled by a factor of 0.1.
Figure 2. Left: Expression box plot of AMCR-MyoDau. The plot shows the TPM values for each condition and biological replicate. Graphs show mean values and standard deviations from the three biological replicates per condition. Right: Coverage plot illustrating the accumulated read mapping to the Myotis daubentonii contig NC_081844.1. The gray lines represent the read counts for each of the 18 samples. The blue line depicts the sum of reads across all samples, which is scaled by a factor of 0.1.
Viruses 16 01210 g002
Figure 3. Top: Schematic figure of the three exons of AMCR-MyoDau, with EBLL-Cultervirus.10-MyoDau (CV.10-MyoDau) highlighted in purple, thus showing the primer pairs and positions used for RT-PCR analysis. Bottom: The expression of AMCR-MyoDau RNA in the Myotis daubentonii cell line is shown. The forward primer sequence is CTCCCTTGAGGAGTGTGACC, and the reverse primer sequence is GGCTGTCAGCAACAGTTTCA, thus targeting exon 3. Transcription of AMCR-MyoDau was observed under all available conditions, with no discernible impact on superinfection with RVFV Clone13 or IFN treatment. The no RT control consists of the Clone13 24 h RNA sample that underwent the RT step without adding primers.
Figure 3. Top: Schematic figure of the three exons of AMCR-MyoDau, with EBLL-Cultervirus.10-MyoDau (CV.10-MyoDau) highlighted in purple, thus showing the primer pairs and positions used for RT-PCR analysis. Bottom: The expression of AMCR-MyoDau RNA in the Myotis daubentonii cell line is shown. The forward primer sequence is CTCCCTTGAGGAGTGTGACC, and the reverse primer sequence is GGCTGTCAGCAACAGTTTCA, thus targeting exon 3. Transcription of AMCR-MyoDau was observed under all available conditions, with no discernible impact on superinfection with RVFV Clone13 or IFN treatment. The no RT control consists of the Clone13 24 h RNA sample that underwent the RT step without adding primers.
Viruses 16 01210 g003
Figure 4. Syntenic analysis for AMCR ((a) in blue), EBLL-Cultervirus.10 ((a) in pink; short: CV.10), the previously described EBLL-IG (b), and homologs that could not be assigned to any syntenic pattern (c). Gene lengths, gap lengths, and the length of EVE featured are logarithmically scaled. The abbreviations for the bat species are explained in Table 1. The initial sequence queries are the three exons (three arrow types) of AMCR-MyoDau (gray box) and the sequence of the previously described EBLL-IG (NC_081852.1:16,169,651–16,170,474), which exhibited high similarities with exon 3 of AMCR-MyoDau. Pink bars in exon 3 indicate sequence similarity to CV.10, yet such similarities do not definitively signify orthology with CV.10. The syntenic genes of all homologs were analyzed and subsequently grouped into syntenic blocks (colored boxed on the right. The color-coded families of the taxonomic assignments (left) from Agnarsson et al. [60] are complemented with bat species absent in the Agnarsson study (dashed lines). A comprehensive list of all sequences and their synthetic associations is provided in the https://www.doi.org/10.17605/OSF.IO/89EF2 and visualized in Figure S3. Some orthologs, marked with an asterisk (*), were already identified using automated pipelines [48,72]. Bats marked with an asterisk (*) in the EBLL-IG clade have already been described in the literature [68].
Figure 4. Syntenic analysis for AMCR ((a) in blue), EBLL-Cultervirus.10 ((a) in pink; short: CV.10), the previously described EBLL-IG (b), and homologs that could not be assigned to any syntenic pattern (c). Gene lengths, gap lengths, and the length of EVE featured are logarithmically scaled. The abbreviations for the bat species are explained in Table 1. The initial sequence queries are the three exons (three arrow types) of AMCR-MyoDau (gray box) and the sequence of the previously described EBLL-IG (NC_081852.1:16,169,651–16,170,474), which exhibited high similarities with exon 3 of AMCR-MyoDau. Pink bars in exon 3 indicate sequence similarity to CV.10, yet such similarities do not definitively signify orthology with CV.10. The syntenic genes of all homologs were analyzed and subsequently grouped into syntenic blocks (colored boxed on the right. The color-coded families of the taxonomic assignments (left) from Agnarsson et al. [60] are complemented with bat species absent in the Agnarsson study (dashed lines). A comprehensive list of all sequences and their synthetic associations is provided in the https://www.doi.org/10.17605/OSF.IO/89EF2 and visualized in Figure S3. Some orthologs, marked with an asterisk (*), were already identified using automated pipelines [48,72]. Bats marked with an asterisk (*) in the EBLL-IG clade have already been described in the literature [68].
Viruses 16 01210 g004
Figure 5. The schematic nucleotide alignment of AMCR is depicted from all sequences found in the bat genomes (see Table 1) whenever there was a commonality with the synteny to Myotis daubentonii. The exon structure and alignment lengths are depicted above the alignments. Taxonomic assignments derived from Agnarsson et al. are shown on the left margin [60]. Families are color-coded, and bat species not included in the study by Agnarsson et al. are represented with dashed lines in the tree. The abbreviations for the bat species are explained in Table 1.
Figure 5. The schematic nucleotide alignment of AMCR is depicted from all sequences found in the bat genomes (see Table 1) whenever there was a commonality with the synteny to Myotis daubentonii. The exon structure and alignment lengths are depicted above the alignments. Taxonomic assignments derived from Agnarsson et al. are shown on the left margin [60]. Families are color-coded, and bat species not included in the study by Agnarsson et al. are represented with dashed lines in the tree. The abbreviations for the bat species are explained in Table 1.
Viruses 16 01210 g005
Table 1. We selected 63 bat genomes of different assembly qualities. Genomes marked with an asterisk (*) are RefSeq genomes. We introduced a three-letter abbreviation for each bat species and listed, for each assembly, the total number of contigs (#con), how many of them represent complete chromosomes (#chr), the N50 value, and the total assembly size in gigabases (Gb) according to NCBI.
Table 1. We selected 63 bat genomes of different assembly qualities. Genomes marked with an asterisk (*) are RefSeq genomes. We introduced a three-letter abbreviation for each bat species and listed, for each assembly, the total number of contigs (#con), how many of them represent complete chromosomes (#chr), the N50 value, and the total assembly size in gigabases (Gb) according to NCBI.
SpeciesAbb.#Con|#ChrN50SizeNCBI Acc.YearFamily
[Mb][Gb]
Yangochiroptera
Aeorestes cinereusACI2536|035.12.1GCA_011751065.12020Vespertilionidae
Corynorhinus townsendiiCTO182|0177.82.0GCA_026230045.12022Vespertilionidae
Plecotus auritusPAU5570|0186.52.2GCA_963455325.12023Vespertilionidae
Nycticeius humeralisNHU1,676,240|00.0152.8GCA_007922795.12019Vespertilionidae
Ia ioIIO2008|0105.82.1GCA_025583905.12022Vespertilionidae
Eptesicus fuscusEFU48|25102.82.0GCF_027574615.1 *2023Vespertilionidae
Eptesicus nilssoniiENI726|0102.42.0GCA_030846915.12023Vespertilionidae
Lasiurus borealisLBO518,900|00.0392.9GCA_004026805.12019Vespertilionidae
Antrozous pallidusAPA93|23114.62.1GCA_027563665.12023Vespertilionidae
Pipistrellus kuhliiPKU202|080.21.8GCF_014108245.1 *2020Vespertilionidae
Pipistrellus pipistrellusPIP323|094.91.8GCA_903992545.12020Vespertilionidae
Pipistrellus pygmaeusPPY243|089.51.9GCA_949987585.12023Vespertilionidae
Murina aurata feaeMAU880,177|00.0262.3GCA_004026665.12019Vespertilionidae
Myotis lucifuguMLU11,654|04.32.0GCF_000147115.1 *2010Vespertilionidae
Myotis brandtiiMBR169,750|03.22.1GCF_000412655.1 *2013Vespertilionidae
Myotis vivesiMVI64,503|091.82.1GCA_035771395.12024Vespertilionidae
Myotis yumanensisMYU476|099.12.0GCA_028538775.12023Vespertilionidae
Myotis davidiiMDV101,769|03.52.1GCF_000327345.1 *2012Vespertilionidae
Myotis rickettiMRI105|0802.0GCA_036010255.12024Vespertilionidae
Myotis daubentoniiMDA121|23102.22.1GCF_963259705.1 *2023Vespertilionidae
Myotis myotisMMY93|094.42.0GCF_014108235.1 *2020Vespertilionidae
Molossus nigricansMNI146|081.92.4GCA_026936385.12022Molossidae
Molossus alvareziMAL187|0113.92.4GCA_031001765.12023Molossidae
Molossus molossusMMO60|0110.72.3GCF_014108415.1 *2020Molossidae
Tadarida brasiliensisTBR148|25111.12.3GCA_030848825.12023Molossidae
Rhynchonycteris nasoRNA50|0286.12.4GCA_031021685.12023Eallonurida
Sturnira hondurensisSHO25,881|010.22.1GCF_014824575.3 *2022Phyllostomidae
Tonatia saurophilaTSA249,8100.1662.1GCA_004024845.12019Phyllostomidae
Trachops cirrhosusTCI396,519124.52.2GCA_028533065.12023Phyllostomidae
Micronycteris hirsutaMHI550,090|00.0692.3GCA_004026765.12019Phyllostomidae
Carollia perspicillataCPE1,925,339|00.0102.7GCA_004027735.12019Phyllostomidae
Anoura caudiferACU337,255|00.1432.2GCA_004027475.12019Phyllostomidae
Desmodus rotundusDRO573|14160.12.1GCF_022682495.1 *2022Phyllostomidae
Phyllostomus discolorPDI78|17171.72.1GCF_004126475.2 *2020Phyllostomidae
Phyllostomus hastatusPHA534|039.22.1GCF_019186645.2 *2021Phyllostomidae
Macrotus californicusMCA1,128,787|00.0172.2GCA_007922815.12019Phyllostomidae
Artibeus jamaicensisAJA868|022.12.1GCF_021234435.1 *2021Phyllostomidae
Pteronotus parnelliiPPA333|031.52.1GCF_021234165.1 *2021Mormoopidae
Mormoops blainvilleiMBL205,259|00.1562.1GCA_004026545.12019Mormoopidae
Noctilio leporinusNLE298,222|00.1362.1GCA_004026585.12019Noctilionidae
Miniopterus natalensisMNA1269|04.31.8GCF_001595765.1 *2016Miniopteridae
Miniopterus schreibersiiMSC177,620|00.1091.8GCA_004026525.12019Miniopteridae
Yinpterochiroptera
Megaderma lyraMLY1,902,801|00.0722.6GCA_004026885.12019Megadermatidae
Craseonycteris thonglongyaiCTH1,224,256|00.0262.3GCA_004027555.12019Craseonycteridae
Aselliscus stoliczkanusAST191|161622.2GCA_033961575.12023Hipposideridae
Hipposideros pendleburyiHPE28,685|015.42.2GCA_021464545.12022Hipposideridae
Hipposideros armigerHAR7571|02.32.2GCF_001890085.2 *2016Hipposideridae
Hipposideros larvatusHLA69|18185.52.3GCA_031876335.12023Hipposideridae
Rhinolophus ferrumequinumRFE50|0922.1GCA_014108255.12020Rhinolophidae
Cynopterus sphinxCSP181|17145.21.9GCA_030015415.12023Pteropodidae
Cynopterus brachyotisCBR48,006|00.2511.8GCA_009793145.12019Pteropodidae
Macroglossus sobrinusMSO171,985|00.4531.9GCA_004027375.12019Pteropodidae
Pteropus alectoPLA65,598|062.0GCF_000325575.1 *2013Pteropodidae
Pteropus vampyrusPVA36,094|062.2GCF_000151845.1 *2014Pteropodidae
Pteropus giganteusPGI16,113|018.92.0GCF_902729225.1 *2020Pteropodidae
Pteropus rufusPRU469,091|19110.52.1GCA_028533765.12023Pteropodidae
Pteropus pselaphonPPS7513|00.7701.9GCA_014363405.12020Pteropodidae
Eidolon dupreanumEDU1,191,098|17101.62.3GCA_028627145.12023Pteropodidae
Eidolon helvumEHE133,538|00.0281.8GCA_000465285.12013Pteropodidae
Eonycteris spelaeaESP4469|082.0GCA_003508835.12018Pteropodidae
Rousettus madagascariensisRMA1,467,186|1885.82.3GCA_028533395.12023Pteropodidae
Rousettus leschenaultiiRLE8141|032.71.9GCA_015472975.12020Pteropodidae
Rousettus aegyptiacusRAE29|0113.81.9GCF_014176215.1 *2020Pteropodidae
Table 2. For all EBLL-Cultervirus.10 elements (short: CV.10) confirmed by our synteny analysis, the table lists the EVE name, the species, the position, and whether the specific EVE has already been described in the literature.
Table 2. For all EBLL-Cultervirus.10 elements (short: CV.10) confirmed by our synteny analysis, the table lists the EVE name, the species, the position, and whether the specific EVE has already been described in the literature.
Name EVESpeciesNCBI acc.Start-StopLiterature
CV.10-MyoDauMyotis daubentoniiNC_081844.139,922,667–39,923,434
CV.10-AeqCinAeorestes cinereusJAAGEH010000014.112,648,154–12,648,930-
CV.10-CorTowCorynorhinus townsendiiJAPDVU010000006.198,426,944–98,427,716-
CV.10-PleAurPlecotus auritusCAUOHH010000436.1107,056–107,829-
CV.10-IaIoIa ioJAJQQW010000006.173,804,507–73,805,270-
CV.10-EptFusEptesicus fuscusNC_072478.136,387,431–36,388,196[48,72]
CV.10-EptNilEptesicus nilssoniiJAULJE010000005.136,472,464–36,473,231-
CV.10-AntPalAntrozous pallidusCM050516.175,692,535–75,693,314[72]
CV.10-PipKuhPipistrellus kuhliiNW_023425584.164,293,707–64,294,472[72]
CV.10-PipPipPipistrellus pipistrellusLR862361.131,459,988–31,460,753[72]
CV.10-PipPymPipistrellus pygmaeusOX465307.164,495,766–64,496,530-
CV.10-MurAurMurina aurata feaePVJC01025922.114,138–14,917[72]
CV.10-MyoLucMyotis lucifugusNW_005871049.134,243,866–34,244,427[48,72]
CV.10-MyoBraMyotis brandtiiNW_005370908.11,501,243–1,501,804[48,72]
CV.10-MyoVivMyotis vivesiJAWPEG010000011.169,888,777–69,889,333-
CV.10-MyoYumMyotis yumanensisJAPQVT010000005.170,588,059–70,588,617-
CV.10-MyoRicMyotis rickettiJASKON010000005.171,087,272–71,087,392-
CV.10-MyoMyoMyotis myotisNW_023416368.171,971,796–71,972,569[72]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ritsch, M.; Eulenfeld, T.; Lamkiewicz, K.; Schoen, A.; Weber, F.; Hölzer, M.; Marz, M. Endogenous Bornavirus-like Elements in Bats: Evolutionary Insights from the Conserved Riboviral L-Gene in Microbats and Its Antisense Transcription in Myotis daubentonii. Viruses 2024, 16, 1210. https://doi.org/10.3390/v16081210

AMA Style

Ritsch M, Eulenfeld T, Lamkiewicz K, Schoen A, Weber F, Hölzer M, Marz M. Endogenous Bornavirus-like Elements in Bats: Evolutionary Insights from the Conserved Riboviral L-Gene in Microbats and Its Antisense Transcription in Myotis daubentonii. Viruses. 2024; 16(8):1210. https://doi.org/10.3390/v16081210

Chicago/Turabian Style

Ritsch, Muriel, Tom Eulenfeld, Kevin Lamkiewicz, Andreas Schoen, Friedemann Weber, Martin Hölzer, and Manja Marz. 2024. "Endogenous Bornavirus-like Elements in Bats: Evolutionary Insights from the Conserved Riboviral L-Gene in Microbats and Its Antisense Transcription in Myotis daubentonii" Viruses 16, no. 8: 1210. https://doi.org/10.3390/v16081210

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop