Next Article in Journal
Establishment and Expression of Cytokines in a Theileria annulata-Infected Bovine B Cell Line
Previous Article in Journal
Distribution and Evolution of Nonribosomal Peptide Synthetase Gene Clusters in the Ceratocystidaceae
 
 
Erratum published on 10 July 2019, see Genes 2019, 10(7), 520.
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genome-Wide Investigation and Functional Analysis of Sus scrofa RNA Editing Sites across Eleven Tissues

1
Department of Computer Science, City University of Hong Kong, Kowloon, Hong Kong, China
2
Department of Pig Genomic Design and Breeding, Agricultural Genome Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Genes 2019, 10(5), 327; https://doi.org/10.3390/genes10050327
Submission received: 6 March 2019 / Revised: 8 April 2019 / Accepted: 17 April 2019 / Published: 30 April 2019
(This article belongs to the Section Animal Genetics and Genomics)

Abstract

:
Recently, the prevalence and importance of RNA editing have been illuminated in mammals. However, studies on RNA editing of pigs, a widely used biomedical model animal, are rare. Here we collected RNA sequencing data across 11 tissues and identified more than 490,000 RNA editing sites. We annotated their biological features, detected flank sequence characteristics of A-to-I editing sites and the impact of A-to-I editing on miRNA–mRNA interactions, and identified RNA editing quantitative trait loci (edQTL). Sus scrofa RNA editing sites showed high enrichment in repetitive regions with a median editing level as 15.38%. Expectedly, 96.3% of the editing sites located in non-coding regions including intron, 3′ UTRs, intergenic, and gene proximal regions. There were 2233 editing sites located in the coding regions and 980 of them caused missense mutation. Our results indicated that to an A-to-I editing site, the adjacent four nucleotides, two before it and two after it, have a high impact on the editing occurrences. A commonly observed editing motif is CCAGG. We found that 4552 A-to-I RNA editing sites could disturb the original binding efficiencies of miRNAs and 4176 A-to-I RNA editing sites created new potential miRNA target sites. In addition, we performed edQTL analysis and found that 1134 edQTLs that significantly affected the editing levels of 137 RNA editing sites. Finally, we constructed PRESDB, the first pig RNA editing sites database. The site provides necessary functions associated with Sus scrofa RNA editing study.

1. Introduction

RNA editing is a posttranscriptional regulatory RNA-processing event (excluding RNA splicing) that increases the diversity of transcriptome by changing nucleotides of RNAs. In mammals, the most common form of RNA editing is A-to-I, catalyzed by the adenosine deaminase that acts on RNA (ADAR) family enzymes which contain three members (ADAR1, ADAR2, and ADAR3) [1,2] and the A-to-I editing can lead to an A-to-G reading of the cDNA molecule [3,4]. In mice, knockout of ADAR1 and ADAR2 (or ADARB1) lead to embryonically and postnatally lethal, respectively [5,6] indicating the importance of RNA editing events in normal physiology.
A-to-I RNA editing sites in coding regions that cause synonymous or nonsynonymous mutations have been found within many RNAs, including glutamate receptor subunits [7,8,9], the G protein-coupled serotonin 2C receptor [10] and the antigenome of the hepatitis delta virus [11,12]. Functional consequences of RNA editing in non-coding regions involve miRNA biogenesis [13], editing of miRNA seed regions [14] or target sequences in mRNA [15] and nuclear retention [16]. Moreover, in humans, RNA editing is associated with diseases including autoimmune disorder Aicardi–Goutières syndrome [17], amyotrophic lateral sclerosis [18], autism [19], viral infection [20], and cancer [21].
With the RNA-Seq technology, RNA editing events have been identified in pigs [22,23]. A spatio-temporal editing study in porcine neural tissues showed editing increased during embryonic development concomitantly with an increase in ADAR2 mRNA level, revealing RNA editing may play essential roles in the development of a swine brain [24].
Although an increasing number of mammal RNA editing databases have been published [25,26,27,28,29], no database contains the Sus scrofa RNA editing sites. The domestic pig (Sus scrofa) is a non-rodent animal model used in biomedical research [30,31,32]. In comparison with traditional rodent models, the pig is more similar to humans than mice in body size, growth, development, immunity, physiology, and metabolism, as well as genetically [33,34,35]. Therefore, in this research we constructed the first RNA editing database for Sus scrofa. We collected 211 RNA-seq datasets across 11 tissues (blood, brain, fat, heart, kidney, liver, lung, muscle, ovary, spleen, and uterus) from the National Center for Biotechnology Information (NCBI). Then, we identified Sus scrofa RNA editing sites and performed statistical analysis. Moreover, we performed molecular properties analyses, including genomic feature annotations and flank sequence characteristics investigation, and potential functional effects, including the impact of A-to-I editing on miRNA–mRNA interactions and RNA editing quantitative trait loci (edQTL) identification.

2. Materials and Methods

2.1. Reads Mapping and RNA-Editing Sites Identification

The RNA-seq data were downloaded from the National Center for Biotechnology Information (NCBI) SRA database. The sequence read archive (SRA) ID number is available in Supplemental Table S1. Genomes and annotation files of Sus scrofa (Sscrofa11.1) were downloaded from the ENSEMBL database [36]. The reads were mapped to reference genomes using Hisat2 (v2.0.1) [37] with default parameters. The Sequence Alignment/Map Format (SAM) files were sorted and converted to BAM files, the compressed binary version of a SAM file, by Samtools (v1.2) [38] with default parameters. RNA editing sites were identified using the “sprint_from_bam” program of SPRINT [39] with default parameters.

2.2. Tissue-Specific Editing

To improve the accuracy of identification of tissue-specific editing sites, for each tissue, we removed the sites that were missing editing value measurements in more than one-third of the samples. The ROKU R package [40] was applied to rank the editing sites by their overall tissue specificity using Shannon entropy. All sites satisfy the requirements that the editing level range (maximum editing level minus minimum editing level) is larger than 0.1 and the Shannon entropy is less than 0.4 were reserved as tissue-specific editing sites.

2.3. Sequence Preference

Two Sample Logo [41] were used to show the enriched and depleted nucleotides nearby the A’s targeted for RNA editing (p < 0.01, t-test). Adenosine sites randomly chosen from reference were used as the negative control.

2.4. Genomic Feature Annotation

Human RNA editing sites were download from the RADAR database [28], then we adopted the UCSC LiftOver tool (https://genome.ucsc.edu/cgi-bin/hgLiftOver) to convert the genome position from human reference to pig reference. We also converted the genome position from pig reference to human reference. The RNA editing sites successfully converted on both turns (from the human to the pig and from the pig to the human) were reserved as conserved RNA editing sites. Pig quantitative trait locus (QTL) was downloaded from Pig QTLdb [42] and RNA editing sites located at QTL regions were selected using a python script.

2.5. Impacts of RNA Editing Sites on miRNA–mRNA Interactions

For each A-to-G editing site located at 3′UTR, we extracted the editing site and the flanking 25 bps from reference (in total 51 bp) as an unedited-type (UT) sequence and changed the editing site from A-to-G (or T-to-C) as an edited-type (ET) sequence. We then used RNAhybrid [43] and miRanda [44] to predict the miRNA target sites on UT and ET sequences. We generated four target datasets, which are RU (RNAhybrid predicted targets on UT sequence), MU (miRanda predicted targets on UT sequence), RE (RNAhybrid predicted targets on ET sequence), and ME (miRanda predicted targets on ET sequence). The miRNA–mRNA interactions that existed in both RU and MU, but in neither RE nor ME, were defined as interaction losses (loss = (RU and MU) – (RE or ME)). On the contrary, the miRNA–lncRNA interactions that existed in both RE and ME, but in neither RU nor MU, were defined as interaction gains (gain = (RE and ME) – (RU or MU)).

2.6. edQTL Analysis

Variant calling using Samtools (v1.2) was performed and we selected variant positions from each sample in which the mismatches were supported by at least two reads while at least 10 reads covering this site and both base and mapping quality scores were at least 20. For QTL mapping, we examined 630 RNA editing sites with editing level measured in at least one-third of the samples. For each of the 630 RNA editing sites we normalized the editing level using Z-score method across all samples. The following protocol was performed to map edQTLs genome-wide: (1) For each editing site, we fit linear models without any covariates between normalized editing levels and genotypes of each variant in the genome using Plink [45]. We only used variants in which the minor allele was present in at least 20 samples. We identified genome-wide edQTLs using a significance threshold of 2 · 10−7 (Bonferroni method).

2.7. Database Construction

PRESDB was built using Django Python Web framework (https://www.djangoproject.com) coupled with MySQL database. The front-end interface was developed based on Bootstrap open source toolkit (https://getbootstrap.com) and jQuery JavaScript library (https://jquery.com). PRESDB was published using Apache Http server and is accessible at https://presdb.deepomics.org.

3. Results

3.1. Identification and Profiling of Sus scrofa RNA Editing Sites

To identify RNA editing sites in Sus scrofa, we collected 211 RNA-seq samples across 11 tissues from the National Center for Biotechnology Information (NCBI). We performed reads mapping and RNA-editing sites identification with Histat2 and SPRINT (see Methods). The genome-wide screening of RNA editing yielded a total of over 490,000 RNA editing events in all eleven tissues. The percentage of A-to-G mismatch, which is consistent with A-to-I editing, was 94.6% in repetitive region and 77.4% in non-repetitive (Figure 1a). Because the primate Alu elements influenced RNA editing in both coding and non-coding regions [46], we restricted our search to swine repetitive sequences and searched the short interspersed nuclear elements (SINE) in pigs which were capable of attracting the majority of ADAR activity. The results showed that, similar to those reported in the pig [23], more than 94.0% (445,057 of the 473,412) A-to-G editing sites were within pig SINE elements as opposed to the long interspersed nuclear elements (LINE) (4.6%) and others (1.4%) (Figure 1b), although SINEs contributed 11.4% of the swine genome, while LINEs contributed 17.5%. Of the 445,057 repetitive A-to-G mismatches, 65.7% were found within the Pre0_SS element (Figure 1c), which was the most identical to the consensus PRE-1 sequence among all elements of the PRE-1 family.
The editing level of each site is calculated as the number of reads supporting this editing over the number of reads covering this locus. Similar to humans [47], rhesus macaque and flies [48], the editing level varied from 1% to 100% among different editing sites, with a median level of 15.38% in Sus scrofa (Figure 1d). Then, we investigated the difference of RNA editing level among repetitive element families. The result showed the median level of LINE, SINE, and others were 16.6%, 16.6%, and 17.3%, respectively (Figure 1e), although the SINE family account for the most A-to-G mismatches. In order to statistic RNA editing sites at the genome-wide level, each chromosome was split into contiguous 1-Mb windows from the beginning to the end, and the number of total RNA editing sites and average editing level within each window were calculated. The result showed that RNA editing events appeared on every chromosome without preference (Figure 1f), highlighting the pervasive nature of RNA editing.

3.2. Annotation of RNA Editing Sites

To annotate the RNA editing sites, we used the SnpEff [49], a genomic variant annotations tool to find the genomic feature of each mismatch to the annotated gene. We annotated the mismatches according to eight genomic features: Intron variant, intergenic variant, downstream gene variant, upstream gene variant, missense variant, 3′-UTR variant, 5′-UTR variant, and others. Consistent with the previous study in pigs [23], 31.1% of all detected mismatches were located in retained introns (Figure 2a). The remaining sites were enriched in other non-coding regions including 3′ UTRs, intergenic, and gene proximal regions. According to our data, 980 pig mismatches within coding regions would result in a missense variant (Supplemental Table S2). Moreover, we identified tissue-specific RNA editing sites using Shannon entropy (see Methods), while the number of detected RNA editing events varied greatly among tissues (Figure 2b). Also, we compared with the human RNA editing sites using UCSC LiftOver tool (see Methods) and found 554 conserved RNA editing sites. Moreover, we compared these 554 conserved editing sites with the 59 previously published conserved mammalian sites [50] and found 13 common conserved editing sites (Table S3). Then, we used a python script to find all RNA editing sites that were located in pig quantitative trait locus (QTL) regions. The results showed that about 14,238 RNA editing sites were located in the pig QTL regions where 53% and 17% of these sites were related with conformation score and front feet conformation, respectively (Table 1).

3.3. Sequence Preference of A-to-I Editing Sites

When investigating the flanking sequences of the A-to-I editing sites identified in this study, we observed C enrichment one nucleotide upstream (−1) and G enrichment one nucleotide downstream (+1) RNA editing sites located in the repetitive region (Figure 3a,c). Meanwhile, consistent with what has been found in human [48], RNA editing sites in the non-repetitive region showed similar significate characters as the repetitive region (Figure 3b). Also, in both repetitive and non-repetitive regions, we observed a weak base preference at the positions beyond the nearest neighboring nucleotides. Specifically, the −2 and +2 positions were slightly enriched for C and G, respectively. (Figure 3a,b). These findings suggest that A-to-I editing was influenced by more than the −1 and +1 nucleotides of the edited A’s. Our data showed that editing levels in non-repetitive regions varied more than these in repetitive regions (Figure 3d), which was similar with what has been reported in other mammals [51].

3.4. Impact of A-to-I Editing on miRNA–mRNA Interactions

Similar to Single Nucleotide Polymorphism sites (SNPs), editing sites also have the potential to influence the miRNA–mRNA interactions. In this work, we selected all A-to-G editing sites (38,272) that located at 3′UTR region and predicted miRNAs that can target on these editing sites using RNAhybrid and miRanda. After that, we identified editing sites potentially to disturb original (loss) miRNA target sites and/or to create new potential (gain) miRNA target sites (see Methods). The results showed that 4552 RNA editing sites disturbed 5189 miRNAs target sites and 4,176 RNA editing sites created 5729 new potential miRNA target sites. Together, we found 1525 mRNA were impacted, and gene ontology and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis showed that biological function of these genes was enriched in metabolic pathways including small molecule metabolic process, organic acid metabolic process and cellular metabolic process (Figure 4a,b). Since pigs are primarily used to provide protein for humans, we selected several miRNAs that participated in skeletal muscle development including miR-1 [52], miR-143 [53], miRNA-206 [54], miRNA-21 [55] and miRNA-378 [57] and constructed the network of the changed miRNA–mRNA pairs caused by the A-to-G mismatches. The result provided nodes and connections between edited mRNAs and their target miRNAs. According to our data, more than one targets of these miRNAs were lost or gained after editing (Figure 4c,d).

3.5. edQTL Analysis

Quantitative trait loci (QTL) mapping has been successfully applied to identify the regulatory mechanisms of many molecular phenotypes such as gene expression (eQTLs) [57] and splicing patterns (sQTLs) [58]. To identify genetic variants that could impact RNA editing levels, we ran association tests between editing levels of selected editing sites and genotypes for all variants (see Methods). In order to focus on cis-effects, we restricted the search space of RNA editing QTL (edQTL) between –200 kb to approximately 200 kb of the RNA editing site. We found most of the selected RNA editing events were associated with genomic polymorphisms (Figure 5a). Meanwhile, 21.7% of the selected RNA editing sites (137 of 630) were associated with more than one edQTL. As expected, edQTLs were highly enriched within 5 kb of their associated RNA editing site (Figure 5b). Moreover, we also observed there were a higher statistical significance and greater association with edQTLs that were closer to the RNA editing site (Figure 5c). An example of an RNA editing event that was associated strongly with a genetic polymorphism occured at chr3: 102, 985, 380 where the T-allele is associated with a high level of RNA editing while the G-allele nearly abolishes RNA editing (Figure 5d).

3.6. Sus scrofa RNA Editing Sites Database

In order to provide users an easy access to pig RNA editing data, we built a database of pig RNA editing sites (PRESDB), containing more than 490,000 RNA editing sites across 11 tissues. PRESDB has four functional pages: An RNA editing site search page, a flanking sequence extraction page, a miRNA–mRNA interaction page, and an RNA editing QTL (edQTL) page (Figure 6). The search page allows users to search interested RNA editing sites by giving a gene name (e.g., PLPPR1) or a genome region (e.g., 1:271477204-271479179). The search result table contains 10 information columns: Tissue, chromosome, position, editing type, editing level, gene name, gene location, quantitative trait locus (QTL), conserved in human, and tissue-specific. Users can click position column to view one specific editing sites in the UCSC genome browser or click the gene name column to view the specific function of this gene on the GeneCards website. The flanking sequence extraction page provides the user with the ability to extract upstream and downstream sequences of the interested RNA editing sites to perform some downstream analysis (e.g., ADAR-binding sequence preferences). The extracted sequences can be downloaded in standard FASTA format. A data table is used to display all editing sites with the potential to disturb original miRNA target sites or to create new potential miRNA target sites for each tissue in the miRNA–mRNA interaction page. The edQTL page collects all RNA editing QTL (edQTL) sites within 200 kb of the RNA editing sites. To facilitate more detailed searches, we have made the entire database contents available as CSV files on the web page.

4. Discussion

Recently, an increasing number of A-to-I RNA editing sites have been published in mice and human. As RNA editing sites appeared in a tissue-specific manner, it is essential that studies of RNA editing in mammals assess various tissues. The pig is not only an important farm animal that provides protein for humans but an important non-rodent animal model that is widely used in biomedical research. Thus, to perform genome-wide identification of Sus scrofa RNA editing sites and explore their patterns, we collected RNA sequencing data across 11 tissues (blood, brain, fat, heart, kidney, liver, lung, muscle, ovary, spleen, and uterus). We identified more than 490,000 Sus scrofa RNA editing sites, which represent a significant addition to the growing catalog of annotated mammalian RNA editing sites. Moreover, similar to results that have been reported in human and mouse [47], Sus scrofa RNA editing sites showed high enrichment in repeated regions and low editing level. Our data indicated that the pig transcriptomes are highly editable among PRE-1 SINE retrotransposons which have similar features to the primate Alu. Since Alu elements enable substantial RNA editing among primate genomes, RNA editing in pigs may be achieved through a similar mechanism.
Consistent with previous studies, our data showed that the majority of RNA editing sites are located in non-coding regions, including intron, intergenic, and gene proximal regions, while little is known about the mechanism of the average effect of RNA editing in these non-coding regions. An instance in human RNA demonstrated that intronic A-to-G editing events contribute to alternative splicing of nuclear prelamin A. Thus, these editing sites are useful to understand the extent to which this phenomenon enhances pig genetic variation. Furthermore, we also identified 980 RNA editing sites that could result in missense mutation in 560 genes. Among these 560 genes, GRIA2 and GRIK2, both found to be edited in brain tissue, were previously identified as genes edited during porcine neural tissue development. ACOX1, CPT1B, CPT2, CYP27A1, and ACSL1, etc. participate in Peroxisome proliferator-activated receptor (PPAR) signaling pathway which plays a critical physiological role in the regulation of numerous biological processes, including lipid and glucose metabolism, and overall energy homeostasis. Therefore, these missense mutations may play important roles in the development and energy metabolism of the pig.
Although the molecular functions of RNA editing sites are mostly unclear, functional consequences of A-to-I RNA editing sites in coding and non-coding regions have been found, including synonymous or nonsynonymous within many RNAs, miRNA biogenesis, and editing of miRNA seed regions. To assess whether Sus scrofa RNA editing sites affect miRNA and mRNA interactions, we predicted miRNA-originating targets and miRNA-edited targets by two computational methods. Moreover, we found editing sites with the potential to disturb original miRNA target sites and to create new potential miRNA target sites. The function of these target genes was enriched in metabolic pathways including a small molecule metabolic process and an organic acid metabolic process, indicating the importance of RNA editing events in metabolic processes. Moreover, as most RNA editing sites reside in intergenic and intron regions, RNA editing sites are considered to have impacts on lncRNA secondary structures. Because of the incomplete annotation of lncRNA in the Sus scrofa reference, we did not explore the effects of RNA editing sites on lncRNA secondary structure, which can be done in a future study.
In order to investigate the cis variation of RNA editing in Sus scrofa, we tested the correlations between RNA editing sites and variants within 200 kb of the RNA editing site. We observed that variants within 1 kb of editing sites were more likely to have significant associations. Therefore, in a future study, these edQTLs can be used to bridge the gap in our understanding between RNA editing level and their respective susceptibility loci. Although edQTLs have been proved to affect editing levels through changes in the local secondary structure for edited dsRNA in human beings, the mechanism by which edQTLs affect editing levels in the pig remain to be studied.
At present, RNA editing sites have been discovered in plants, animals, and humans, but are rare in Sus scrofa. Exploring the pig genome can illuminate facets of human culture, organic evolution, biomedical research, and animal breeding. Therefore, we constructed the database of Sus scrofa RNA editing sites based on our data. To the best of our knowledge, this database is the first public resource containing information about the RNA editing sites of Sus scrofa. Although these identified editing sites in our database, especially in the coding regions, remain to be experimentally validated, our database will enhance pig genetic variation and provide explanations for a variety of traits of interest to both biomedical and agricultural communities.

Supplementary Materials

The following are available online at https://www.mdpi.com/2073-4425/10/5/327/s1. Table S1: Summary of tissue samples for total RNA sequencing in Sus scrofa. Table S2: RNA editing sites resulting in amino acid changes.. Table S3: List of 13 common conserved editing sites.

Author Contributions

Conceptualization, Z.W., X.F. and S.C.L.; data curation, Z.T.; formal analysis, Z.W. and X.F.; methodology, Z.W. and X.F.; project administration, Z.T. and S.C.L.; software, X.F.; Writing—Original Draft, Z.W., X.F. and S.C.L.

Funding

This research was funded by a GRF Project grant from the RGC General Research Fund (9042181; CityU 11203115), the GRF Research Project (9042348; CityU 11257316), the National Natural Science Foundation of China (31830090).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Bass, B.L. RNA editing by adenosine deaminases that act on RNA. Annu. Rev. Biochem. 2002, 71, 817–846. [Google Scholar] [CrossRef]
  2. Nishikura, K. Functions and regulation of RNA editing by adar deaminases. Annu. Rev. Biochem. 2010, 79, 321–349. [Google Scholar] [CrossRef] [PubMed]
  3. Gott, J.M.; Emeson, R.B. Functions and mechanisms of RNA editing. Annu. Rev. Genet. 2000, 34, 499–531. [Google Scholar] [CrossRef]
  4. Lee, J.-H.; Ang, J.K.; Xiao, X. Analysis and design of RNA sequencing experiments for identifying RNA editing and other single-nucleotide variants. RNA 2013, 19, 725–732. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Wang, Q.; Miyakoda, M.; Yang, W.; Khillan, J.; Stachura, D.L.; Weiss, M.J.; Nishikura, K. Stress-induced apoptosis associated with null mutation of adar1 RNA editing deaminase gene. J. Biol. Chem. 2004, 279, 4952–4961. [Google Scholar] [CrossRef] [PubMed]
  6. Higuchi, M.; Maas, S.; Single, F.N.; Hartner, J.; Rozov, A.; Burnashev, N.; Feldmeyer, D.; Sprengel, R.; Seeburg, P.H. Point mutation in an ampa receptor gene rescues lethality in mice deficient in the RNA-editing enzyme adar2. Nature 2000, 406, 78. [Google Scholar] [CrossRef]
  7. Sommer, B.; Köhler, M.; Sprengel, R.; Seeburg, P.H. RNA editing in brain controls a determinant of ion flow in glutamate-gated channels. Cell 1991, 67, 11–19. [Google Scholar] [CrossRef]
  8. Köhler, M.; Burnashev, N.; Sakmann, B.; Seeburg, P.H. Determinants of Ca2+ permeability in both TM1 and TM2 of high affinity kainate receptor channels: Diversity by RNA editing. Neuron 1993, 10, 491–500. [Google Scholar] [CrossRef]
  9. Lomeli, H.; Mosbacher, J.; Melcher, T.; Hoger, T.; Kuner, T.; Monyer, H.; Higuchi, M.; Bach, A.; Seeburg, P.H. Control of kinetic properties of ampa receptor channels by nuclear RNA editing. Science 1994, 266, 1709–1713. [Google Scholar] [CrossRef]
  10. Burns, C.M.; Chu, H.; Rueter, S.M.; Hutchinson, L.K.; Canton, H.; Sanders-Bush, E.; Emeson, R.B. Regulation of serotonin-2c receptor g-protein coupling by RNA editing. Nature 1997, 387, 303. [Google Scholar] [CrossRef]
  11. Casey, J.L.; Gerin, J.L. Hepatitis d virus RNA editing: Specific modification of adenosine in the antigenomic RNA. J. Virol. 1995, 69, 7593–7600. [Google Scholar]
  12. Poison, A.G.; Bass, B.L.; Casey, J.L. RNA editing of hepatitis delta virus antigenome by dsrna-adenosine deaminase. Nature 1996, 380, 454. [Google Scholar] [CrossRef]
  13. Blow, M.J.; Grocock, R.J.; van Dongen, S.; Enright, A.J.; Dicks, E.; Futreal, P.A.; Wooster, R.; Stratton, M.R. RNA editing of human micrornas. Genome Biol. 2006, 7, R27. [Google Scholar] [CrossRef]
  14. Kume, H.; Hino, K.; Galipon, J.; Ui-Tei, K. A-to-i editing in the mirna seed region regulates target mrna selection and silencing efficiency. Nucleic Acids Res. 2014, 42, 10050–10060. [Google Scholar] [CrossRef] [PubMed]
  15. Zhang, L.; Yang, C.-S.; Varelas, X.; Monti, S. Altered RNA editing in 3′ utr perturbs microrna-mediated regulation of oncogenes and tumor-suppressors. Sci. Rep. 2016, 6, 23226. [Google Scholar] [CrossRef]
  16. Prasanth, K.V.; Prasanth, S.G.; Xuan, Z.; Hearn, S.; Freier, S.M.; Bennett, C.F.; Zhang, M.Q.; Spector, D.L. Regulating gene expression through RNA nuclear retention. Cell 2005, 123, 249–263. [Google Scholar] [CrossRef] [PubMed]
  17. Rice, G.I.; Kasher, P.R.; Forte, G.M.; Mannion, N.M.; Greenwood, S.M.; Szynkiewicz, M.; Dickerson, J.E.; Bhaskar, S.S.; Zampini, M.; Briggs, T.A. Mutations in adar1 cause aicardi-goutieres syndrome associated with a type i interferon signature. Nat. Genet. 2012, 44, 1243. [Google Scholar] [CrossRef] [PubMed]
  18. Yamashita, T.; Kwak, S. The molecular link between inefficient glua2 q/r site-RNA editing and tdp-43 pathology in motor neurons of sporadic amyotrophic lateral sclerosis patients. Brain Res. 2014, 1584, 28–38. [Google Scholar] [CrossRef] [PubMed]
  19. Eran, A.; Li, J.B.; Vatalaro, K.; McCarthy, J.; Rahimov, F.; Collins, C.; Markianos, K.; Margulies, D.M.; Brown, E.N.; Calvo, S.E. Comparative RNA editing in autistic and neurotypical cerebella. Mol. Psychiatry 2013, 18, 1041. [Google Scholar] [CrossRef] [PubMed]
  20. Toth, A.M.; Li, Z.; Cattaneo, R.; Samuel, C.E. RNA-specific adenosine deaminase adar1 suppresses measles virus-induced apoptosis and activation of protein kinase pkr. J. Biol. Chem. 2009, 284, 29350–29356. [Google Scholar] [CrossRef]
  21. Han, L.; Diao, L.; Yu, S.; Xu, X.; Li, J.; Zhang, R.; Yang, Y.; Werner, H.M.; Eterovic, A.K.; Yuan, Y. The genomic landscape and clinical relevance of a-to-i RNA editing in human cancers. Cancer Cell 2015, 28, 515–528. [Google Scholar] [CrossRef]
  22. Martínez-Montes, A.; Fernández, A.; Pérez-Montarelo, D.; Alves, E.; Benítez, R.; Nuñez, Y.; Ovilo, C.; Ibañez-Escriche, N.; Folch, J.; Fernandez, A. Using RNA-seq snp data to reveal potential causal mutations related to pig production traits and RNA editing. Anim. Genet. 2017, 48, 151–165. [Google Scholar] [CrossRef]
  23. Funkhouser, S.A.; Steibel, J.P.; Bates, R.O.; Raney, N.E.; Schenk, D.; Ernst, C.W. Evidence for transcriptome-wide RNA editing among sus scrofa pre-1 sine elements. Bmc Genom. 2017, 18, 360. [Google Scholar] [CrossRef]
  24. Stulić, M.; Jantsch, M.F. Spatio-temporal profiling of filamin a RNA-editing reveals adar preferences and high editing levels outside neuronal tissues. RNA Biol. 2013, 10, 1611–1617. [Google Scholar] [CrossRef] [PubMed]
  25. Neeman, Y.; Levanon, E.Y.; Jantsch, M.F.; Eisenberg, E. RNA editing level in the mouse is determined by the genomic repeat repertoire. RNA 2006, 12, 1802–1809. [Google Scholar] [CrossRef]
  26. Picardi, E.; Regina, T.M.R.; Brennicke, A.; Quagliariello, C. Redidb: The RNA editing database. Nucleic Acids Res. 2006, 35, D173–D177. [Google Scholar] [CrossRef]
  27. Kiran, A.; Baranov, P.V. Darned: A database of RNA editing in humans. Bioinformatics 2010, 26, 1772–1776. [Google Scholar] [CrossRef]
  28. Ramaswami, G.; Li, J.B. Radar: A rigorously annotated database of a-to-i RNA editing. Nucleic Acids Res. 2013, 42, D109–D113. [Google Scholar] [CrossRef]
  29. Picardi, E.; D’Erchia, A.M.; Lo Giudice, C.; Pesole, G. Rediportal: A comprehensive database of a-to-i RNA editing events in humans. Nucleic Acids Res. 2016, 45, D750–D757. [Google Scholar] [CrossRef]
  30. Kumar, S.; Hedges, S.B. A molecular timescale for vertebrate evolution. Nature 1998, 392, 917. [Google Scholar] [CrossRef]
  31. Larson, G.; Dobney, K.; Albarella, U.; Fang, M.; Matisoo-Smith, E.; Robins, J.; Lowden, S.; Finlayson, H.; Brand, T.; Willerslev, E. Worldwide phylogeography of wild boar reveals multiple centers of pig domestication. Science 2005, 307, 1618–1621. [Google Scholar] [CrossRef]
  32. Groenen, M.A.; Archibald, A.L.; Uenishi, H.; Tuggle, C.K.; Takeuchi, Y.; Rothschild, M.F.; Rogel-Gaillard, C.; Park, C.; Milan, D.; Megens, H.-J. Analyses of pig genomes provide insight into porcine demography and evolution. Nature 2012, 491, 393. [Google Scholar] [CrossRef]
  33. Wernersson, R.; Schierup, M.H.; Jørgensen, F.G.; Gorodkin, J.; Panitz, F.; Stærfeldt, H.-H.; Christensen, O.F.; Mailund, T.; Hornshøj, H.; Klein, A. Pigs in sequence space: A 0.66 x coverage pig genome survey based on shotgun sequencing. BMC Genom. 2005, 6, 70. [Google Scholar] [CrossRef]
  34. Liu, Y.; Ma, Y.; Yang, J.-Y.; Cheng, D.; Liu, X.; Ma, X.; West, F.D.; Wang, H. Comparative gene expression signature of pig, human and mouse induced pluripotent stem cell lines reveals insight into pig pluripotency gene networks. Stem Cell Rev. Rep. 2014, 10, 162–176. [Google Scholar] [CrossRef] [PubMed]
  35. Wall, R.; Shani, M. Are animal models as good as we think? Theriogenology 2008, 69, 2–9. [Google Scholar] [CrossRef] [PubMed]
  36. Aken, B.L.; Achuthan, P.; Akanni, W.; Amode, M.R.; Bernsdorff, F.; Bhai, J.; Billis, K.; Carvalho-Silva, D.; Cummins, C.; Clapham, P. Ensembl 2017. Nucleic Acids Res. 2016, 45, D635–D642. [Google Scholar] [CrossRef] [Green Version]
  37. Kim, D.; Langmead, B.; Salzberg, S.L. Hisat: A fast spliced aligner with low memory requirements. Nat. Methods 2015, 12, 357–360. [Google Scholar] [CrossRef]
  38. Li, H. A statistical framework for snp calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics 2011, 27, 2987–2993. [Google Scholar] [CrossRef]
  39. Zhang, F.; Lu, Y.; Yan, S.; Xing, Q.; Tian, W. Sprint: An snp-free toolkit for identifying rna editing sites. Bioinformatics (Oxford, England) 2017, 33, 3538–3548. [Google Scholar]
  40. Kadota, K.; Ye, J.; Nakai, Y.; Terada, T.; Shimizu, K. Roku: A novel method for identification of tissue-specific genes. Bmc Bioinform. 2006, 7, 294. [Google Scholar] [CrossRef] [PubMed]
  41. Vacic, V.; Iakoucheva, L.M.; Radivojac, P. Two sample logo: A graphical representation of the differences between two sets of sequence alignments. Bioinformatics 2006, 22, 1536–1537. [Google Scholar] [CrossRef]
  42. Hu, Z.-L.; Park, C.A.; Reecy, J.M. Developmental progress and current status of the animal qtldb. Nucleic Acids Res. 2015, 44, D827–D833. [Google Scholar] [CrossRef]
  43. Krüger, J.; Rehmsmeier, M. Rnahybrid: Microrna target prediction easy, fast and flexible. Nucleic Acids Res. 2006, 34, W451–W454. [Google Scholar] [CrossRef]
  44. Betel, D.; Wilson, M.; Gabow, A.; Marks, D.S.; Sander, C. The microrna. Org resource: Targets and expression. Nucleic Acids Res. 2008, 36, D149–D153. [Google Scholar] [CrossRef]
  45. Purcell, S.; Neale, B.; Todd-Brown, K.; Thomas, L.; Ferreira, M.A.; Bender, D.; Maller, J.; Sklar, P.; De Bakker, P.I.; Daly, M.J. Plink: A tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 2007, 81, 559–575. [Google Scholar] [CrossRef]
  46. Ramaswami, G.; Lin, W.; Piskol, R.; Tan, M.H.; Davis, C.; Li, J.B. Accurate identification of human alu and non-alu RNA editing sites. Nat. Methods 2012, 9, 579. [Google Scholar] [CrossRef]
  47. Li, J.B.; Levanon, E.Y.; Yoon, J.-K.; Aach, J.; Xie, B.; LeProust, E.; Zhang, K.; Gao, Y.; Church, G.M. Genome-wide identification of human RNA editing sites by parallel DNA capturing and sequencing. Science (New York N. Y.) 2009, 324, 1210–1213. [Google Scholar] [CrossRef]
  48. Ramaswami, G.; Zhang, R.; Piskol, R.; Keegan, L.P.; Deng, P.; O’connell, M.A.; Li, J.B. Identifying RNA editing sites using RNA sequencing data alone. Nat. Methods 2013, 10, 128. [Google Scholar] [CrossRef]
  49. Cingolani, P.; Platts, A.; Wang, L.L.; Coon, M.; Nguyen, T.; Wang, L.; Land, S.J.; Lu, X.; Ruden, D.M. A program for annotating and predicting the effects of single nucleotide polymorphisms, snpeff: Snps in the genome of drosophila melanogaster strain w1118; iso-2; iso-3. Fly 2012, 6, 80–92. [Google Scholar]
  50. Pinto, Y.; Cohen, H.Y.; Levanon, E.Y. Mammalian conserved adar targets comprise only a small fragment of the human editosome. Genome Biol. 2014, 15, R5. [Google Scholar] [CrossRef]
  51. Tan, M.H.; Li, Q.; Shanmugam, R.; Piskol, R.; Kohler, J.; Young, A.N.; Liu, K.I.; Zhang, R.; Ramaswami, G.; Ariyoshi, K. Dynamic landscape and regulation of RNA editing in mammals. Nature 2017, 550, 249. [Google Scholar] [CrossRef] [PubMed]
  52. Chen, J.-F.; Mandel, E.M.; Thomson, J.M.; Wu, Q.; Callis, T.E.; Hammond, S.M.; Conlon, F.L.; Wang, D.Z. The role of microrna-1 and microrna-133 in skeletal muscle proliferation and differentiation. Nat. Genet. 2006, 38, 228. [Google Scholar]
  53. Jordan, S.D.; Krüger, M.; Willmes, D.M.; Redemann, N.; Wunderlich, F.T.; Brönneke, H.S.; Merkwirth, C.; Kashkar, H.; Olkkonen, V.M.; Böttger, T. Obesity-induced overexpression of mirna-143 inhibits insulin-stimulated akt activation and impairs glucose metabolism. Nat. Cell Biol. 2011, 13, 434. [Google Scholar] [CrossRef] [PubMed]
  54. McCarthy, J.J. Microrna-206: The skeletal muscle-specific myomir. Biochim. Et Biophys. Acta Gene Regul. Mech. 2008, 1779, 682–691. [Google Scholar] [CrossRef] [PubMed]
  55. Bai, L.; Liang, R.; Yang, Y.; Hou, X.; Wang, Z.; Zhu, S.; Wang, C.; Tang, Z.; Li, K. Microrna-21 regulates pi3k/akt/mtor signaling by targeting tgfβi during skeletal muscle development in pigs. PLoS ONE 2015, 10, e0119396. [Google Scholar] [CrossRef] [PubMed]
  56. Gerin, I.; Bommer, G.T.; McCoin, C.S.; Sousa, K.M.; Krishnan, V.; MacDougald, O.A. Roles for mirna-378/378* in adipocyte gene expression and lipogenesis. Am. J. Physiol.-Endocrinol. Metab. 2010, 299, E198–E206. [Google Scholar] [CrossRef]
  57. Montgomery, S.B.; Sammeth, M.; Gutierrez-Arcelus, M.; Lach, R.P.; Ingle, C.; Nisbett, J.; Guigo, R.; Dermitzakis, E.T. Transcriptome genetics using second generation sequencing in a caucasian population. Nature 2010, 464, 773. [Google Scholar] [CrossRef] [PubMed]
  58. Lappalainen, T.; Sammeth, M.; Friedländer, M.R.; AC‘t Hoen, P.; Monlong, J.; Rivas, M.A.; Gonzalez-Porta, M.; Kurbatova, N.; Griebel, T.; Ferreira, P.G. Transcriptome and genome sequencing uncovers functional variation in humans. Nature 2013, 501, 506. [Google Scholar] [CrossRef]
Figure 1. Identification and profiling of A-to-I RNA editing. (a) Percentage of 12 types of RNA editing events in repetitive and non-repetitive regions. (b) Distribution of repetitive A-to-G mismatches across major repetitive element families (c) and further broken down into specific repetitive element types. (d) Histogram and box plot showing the frequency of RNA editing levels. (e) Overall RNA editing levels across major repetitive element families. (f) Genome-wide distribution of RNA editing sites. From outside to inside, each circle represents the number of all editing sites and the average editing level of all RNA editing, respectively.
Figure 1. Identification and profiling of A-to-I RNA editing. (a) Percentage of 12 types of RNA editing events in repetitive and non-repetitive regions. (b) Distribution of repetitive A-to-G mismatches across major repetitive element families (c) and further broken down into specific repetitive element types. (d) Histogram and box plot showing the frequency of RNA editing levels. (e) Overall RNA editing levels across major repetitive element families. (f) Genome-wide distribution of RNA editing sites. From outside to inside, each circle represents the number of all editing sites and the average editing level of all RNA editing, respectively.
Genes 10 00327 g001
Figure 2. Annotation of RNA editing sites. (a) A-to-I mismatch locations relative to the nearest annotated gene. Percentages shown are out of all A-to-G mismatches. (b) Heat map of editing levels from sites that are specifically edited in one tissue.
Figure 2. Annotation of RNA editing sites. (a) A-to-I mismatch locations relative to the nearest annotated gene. Percentages shown are out of all A-to-G mismatches. (b) Heat map of editing levels from sites that are specifically edited in one tissue.
Genes 10 00327 g002
Figure 3. Sequence context of RNA editing sites. Sequence preferences for base positions flanking (−3, +3) detected A-to-I editing sites (a) in repetitive region and (b) non-repetitive region. (c) The number of 16 possible triplets centered on the edited adenosine (NAN). (d) Editing levels of editing sites in different triplets.
Figure 3. Sequence context of RNA editing sites. Sequence preferences for base positions flanking (−3, +3) detected A-to-I editing sites (a) in repetitive region and (b) non-repetitive region. (c) The number of 16 possible triplets centered on the edited adenosine (NAN). (d) Editing levels of editing sites in different triplets.
Genes 10 00327 g003
Figure 4. Impact of RNA editing on miRNA–mRNA interactions. (a) Gene Ontology (GO) analysis of all edited mRNAs. (b) Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis of all edited mRNAs. Network of (c) gain pairs or (d) loss pairs appeared with miR-1, miR-143, miRNA-206, miRNA-21 and miRNA-378.
Figure 4. Impact of RNA editing on miRNA–mRNA interactions. (a) Gene Ontology (GO) analysis of all edited mRNAs. (b) Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis of all edited mRNAs. Network of (c) gain pairs or (d) loss pairs appeared with miR-1, miR-143, miRNA-206, miRNA-21 and miRNA-378.
Genes 10 00327 g004
Figure 5. RNA editing quantitative trait loci (edQTLs) analysis. (a) Quantile–quantile (QQ) plot for association testing p-values of RNA levels with genetic variants within 200 kb of the 630 editing sites. (b) Statistic of the number of edQTLs located at different distance (kb) regions of the 630 RNA editing sites. (c) Significance of association tests in relation to the distance between the editing sites and variants. (d) Example of an edQTL. Box plot showing the association of edQTL (chr3: 102921373) with the editing level at chr3: 102985380.
Figure 5. RNA editing quantitative trait loci (edQTLs) analysis. (a) Quantile–quantile (QQ) plot for association testing p-values of RNA levels with genetic variants within 200 kb of the 630 editing sites. (b) Statistic of the number of edQTLs located at different distance (kb) regions of the 630 RNA editing sites. (c) Significance of association tests in relation to the distance between the editing sites and variants. (d) Example of an edQTL. Box plot showing the association of edQTL (chr3: 102921373) with the editing level at chr3: 102985380.
Genes 10 00327 g005
Figure 6. Framework of the database construction in PRESDB.
Figure 6. Framework of the database construction in PRESDB.
Genes 10 00327 g006
Table 1. Statistic of RNA editing events in QTL regions.
Table 1. Statistic of RNA editing events in QTL regions.
Quantitative Trait Locus (QTL)NumberPercentage
Conformation score753653.01%
Front feet conformation243117.10%
Maternal infanticide7955.59%
Ear weight6044.25%
Rhinitis5694.00%
Hind leg conformation4753.34%
Vertebra number4423.11%
Spinal curvature4222.97%
Left teat number4142.91%
Lumbar vertebra number2571.81%
Time spent feeding1881.32%
Ear erectness540.38%
Ear size150.11%
Umbilical hernia110.08%
Thoracic vertebra number20.01%

Share and Cite

MDPI and ACS Style

Wang, Z.; Feng, X.; Tang, Z.; Li, S.C. Genome-Wide Investigation and Functional Analysis of Sus scrofa RNA Editing Sites across Eleven Tissues. Genes 2019, 10, 327. https://doi.org/10.3390/genes10050327

AMA Style

Wang Z, Feng X, Tang Z, Li SC. Genome-Wide Investigation and Functional Analysis of Sus scrofa RNA Editing Sites across Eleven Tissues. Genes. 2019; 10(5):327. https://doi.org/10.3390/genes10050327

Chicago/Turabian Style

Wang, Zishuai, Xikang Feng, Zhonglin Tang, and Shuai Cheng Li. 2019. "Genome-Wide Investigation and Functional Analysis of Sus scrofa RNA Editing Sites across Eleven Tissues" Genes 10, no. 5: 327. https://doi.org/10.3390/genes10050327

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop