Next Article in Journal
Sandwich Enzyme-Linked Immunosorbent Assay for Quantification of Callose
Previous Article in Journal
Combined Use of Non-Invasive and Micro-Invasive Analytical Investigations to Understand the State of Conservation and the Causes of Degradation of I Tesori del Mare (1901) by Plinio Nomellini
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Imaging Intron Evolution

by
Maria Antonietta Panaro
,
Rosa Calvello
,
Daniela Valeria Miniero
,
Vincenzo Mitolo
and
Antonia Cianciulli
*
Department of Biosciences, Biotechnologies and Biopharmaceutics, University of Bari, Via Orabona, 4, 70126 Bari, Italy
*
Author to whom correspondence should be addressed.
Methods Protoc. 2022, 5(4), 53; https://doi.org/10.3390/mps5040053
Submission received: 28 March 2022 / Revised: 13 June 2022 / Accepted: 21 June 2022 / Published: 24 June 2022
(This article belongs to the Section Biochemical and Chemical Analysis & Synthesis)

Abstract

:
Intron evolution may be readily imaged through the combined use of the “dot plot” function of the NCBI BLAST, aligning two sequences at a time, and the Vertebrate “Multiz” alignment and conservation tool of the UCSC Genome Browser. With the NCBI BLAST, an ideal alignment of two highly conserved sequences generates a diagonal straight line in the plot from the lower left corner to the upper right corner. Gaps in this line correspond to non-conserved sections. In addition, the dot plot of the alignment of a sequence with the same sequence after the removal of the Transposable Elements (TEs) can be observed along the diagonal gaps that correspond to the sites of TE insertion. The UCSC Genome Browser can graph, along the entire sequence of a single gene, the level of overall conservation in vertebrates. This level can be compared with the conservation level of the gene in one or more selected vertebrate species. As an example, we show the graphic analysis of the intron conservation in two genes: the mitochondrial solute carrier 21 (SLC25A21) and the growth hormone receptor (GHR), whose coding sequences are conserved through vertebrates, while their introns show dramatic changes in nucleotide composition and even length. In the SLC25A21, a few short but significant nucleotide sequences are conserved in zebrafish, Xenopus and humans, and the rate of conservation steadily increases from chicken/human to mouse/human alignments. In the GHR, a less conserved gene, the earlier indication of intron conservation is a small signal in chicken/human alignment. The UCSC tool may simultaneously display the conservation level of a gene in different vertebrates, with reference to the level of overall conservation in Vertebrates. It is shown that, at least in SLC25A21, the sites of higher conservation are not always coincident in chicken and zebrafish nor are the sites of higher vertebrate conservation.

1. Introduction

In eukaryotes, the coding DNA sequence of most protein-coding genes is interrupted by the interpolation of non-coding DNA sequences termed introns (“intragenic regions”), as opposed to the coding DNA segments termed exons [1,2]. In the cell nucleus, the whole gene is sequentially transcribed as a pre-mRNA, but before it is passed to the cytoplasm as mature mRNA, the introns are removed by a process called RNA splicing, carried out by a large ribonucleoprotein machinery called the Spliceosome [3,4,5]. In evolutionary conserved genes, the sites of splicing are also most often conserved, although alternatively spliced transcripts may be expressed in addition to the canonical transcript [6,7]. Although originally considered “junk” DNA, the introns are now believed to play a role in the control of gene expression [8,9]. In addition, introns are thought to control their own splicing by carrying nucleotide signaling sequences at their ends (5’ss and 3’ss) [10,11], as well as internal splicing enhancer or silencing sequences and sequences that interact with components of the spliceosome [12]. The origin and evolution of introns is still poorly understood. It is debated whether introns are very ancient and are gradually being lost, or whether their relatively recent evolution has caused them to gradually accumulate: the intron-early and intron-late hypotheses, respectively [13,14,15,16,17]. It is likely that introns have played a role in the main steps of animal evolution.
This short graphical essay aims to demonstrate the use of NCBI BLAST dot plots and the UCSC Genome Browser graphics for ready visual demonstrations of evolutionary patterns in introns of homologous genes. Here, we show the evolution—from Euteleostomi to Amphibia, birds and mammals—of two representative genes: the mitochondrial 2-oxodicarboxylate carrier (mitochondrial solute carrier 21; SLC25A21) [18] and the growth hormone receptor (GHR) [19]. Possibly due to the very basic life functions they serve, in the mitochondrial transporter family (SLC25), members of the coding sequences (and protein products) are conserved between fungi and primates; these genes are among the most studied within vertebrates [20,21]. In particular, the SLC25A21 gene was selected as the oldest member of this family in humans. The GHR gene is also “conserved”, but limited to vertebrates.
The sequence evolution in introns has been studied with reference to specific sites or motifs only. For instance, the conservation in homologous 5’ss and 3’ss [7,22] and the occurrence of short k-mer words [23] were investigated in vertebrates. However, from these studies a comprehensive scheme of the intron evolution did not emerge.
Two-sequence BLAST comparisons were made for each gene between the intron sequences of different species, and the results are mainly presented as dot plot diagrams. The genes studied are, in all species, considered heavily burdened with specific transposable elements (TEs) [24] intervening between the genuine original intronic sequences. We thus made additional comparisons between the actual intronic sequence of a gene and the intronic sequence of the same gene after TE clearing, or between TE-cleared introns of two different species. This procedure points out the sites of TES as blanks in the alignment.
The UCSC Genome graphics was used to display, for SLC25A21 or GHR, along their entire sequence, the average level of conservation calculated from up to 100 multialigned vertebrate species. In addition, the conservation of the relevant gene in selected vertebrate species was graphed in parallel and this allowed both comparisons among species and comparisons between individual species and the vertebrate average.
The extensive information that may be gained from analyses of the type outlined here might help in drafting a scheme of the evolutionary flow of the information carried through introns. The primary information concerning protein structure is conveyed by exons, which maintain a relative structural stability through the negative selection of modified key nucleotides. As a result, key proteins are conserved through very long evolutionary periods, as in SLC25 genes. Introns possibly carry supplemental information, regulating the gene expression, for example. The results of the present investigation show that the structural conservation in introns is much poorer than in exons, but the mutation rate during evolution may be very different depending on the gene concerned. Furthermore, the insertions of species-specific TEs contribute to the structural divergence between species. In conclusion, the information carried by introns (if any) is likely to fade out quite rapidly, albeit with some differences between genes.
The graphical approaches discussed in this paper have the advantage of readily providing quick comparative overviews of the intron structure of two or more homologous genes, as well as the possibility of zooming in on specific segments. Under the guidance of the visual information gained, further quantitative analyses may focus on regions of specific interest, for instance, pertaining to the conservation of nucleotide composition or motifs, the similarity of secondary structures or the presence of TEs common to different species.

2. Material and Methods

The NCBI bank of homologous genes (https://www.ncbi.nlm.nih.gov/homologene, accessed on 1 January 2020) was used to select the homologous SLC25A21 and GHR genes of human (Homo sapiens), mouse (Mus musculus), chicken (Gallus gallus), zebrafish (Danio rerio), and Xenopus (Xenopus tropicalis). Genomic and mRNA sequences of the genes were retrieved from the NCBI GenBank (https://www.ncbi.nlm.nih.gov, accessed on 1 January 2020) (Table 1 and Table 2). The genomic sequences (from start codon to stop codon) were processed by deleting the coding sequences, thus leaving the introns only (fourth column of Table 3). Some analyses were performed after clearing the intron sequences of all transposable elements (TEs), using the CENSOR software (https://www.girinst.org/censor/, accessed on 1 January 2020) [25] (Table 3, last column). Nucleotide alignments were usually performed with BLAST (Basic Local Alignment Search Tool; https://blast.ncbi.nlm.nih.gov/Blast.cgi, accessed on 1 January 2020), setting the Expect threshold at 0.0005. BLAST finds regions of local similarity between sequences and calculates the statistical significance of matches [26]. The dot plot is a graph on the XY plane, each dot having the positions of the aligned segment in one gene and in the other gene as coordinates. When comparing homologous genes, the series of dots will align along a somewhat regular straight line stretching from the lower left corner to the upper right corner. The gaps in the line correspond to non-homologous segments.
The UCSC Genome Browser was accessed at https://genome.ucsc.edu on 1 January 2020, and the function “Comparative Genomics-Conservation” was used. This method shows multiple alignments of 100 vertebrate species and measurements of evolutionary conservation using two methods (phastCons and phyloP).
Further instructions for both BLAST and UCSC Genome Browser plots are available at the indicated sites.
BLAST dot plots and UCSC graphics photos are unretouched in order to show the actual outputs of the analysis. However, when the matching “dots” were too small to be viewed on the original print (e.g., human/zebrafish alignments, accessed on 1 January 2020), the plots were redrawn with suitably sized dots at the sites of alignment.
The percentages of nucleotide conservation between species were calculated as follows: From the list of the individual aligned sections (with their length) provided by BLAST, we computed the total number of the aligning nucleotides and divided it by the total number of intron nucleotides.

3. Results

3.1. The TEs

The alignment of an intron sequence with itself, apart from the expected exact diagonal alignment, usually shows a large number of matches outside the diagonal (Figure 1A,B).
In the next experiment, we removed the SINE Alus from the SLC25A21 intron sequence which are the most represented TEs in human genes, but average 280 nucleotides only in length. Their removal generates a dramatic decrease in the matches outside the diagonal but also reveals gaps along the diagonal. Due to the short length of the Alus, the gaps in the diagonal are not apparent at a low magnification (Figure 1C), but may be clearly demonstrated at a higher magnification (Figure 1D). When all the other non-Alu TEs were also removed (a total of 192,584 nucleotides, i.e., about 39% of the introns) the small gaps along the diagonal could be appreciated, even at a low magnification. It was also seen that all matches outside the diagonal disappeared, showing that the latter were all due to coupling between TEs (Figure 1E). The same experiment with GHR yielded similar results (not shown).

3.2. Human/Mouse Alignments

The BLAST dot plot alignment of the human SLC25A21 with the homologous mouse gene shows that the large majority of matches lie along a slightly irregular diagonal line. The additional sparse matches are likely to correspond to TEs, which are at least partly similar in human and mouse, such as LINE 1 transposons (Figure 2A); indeed, when all TEs are cancelled in both genes before the alignment, the sparse matches are practically absent (and the alignment diagonal is more regular) (Figure 2B). The gaps (usually short) in the alignment diagonal correspond to poorly conserved sections.
The BLAST dot plot alignment of the human GHR with the homologous mouse gene shows that approximately 1 to 140,000 human nucleotides have no match with the mouse sequence and are likely to be an additional insertion peculiar in human. On the contrary, the rest of the human sequence significantly aligns with the mouse sequence, although the alignment line is considerably distorted and exhibits relatively large gaps (Figure 2C). Following the removal of all TEs in both sequences, the diagonal alignment becomes more regular, but with gaps that are much larger than in the SLC25A21 alignments (Figure 2D; compare with Figure 1E).

3.3. Human/Chicken Alignments

The BLAST dot plot alignment of the human SLC25A21 with the homologous chicken gene yielded virtually identical results, whether using the complete sequences or after TEs deletion, since practically no TE is shared between the two species. Figure 3A shows that the vast majority of matches, i.e., only 32, lie along a slightly irregular diagonal line; the gaps between matching segments, which correspond to segments that do not share significant homologies, are relatively wide. The matching segments are so short that they appear as simple dots at a lower magnification (Figure 3A), but at a higher magnification, they appear to be short segments (Figure 3B).
On the contrary, the BLAST 2-nucleotide search of the human GHR intron sequence against the homologous chicken sequence yielded only one 182 nucleotide-long match (i.e., 0.30 of the chicken sequence).

3.4. Human/Zebrafish Alignments

The BLAST dot plot alignment of the human SLC25A21 with the homologous zebrafish gene yielded five non-TE matches (with a total of 1022 nucleotides; 0.51% of the total) positioned near the theoretical diagonal line (Figure 4).
On the contrary, no significant similarity below the Expect threshold was found when aligning the human GHR intron sequence against the intron sequence of the GHR of zebrafish.

3.5. Human/Xenopus Alignments

The BLAST dot plot alignment of the human SLC25A21 with the homologous Xenopus gene yielded five non-TE matches (with a total of 846 nucleotides; 0.45% of the total) positioned near the theoretical diagonal line (Figure 5).
On the contrary, the alignment of the human GHR intron sequence against the intron sequence of the GHR of Xenopus did not yield significant similarities below the Expect threshold.

3.6. Zebrafish/Chicken Alignments

The BLAST dot plot alignment of the zebrafish slc25a21 with the homologous chicken gene (not shown) yielded five non-TE matches positioned near the theoretical diagonal line. The aligned nucleotides constitute 0.42% of the chicken sequence.
The alignment of the zebrafish GHR intron sequence against the intron sequence of the GHR of chicken did not yield significant similarities.

3.7. Zebrafish/Xenopus Alignments

The BLAST dot plot alignment of the zebrafish slc25a21 with the homologous Xenopus gene (not shown) yielded one match only corresponding to 0.11% of the zebrafish sequence and 0.10% of the Xenopus sequence.
The alignment of the zebrafish GHR intron sequence against the intron sequence of the GHR of Xenopus did not yield significant similarities.

3.8. Chicken/Xenopus Alignments

The BLAST dot plot alignment of the chicken slc25a21 with the homologous Xenopus gene (not shown) yielded six non-TE matches positioned near the theoretical diagonal line. The aligned nucleotides constitute 0.44% of the chicken sequence.
The alignment of the chicken GHR intron sequence against the intron sequence of the GHR of Xenopus did not yield significant similarities.

3.9. Global Analysis of the SLC25A21 and GHR Conservation in Vertebrates

The UCSC Genome Browser, using the function Comparative Genomics/Conservation, plots the estimated evolutionary trend of a given vertebrate gene, calculated along its entire length as an average from 100 multialigned vertebrate species. The system calculates, for each position in the gene, the conservation expected under neutral drift (the zero level in the plot) and then determines the evolutionary “acceleration” (faster evolution than expected, i.e., poor conservation) and the “conservation” (slower than expected evolution) values. The values of “conservation” are plotted as positives, while the values of “acceleration” are plotted as negatives. An example is given in Figure 6A. In addition, the system provides the statistical distribution of the readings and the general average in the section studied.
The conservation profiles of SLC25A21 and GHR are shown in Figure 6B,C, respectively. In both graphs, the positive values largely prevail, although a few small negative peaks may be appreciated in spite of the small enlargement. The general averages of all readings are 0.256 in SLC25A21 and 0.093 in GHR. These averages forcefully include exons; however, these represent a very small fraction of the sequences (0.18% of SLC25A21 in human and 0.66% in GHR). The average figures indicate that, in both genes, the intron sequences tend to be conserved, but the level of conservation is about 2.8 times higher in SLC25A21.

3.10. Differences among Species

The peaks of conservation in the different vertebrate species do not always match the general vertebrate pattern. Here, we provide two examples covering two different 53,000-nucleotide segments in intron 1 (Figure 7A,B).

3.11. Repeats in Introns of Human SLC25A21 and GHR

Within the intronic sequence of human SLC25A21, we found two short Plus/Plus repeats of non-TE sequences: the longest comprised 96 nt (Expect = 5 × 10−34). Within the intronic sequence of human GHR, we found four short Plus/Plus repeats of non-TE sequences: the longest comprised 81 nt (Expect = 3 × 10−22).

3.12. Exon Nucleotide and Amino Acid Conservation in SLC25A21 and GHR (Zebrafish/Human)

In SLC25A21, the zebrafish/human conservation was 69.6% for exon nucleotides and 81.2% for the amino acids. In GHR, the zebrafish/human conservation was 56.3% for exon nucleotides and 43.5% for the amino acids.

4. Discussion

Bony fishes (Euteleostomi) appeared about 430 million years ago (MYA) and evolved into Actinopterygii (including zebrafish) and Sarcopterygii. About 350 MYA the Sarcopterygii evolved into Tetrapoda, of which Amphibia (including Xenopus) and Amniota are specific clades. Amniota evolved, possibly around 320 MYA, into Synapsida (ancestors of Mammals) and Sauropsida (comprising Reptilia and Birds). Eventually, rodents/primates divergence took place around 65–100 MYA, while the species Homo sapiens emerged only 160,000 years ago [27,28,29,30,31,32].
Actinopterygii/Sarcopterygii divergence seems to have entailed a radical re-editing of introns. In SLC25A21 alignments of the actinopterygian zebrafish with human, chicken, and Xenopus (all of the sarcopterygian group), there were five (0.51% of the intron sequence), five (0.42%), and one (0.11%) matches, respectively. The corresponding alignments with the less conserved GHR yielded no matches.
In the sarcopterygian group of Tetrapoda, the subsequent Amphibia/Amniota divergence also entailed a similarly radical re-editing of introns. In SLC25A21 alignments of Xenopus with human and chicken, the matches were 5 (0.45% of the intron sequence), and 6 (0.44%), respectively. The corresponding alignments with the less conserved GHR yielded no match.
In Amniota, the Synapsida/Sauropsida divergence appears to be much more conservative. In the SLC25A21 alignment of human with chicken, the matches were 32 (4.66% of the intron sequence). The corresponding alignment with the less conserved GHR yielded only one match (0.30%).
Eventually, as expected, the more recent rodents/primates divergence was more conservative. The human/mouse alignments yielded 266 matches (40.03%) with SLC25A21 and 27 matches (11.07%) with GHR.
The evolution of exons was also more conservative in SLC25A21 than in GHR, as reported in paragraph 3.12. Thus, the nucleotide dynamics of introns seems to parallel that of exons, although changes in introns are possibly less restricted and can accumulate, while exons are subject to a selective pressure, which minimizes changes that would alter their protein product.
A comparison of the total number of intronic nucleotides in human and zebrafish shows that the figures in human are almost three times those in zebrafish, both in SLC25A21 and GHR (Table 3). According to our investigations (unpublished results), through the whole mitochondrial carrier subfamily, the number of intron nucleotides is significantly higher in human than in zebrafish. The reason for a trend of intron length increase during evolution has not yet been completely elucidated. The hypothesis is that introns can grow by progressive addition of new “structure units” at their 3′ end [33]. In addition, the spontaneous duplication of DNA segments, apparently by accidental replication, was described [34]. In the SLC25A3 gene of vertebrates, a whole exon (about 120 nt long) is duplicated, and two copies which retain a high level of similarity are alternatively transcribed, giving rise to two similar transcripts [35,36,37]. In the present material, we found a few short but significant repeats in the human sequences (Section 3.11) that could be the recognizable remnants of larger duplications which had diverged by a summation of mutations. The relatively regular distribution of matches along the entire gene length seems to contradict the hypothesis of additions at the 3′ end only; on the other hand, the number of duplications observed appears too low to support weighty intron elongations. These observations suggest that the increase in intron length during evolution may occur through different mechanisms.

5. Conclusions

Using the NIH Nucleotide BLAST suite to align the introns of homologous genes of two species, the “dot plot” provides a complete visual image of the intron conservation between the two species. Aligning a genuine intronic sequence with the same sequence after TEs clearing, the “dot plot” returns a clear (negative) image of the numerousness and positions of TEs (Figure 1E). However, when the two species share similar TEs, as is the case of the human SINE Alu and mouse SINE B1 [38], several matches show outside of the alignment diagonal (Figure 2A), but after TE removal, these matches are no longer present (Figure 2B). The UCSC Genome graphics suite may be a powerful tool for displaying multicomparisons along the entire gene of reference, the average conservation level in vertebrates and the conservation level profile in selected individual species, simultaneously. Thus, the combination of the two approaches may be helpful to gain a better understanding of the general evolutionary dynamics in vertebrate introns. For this connection, the graphically basic approaches outlined here may help in providing a qualitative, but readily visual, evaluation of the intron structural conservation/mutation and integration with transposable elements in vertebrate evolution, as a preliminary step to more quantitative approaches.

Author Contributions

M.A.P. Supervision; R.C. and D.V.M. Data curation; V.M. conceptualization, methodology, writing—original draft preparation; A.C. review and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data and methods used in this paper are presented in sufficient details. Any additional questions should be directed to the authors.

Acknowledgments

The authors thank to Mary V. C. Pragnell for linguistic text revision.

Conflicts of Interest

The authors state no conflict of interest.

References

  1. Rogozin, I.B.; Carmel, L.; Csuros, M.; Koonin, E.V. Origin and evolution of spliceosomal introns. Biol. Direct 2012, 16, 7–11. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Poverennaya, I.V.; Roytberg, M.A. Spliceosomal Introns: Features, Functions, and Evolution. Biochemistry (Moscow) 2020, 85, 725–734. [Google Scholar] [CrossRef] [PubMed]
  3. Matlin, A.J.; Moore, M.J. Spliceosome assembly and composition. Adv. Exp. Med. Biol. 2007, 623, 14–35. [Google Scholar]
  4. Shi, Y. Mechanistic insights into precursor messenger RNA splicing by the spliceosome. Nat. Rev. Mol. Cell. Biol. 2017, 18, 655–670. [Google Scholar] [CrossRef] [PubMed]
  5. Kastner, B.; Will, C.L.; Stark, H.; Lührmann, R. Structural Insights into Nuclear pre-mRNA Splicing in Higher Eukaryotes. Cold. Spring. Harb. Perspect. Biol. 2019, 11, a032417. [Google Scholar] [CrossRef] [PubMed]
  6. De Conti, L.; Baralle, M.; Buratti, E. Exon and intron definition in pre-mRNA splicing. Wiley Interdiscip. Rev. RNA 2013, 4, 49–60. [Google Scholar] [CrossRef] [PubMed]
  7. Panaro, M.A.; Calvello, R.; Mitolo, V.; Cianciulli, A. 5′ and 3′ splicing signals evolution in vertebrates: Analysis in a conserved gene family. Comput. Biol. Chem. 2020, 86, 107251. [Google Scholar] [CrossRef] [PubMed]
  8. Shaul, O. How introns enhance gene expression. Int. J. Biochem. Cell. Biol. 2017, 91, 145–155. [Google Scholar] [CrossRef]
  9. Rose, A.B. Introns as Gene Regulators: A Brick on the Accelerator. Front. Genet. 2019, 7, 9–672. [Google Scholar] [CrossRef] [Green Version]
  10. Schwartz, S.H.; Silva, J.; Burstein, D.; Pupko, T.; Eyras, E.; Ast, G. Large-scale comparative analysis of splicing signals and their corresponding splicing factors in eukaryotes. Genome Res. 2008, 18, 88–103. [Google Scholar] [CrossRef]
  11. Calvello, R.; Cianciulli, A.; Panaro, M.A. Conservation/Mutation in the splice sites of cytokine receptor genes of mouse and human. Int. J. Evol. Biol. 2013, 2013, 818954. [Google Scholar] [CrossRef]
  12. Chasin, L.A. Searching for splicing motifs. Adv. Exp. Med. Biol. 2007, 623, 85–106. [Google Scholar]
  13. Fedorova, L.; Fedorov, A. Introns in gene evolution. Genetica 2003, 118, 123–131. [Google Scholar] [CrossRef]
  14. Roy, S.W. The origin of recent introns: Transposons? Genome Biol. 2004, 5, 251. [Google Scholar] [CrossRef] [Green Version]
  15. Koonin, E.V. The origin of introns and their role in eukaryogenesis: A compromise solution to the introns-early versus introns-late debate? Biol. Direct 2006, 14, 22. [Google Scholar] [CrossRef] [Green Version]
  16. Penny, D.; Hoeppner, M.P.; Poole, A.M.; Jeffares, D.C. An overview of the introns-first theory. J. Mol. Evol. 2009, 69, 527–540. [Google Scholar] [CrossRef]
  17. Irimia, M.; Roy, S.W. Origin of spliceosomal introns and alternative splicing. Cold. Spring. Harb. Perspect. Biol. 2014, 6, a016071. [Google Scholar] [CrossRef]
  18. Fiermonte, G.; Dolce, V.; Palmieri, L.; Ventura, M.; Runswick, M.J.; Palmieri, F.; Walker, J.E. Identification of the human mitochondrial oxodicarboxylate carrier. Bacterial expression, reconstitution, functional characterization, tissue distribution, and chromosomal location. J. Biol. Chem. 2001, 276, 8225–8230. [Google Scholar] [CrossRef] [Green Version]
  19. Lin, S.; Li, C.; Li, C.; Zhang, X. Growth Hormone Receptor Mutations Related to Individual Dwarfism. Int. J. Mol. Sci. 2018, 19, 1433. [Google Scholar] [CrossRef] [Green Version]
  20. Palmieri, F. The mitochondrial transporter family SLC25: Identification, properties and physiopathology. Mol. Aspects Med. 2013, 34, 465–484. [Google Scholar] [CrossRef]
  21. Palmieri, F.; Scarcia, P.; Monné, M. Diseases Caused by Mutations in Mitochondrial Carrier Genes SLC25: A Review. Biomolecules 2020, 10, 655. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  22. Abril, J.F.; Castelo, R.; Guigó, R. Comparison of splice sites in mammals and chicken. Genome Res. 2005, 15, 111–119. [Google Scholar] [CrossRef] [Green Version]
  23. Sievers, A.; Sauer, L.; Hausmann, M.; Hildenbrand, G. Eukaryotic Genomes Show Strong Evolutionary Conservation of k-mer Composition and Correlation Contributions between Introns and Intergenic Regions. Genes 2021, 12, 1571. [Google Scholar] [CrossRef] [PubMed]
  24. Bourgeois, Y.; Boissinot, S. On the Population Dynamics of Junk: A Review on the Population Genomics of Transposable Elements. Genes 2019, 10, 419. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Kohany, O.; Gentles, A.J.; Hankus, L.; Jurka, J. Annotation, submission and screening of repetitive elements in Repbase: RepbaseSubmitter and Censor. BMC Bioinform. 2006, 7, 474. [Google Scholar] [CrossRef] [Green Version]
  26. Altschul, S.F.; Gish, W.; Miller, W.; Myers, E.W.; Lipman, D.J. Basic local alignment search tool. J. Mol. Biol. 1990, 215, 403–410. [Google Scholar] [CrossRef]
  27. Foote, M.; Hunter, J.P.; Janis, C.M.; Sepkoski, J.J., Jr. Evolutionary and preservational constraints on origins of biologic groups: Divergence times of eutherian mammals. Science 1999, 283, 1310–1314. [Google Scholar] [CrossRef] [Green Version]
  28. Lee, M.S. Molecular clock calibrations and metazoan divergence dates. J. Mol. Evol. 1999, 49, 385–391. [Google Scholar] [CrossRef]
  29. Nei, M.; Xu, P.; Glazko, G. Estimation of divergence times from multiprotein sequences for a few mammalian species and several distantly related organisms. Proc. Natl. Acad. Sci. USA 2001, 98, 2497–2502. [Google Scholar] [CrossRef] [Green Version]
  30. Nobrega, M.A.; Pennacchio, L.A. Comparative genomic analysis as a tool for biological discovery. J. Physiol. 2004, 554, 31–39. [Google Scholar] [CrossRef]
  31. Broughton, R.E.; Betancur-R, R.; Li, C.; Arratia, G.; Ortí, G. Multi-locus phylogenetic analysis reveals the pattern and tempo of bony fish evolution. PLoS Curr. 2013, 5. [Google Scholar] [CrossRef]
  32. Betancur-R, R.; Broughton, R.E.; Wiley, E.O.; Carpenter, K.; López, J.A.; Li, C.; Holcroft, N.I.; Arcila, D.; Sanciangco, M.; Cureton, J.C., II; et al. The tree of life and a new classification of bony fishes. PLoS Curr. 2013, 5. [Google Scholar] [CrossRef]
  33. Zhang, Q.; Li, H.; Zhao, X.Q.; Xue, H.; Zheng, Y.; Meng, H.; Jia, Y.; Bo, S.L. The evolution mechanism of intron length. Genomics 2016, 108, 47–55. [Google Scholar] [CrossRef]
  34. Koszul, R.; Caburet, S.; Dujon, B.; Fischer, G. Eucaryotic genome evolution through the spontaneous duplication of large chromosomal segments. EMBO J. 2004, 23, 234–243. [Google Scholar] [CrossRef]
  35. Dolce, V.; Iacobazzi, V.; Palmieri, F.; Walker, J.E. The sequences of human and bovine genes of the phosphate carrier from mitochondria contain evidence of alternatively spliced forms. J. Biol. Chem. 1994, 269, 10451–11060. [Google Scholar] [CrossRef]
  36. Fiermonte, G.; Dolce, V.; Palmieri, F. Expression in Escherichia coli, functional characterization, and tissue distribution of isoforms A and B of the phosphate carrier from bovine mitochondria. J. Biol. Chem. 1998, 273, 22782–22787. [Google Scholar] [CrossRef] [Green Version]
  37. Calvello, R.; Cianciulli, A.; Panaro, M.A. Unusual structure and splicing pattern of the vertebrate mitochondrial solute carrier SLC25A3 gene. J. Genet. 2018, 97, 225–233. [Google Scholar] [CrossRef]
  38. Krayev, A.S.; Markusheva, T.V.; Kramerov, D.A.; Ryskov, A.P.; Skryabin, K.G.; Bayev, A.A.; Georgiev, G.P. Ubiquitous transposon-like repeats B1 and B2 of the mouse genome: B2 sequencing. Nucleic Acids Res. 1982, 10, 7461–7475. [Google Scholar] [CrossRef] [Green Version]
Figure 1. (A) Abscissa: Human SLC25A21 (490,793 nucleotides; the sequence from start to stop codons after removal of exons). Ordinate: the same sequence as in the Abscissa. Number of matches: 7704. Note the perfect alignment along the diagonal from the lower left corner to the upper right corner and the large number of matches outside the diagonal. (B) Abscissa: Human GHR (292,898 nucleotides; the sequence from start to stop codons after removal of exons). Ordinate: the same sequence as in the Abscissa. Number of matches: 1840. Note the perfect alignment along the diagonal from the lower left corner to the upper right corner and the large number of matches outside the diagonal. (C) Abscissa: Human SLC25A21 (490,793 nucleotides). Ordinate: Human SLC25A21 after removal of all the 130 SINE Alus (462,572 nucleotides). Number of matches: 3220 (compared with A). Note the seemingly perfect alignment along the diagonal from the lower left corner to the upper right corner. (D) A partial, magnified, view of panel (C), showing a 10 K per 10 K area from the lower left corner, magnified by 50×. The gaps correspond to the removed SINE Alus. (E) Abscissa: Human SLC25A21 (490,793 nucleotides). Ordinate: Human SLC25A21 after removal of all TEs (298,209 nucleotides). These additional TEs belong to different categories, and some of them are much larger than the SINE Alus. Number of matches: 505. Note that the gaps (due to TEs) along the diagonal from the lower left corner to the upper right corner are now more visible at low magnification.
Figure 1. (A) Abscissa: Human SLC25A21 (490,793 nucleotides; the sequence from start to stop codons after removal of exons). Ordinate: the same sequence as in the Abscissa. Number of matches: 7704. Note the perfect alignment along the diagonal from the lower left corner to the upper right corner and the large number of matches outside the diagonal. (B) Abscissa: Human GHR (292,898 nucleotides; the sequence from start to stop codons after removal of exons). Ordinate: the same sequence as in the Abscissa. Number of matches: 1840. Note the perfect alignment along the diagonal from the lower left corner to the upper right corner and the large number of matches outside the diagonal. (C) Abscissa: Human SLC25A21 (490,793 nucleotides). Ordinate: Human SLC25A21 after removal of all the 130 SINE Alus (462,572 nucleotides). Number of matches: 3220 (compared with A). Note the seemingly perfect alignment along the diagonal from the lower left corner to the upper right corner. (D) A partial, magnified, view of panel (C), showing a 10 K per 10 K area from the lower left corner, magnified by 50×. The gaps correspond to the removed SINE Alus. (E) Abscissa: Human SLC25A21 (490,793 nucleotides). Ordinate: Human SLC25A21 after removal of all TEs (298,209 nucleotides). These additional TEs belong to different categories, and some of them are much larger than the SINE Alus. Number of matches: 505. Note that the gaps (due to TEs) along the diagonal from the lower left corner to the upper right corner are now more visible at low magnification.
Mps 05 00053 g001aMps 05 00053 g001bMps 05 00053 g001c
Figure 2. (A) Abscissa: Human SLC25A21 (490,793 nucleotides). Ordinate: Mouse slc25a21 (444,514 nucleotides). Number of matches: 1660. Note the significant alignment of most matches along the diagonal from the lower left corner to the upper right corner. The matches which are outside the diagonal mostly correspond to human and mouse TEs similar in structure. (B) Abscissa: Human SLC25A21 after the removal of all TEs. (298,209 nucleotides). Ordinate: Mouse slc25a21 after the removal of all TEs. (296,350 nucleotides). Number of matches: 266.The plot demonstrates a good overall alignment between the two sequences, while the gaps (which are usually short) correspond to poorly conserved sections. Note also the absence of significant matches outside the diagonal. (C) Abscissa: Human GHR (292,898 nucleotides). Ordinate: Mouse GHR (136,305 nucleotides). Number of matches: 280. Note the significant alignment of most matches along the diagonal from human GHR nucleotide 140,000 to the upper right corner, indicating that human nucleotides 1 to 140,000 have no matches with the mouse sequence, while the rest of the human sequence and the whole mouse sequence are homologous. The matches that are outside the diagonal mostly correspond to TEs that are similar in human and mouse. (D) Abscissa: Human GHR (homologous section only) after the removal of all TEs (85,058 nucleotides; from 78,000 to163,058). Ordinate: Mouse GHR after the removal of all TEs (89,777 nucleotides). Number of matches: 27. The plot demonstrates a general overall alignment between the two sequences, but the gaps are numerous and wide, corresponding to poorly conserved sections.
Figure 2. (A) Abscissa: Human SLC25A21 (490,793 nucleotides). Ordinate: Mouse slc25a21 (444,514 nucleotides). Number of matches: 1660. Note the significant alignment of most matches along the diagonal from the lower left corner to the upper right corner. The matches which are outside the diagonal mostly correspond to human and mouse TEs similar in structure. (B) Abscissa: Human SLC25A21 after the removal of all TEs. (298,209 nucleotides). Ordinate: Mouse slc25a21 after the removal of all TEs. (296,350 nucleotides). Number of matches: 266.The plot demonstrates a good overall alignment between the two sequences, while the gaps (which are usually short) correspond to poorly conserved sections. Note also the absence of significant matches outside the diagonal. (C) Abscissa: Human GHR (292,898 nucleotides). Ordinate: Mouse GHR (136,305 nucleotides). Number of matches: 280. Note the significant alignment of most matches along the diagonal from human GHR nucleotide 140,000 to the upper right corner, indicating that human nucleotides 1 to 140,000 have no matches with the mouse sequence, while the rest of the human sequence and the whole mouse sequence are homologous. The matches that are outside the diagonal mostly correspond to TEs that are similar in human and mouse. (D) Abscissa: Human GHR (homologous section only) after the removal of all TEs (85,058 nucleotides; from 78,000 to163,058). Ordinate: Mouse GHR after the removal of all TEs (89,777 nucleotides). Number of matches: 27. The plot demonstrates a general overall alignment between the two sequences, but the gaps are numerous and wide, corresponding to poorly conserved sections.
Mps 05 00053 g002aMps 05 00053 g002b
Figure 3. (A) Abscissa: Human SLC25A21 after the removal of all TEs (298,209 nucleotides). Ordinate: Chicken slc25a21 after the removal of all the TEs (200,341 nucleotides). Number of matches: 32. Short but likely significant matching segments align almost exactly along the diagonal from the lower left corner to the upper right corner. Gaps between matching segments are relatively wide and correspond to segments that do not share significant homologies. Chicken nucleotides aligning with human: 10,717 over 230,188, i.e., 4.66%. (B) A detail (magnification 5.7 times) of the lower left corner of (A) (52,000 by 32,000 nt).
Figure 3. (A) Abscissa: Human SLC25A21 after the removal of all TEs (298,209 nucleotides). Ordinate: Chicken slc25a21 after the removal of all the TEs (200,341 nucleotides). Number of matches: 32. Short but likely significant matching segments align almost exactly along the diagonal from the lower left corner to the upper right corner. Gaps between matching segments are relatively wide and correspond to segments that do not share significant homologies. Chicken nucleotides aligning with human: 10,717 over 230,188, i.e., 4.66%. (B) A detail (magnification 5.7 times) of the lower left corner of (A) (52,000 by 32,000 nt).
Mps 05 00053 g003aMps 05 00053 g003b
Figure 4. Abscissa: Human SLC25A21 (490,793 nucleotides). Ordinate: Zebrafish slc25a21 (200,517 nucleotides). The matching points significantly align along the diagonal from the lower left corner to the upper right corner. Aligned nucleotides = 1022 over 200,517, i.e., 0.51%.
Figure 4. Abscissa: Human SLC25A21 (490,793 nucleotides). Ordinate: Zebrafish slc25a21 (200,517 nucleotides). The matching points significantly align along the diagonal from the lower left corner to the upper right corner. Aligned nucleotides = 1022 over 200,517, i.e., 0.51%.
Mps 05 00053 g004
Figure 5. Abscissa: Human SLC25A21 (490,793 nucleotides). Ordinate: Xenopus slc25a21 (187,108 nucleotides). The matching points do not correspond to TEs and significantly align along the diagonal from the lower left corner to the upper right corner. Aligned nucleotides = 846 over 187,108, i.e., 0.45%.
Figure 5. Abscissa: Human SLC25A21 (490,793 nucleotides). Ordinate: Xenopus slc25a21 (187,108 nucleotides). The matching points do not correspond to TEs and significantly align along the diagonal from the lower left corner to the upper right corner. Aligned nucleotides = 846 over 187,108, i.e., 0.45%.
Mps 05 00053 g005
Figure 6. (A) An example of the analysis of conservation with the UCSC Genome Browser. The data, plotted as a bar graph, derive from the multialignment of 100 representative vertebrates. Each bar measures the average conservation/mutation of a series of 800 nucleotides; the blue bars (positive values) measure the conservation; the red bars (negative values) measure the mutation; the zero level is the conservation expected under neutral drift on the left of the scale (from −1.8647 to 2.20568). (B) SLC25A21: bar graph of the conservation profile in vertebrates. Each of the (minute) bars corresponds to 800 nucleotides. The series of arrows (top of the figure) indicate the direction of translation. The small vertical lines on the line of arrows mark the position of the exons. Other captions are the same as those in (A). (C) GHR: bar graph of the conservation profile in vertebrates. The same scale and other captions are used in (B), but each bar corresponds to 480 nucleotides.
Figure 6. (A) An example of the analysis of conservation with the UCSC Genome Browser. The data, plotted as a bar graph, derive from the multialignment of 100 representative vertebrates. Each bar measures the average conservation/mutation of a series of 800 nucleotides; the blue bars (positive values) measure the conservation; the red bars (negative values) measure the mutation; the zero level is the conservation expected under neutral drift on the left of the scale (from −1.8647 to 2.20568). (B) SLC25A21: bar graph of the conservation profile in vertebrates. Each of the (minute) bars corresponds to 800 nucleotides. The series of arrows (top of the figure) indicate the direction of translation. The small vertical lines on the line of arrows mark the position of the exons. Other captions are the same as those in (A). (C) GHR: bar graph of the conservation profile in vertebrates. The same scale and other captions are used in (B), but each bar corresponds to 480 nucleotides.
Mps 05 00053 g006
Figure 7. (A) A section of the SLC25A21 sequence: The general vertebrate conservation exhibits five distinct peaks, but the chicken sequence matches only two of them. This section approximately corresponds to nucleotides 36,731 K to 36,784 K in human (refers to genomic sequences; Table 2). (B) A section of the SLC25A21 sequence: the general vertebrate conservation exhibits a long series of conserved nucleotides, while the chicken matches only a small part of the series and zebrafish exhibits no signal at all. This section approximately corresponds in human to nucleotides 36,792 K to 36,845 K.
Figure 7. (A) A section of the SLC25A21 sequence: The general vertebrate conservation exhibits five distinct peaks, but the chicken sequence matches only two of them. This section approximately corresponds to nucleotides 36,731 K to 36,784 K in human (refers to genomic sequences; Table 2). (B) A section of the SLC25A21 sequence: the general vertebrate conservation exhibits a long series of conserved nucleotides, while the chicken matches only a small part of the series and zebrafish exhibits no signal at all. This section approximately corresponds in human to nucleotides 36,792 K to 36,845 K.
Mps 05 00053 g007
Table 1. The transcripts and their mRNAs (NCBI GenBank).
Table 1. The transcripts and their mRNAs (NCBI GenBank).
TranscriptExonsSequence IDmRNA
mRNANucleotides
SLC25A21HumanVariant 110NM_030631.4900
SLC25A21MouseVariant 210NM_001167976.1918
SLC25A21ChickenVariant X211XM_025150918.11002
SLC25A21Zebrafish-----------10NM_001076632.1897
SLC25A21Xenopus-----------10NM_001017069.2900
GHRHumanVariant 29NC_000005.101938
GHRMouseVariant 410NM_001286370.11953
GHRChicken-----------8NM_001001293.11827
GHRZebrafish-----------8NM_001083578.11713
GHRXenopus-----------9XM_002933047.41866
Table 2. The genomic sequences (NCBI Reference Sequence). Coverage: FROM the first nucleotide of the Start codon to the last nucleotide of the Stop codon.
Table 2. The genomic sequences (NCBI Reference Sequence). Coverage: FROM the first nucleotide of the Start codon to the last nucleotide of the Stop codon.
ChromosomeSequence IDFROMTO
Genomic
SLC25A21Human14NC_000014.936,680,65837,172,350
SLC25A21Mouse12NC_000078.756,760,59857,206,029
SLC25A21Chicken5NC_006092.536,947,21137,178,401
SLC25A21Zebrafish17NC_007128.738,030,62738,232,040
SLC25A21Xenopus8NC_030684.2108,640,883108,828,890
GHRHuman5NC_000005.1042,424,58942,719,424
GHRMouse15NC_000081.73,349,2243,487,481
GHRChickenZNC_006127.513,495,91913,558,411
GHRZebrafish8NC_007119.731,465,53431,571,560
GHRXenopus1NC_030677.2195,152,927195,320,257
Table 3. Total: total number of nucleotides (start to stop codons); Intronic: number of intronic nucleotides; Transposable Elements: % of nucleotides of transposable elements over the total intronic nucleotides; Intronic Non-TE: intronic nucleotides minus TE nucleotides.
Table 3. Total: total number of nucleotides (start to stop codons); Intronic: number of intronic nucleotides; Transposable Elements: % of nucleotides of transposable elements over the total intronic nucleotides; Intronic Non-TE: intronic nucleotides minus TE nucleotides.
TotalIntronicTransposableIntronic
ntntElements %Non-TE nt
SLC25A21Human491,693490,79339.24298,209
SLC25A21Mouse445,432444,51433.33296,350
SLC25A21Chicken231,191230,18912.97200,341
SLC25A21Zebrafish201,414200,51749.02102,233
SLC25A21Xenopus188,008187,10841.23109,972
GHRHuman294,836292,89844.09163,758
GHRMouse138,258136,30534.1489,777
GHRChicken62,49360,66615.0551,537
GHRZebrafish106,027104,31474.8426,244
GHRXenopus167,331165,46543.0694,223
2,312,769 1,432,644
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Panaro, M.A.; Calvello, R.; Miniero, D.V.; Mitolo, V.; Cianciulli, A. Imaging Intron Evolution. Methods Protoc. 2022, 5, 53. https://doi.org/10.3390/mps5040053

AMA Style

Panaro MA, Calvello R, Miniero DV, Mitolo V, Cianciulli A. Imaging Intron Evolution. Methods and Protocols. 2022; 5(4):53. https://doi.org/10.3390/mps5040053

Chicago/Turabian Style

Panaro, Maria Antonietta, Rosa Calvello, Daniela Valeria Miniero, Vincenzo Mitolo, and Antonia Cianciulli. 2022. "Imaging Intron Evolution" Methods and Protocols 5, no. 4: 53. https://doi.org/10.3390/mps5040053

Article Metrics

Back to TopTop