Next Article in Journal
Virgin Olive Oil Quality Is Affected by the Microbiota that Comprise the Biotic Fraction of the Oil
Next Article in Special Issue
Natural Function and Structural Modification of Climacostol, a Ciliate Secondary Metabolite
Previous Article in Journal
The Microbiome of Peri-Implantitis: A Systematic Review and Meta-Analysis
Previous Article in Special Issue
Comparative Transcriptomics Reveals Distinct Gene Expressions of a Model Ciliated Protozoan Feeding on Bacteria-Free Medium, Digestible, and Digestion-Resistant Bacteria
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Characterization of Simple Sequence Repeats (SSRs) in Ciliated Protists Inferred by Comparative Genomics

Institute of Evolution & Marine Biodiversity, KLMME, Ocean University of China, Qingdao 266003, China
*
Authors to whom correspondence should be addressed.
These authors contributed equally.
Microorganisms 2020, 8(5), 662; https://doi.org/10.3390/microorganisms8050662
Submission received: 22 January 2020 / Revised: 24 April 2020 / Accepted: 26 April 2020 / Published: 1 May 2020

Abstract

:
Simple sequence repeats (SSRs) are prevalent in the genomes of all organisms. They are widely used as genetic markers, and are insertion/deletion mutation hotspots, which directly influence genome evolution. However, little is known about such important genomic components in ciliated protists, a large group of unicellular eukaryotes with extremely long evolutionary history and genome diversity. With recent publications of multiple ciliate genomes, we start to get a chance to explore perfect SSRs with motif size 1–100 bp and at least three motif repeats in nine species of two ciliate classes, Oligohymenophorea and Spirotrichea. We found that homopolymers are the most prevalent SSRs in these A/T-rich species, with AAA (lysine, charged amino acid; also seen as an SSR with one-adenine motif repeated three times) being the codons repeated at the highest frequencies in coding SSR regions, consistent with the widespread alveolin proteins rich in lysine repeats as found in Tetrahymena. Micronuclear SSRs are universally more abundant than the macronuclear ones of the same motif-size, except for the 8-bp-motif SSRs in extensively fragmented chromosomes. Both the abundance and A/T content of SSRs decrease as motif-size increases, while the abundance is positively correlated with the A/T content of the genome. Also, smaller genomes have lower proportions of coding SSRs out of all SSRs in Paramecium species. This genome-wide and cross-species analysis reveals the high diversity of SSRs and reflects the rapid evolution of these simple repetitive elements in ciliate genomes.

1. Introduction

Simple sequence repeats (SSRs), also known as tandem repeats, are abundant components present in all known genomes. They are major contributors of genome repetivity and are associated with transposable elements [1,2,3,4]. Homopolymer runs and microsatellites are two well-known representatives of SSRs. These repeats are usually insertion/deletion (indel) mutation hotspots that cause replication slippage of DNA polymerases. They could lead to high genome instability thus causing certain diseases, for example Lynch syndrome, a hereditary non-polyposis colorectal cancer in humans [5,6,7,8]. The high indel mutation rate of SSRs increases genetic variation between individuals in a population, making SSRs suitable tools for developing genetic markers and for studies of population genetics in a variety of organisms; tandem repeats of amino acids may also facilitate rapid generation of morphological variation [9,10,11,12,13,14].
Ciliates are microbial eukaryotes with high species and genomic diversity, and are characterized by nuclear dimorphism [15,16,17,18,19,20,21,22,23]. The macronucleus is transcriptionally active whereas the micronucleus is only active during sexual reproduction [24]. Genomes of these unicellular organisms are highly A/T-rich and repetitive, causing difficulties in genome-sequencing. Nonetheless, genomes have been deciphered for increasing numbers of species, thus providing the opportunity to study genome evolution using comparative genomics methods [15,25,26,27,28,29,30,31,32].
During development of the new macronucleus, most micronuclear non-coding sequences, including repetitive ones, are eliminated, while some long repeats are still retained in macronuclear genomes [15,33,34]. It remains a question how the genome rearrangement process changes the shape and span of the frequency distribution of macronuclear SSRs, compared with that of the micronucleus.
In this study, we explore the genome-wide variation of SSR characteristics using published high-quality genomes of nine ciliates: Ichthyophthirius multifiliis, Oxytricha trifallax, Paramecium biaurelia, P. caudatum, P. sexaurelia, P. tetraurelia, Pseudocohnilembus persalinus, Stylonychia lemnae, and Tetrahymena thermophila (Table 1). We focus on the patterns of distribution, structure, and codons of SSRs, and the evolutionary mechanisms that determine these patterns.

2. Materials and Methods

2.1. Genome Sequences and Annotations

Genome and annotation data of the following species were downloaded from the National Center for Biotechnology Information (NCBI) Genome database: Ichthyophthirius multifiliis (macronucleus: GCF_000220395.1), Oxytricha trifallax (macronucleus: GCA_000295675.1; micronucleus: GCA_000711775.1), Paramecium tetraurelia (macronucleus: GCA_000715435.1), Pseudocohnilembus persalinus (macronucleus: GCA_001447515.1), Stylonychia lemnae (macronucleus: GCA_000751175.1), and Tetrahymena thermophila (macronucleus: GCF_000189635.1; micronucleus: GCA_000261185.1). Those of Paramecium biaurelia, P. caudatum, and P. sexaurelia were downloaded from the ParameciumDB database (https://paramecium.i2bc.paris-saclay.fr/; access on 20 February 2020).

2.2. Analysis of Simple Sequence Repeats (SSRs)

Perfect SSRs with motif size 1–100 bp (each motif has ≥3 repeats; no SSR with motif size >100 bp was detected in any genomes involved in this study) were detected with a Perl program originally developed by Dr. Way Sung, University of North Carolina, Charlotte. This program applies a greedy algorithm to find the maximum number of repeats. For motifs nested in one SSR, which are rare, only the smallest motif was counted. Details are described in Sung et al. [38]. Codons in SSRs were iterated from coding sequences of each genome, with both the strand and starting codon position taken into account. All statistical tests were carried out in R 3.4.4 [39]. Plotting was performed using R packages ggplot2 and ggpmisc.

3. Results

The detailed genomic features of the nine ciliate species are shown in Table 1. All genomes are A/T-rich (A/T content: 68.30%–84.09%; Table 1) with a wide range of genome sizes and total gene numbers. The species belong to one of two ciliate classes: Oligohymenophorea (Ichthyophthirius multifiliis, Paramecium biaurelia, P. caudatum, P. sexaurelia, P. tetraurelia, Pseudocohnilembus persalinus, Tetrahymena thermophila) and Spirotrichea (Oxytricha trifallax, Stylonychia lemnae). Most macronuclear chromosomes in the two spirotricheans are extremely fragmented and amplified during genome rearrangement.

3.1. Size Distribution and A/T Content of SSRs

SSRs are abundant in all macronuclear genomes, accounting for ~7.59% to 11.97% of the whole genome (Table 2; Figure 1). Such abundance is strongly correlated with the genome-wide A/T content (Pearson’s r = 0.94, p = 0.0002). This confirms that the more polarized the A/T content, the more repetitive the genome. Here, we define a motif as the shortest repeating unit of any given SSR. SSRs with motif sizes 1–10 bp are more abundant than those with longer motifs, especially mononucleotide repeats as homopolymer runs, such as (A)n, (C)n, (G)n, and (T)n (Table 2; Figure 1). In addition to these homopolymer motifs, there are another 166 motifs with sizes of 2–6 bp that are shared in all nine species (Supplementary Table S1). These motifs form similar microsatellite sequences, but their distribution and repeat number do not show specific relevance to each other.
The number of repeats decreases as the motif gets larger (Figure 2). Interestingly, there are peaks at 8-bp motifs in the two spirotricheans, O. trifallax and S. lemnae, with (G)4(T)4 or (A)4(C)4 at the ends of scaffolds being the majority (50.22% and 70.92%, respectively; Figure 1). These repeat motifs are known telomeric sequences that are added mostly to the ends of the gene-sized chromosomes by telomerases during macronuclear development. However, there are extremely rare internal telomeric repeats, defined as (G)4(T)4 or (A)4(C)4 motifs repeated at least twice in contigs with telomeric repeats at both ends and not located at the first or last 10% of the contigs. In S. lemnae, 36 possible internal telomeres are distributed in 36 gene-sized chromosomes; in O. trifallax, 39 in 38 chromosomes (Supplementary Table S2). However, the presence of 1000–1500 internal telomeres in the micronuclear polytene chromosomes has been previously reported in S. lemnae [40,41]. This indicates that most internal telomeres are eliminated or rearranged during macronuclear development, or unknown internal telomeric sequence difference exists between the macronucleus and micronucleus, as previously reported in T. thermophila [42]. In addition, both species have numerous extremely short, gene-sized (i.e., <1 kbp) chromosomes. This is consistent with the assertion that extreme genome fragmentation and amplification increases genome repetivity. By contrast, motifs larger than 10 bp are rare, especially in the two spirotricheans, the assembly scaffolds of which are extremely short (Table 1).
The A/T content of SSRs is significantly higher than that of the corresponding genomes (one-sided paired t-test, t = -21.563, df = 8, p = 1.13 × 10−8; Table 1 and Table 2) and they are strongly correlated (r = 0.90, p = 0.0008). The higher A/T content of SSRs is likely due to the dominance of A/T homopolymers in SSRs (Table 2). This domination also elevates the median A/T content of SSRs in all nine species almost to 1.0 (Figure 2). A/T content generally decreases as motif size gets larger (Figure 2; Table 2).

3.2. Association between SSRs and Genome Architecture

It is known that repetitive elements contribute to the generation or positional rearrangement of overlapping genes [43,44], for example, in mosquitos the overlapping events are significantly associated with the microsatellite sequences’ amount in the overlapped genes. The microsatellite sequences might have facilitated the crossover events, which lead to positional rearrangement of neighboring genes [44]. Thus, we ask whether ciliate genomes with more SSRs would have more overlapping genes. The proportion of overlapping genes and the proportion of SSRs in the genome are not correlated with each other (Pearson’s r = 0.55; p = 0.12), giving no significant support to the assertion that SSRs elevate the number of overlapping genes. Nonetheless, the possibility that such lack of correlation is an artifact caused by insufficient annotation quality cannot be excluded. It is noteworthy that there are only three species with overlapping genes and the two with the most overlapping genes, i.e., Paramecium tetraurelia and Tetrahymena thermophila, have the best-annotated/maintained genomes (Table 1).
We also ask the question whether SSRs in the macronuclear and micronuclear genomes follow the same size distributions. Due to the paucity of available micronuclear genomes, only O. trifallax and T. thermophila are included in this analysis. In O. trifallax, for the same motif size, there are more SSRs in the micronuclear genome than in the macronuclear genome, except for those with 8-bp motifs (Figure 3). Of these repeat motifs, 50.22% are in telomeres, probably because the chromosomes are extensively fragmented and amplified during macronuclear development. In O. trifallax, 8-bp-motif SSRs account for about 9.46% of all non-homopolymer SSRs in the macronuclear genome, whereas this proportion is only 0.04% in the micronuclear genome. By contrast, in T. thermophila, a species with low levels of genome rearrangement, micronuclear SSRs are universally more abundant than the macronuclear SSRs, i.e., there is higher repetivity in the micronuclear than the macronuclear genome (Figure 3).
In order to show more specific SSR patterns, we picked two genes (MTA6, MTB6; each contains one internally eliminated sequence (IES); NCBI accession numbers: KC405252.1, KC405257.1) in the T. thermophila mating type gene family, which are well-studied and have clear gene structural annotations [45]. For each gene, we ran the SSR pipelines and aligned the MDSs (Macronucleus-Destined Sequences) in the micronuclear genome with those in the macronuclear genome (Supplementary Table S3). Consistent with the genome-wide comparison shown in Figure 3, after taking into account all sites of both genes, the macronuclear genes have fewer SSRs than the micronuclear ones. We also parsed out micronuclear intronic SSRs of the two genes and aligned them with those in the macronuclear introns. These conserved SSRs (at least in the two focal genes) do not only include homopolymers such as 5′AAAAAAAA3′, 5′AAAAA3′, but also include microsatellites 5′AATAATAAT3′, 5′ATATAT3′, 5′TATATA3′. The specific functions for these SSRs are unclear, and they could be motifs associated with the rearrangement process. Analyzing SSRs in MDSs shared by both MIC and MAC MTA6 and MTB6 genes, we found that ~50% of SSRs have a higher copy number in the macronucleus than in the micronucleus, with the remaining ~50% being equal in the two nuclei. As mentioned above, the total number of SSRs in the two genes (full length) are higher in the micronucleus than in the macronucleus, thus implying that IESs greatly elevate the repetitiveness of the micronuclear genome. This observation from the two genes might be extended to whole-genome-level, although a robust test with fully-annotated macronuclear and micronuclear genomes would be needed. We also found a few SSRs unique to the macronuclear MDSs (i.e., not present in the corresponding MIC genes), for example, 5′CTCCTCCTC3′, 5′CTGCTGCTG3′, 5′GCTGCTGCT3′, 5′TCTCTC3′, 5′TGCTGCTGC3′ in MTA6; 5′AACAACAAC3′, 5′AGCAGCAGC3′, 5′AGTAGTAGT3′, 5′CTTCTTCTT3′, 5′GAGAGA3′, 5′TGGTGGTGG3′ in MTB6 (Supplementary Table S3), suggesting that novel SSRs might be created during the rearrangement process.
Since some tandem repeats with 10–20 bp repeat units are involved in the genome rearrangement [46], we searched SSRs with repeat motifs of 10–20 bases in the micronuclear and macronuclear genomes of both Tetrahymena thermophila and Oxytricha trifallax (Supplementary Table S4). These SSRs are more abundant in the micronucleus than in the macronucleus (42 in the micronucleus vs. 25 in the macronucleus of T. thermophila, and among them 10 are shared with mostly the same sequence and length in both genomes; 368 vs. 8 in O. trifallax and 4 are shared; Supplementary Table S4) and are distributed evenly along the scaffolds/chromosomes in both genomes. We also compared these SSRs to those previously published. Interestingly, two identical 19mer SSRs have been detected in two different micronuclear scaffolds (5′ATTATTTCTTTTTACATTT3′; Supplementary Table S4). These are known tandem repeats in Tlr1 [Tetrahymena long repeat 1; a member of a gene family with 20-30 DNA elements encoding a polynucleotide transferase; 45], which is involved in genome rearrangement of T. thermophila [47] (Supplementary Table S4). This example and the identification of other 10-20bp SSRs confirm the quality of the genomes, the fidelity of the analysis, as well as provide unexplored SSR candidates possibly functioning in the genome arrangement process of both T. thermophila and O. trifallax.

3.3. SSRs in Coding Regions

SSRs are evenly distributed in gene regions, without upstream or downstream biases (Table 2, RPG). As is shown in Figure 4, the top four codons in SSRs of all nine species are AAA (codes for lysine, a charged amino acid), TTT (phenylalanine, a hydrophobic amino acid), GGG (glycine, a hydrophobic amino acid), and CCC (proline, a hydrophobic amino acid). This is consistent with the observation that the vast majority of SSRs are homopolymers.
In order to identify codons that are frequently repeated in coding regions, or possibly most tolerated by the gene, we analyzed codons that are repeated more than 10 times. Isoleucine (hydrophobic), asparagine (hydrophilic), leucine (hydrophobic), tyrosine (hydrophilic), and glutamic acid (charged) codon repetitions are the most abundant in most species. Ichthyophthirius multifiliis, Paramecium biaurelia, P. sexaurelia, and P. tetraurelia are the four species with the highest numbers of repeated codons (Table 3). Of the oligohymenophoreans, P. caudatum seems to have extremely rare repeated codons. This result suggests that in the four Paramecium species included in the present study, the relative abundance of coding SSRs is strongly correlated with genome size (adjusted R2 = 0.98, p = 0.006; Table 1 and Table 2). However, when all nine species were analyzed, the correlation is not significant (adjusted R2 = 0.13, p = 0.19).

4. Discussion

In this study, we investigated perfect SSRs in nine ciliate species for which high-quality genomic data are available in order to determine their size distribution, A/T content, repeated codons, and their association with other genomic features. Nevertheless, characterization of SSRs is not the equivalent of a comprehensive investigation of genome repetivity since similar studies have yet to be carried out on large repetitive elements, e.g., transposable elements.
A/T content generally decreases as motif size increases (Figure 2; Table 2), which is consistent with the observation of minisatellites (motif size > 10 bp) being GC-rich in other organisms [48]. In the macronuclear genomes of all the nine ciliates in this study, we also confirm that A/T content of each single motif is also associated with A/T content of the flanking region (the two nucleotides flanking each SSR; Pearson’s r ~1, p < 2.20 × 10−16), which indicates the origin of non-dispersal repeats.
We found that A/T content is strongly associated with SSR abundance. In comparison with other protists, the level of SSR content in ciliates is similar to that of the malaria pathogen Plasmodium falciparum (~9% of the genome is SSRs; A/T content 80.67%) [49], while it is much lower than that of Trypanosoma cruzi (~30% of the genome is SSRs; A/T content 48.30%) [50], suggesting that the positive correlation between A/T content and SSR abundance is not a general rule in protists, and infers diversifying mechanisms in genome repetitive elements evolution.
Amino acid repeats in proteins are known to play important roles in pathogenesis, cell interaction, motility, cytoskeleton and morphological evolution [13,51,52]. In parasitic ciliates such as Ichthyophthirius multifiliis and Cryptocaryon irritans, amino acid repeats are important components of the cell surface immobilization antigens (i-ags), which are targets of host antibodies, and codons for amino acids repeats are usually repeated also at the DNA level [53,54,55]. These repeats could cause unequal crossover, creating new alleles and thus increasing antigen diversity. Such recombinogenic expansion of surface antigens might be an adaptive strategy to increase the survival of parasitic ciliates when facing the harsh environment of host secretions. Therefore, the unstable nature of SSRs/tandem repeats could be partially advantageous for ciliate genome evolution, especially for parasitic species.
Across all the ciliate species in this study, the most abundant 3-bp SSRs in coding regions are AAAs, which code for lysines. Lysine-repeats are the most abundant amino-acid repeats in the pellicle alveolins of the alveoli, which are important cellular structures in ciliates for occupying diverse habitats and reflect highly divergent protein evolution [51,56,57,58]. This finding suggests that the SSR motifs are conserved in ciliates with different morphology and life histories. Homopolymers are prone to occur in non-coding regions (Table 2, coding SSR proportion column). It has previously been suggested that homopolymers in non-coding regions can be involved in protein binding, e.g., as upstream promoter elements [59], which implies that the presence of SSRs might be a key factor in driving genome evolution in ciliates. Besides, repeated-codons (>=10 repeats) are rare, potentially as a result of stronger selection against gene mis/dysfunction caused by repetivity in smaller genomes.
In ciliates, the macronucleus is resorbed in each sexual cycle, and its evolution is more driven by epigenetic mechanisms other than classical genetic mechanisms. Relating macronuclear SSRs to the genome evolution of ciliates thus seems to be difficult; however, the macronuclear genome structurally corresponds to the macronucleus-destined sequences in the micronucleus, and the haploid genome sizes of the macronucleus and micronucleus do not usually differ much in most ciliates. In other words, studying macronuclear SSRs’ roles in genome evolution is like an investigation by subsampling the short repetitive elements in the MIC genome (as is shown in Figure 3), with the assumption that short non-IES (internally eliminated sequences) repeats are conserved in both the MAC and MIC, although this might not always be true especially in species with highly fragmented and scrambled genes. Of course, a full picture of SSRs in genome evolution would definitely need the micronuclear genome sequences well annotated in more species.

5. Conclusions

This genome-wide and cross-species analysis reveals general features of ciliate SSRs and demonstrates the association between SSRs and the unique genome architectures of ciliates. SSRs might thus be an important driver in genome evolution of this large, charismatic group of microbial eukaryotes.

Supplementary Materials

The following are available online at https://www.mdpi.com/2076-2607/8/5/662/s1, Table S1. The SSR motifs that are shared in all nine ciliate species in this study; Table S2. Details of internal telomeric repeats in Stylonychia lemnae and Oxytricha trifallax. Table S3. The SSRs from the MTA6 and MTB6 genes in the macronucleus and micronucleus of Tetrahymena thermophila. Table S4. Details of the macronuclear and micronuclear SSRs with motif size 10–20 bp in Tetrahymena thermophila and Oxytricha trifallax.

Author Contributions

Conceptualization, Y.L., H.L. and Y.Y.; data curation, K.W.; formal analysis, X.C. and H.L.; visualization, Y.L., X.C. and H.L.; methodology, K.W. and H.L.; project administration, H.L. and Y.Y.; resources, H.L.; software, J.P.; supervision, H.L. and Y.Y.; validation, H.L. and Y.Y.; writing—original draft preparation, H.L., Y.Y. and Y.L.; writing—review and editing, Y.Y. and Y.L.; funding acquisition, H.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by grants from the Marine S and T Fund of Shandong Province for Pilot National Laboratory for Marine Science and Technology (Qingdao) (2018SDKJ0406-5), Distinguished Scholars Support Program of Laboratory for Marine Biology and Biotechnology (YJ2019NO04), the National Natural Science Foundation of China (31872228, 31961123002), Young Taishan Scholars Program (tsqn201812024), and Fundamental Research Funds for the Central Universities of China.

Acknowledgments

We thank Way Sung for technical help. Our special thanks are given to Weibo Song (OUC), Alan Warren (the Natural History Museum, UK), and the insightful reviewers for their constructive suggestions on the manuscript. We appreciate the computation support from the IEMB-1 cluster at OUC.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Lupski, J.R.; Weinstock, G.M. Short, interspersed repetitive DNA sequences in prokaryotic genomes. J. Bacteriol. 1992, 174, 4525–4529. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Tautz, D.; Renz, M. Simple sequences are ubiquitous repetitive components of eukaryotic genomes. Nucleic Acids Res. 1984, 12, 4127–4138. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Temnykh, S.; DeClerck, G.; Lukashova, A.; Lipovich, L.; Cartinhour, S.; McCouch, S. Computational and experimental analysis of microsatellites in rice (Oryza sativa L.): Frequency, length variation, transposon associations, and genetic marker potential. Genome Res. 2001, 11, 1441–1452. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. Zheng, W.B.; Wang, C.D.; Yan, Y.; Gao, F.; Doak, T.G.; Song, W.B. Insights into an extensively fragmented eukaryotic genome: De novo genome sequencing of the multinuclear ciliate Uroleptopsis citrina. Genome Biol. Evol. 2018, 10, 883–894. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Baglietto, L.; Lindor, N.M.; Dowty, J.G.; White, D.M.; Wagner, A.; Gomez Garcia, E.B.; Vriends, A.H.; Dutch Lynch Syndrome Study, G.; Cartwright, N.R.; Barnetson, R. A Risks of Lynch syndrome cancers for MSH6 mutation carriers. J. Natl. Cancer Inst. 2010, 102, 193–201. [Google Scholar] [CrossRef] [PubMed]
  6. Caskey, C.T.; Pizzuti, A.; Fu, Y.H.; Fenwick, R.G.; Nelson, D.L. Triplet repeat mutations in human-disease. Science 1992, 256, 784–789. [Google Scholar] [CrossRef]
  7. Fondon, J.W., III; Hammock, E.A.; Hannan, A.J.; King, D.G. Simple sequence repeats: Genetic modulators of brain function and behavior. Trends Neurosci. 2008, 31, 328–334. [Google Scholar] [CrossRef]
  8. Sutherland, G.R.; Richards, R.I. Simple tandem DNA repeats and human genetic disease. Proc. Natl. Acad. Sci. USA 1995, 92, 3636–3641. [Google Scholar] [CrossRef] [Green Version]
  9. Di Rienzo, A.; Peterson, A.C.; Garza, J.C.; Valdes, A.M.; Slatkin, M.; Freimer, N.B. Mutational processes of simple-sequence repeat loci in human populations. Proc. Natl. Acad. Sci. USA 1994, 91, 3166–3170. [Google Scholar] [CrossRef] [Green Version]
  10. Gupta, M.; Chyi, Y.S.; Romero-Severson, J.; Owen, J.L. Amplification of DNA markers from evolutionarily diverse genomes using single primers of simple-sequence repeats. Theor. Appl. Genet. 1994, 89, 998–1006. [Google Scholar] [CrossRef]
  11. Powell, W.; Machray, G.C.; Provan, J. Polymorphism revealed by simple sequence repeats. Trends Plant Sci. 1996, 1, 215–222. [Google Scholar] [CrossRef]
  12. Reddy, M.P.; Sarla, N.; Siddiq, E.A. Inter simple sequence repeat (ISSR) polymorphism and its application in plant breeding. Euphytica 2002, 128, 9–17. [Google Scholar] [CrossRef]
  13. Fondon, J.W., 3rd; Garner, H.R. Molecular origins of rapid and continuous morphological evolution. Proc. Natl. Acad. Sci. USA 2004, 101, 18058–18063. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Huntley, M.A.; Clark, A.G. Evolutionary analysis of amino acid repeats across the genomes of 12 Drosophila species. Mol. Biol. Evol. 2007, 24, 2598–2609. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Eisen, J.A.; Coyne, R.S.; Wu, M.; Wu, D.; Thiagarajan, M.; Wortman, J.R.; Badger, J.H.; Ren, Q.; Amedeo, P.; Jones, K.M.; et al. Macronuclear genome sequence of the ciliate Tetrahymena thermophila, a model eukaryote. PLoS Biol. 2006, 4, e286. [Google Scholar] [CrossRef]
  16. Foissner, W. Protist diversity: Estimates of the near-imponderable. Protist 1999, 150, 363–368. [Google Scholar] [CrossRef]
  17. Gao, F.; Warren, A.; Zhang, Q.; Gong, J.; Miao, M.; Sun, P.; Xu, D.; Huang, J.; Yi, Z.; Song, W. The all-data-based evolutionary hypothesis of ciliated protists with a revised classification of the phylum Ciliophora (Eukaryota, Alveolata). Sci. Rep. 2016, 6, 24874. [Google Scholar] [CrossRef] [Green Version]
  18. McGrath, C.L.; Katz, L.A. Genome diversity in microbial eukaryotes. Trends Ecol. Evol. 2004, 19, 32–38. [Google Scholar] [CrossRef]
  19. Wang, Y.; Wang, C.; Jiang, Y.; Katz, L.A.; Gao, F.; Yan, Y. Further analyses of variation of ribosome DNA copy number and polymorphism in ciliates provide insights relevant to studies of both molecular ecology and phylogeny. Sci. China Life Sci. 2019, 62, 203–214. [Google Scholar] [CrossRef]
  20. Warren, A.; Patterson, D.J.; Dunthorn, M.; Clamp, J.C.; Achilles-Day, U.E.M.; Aescht, E.; Al-Farraj, S.A.; Al-Quraishy, S.; Al-Rasheid, K.; Carr, M.; et al. Beyond the “Code”: A guide to the description and documentation of biodiversity in ciliated protists (Alveolata, Ciliophora). J. Eukaryot. Microbiol. 2017, 64, 539–554. [Google Scholar] [CrossRef] [Green Version]
  21. Zhang, T.T.; Wang, C.D.; Katz, L.A.; Gao, F. A paradox: Rapid evolution rates of germline-limited sequences are associated with conserved patterns of rearrangements in cryptic species of Chilodonella uncinata (Protist, Ciliophora). Sci. China Life Sci. 2018, 61, 1071–1078. [Google Scholar] [CrossRef] [PubMed]
  22. Zhao, Y.; Yi, Z.; Warren, A.; Song, W.B. Species delimitation for the molecular taxonomy and ecology of the widely distributed microbial eukaryote genus Euplotes (Alveolata, Ciliophora). Proc. R. Soc. B 2018, 285, 20172159. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Cheng, T.; Wang, Y.; Huang, J.; Chen, X.; Zhao, X.; Gao, S.; Song, W. Our recent progress in epigenetic research using the model ciliate, Tetrahymena thermophila. Mar. Life Sci. Technol. 2019, 1, 4–14. [Google Scholar]
  24. Prescott, D.M. The DNA of ciliated protozoa. Microbiol. Rev. 1994, 58, 233–267. [Google Scholar] [CrossRef]
  25. Aeschlimann, S.H.; Jonsson, F.; Postberg, J.; Stover, N.A.; Petera, R.L.; Lipps, H.J.; Nowacki, M.; Swart, E.C. The draft assembly of the radically organized Stylonychia lemnae macronuclear genome. Genome Biol. Evol. 2014, 6, 1707–1723. [Google Scholar] [CrossRef]
  26. Aury, J.M.; Jaillon, O.; Duret, L.; Noel, B.; Jubin, C.; Porcel, B.M.; Segurens, B.; Daubin, V.; Anthouard, V.; Aiach, N.; et al. Global trends of whole-genome duplications revealed by the ciliate Paramecium tetraurelia. Nature 2006, 444, 171–178. [Google Scholar] [CrossRef]
  27. Chen, X.; Wang, Y.; Sheng, Y.; Warren, A.; Gao, S. GPS it: An automated method for evolutionary analysis of nonculturable ciliated microeukaryotes. Mol. Ecol. Resour. 2018, 18, 700–713. [Google Scholar] [CrossRef]
  28. Coyne, R.S.; Hannick, L.; Shanmugam, D.; Hostetler, J.B.; Brami, D.; Joardar, V.S.; Johnson, J.; Radune, D.; Singh, I.; Badger, J.H.; et al. Comparative genomics of the pathogenic ciliate Ichthyophthirius multifiliis, its free-living relatives and a host species provide insights into adoption of a parasitic lifestyle and prospects for disease control. Genome Biol. 2011, 12, R100. [Google Scholar] [CrossRef] [Green Version]
  29. McGrath, C.L.; Gout, J.F.; Doak, T.G.; Yanagi, A.; Lynch, M. Insights into three whole-genome duplications gleaned from the Paramecium caudatum genome sequence. Genetics 2014, 197, 1417–1428. [Google Scholar] [CrossRef] [Green Version]
  30. Swart, E.C.; Bracht, J.R.; Magrini, V.; Minx, P.; Chen, X.; Zhou, Y.; Khurana, J.S.; Goldman, A.D.; Nowacki, M.; Schotanus, K.; et al. The Oxytricha trifallax macronuclear genome: A complex eukaryotic genome with 16,000 tiny chromosomes. PLoS Biol. 2013, 11, e1001473. [Google Scholar] [CrossRef] [Green Version]
  31. Wang, Y.; Chen, X.; Sheng, Y.; Liu, Y.; Gao, S. N6-adenine DNA methylation is associated with the linker DNA of H2A. Z-containing well-positioned nucleosomes in Pol II-transcribed genes in Tetrahymena. Nucleic Acids Res. 2017, 45, 11594–11606. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  32. Xiong, J.; Wang, G.; Cheng, J.; Tian, M.; Pan, X.; Warren, A.; Jiang, C.; Yuan, D.; Miao, W. Genome of the facultative scuticociliatosis pathogen Pseudocohnilembus persalinus provides insight into its virulence through horizontal gene transfer. Sci. Rep. 2015, 5, 15470. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  33. Forney, J.; Rodkey, K. A repetitive DNA sequence in Paramecium macronuclei is related to the beta subunit of G proteins. Nucleic Acids Res. 1992, 20, 5397–5402. [Google Scholar] [CrossRef] [PubMed]
  34. Couvillion, M.T.; Lee, S.R.; Hogstad, B.; Malone, C.D.; Tonkin, L.A.; Sachidanandam, R.; Hannon, G.J.; Collins, K. Sequence, biogenesis, and function of diverse small RNA classes bound to the Piwi family proteins of Tetrahymena thermophila. Genes Dev. 2009, 23, 2016–2032. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Chen, X.; Bracht, J.R.; Goldman, A.D.; Dolzhenko, E.; Clay, D.M.; Swart, E.C.; Perlman, D.H.; Doak, T.G.; Stuart, A.; Amemiya, C.T.; et al. The architecture of a scrambled genome reveals massive levels of genomic rearrangement during development. Cell 2014, 158, 1187–1198. [Google Scholar] [CrossRef] [Green Version]
  36. Coyne, R.S.; Thiagarajan, M.; Jones, K.M.; Wortman, J.R.; Tallon, L.J.; Haas, B.J.; Cassidy-Hanley, D.M.; Wiley, E.A.; Smith, J.J.; Collins, K.; et al. Refined annotation and assembly of the Tetrahymena thermophila genome sequence through EST analysis, comparative genomic hybridization, and targeted gap closure. BMC Genomics 2008, 9, 562. [Google Scholar] [CrossRef] [Green Version]
  37. Hamilton, E.P.; Kapusta, A.; Huvos, P.E.; Bidwell, S.L.; Zafar, N.; Tang, H.; Hadjithomas, M.; Krishnakumar, V.; Badger, J.H.; Caler, E.V.; et al. Structure of the germline genome of Tetrahymena thermophila and relationship to the massively rearranged somatic genome. eLife 2016, 5. [Google Scholar] [CrossRef]
  38. Sung, W.; Tucker, A.; Bergeron, R.D.; Lynch, M.; Thomas, W.K. Simple sequence repeat variation in the Daphnia pulex genome. BMC Genom. 2010, 11, 691. [Google Scholar] [CrossRef] [Green Version]
  39. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2018; Available online: https://www.r-project.org/ (accessed on 1 March 2020).
  40. Stoll, S.; Zirlik, T.; Maercker, C.; Lipps, H.J. The organization of internal telomeric repeats in the polytene chromosomes of the hypotrichous ciliate Stylonychia lemnae. Nucleic Acids Res. 1993, 21, 1783–1788. [Google Scholar] [CrossRef] [Green Version]
  41. Williams, K.; Doak, T.G.; Herrick, G. Developmental precise excision of Oxytricha trifallax telomere-bearing elements and formation of circles closed by a copy of the flanking target duplication. EMBO J. 1993, 12, 4593–4601. [Google Scholar] [CrossRef]
  42. Kirk, K.E.; Blackburn, E.H. An unusual sequence arrangement in the telomeres of the germ-line micronucleus in Tetrahymena thermophila. Genes Dev. 1995, 9, 59–71. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  43. Makalowska, I.; Lin, C.F.; Hernandez, K. Birth and death of gene overlaps in vertebrates. BMC Evol. Biol. 2007, 7, 193. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  44. Behura, S.K.; Severson, D.W. Overlapping genes of Aedes aegypti: Evolutionary implications from comparison with orthologs of Anopheles gambiae and other insects. BMC Evol. Biol. 2013, 13, 124. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  45. Cervantes, M.D.; Hamilton, E.P.; Xiong, J.; Lawson, M.J.; Yuan, D.; Hadjithomas, M.; Miao, W.; Orias, E. Selecting one of several mating types through gene segment joining and deletion in Tetrahymena thermophila. PLoS Biol. 2013, 11, e1001518. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  46. Gershan, J.A.; Karrer, K.M. A family of developmentally excised DNA elements in Tetrahymena is under selective pressure to maintain an open reading frame encoding an integrase-like protein. Nucleic Acids Res. 2000, 28, 4105–4112. [Google Scholar] [CrossRef] [Green Version]
  47. Wells, J.M.; Ellingson, J.L.; Catt, D.M.; Berger, P.J.; Karrer, K.M. A small family of elements with long inverted repeats is located near sites of developmentally regulated DNA rearrangement in Tetrahymena thermophila. Mol. Cell. Biol. 1994, 14, 5939–5949. [Google Scholar] [CrossRef] [Green Version]
  48. Vergnaud, G.; Denoeud, F. Minisatellites: Mutability and genome architecture. Genome Res. 2000, 10, 899–907. [Google Scholar] [CrossRef] [Green Version]
  49. Tan, J.C.; Tan, A.; Checkley, L.; Honsa, C.M.; Ferdig, M.T. Variable numbers of tandem repeats in Plasmodium falciparum genes. J. Mol. Evol. 2010, 71, 268–278. [Google Scholar] [CrossRef] [Green Version]
  50. El-Sayed, N.M.; Myler, P.J.; Bartholomeu, D.C.; Nilsson, D.; Aggarwal, G.; Tran, A.N.; Ghedin, E.; Worthey, E.A.; Delcher, A.L.; Blandin, G.; et al. The genome sequence of Trypanosoma cruzi, etiologic agent of Chagas disease. Science 2005, 309, 409–415. [Google Scholar] [CrossRef] [Green Version]
  51. Gould, S.B.; Kraft, L.G.; van Dooren, G.G.; Goodman, C.D.; Ford, K.L.; Cassin, A.M.; Bacic, A.; McFadden, G.I.; Waller, R.F. Ciliate pellicular proteome identifies novel protein families with characteristic repeat motifs that are common to alveolates. Mol. Biol. Evol. 2011, 28, 1319–1331. [Google Scholar] [CrossRef] [Green Version]
  52. Davies, H.M.; Thalassinos, K.; Osborne, A.R. Expansion of lysine-rich repeats in Plasmodium proteins generates novel localization sequences that target the periphery of the host erythrocyte. J. Biol. Chem. 2016, 291, 26188–26207. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  53. Clark, T.G.; Lin, T.L.; Dickerson, H.W. Surface immobilization antigens of Ichthyophthirius multifiliis: Their role in protective immunity. Annu. Rev. Fish Dis. 1995, 5, 113–131. [Google Scholar] [CrossRef]
  54. Gerber, C.A.; Lopez, A.B.; Shook, S.J.; Doerder, F.P. Polymorphism and selection at the SerH immobilization antigen locus in natural populations of Tetrahymena thermophila. Genetics 2002, 160, 1469–1479. [Google Scholar] [PubMed]
  55. Mo, Z.Q.; Xu, S.; Cassidy-Hanley, D.M.; Li, Y.W.; Kolbin, D.; Fricke, J.M.; Li, A.X.; Clark, T.G.; Dan, X.M. Characterization and immune regulation role of an immobilization antigen from Cryptocaryon irritans on groupers. Sci. Rep. 2019, 9, 1029. [Google Scholar] [CrossRef] [PubMed]
  56. Gould, S.B.; Tham, W.H.; Cowman, A.F.; McFadden, G.I.; Waller, R.F. Alveolins, a new family of cortical proteins that define the protist infrakingdom Alveolata. Mol. Biol. Evol. 2008, 25, 1219–1230. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  57. Katti, M.V.; Ranjekar, P.K.; Gupta, V.S. Differential distribution of simple sequence repeats in eukaryotic genome sequences. Mol. Biol. Evol. 2001, 18, 1161–1167. [Google Scholar] [CrossRef] [Green Version]
  58. Katti, M.V.; Sami-Subbu, R.; Ranjekar, P.K.; Gupta, V.S. Amino acid repeat patterns in protein sequences: Their diversity and structural-functional implications. Protein Sci. 2000, 9, 1203–1209. [Google Scholar] [CrossRef]
  59. Bizzaro, J.W.; Marx, K.A. Poly: A quantitative analysis tool for simple sequence repeat (SSR) tracts in DNA. BMC Bioinform. 2003, 4, 22. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Counts of simple sequence repeats (SSRs) with 1–100 bp motifs (≥three repeats) in the nine ciliate macronuclear genomes. The y-axis is log10 transformed.
Figure 1. Counts of simple sequence repeats (SSRs) with 1–100 bp motifs (≥three repeats) in the nine ciliate macronuclear genomes. The y-axis is log10 transformed.
Microorganisms 08 00662 g001
Figure 2. Number of motif repeats, which is represented by y-axis values ≥3, and A/T content in SSRs with different sizes of motifs, represented by y-axis values ≤1. Dots are jittered. Due to the limited jittering-distance, the sizes of dots do not reflect the dominating number of homopolymer SSRs. The y-axis is log10 transformed.
Figure 2. Number of motif repeats, which is represented by y-axis values ≥3, and A/T content in SSRs with different sizes of motifs, represented by y-axis values ≤1. Dots are jittered. Due to the limited jittering-distance, the sizes of dots do not reflect the dominating number of homopolymer SSRs. The y-axis is log10 transformed.
Microorganisms 08 00662 g002
Figure 3. Comparison of SSR counts in the macronucleus and micronucleus of Oxytricha trifallax and Tetrahymena thermophila. The arrow marks the 8-bp-motif SSRs in the macronuclear genome. The y-axis is log10-transformed.
Figure 3. Comparison of SSR counts in the macronucleus and micronucleus of Oxytricha trifallax and Tetrahymena thermophila. The arrow marks the 8-bp-motif SSRs in the macronuclear genome. The y-axis is log10-transformed.
Microorganisms 08 00662 g003
Figure 4. Numbers of codons that are in SSR regions. White boxes represent 0. Ich, Ichthyophthirius multifiliis; Oxy, Oxytricha trifallax; Pbia, Paramecium biaurelia; Pcau, P. caudatum; Psex, P. sexaurelia; Ptet, P. tetraurelia; Pseudo, Pseudocohnilembus persalinus; Sty, Stylonychia lemnae; Tetra, Tetrahymena thermophila.
Figure 4. Numbers of codons that are in SSR regions. White boxes represent 0. Ich, Ichthyophthirius multifiliis; Oxy, Oxytricha trifallax; Pbia, Paramecium biaurelia; Pcau, P. caudatum; Psex, P. sexaurelia; Ptet, P. tetraurelia; Pseudo, Pseudocohnilembus persalinus; Sty, Stylonychia lemnae; Tetra, Tetrahymena thermophila.
Microorganisms 08 00662 g004
Table 1. Features of macronuclear and micronuclear genomes analyzed in this study.
Table 1. Features of macronuclear and micronuclear genomes analyzed in this study.
SpeciesG (Mbp)A/TTNGnN50 (kbp)PlatformClassData Source
Ichthyophthirius multifiliis (MAC)48.8084.0980964955.11454, SangerOligohymenophorea[28]
Oxytricha trifallax (MAC)67.1668.651850003.74Illumina, 454, SangerSpirotrichea[30]
Oxytricha trifallax (MIC)496.2971.56810 a-27.81Illumina, PacBioSpirotrichea[35]
Paramecium biaurelia (MAC)79.9674.23392420-Illumina, 454Oligohymenophorea[29]
P. caudatum (MAC)30.4871.80185090-Illumina, 454Oligohymenophorea[29]
P. sexaurelia (MAC)68.0275.93349390-Illumina, 454Oligohymenophorea[29]
P. tetraurelia (MAC)72.0971.9539521144413SangerOligohymenophorea[26]
Pseudocohnilembus persalinus (MAC)55.4681.19131860368IlluminaOligohymenophorea[32]
Stylonychia lemnae (MAC)50.1668.30207400-IlluminaSpirotrichea[25]
Tetrahymena thermophila (MAC)103.0177.682472560521SangerOligohymenophorea[36]
Tetrahymena thermophila (MIC)157.6977.9247 b-486.55IlluminaOligohymenophorea[37]
A/T, A/T content of the genome; Class, the taxonomic class in which the species is; G, genome size; MAC, macronucleus; MIC, micronucleus; n, number of overlapping genes; N50, scaffold N50; Platform, genome sequencing platform; TNG, total number of genes in the genome; a, not including internally eliminated sequences (IES)-less genes; b, genes only predicted in non-maintained macronuclear chromosomes, which are lost after macronuclear differentiation.
Table 2. Macronuclear simple sequence repeats information.
Table 2. Macronuclear simple sequence repeats information.
SpeciesA/TSSR/GH/SSRA/T-Hr1(P)r2(P)CSPRPG(SEM)
Ichthyophthirius multifiliis97.6311.9791.0497.62−0.72(3.76 × 10−6)−0.55(0.01)17.08(20.60)0.50(2.62 × 10−4)
Oxytricha trifallax87.748.0295.1287.76−0.73(1.27 × 10−3)−0.80(6.08 × 10−4)63.41(70.50)0.50(1.58 × 10−4)
Paramecium biaurelia95.188.2294.5293.95−0.19(0.33)−0.02(0.93)73.67(72.77)0.51(1.41 × 10−4)
P. caudatum92.177.5995.1591.86−0.81(4.51 × 10−4)−0.79(7.10 × 10−4)15.34(86.46)0.51(2.01 × 10−4)
P. sexaurelia95.548.6894.8395.49−0.31(0.09)−0.40(0.05)69.24(73.43)0.51(1.97 × 10−4)
P. tetraurelia91.977.8094.9992.07−0.31(0.15)−0.08(0.74)72.24(75.55)0.50(1.49 × 10−4)
Pseudocohnilembus persalinus95.9111.3893.7595.95−0.48(1.23 × 10−3)−0.56(0.01)34.59(39.34)0.50(1.59 × 10−4)
Stylonychia lemnae87.357.8194.9687.39−0.71(4.76 × 10−3)−0.72(8.67 × 10−3)63.70(71.39)0.50(1.88 × 10−4)
Tetrahymena thermophila96.6910.0995.2996.61−0.35(0.05)−0.72(8.67 × 10−3)41.40(49.39)0.50(1.21 × 10−4)
All numbers are percentages, except for those in the r1, r2, and RPG columns. A/T, A/T content of SSRs in the genome; SSR/G, proportion of SSR sequences in the whole genome; H/SSR, proportion of homopolymer runs in SSR sequences; A/T-H, A or T homopolymers out of all homopolymers; r1(P), Pearson’s correlation coefficient (P value) of motif size vs. A/T content at all sites; r2(P), Pearson’s correlation coefficient (P value) of motif size vs. A/T content at coding sites; CSP, coding SSR proportion, proportions of SSRs in coding regions out of all SSRs, proportions of coding sequences out of the whole-genome sequences are in the parentheses; RPG, relative position of homopolymer SSRs in a gene, calculated by (|homopolymer median genomic coordinate-gene start position|+1)/(gene length); SEM, standard error of the mean.
Table 3. Total counts of SSRs with codon repeats (>=10) in the nine ciliate genomes.
Table 3. Total counts of SSRs with codon repeats (>=10) in the nine ciliate genomes.
CodonsAmino AcidIchOxyPbiaPcauPsexPtetPseudoStyTetra
GCA|GCG|GCC|GCTAlanine000000000
CGA|CGG|CGC|CGT|AGA|AGGArginine800050101
AAC|AATAsparagine650700111388012
GAC|GATAspartic acid1300013201
TGC|TGTCysteine101001000
GGA|GGG|GGC|GGTGlycine111121111
GAA|GAGGlutamic acid1601165703
CAA|CAGGlutamine001013100
CAC|CATHistidine400000000
ATA|ATC|ATTIsoleucine80170111320616
CTA|CTG|GTC|CTT|TTA|TTGLeucine130980048102
AAA|AAGLysine150501011005
ATGMethionine200010301
TTC|TTTPhenylalanine205000200
CCA|CCG|CCC|CCTProline101005100
TCA|TCT|TCC|TCT|AGC|AGTSerine402000000
ACA|ACG|ACC|ACTThreonine1001043201
TGGTryptophan200000001
TAC|TATTyrosine170600030000
GTA|GTG|GTC|GTTValine300010301
Ich, Ichthyophthirius multifiliis; Oxy, Oxytricha trifallax; Pbia, Paramecium biaurelia; Pcau, P. caudatum; Psex, P. sexaurelia; Ptet, P. tetraurelia; Pseudo, Pseudocohnilembus persalinus; Sty, Stylonychia lemnae; Tetra, Tetrahymena thermophila.

Share and Cite

MDPI and ACS Style

Li, Y.; Chen, X.; Wu, K.; Pan, J.; Long, H.; Yan, Y. Characterization of Simple Sequence Repeats (SSRs) in Ciliated Protists Inferred by Comparative Genomics. Microorganisms 2020, 8, 662. https://doi.org/10.3390/microorganisms8050662

AMA Style

Li Y, Chen X, Wu K, Pan J, Long H, Yan Y. Characterization of Simple Sequence Repeats (SSRs) in Ciliated Protists Inferred by Comparative Genomics. Microorganisms. 2020; 8(5):662. https://doi.org/10.3390/microorganisms8050662

Chicago/Turabian Style

Li, Yuan, Xiao Chen, Kun Wu, Jiao Pan, Hongan Long, and Ying Yan. 2020. "Characterization of Simple Sequence Repeats (SSRs) in Ciliated Protists Inferred by Comparative Genomics" Microorganisms 8, no. 5: 662. https://doi.org/10.3390/microorganisms8050662

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop