Next Article in Journal
Diversity and Big Tree Patterns in the Brazilian Amazon
Previous Article in Journal
Assessment Potential of Zooplankton to Establish Reference Conditions in Lowland Temperate Lakes
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The CRISPR/Cas Machinery Evolution and Gene Flow in the Hot Spring Cyanobacterium Thermostichus

by
Eva Jahodářová
1,
Aloisie Poulíčková
2 and
Petr Dvořák
2,*
1
Department of Biology, Faculty of Education, Palacký University in Olomouc, Purkrabská 2, CZ-771 40 Olomouc, Czech Republic
2
Department of Botany, Faculty of Science, Palacký University in Olomouc, Šlechtitelů 27, CZ-783 71 Olomouc, Czech Republic
*
Author to whom correspondence should be addressed.
Diversity 2022, 14(7), 502; https://doi.org/10.3390/d14070502
Submission received: 6 May 2022 / Revised: 10 June 2022 / Accepted: 18 June 2022 / Published: 21 June 2022
(This article belongs to the Section Microbial Diversity and Culture Collections)

Abstract

:
Drivers of the speciation in bacteria, including geographical isolation and horizontal gene transfer, are still poorly understood. Here, we characterized a new lineage within an anciently diverged thermophilic cyanobacteria. We sequenced the whole genome of a strain Thermostichus vulcanus isolated from the Rupite spring (Bulgaria), which is closely related to Theromstichus strains JA-2-3Aa and JA-2-3B′a(2-13). We performed phylogenetic inference, horizontal gene transfer estimation, and CRISPR/Cas system characterization. We found that the Rupite strain is a distinct species from strains JA-2-3Aa, and JA-2-3B′a(2-13). Furthermore, the horizontal gene transfer seemed to be more frequent among the geographically distant species than between the two species within the same hot spring. The CRISPR/Cas system had variable complexity among the species of Thermostichus in terms of both the number of spacers and genes. The Rupite strain had the highest, and JA-2-3Aa the lowest number of spacers among the analyzed strains, and the CRISPR spacers were only rarely shared among the strains. We conclude that the CRISPR/Cas system size varied among the lineages as well as the gene flow.

1. Introduction

Mechanisms driving the diversification of the bacterial species are still far from being explained. The view of bacterial species as clonal populations that are changing via mutations and adaptation was challenged when homologous and non-homologous recombination (horizontal gene transfer—HGT) was discovered as abundant in many bacterial species. HGT may provide a channel for evolutionary adaptation [1] as well as genome diversification (reviewed by [2,3]). HGT events seem to be spread through most bacteria taxa, and thus, HGT played an important role in the evolution of bacteria [4]. Furthermore, HGT does not appear to be random, but it is more likely to happen among closely related species [5,6].
Geographical isolation has been traditionally recognized as a crucial driver of diversification in animals and plants because it represents a solid barrier to gene flow. However, with rapidly growing population genomic data, it has been shown that species can diverge while facing gene flow [2,7]. A view on geographical isolation in speciation of prokaryotes has taken an opposite course. Microorganisms, in general, were considered so small that they can disperse anywhere with no barriers [8]. This assumption seems to be working when only morphological characters are analyzed (e.g., [9]), but microorganisms possess relatively low morphological diversity in comparison with genome diversity. Molecular phylogeny provided evidence for the geographical structuring of microbial species in prokaryotes (e.g., [10,11]). Possible geographical isolation was also identified in cyanobacteria, e.g., in Microcoleus [12], and thermophilic cyanobacteria (see below).
Hot springs are isolated habitats inhabited by a specific group of microbes adapted to high temperatures. With a sufficient light intensity, oxyphototrophic bacteria (cyanobacteria) thrive in hot springs. The diversity of cyanobacteria in hot springs ranges from simple unicellular forms (e.g., Synechococcus [13]) to morphologically complex species characterized by cell differentiation, such as Mastigocladus laminosus [14].
Thermostichus was found in Yellowstone National Park in Octopus and Mushroom springs [15,16]. Thermostichus was derived from Synechococcus [17]. This lineage was identified as one of the earliest derived in the phylogeny of cyanobacteria [18,19]. There were two recognized lineages represented by two strains, JA-2-3Aa and JA-2-3B′a(2-13), whose full genome sequence is available. As mentioned above, hot springs are isolated biotopes and the distance between them can be thousands of kilometers. Thus, they are an ideal model system to study the importance of geographical isolation in the speciation of bacteria. Papke et al. [13] suggested that geographical isolation plays an important role in the speciation of Thermostichus and Synechococcus. Later, Ionescu et al. [20] analyzed a larger dataset of sequences which suggested that Thermostichus is much more diverse. It should be noted that mentioned analyses are limited to 16S rRNA obtained either from environmental samples or strains of Thermostichus. No genome of Thermostichus has been sequenced outside the Yellowstone hot spring. The purpose of this paper is to fill this gap.
CRISPR (clustered regularly interspaced short palindromic repeats)/Cas (CRISPR-associated) is an immune system in prokaryotes with an adaptive ability. It provides immunity against phage infection (reviewed, e.g., in [21]). Although its potential in biotechnology and molecular biology has already been widely recognized [22], the role of CRISPR/Cas systems in evolution is still not understood [23]. It has been shown that sympatric species more likely share the same spacer rather than allopatric [24,25]. This suggests that CRISPR spacers evolve rapidly with the phage community [21]. Moreover, the immunity function of CRISPR/Cas systems suggests that it also provides a barrier to horizontal gene transfer, which potentially can play a significant role in genome diversification [26,27].
In this paper, we will focus on the whole-genome sequencing of a Thermostichus strain isolated from the Rupite hot spring in Bulgaria. We will perform phylogenomic reconstruction of the evolutionary history of our and related strains, we will compare their CRISPR/Cas system, and we will infer the intensity of the horizontal gene transfer among the strains.

2. Materials and Methods

2.1. Strain Information and Cultivation

We analyzed one strain of Thermostichus vulcanus isolated from the Rupite thermal spring. The hot spring is slightly mineralized (lat. 41°21′ N, long. 23°14′ E) with an inlet temperature of 71 °C [28]. It is located in the Sandanski-Petrich basin, Blagoevgrad Province, SW Bulgaria. The strain was isolated for a previous study [29]. Details of strain isolation and culture maintenance can be found there. The strain was identified as Synechococcus bigranulatus based on morphology in the light microscope. The BLAST search of the 16S rRNA sequence suggested its close relationship to Synechococcus strains JA-2-3Aa and JA-2-3B′a(2-13).

2.2. DNA Isolation and Sequencing

The genomic DNA was extracted by UltraClean Microbial DNA Isolation kit (Mobio, Carlsbad, CA, USA). The quality of DNA was assessed by 1.5% agarose electronic gel electrophoresis stained with ethidium bromide.
The sequencing library was prepared by Nextera DNA Library Preparation Kit (Illumina Inc., San Diego, CA, USA) and 50 ng of DNA was used. The selection of fragments was performed by Agilent High-Sensitivity DNA Kit (Agilent Technologies, Inc., Santa Clara, CA, USA) to obtain insert sizes of 1000 bp. The sequencing library concentration was evaluated by KAPA Library Quantification Kit for Illumina (Kapa Biosystems, Woburn, MA, USA). A sequencing run was performed using MiSeq Reagent Kit v3 (Illumina Inc., San Diego, CA, USA)—300 bp pair-end reads were generated. The sequencing was performed commercially at the Institute of Experimental Botany of the Czech Academy of Sciences (Olomouc, Czech Republic).
The raw reads were clipped and quality-filtered, and pair-ended reads were identified using Trimmomatic 0.36 [30]; by following settings: java -jar trimmomatic-0.36.jar PE -threads 24 -phred33 input.fq.gz output.fq.gz ILLUMINACLIP:Nextera-PE-PE.fa:2:30:10 LEADING:3 TRAILING:3 SLIDINGWINDOW:4:15 MINLEN:50. Filtered reads were assembled using Spades 3.10.1 [31]; with default settings. Spades 3.10.1 does not support metagenomics assembling with unpaired reads. Produced scaffolds were binned by Maxbin 2.2.4 [32]. The completeness of the genome was analyzed using CheckM v1.1.6 [33] and reached 100% with only 1.75% of contaminants. The scaffolds identified by BLASTn search as Thermostichus were annotated in RAST [34]. The annotation also included the identification of repeats and CRISPR spacer and repeats. The rest of the scaffolds were discarded. The genome assembly was submitted to GenBank (https://www.ncbi.nlm.nih.gov/ (accessed on 24 February 2021)) with the accession number JAFIRA000000000.

2.3. Phylogenetic Inference and Horizontal Gene Transfer Detection

We downloaded all proteomes of species that were related based on BLAST search to our genome and strains Thermostichus JA-2-3Aa and JA-2-3B′a(2-13). It was a total of nine genomes plus Thermostichus sp. PCC 7336 as an outgroup. The average nucleotide identity (ANI) represents the similarity of all homologous regions which are shared among the genomes. The BLAST (ANIb) and mummer (ANIm) algorithms were estimated using Jspecies 1.2.1 [35]. Orthologs were identified using OrthoFinder 2.3.1 [36]. The search yielded 1513 ortholog groups. The best model was identified using ModelFinder [37]; based on BIC (LG+F+G4). Subsequently, ML phylogenetic reconstruction was performed in IQ-TREE 1.6.5 [38]. The tree topology was tested by 2000 ultrafast bootstrap re-samplings. The relationships between the analyzed strains were also visualized by Densitree 2.2.5 [39]. The 1513 individual trees were constructed based on ortholog sets from the previous step using IQ-TREE with option -m TEST, which selected the most appropriate maximum likelihood model for each tree. The ortholog sets were aligned in MUSCLE 3.8.31 [40] with default settings.
The 16S rRNA phylogeny was performed in IQ-TREE 1.6.5. [38] using GTR+I+G (identified by ModelFinder) and it was tested by 2000 ultrafast bootstrap re-samplings. Sequences were identified using BLAST search. The 16S rRNA of Thermostichus (extracted from annotated ribosomal sequences of the draft genome) was used as a query. Sequences of 90% similarity were downloaded and aligned using MUSCLE 3.8.31 [40].
Horizontal gene transfer was analyzed using T-REX web server version 3.4 [41] with the following settings: consensus HGT tool, HGT detection mode—several HGTs by iteration, and optimization criterion—bipartition dissimilarity. The tree used for HGT analyses was obtained in the previous step.

3. Results

We obtained a draft genome of a strain of Thermostichus isolated from the Rupite hot spring in Bulgaria. The quality of genomes was provided by CheckM. Completeness of the genome was 100% and contamination was only 1.75%. N50 was 59 kb and the assembly was composed of 120 scaffolds. The estimated genome size based on the total length of scaffolds was 3.7 Mb and G+C content was 54.9%. The annotation revealed 3832 protein-coding regions and 44 RNAs.
The evolutionary history of the Thermostichus Rupite strain was reconstructed using phylogenomic analysis of 1513 single-copy orthologs. It revealed that Thermostichus Rupite is the sister group to the Yellowstone lineages represented by strains JA-2-3B′a(2-13) and JA-3-3Ab, but the node had low bootstrap support (Figure 1). Moreover, we estimated average nucleotide identity (ANI) using both BLAST and mummer algorithms. The ANI values for Thermostichus Rupite and the rest of the strains were below the species threshold (95–96% [35])—ANIb = 67–77% and ANIm = 83–87%. Similarly, the strain JA-2-3B′a(2-13) and the cluster of JA-3-3Ab had ANI values below the species threshold—ANIb = 84–85% and ANIm = 87–88%. The strains within the cluster of JA-3-3Ab had ANIb and ANIm between 98–99%, which is indicative that they belong to the same species (Supplementary Table S1). Further, we used densely sampled 16S rRNA phylogeny to clarify the relationship of Thermostichus Rupite to the other related taxa. The result confirmed the independent origin of the Thermostichus Rupite strain. Moreover, it is distantly related to Synechococcus group T1 and Synechococcus group A/B (Supplementary Figure S1).
We identified 1149 CRISPR spacers in all strains, and the number of spacers largely varied among them. The largest number of spacers was found in the Rupite Bulgaria strain (390; Figure 1). The second largest was identified in the non-thermophilic outgroup strain PCC 7336–286. Strains JA-2-3B′a(2-13) and JA-3-3Ab had 124 and 88, respectively. The strains clustering with JA-3-3Ab had between 21 and 72 spacers (Figure 1). The CRISPR spacers carry information about historical viral infections. They serve as a memory; therefore, if the same spacers are present in the genome, the strains may have been infected by the same virus. Moreover, a number of CRISPR spacers are indicative of the complexity of the CRISPR/Cas system within a strain [23]. Interestingly, all the strains had between four (63AY4M1) and 11 (65AY6Li; Table 1) CRISPR arrays. Thus, although the number of the CRISPR spacer was lower in the group of JA-3-3Ab, the number of the CRISPR arrays was not lower or was larger.
We found that the strains shared only a minority of the spacers; at most 14, in the case of 65AY6A5 and 60AY4M2 (Table 2). These two strains belong to the cluster of JA-3-3Ab and they were isolated from the same locality. The Rupite Bulgaria strain shared no spacers with the rest of the strains, and JA-2-3B′a(2-13) shared only one spacer with JA-3-3Ab (Table 2). All spacers were submitted to the BLAST search, which revealed no significant hit with any sequenced viruses stored in the NCBI GenBank.
The number of CRISPR spacers has varied during the evolution of Thermostichus. The Rupite Bulgaria strain had the most complex CRISPR system indicated by the number of genes. The CRISPR system was severely reduced in JA-2-3B′a(2-13) and again expanded in JA-3-3Ab and its related strains. All investigated strains possess class 1 [23] cas genes.
It should be also noted that the Rupite Bulgaria strain had a larger genome (estimated, 3.7 Mbp) than the rest of the thermophilic strains, which had genome sizes of approximately 3 Mbp. Besides the larger number of cas genes, RAST functional gene annotation revealed 81 genes with annotated functions that were not present in the JA-2-3B′a(2-13) genome (Supplementary Tables S2–S4). In total, JA-2-3B′a(2-13) had 727 fewer genes, but most of them were identified with a hypothetical function. The genes with identified functions were involved in the metabolism of amino acids, carbohydrates, cofactors and vitamins, DNA, membrane transport, nitrogen, nucleotides, phosphorus, protein, RNA, respiration, stress response, sulfur, and defense.
A number of horizontal gene transfers were used as a proxy for gene flow intensity between the investigated strains. A visual inspection of the Densitree visualization of 1513 individual phylogenetic trees revealed that the species tree topology (Figure 2) is violated by many gene trees in the case of Thermostichus Rupite and Synechococcus JA-2-3B′a(2-13), which are geographically distant. Further entanglement of gene trees appeared among Synechococcus sp. JA-3-3Ab and its related strains that were found at the same place, and they seem to belong to the same species. To provide statistical evidence for this pattern, we used T-Rex inference of HGT. Indeed, we found that HGT events were the most frequent within the clade of Synechococcus sp. JA-3-3Ab and its related strains (4741). Moreover, HGT events were more frequent between Thermostichus Rupite and Synechococcus JA-2-3B′a(2-13) (347) than between Synechococcus JA-2-3B′a(2-13) and Synechococcus sp. JA-3-3Ab and its related strains (52).

4. Discussion

Horizontal gene transfer (HGT) plays an important role in the evolution of bacterial species. Barriers to HGT can be raised by a geographical distance and genome defense mechanisms—CRISPR/Cas system or various restriction-modification systems (e.g., [2,12,23,42]). Here, we shed some new light on the patterns of HGT and the CRISPR/Cas evolution in the Thermostichus.
The Thermostichus strain isolated from the Rupite hot spring formed a distinct lineage from both Yellowstone strains in the phylogeny (Figure 1) and based on ANI values. The divergence is so deep that all three lineages (represented by strains Rupite, JA-2-3B′a(2-13), and JA-3-3Ab) can be considered as three species based on the criteria of [35] (ANI < 95–96). These three lineages together with Synechococcus PCC 7336 (it can also be identified as Thermostichus as in [17]) form the earliest diverged lineage in the photosynthetic cyanobacteria [19,43,44]. Although metagenomic studies and isolation efforts suggest that there are many cyanobacterial strains related to the Yellowstone Thermostichus in the other hot springs [20], this is the first genome of thermophilic strain from this clade that has been sequenced outside Yellowstone. It provides a unique source of information that can bring important insights into patterns in the evolution of the thermophilic cyanobacteria.
The Rupite strain had a one Mb larger predicted genome in comparison with the rest of the strains, which suggests that its genome is more complex. The genes with known annotation based on homology belonged to various metabolic pathways (Supplementary Table S2). We focused on the CRISPR/Cas system, which varied with respect to both genes and CRISPR spacers. Rupite strain had three times more spacer than JA-2-3B′a(2-13) and four times more than JA-3-3Ab and associated strains. Rupite strain also possessed more CRISPR-associated genes (Figure 1). However, they had almost the same number of CRISPR arrays, suggesting that the reduction happened relatively recently. The larger CRISPR spacer diversity may be suggestive of the adaptation to wider phage diversity (reviewed in [45]). Moreover, the CRISPR/Cas system has been shown to provide an effective barrier to HGT in Bacillus cereus [46]. Thus, diversification and adaptation of species of Thermostichus may have affected the intensity of the phage predation in the investigated species.
We identified over 1000 different spacers, but they were only rarely shared by the investigated strains and were predominantly in the clade of JA-3-3Ab. This evidences that the three hot spring Thermostichus species recently encountered different viruses, because the evolution of the spacers seems to be quite rapid [47]. It contradicts patterns observed in the thermophilic bacterium Thermus. Lopatina et al. [48] found that the Thermus strains which were isolated from the springs several hundred meters distant from each other have similar spacer sets. With an increasing distance, the spacers were more different. Furthermore, the BLAST search did not reveal any significant hit with any virus in the GenBank database. It suggests that the Thermostichus cyanobacteria are infected by unknown viruses.
The role of geographical isolation in the speciation of the microbes has been discussed for a long time with contrasting results. Thermophilic cyanobacteria were thought to have been geographically isolated due to scarce dispersal routes and because the hot spring has a very specific environment (e.g., [13,14]). These studies were based on several conserved markers, especially 16S rRNA, and had small taxon sampling. A larger dataset by Ionescu et al. [20] showed that the geographical isolation among the hot spring is less likely. We provide further evidence for the gene flow across long distances using the whole-genome data. We found that there was a more frequent horizontal transfer between the Rupite strain and JA-2-3B′a(2-13) than among the species originating from the same locality in the Yellowstone National Park (Figure 2). The two species JA-2-3B′a(2-13) and JA-3-3Ab showed very low HGT intensity.
The two species JA-2-3B′a(2-13) and JA-3-3Ab were suggested to represent two separate putative ecotypes inhabiting the same hot spring [49,50,51]. Their temperature optima concur with that—JA-2-3B′a(2-13) prefers 58−65 °C and JA-3-3Ab 51−65 °C [52]. Strunecký et al. [52]; showed that the Thermostichus Rupite strain occurs in temperatures of about 62 °C. Thus, the temperature optima lie between the putative ecotypes JA-2-3B′a(2-13) and JA-3-3Ab. This suggests that Thermostichus Rupite may be adapted to a different temperature regime than the rest of the strains.
Taken together, there have evolved at least three species of Thermostichus. Their evolution has been affected by an HGT and there seemed to be local barriers to HGT. CRISPR spacer diversity suggests that the trajectory of the evolution of the three species may have been directed by the different intensities of the virus predation.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/d14070502/s1, Figshare with https://doi.org/10.6084/m9.figshare.14423189, https://doi.org/10.6084/m9.figshare.14423195.v1, https://doi.org/10.6084/m9.figshare.14386688.v1, and https://doi.org/10.6084/m9.figshare.14386661.v1. Table S1: Average nucleotide values estimated by both Mummer and BLAST for all strain combinations. The third matrix summarizes Tetra values. https://doi.org/10.6084/m9.figshare.14423189. Table S2: A table of comparison of gene function between the Rupite strain and JA-2-3B′a(2-13). https://doi.org/10.6084/m9.figshare.14423195. Table S3: A table with a list of CRISPR repeats for each investigated strain. https://doi.org/10.6084/m9.figshare.19710835. Table S4: A list of investigated strains with assembly accession numbers in GenBank and their finishing level according to the GenBank nomenclature. https://doi.org/10.6084/m9.figshare.19710841. Figure S1: A phylogenetic reconstruction based on 16S rRNA using IQ-TREE inference. Thermostichus Rupite is marked with an arrow. Orange color represents strains from Synechococcus group A and B. Green color represents members of group T1. Supports at the nodes represent only significant values (>99 ultrafast bootstraps). Dataset S1: Multiple sequence alignment of the protein sequences used for the phylogenomic reconstruction. Fasta format. Size: 10 sequences, 492,086 amino acid positions. https://doi.org/10.6084/m9.figshare.14386661. Dataset S2: Phylogenetic trees, which were used for the horizontal gene analysis by T-Rex. The trees are in the Newick format. https://doi.org/10.6084/m9.figshare.14386688.

Author Contributions

Conceptualization, P.D.; methodology, P.D. and E.J.; data analysis, P.D. and E.J.; writing—original draft preparation, P.D., E.J. and A.P.; writing—review and editing, P.D. and E.J.; visualization, P.D. and E.J.; supervision, A.P.; project administration, P.D.; funding acquisition, P.D. and A.P. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the project Internal Fund of Faculty of Education VaV_PdF_2021_04 and the Internal Grant Agency of the Palacký University Prf-2022-002.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets and supplements are available at Figshare with https://doi.org/10.6084/m9.figshare.14423189, https://doi.org/10.6084/m9.figshare.14423195.v1, https://doi.org/10.6084/m9.figshare.14386688.v1, and https://doi.org/10.6084/m9.figshare.14386661.v1. The genome was stored at the GenBank with an accession number: JAFIRA000000000.

Acknowledgments

We would like to thank Jaromír Lukavský (Institute of Botany, Czech Academy of Sciences) who provided the strain of Thermostichus.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Arnold, B.J.; Huang, I.-T.; Hanage, W.P. Horizontal gene transfer and adaptive evolution in bacteria. Nat. Rev. Genet. 2021, 20, 206–218. [Google Scholar] [CrossRef] [PubMed]
  2. Shapiro, B.J.; Leducq, J.-B.; Mallet, J. What Is Speciation? PLoS Genet. 2016, 12, e1005860. [Google Scholar] [CrossRef] [Green Version]
  3. Shapiro, B.J.; Polz, M.F. Microbial Speciation. Cold Spring Harb. Perspect. Biol. 2015, 7, a018143. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. Gogarten, J.P.; Townsend, J.P. Horizontal gene transfer, genome innovation and evolution. Nat. Rev. Genet. 2005, 3, 679–687. [Google Scholar] [CrossRef]
  5. Soucy, S.M.; Huang, J.; Gogarten, J.P. Horizontal gene transfer: Building the web of life. Nat. Rev. Genet. 2015, 16, 472–482. [Google Scholar] [CrossRef] [PubMed]
  6. Popa, O.; Hazkani-Covo, E.; Landan, G.; Martin, W.; Dagan, T. Directed networks reveal genomic barriers and DNA repair bypasses to lateral gene transfer among prokaryotes. Genome Res. 2011, 21, 599–609. [Google Scholar] [CrossRef] [Green Version]
  7. Feder, J.L.; Egan, S.P.; Nosil, P. The genomics of speciation-with-gene-flow. Trends Genet. 2012, 28, 342–350. [Google Scholar] [CrossRef]
  8. Baas Becking, L.G.M. Geobiologie of Inleiding Tot De Milieukunde; W. P. van Stockum & Zoon: Hague, The Netherlands, 1934. [Google Scholar]
  9. Finlay, B.J. Global Dispersal of Free-Living Microbial Eukaryote Species. Science 2002, 296, 1061–1063. [Google Scholar] [CrossRef] [Green Version]
  10. Ribeiro, K.F.; Ferrero, A.P.; Duarte, L.; Turchetto-Zolet, A.C.; Crossetti, L.O. Comparative phylogeography of two free-living cosmopolitan cyanobacteria: Insights on biogeographic and latitudinal distribution. J. Biogeogr. 2020, 47, 1106–1118. [Google Scholar] [CrossRef]
  11. Reno, M.L.; Held, N.L.; Fields, C.J.; Burke, P.V.; Whitaker, R.J. Biogeography of the Sulfolobus islandicus pan-genome. Proc. Natl. Acad. Sci. USA 2009, 106, 8605–8610. [Google Scholar] [CrossRef] [Green Version]
  12. Dvořák, P.; Hašler, P.; Poulíčková, A. Phylogeography of the Microcoleus vaginatus (Cyanobacteria) from Three Continents—A Spatial and Temporal Characterization. PLoS ONE 2012, 7, e40153. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Papke, R.T.; Ramsing, N.B.; Bateson, M.M.; Ward, D.M. Geographical isolation in hot spring cyanobacteria. Environ. Microbiol. 2003, 5, 650–659. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Miller, S.R.; Castenholz, R.W.; Pedersen, D. Phylogeography of the Thermophilic Cyanobacterium Mastigocladus laminosus. Appl. Environ. Microbiol. 2007, 73, 4751–4759. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Castenholz, R.W. Isolation and Cultivation of Thermophilic Cyanobacteria. In The Prokaryotes; Springer: Berlin/Heidelberg, Germany, 1981; pp. 236–246. [Google Scholar]
  16. Ferris, M.J.; Ruff-Roberts, A.L.; Kopczynski, E.D.; Bateson, M.M.; Ward, D.M. Enrichment culture and microscopy conceal diverse thermophilic Synechococcus populations in a single hot spring microbial mat habitat. Appl. Environ. Microbiol. 1996, 62, 1045–1050. [Google Scholar] [CrossRef] [Green Version]
  17. Komárek, J.; Johansen, J.R.; Šmarda, J.; Strunecký, O. Phylogeny and taxonomy of Synechococcus-like cyanobacteria. Fottea 2020, 20, 171–191. [Google Scholar] [CrossRef]
  18. Shih, P.M.; Wu, D.; Latifi, A.; Axen, S.D.; Fewer, D.P.; Talla, E.; Calteau, A.; Cai, F.; de Marsac, N.T.; Rippka, R.; et al. Improving the coverage of the cyanobacterial phylum using diversity-driven genome sequencing. Proc. Natl. Acad. Sci. USA 2013, 110, 1053–1058. [Google Scholar] [CrossRef] [Green Version]
  19. Dvořák, P.; Casamatta, D.A.; Poulíčková, A.; Hašler, P.; Ondřej, V.; Sanges, R. Synechococcus: 3 billion years of global dominance. Mol. Ecol. 2014, 23, 5538–5551. [Google Scholar] [CrossRef]
  20. Ionescu, D.; Hindiyeh, M.; Malkawi, H.; Oren, A. Biogeography of thermophilic cyanobacteria: Insights from the Zerka Ma’in hot springs (Jordan). FEMS Microbiol. Ecol. 2010, 72, 103–113. [Google Scholar] [CrossRef] [Green Version]
  21. Bhaya, D.; Davison, M.; Barrangou, R. CRISPR-Cas Systems in Bacteria and Archaea: Versatile Small RNAs for Adaptive Defense and Regulation. Annu. Rev. Genet. 2011, 45, 273–297. [Google Scholar] [CrossRef] [Green Version]
  22. Donohoue, P.D.; Barrangou, R.; May, A.P. Advances in Industrial Biotechnology Using CRISPR-Cas Systems. Trends Biotechnol. 2018, 36, 134–146. [Google Scholar] [CrossRef] [Green Version]
  23. Westra, E.R.; Dowling, A.J.; Broniewski, J.M.; van Houte, S. Evolution and Ecology of CRISPR. Annu. Rev. Ecol. Evol. Syst. 2016, 47, 307–331. [Google Scholar] [CrossRef] [Green Version]
  24. Rho, M.; Wu, Y.-W.; Tang, H.; Doak, T.; Ye, Y. Diverse CRISPRs Evolving in Human Microbiomes. PLoS Genet. 2012, 8, e1002441. [Google Scholar] [CrossRef]
  25. Kunin, V.; He, S.; Warnecke, F.; Peterson, S.B.; Martin, H.G.; Haynes, M.; Ivanova, N.; Blackall, L.L.; Breitbart, M.; Rohwer, F.; et al. A bacterial metapopulation adapts locally to phage predation despite global dispersal. Genome Res. 2007, 18, 293–297. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Marraffini, L.A.; Sontheimer, E.J. CRISPR Interference Limits Horizontal Gene Transfer in Staphylococci by Targeting DNA. Science 2008, 322, 1843–1845. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  27. Jiang, W.; Bikard, D.; Cox, D.; Zhang, F.; Marraffini, L.A. RNA-guided editing of bacterial genomes using CRISPR-Cas systems. Nat. Biotechnol. 2013, 31, 233–239. [Google Scholar] [CrossRef]
  28. Lukavský, J.; Furnadzhieva, S.; Pilarski, P. Cyanobacteria of the thermal spring at Pancharevo, Sofia, Bulgaria. Acta Bot. Croat. 2011, 70, 191–208. [Google Scholar] [CrossRef] [Green Version]
  29. Strunecký, O.; Kopejtka, K.; Goecke, F.; Tomasch, J.; Lukavský, J.; Neori, A.; Kahl, S.; Pieper, D.H.; Pilarski, P.; Kaftan, D.; et al. High diversity of thermophilic cyanobacteria in Rupite hot spring identified by microscopy, cultivation, single-cell PCR and amplicon sequencing. Extremophiles 2018, 23, 35–48. [Google Scholar] [CrossRef]
  30. Bolger, A.M.; Lohse, M.; Usadel, B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics 2014, 30, 2114–2120. [Google Scholar] [CrossRef] [Green Version]
  31. Bankevich, A.; Nurk, S.; Antipov, D.; Gurevich, A.A.; Dvorkin, M.; Kulikov, A.S.; Lesin, V.M.; Nikolenko, S.I.; Pham, S.; Prjibelski, A.D.; et al. SPAdes: A new genome assembly algorithm and its applications to single-cell sequencing. J. Comput. Biol. 2012, 19, 455–477. [Google Scholar] [CrossRef] [Green Version]
  32. Wu, Y.-W.; Simmons, B.; Singer, S.W. MaxBin 2.0: An automated binning algorithm to recover genomes from multiple metagenomic datasets. Bioinformatics 2015, 32, 605–607. [Google Scholar] [CrossRef]
  33. Parks, D.H.; Imelfort, M.; Skennerton, C.T.; Hugenholtz, P.; Tyson, G.W. CheckM: Assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 2015, 25, 1043–1055. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Aziz, R.K.; Bartels, D.; Best, A.A.; DeJongh, M.; Disz, T.; Edwards, R.A.; Formsma, K.; Gerdes, S.; Glass, E.M.; Kubal, M.; et al. The RAST server: Rapid annotations using subsystems technology. BMC Genom. 2008, 9, 75. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Richter, M.; Rosselló-Móra, R. Shifting the genomic gold standard for the prokaryotic species definition. Proc. Natl. Acad. Sci. USA 2009, 106, 19126–19131. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  36. Emms, D.M.; Kelly, S. OrthoFinder: Solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy. Genome Biol. 2015, 16, 157. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  37. Kalyaanamoorthy, S.; Minh, B.Q.; Wong, T.K.F.; Von Haeseler, A.; Jermiin, L.S. ModelFinder: Fast model selection for accurate phylogenetic estimates. Nat. Methods 2017, 14, 587–589. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  38. Nguyen, L.-T.; Schmidt, H.A.; Von Haeseler, A.; Minh, B.Q. IQ-TREE: A Fast and Effective Stochastic Algorithm for Estimating Maximum-Likelihood Phylogenies. Mol. Biol. Evol. 2015, 32, 268–274. [Google Scholar] [CrossRef]
  39. Bouckaert, R.R.; Heled, J. DensiTree 2: Seeing Trees Through the Forest. bioRxiv 2014, 012401. [Google Scholar] [CrossRef] [Green Version]
  40. Edgar, R.C. MUSCLE: Multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004, 32, 1792–1797. [Google Scholar] [CrossRef] [Green Version]
  41. Boc, A.; Diallo, A.B.; Makarenkov, V. T-REX: A web server for inferring, validating and visualizing phylogenetic trees and networks. Nucleic Acids Res. 2012, 40, W573–W579. [Google Scholar] [CrossRef] [Green Version]
  42. Shapiro, B.J.; Friedman, J.; Cordero, O.X.; Preheim, S.P.; Timberlake, S.C.; Szabó, G.; Polz, M.F.; Alm, E.J. Population Genomics of Early Events in the Ecological Differentiation of Bacteria. Science 2012, 336, 48–51. [Google Scholar] [CrossRef] [Green Version]
  43. Shih, P.M.; Hemp, J.; Ward, L.; Matzke, N.; Fischer, W.W. Crown group Oxyphotobacteria postdate the rise of oxygen. Geobiology 2016, 15, 19–29. [Google Scholar] [CrossRef] [Green Version]
  44. Magnabosco, C.; Moore, K.R.; Wolfe, J.M.; Fournier, G.P. Dating phototrophic microbial lineages with reticulate gene histories. Geobiology 2018, 16, 179–189. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  45. Koskella, B.; Brockhurst, M.A. Bacteria–phage coevolution as a driver of ecological and evolutionary processes in microbial communities. FEMS Microbiol. Rev. 2014, 38, 916–931. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  46. Zheng, Z.; Zhang, Y.; Liu, Z.; Dong, Z.; Xie, C.; Bravo, A.; Soberón, M.; Mahillon, J.; Sun, M.; Peng, D. The CRISPR-Cas systems were selectively inactivated during evolution of Bacillus cereus group for adaptation to diverse environments. ISME J. 2020, 14, 1479–1493. [Google Scholar] [CrossRef]
  47. Heidelberg, J.; Nelson, W.; Schoenfeld, T.; Bhaya, D. Germ Warfare in a Microbial Mat Community: CRISPRs Provide Insights into the Co-Evolution of Host and Viral Genomes. PLoS ONE 2009, 4, e4169. [Google Scholar] [CrossRef] [Green Version]
  48. Lopatina, A.; Medvedeva, S.; Artamonova, D.; Kolesnik, M.; Sitnik, V.; Ispolatov, Y.; Severinov, K. Natural diversity of CRISPR spacers of Thermus: Evidence of local spacer acquisition and global spacer exchange. Philos. Trans. R. Soc. B Biol. Sci. 2019, 374, 20180092. [Google Scholar] [CrossRef] [Green Version]
  49. Nowack, S.; Olsen, M.T.; Schaible, G.; Becraft, E.D.; Shen, G.; Klapper, I.; Bryant, D.A.; Ward, D.M. The molecular dimension of microbial species: 2. Synechococcus strains representative of putative ecotypes inhabiting different depths in the Mushroom Spring microbial mat exhibit different adaptive and acclimative responses to light. Front. Microbiol. 2015, 6, 626. [Google Scholar] [CrossRef] [Green Version]
  50. Becraft, E.D.; Wood, J.; Rusch, D.B.; Kühl, M.; Jensen, S.I.; Bryant, D.A.; Roberts, D.; Cohan, F.M.; Ward, D.M. The molecular dimension of microbial species: 1. Ecological distinctions among, and homogeneity within, putative ecotypes of Synechococcus inhabiting the cyanobacterial mat of Mushroom Spring, Yellowstone National Park. Front. Microbiol. 2015, 6, 590. [Google Scholar] [CrossRef] [Green Version]
  51. Olsen, M.T.; Nowack, S.; Wood, J.; Becraft, E.D.; LaButti, K.; Lipzen, A.; Martin, J.; Schackwitz, W.S.; Rusch, D.B.; Cohan, F.M.; et al. The molecular dimension of microbial species: 3. Comparative genomics of Synechococcus strains with different light responses and in situ diel transcription patterns of associated putative ecotypes in the Mushroom Spring microbial mat. Front. Microbiol. 2015, 6, 604. [Google Scholar] [CrossRef]
  52. Allewalt, J.P.; Bateson, M.M.; Revsbech, N.P.; Slack, K.; Ward, D.M. Effect of Temperature and Light on Growth of and Photosynthesis by Synechococcus Isolates Typical of Those Predominating in the Octopus Spring Microbial Mat Community of Yellowstone National Park. Appl. Environ. Microbiol. 2006, 72, 544–550. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Maximum likelihood phylogenomic inference based on 1513 orthologous genes. The studied strain is in bold. Asterisk represents 100 ultrafast bootstraps. Each taxon has a CRISPR spacer number and a list of CRISPR genes annotated within the respective genome.
Figure 1. Maximum likelihood phylogenomic inference based on 1513 orthologous genes. The studied strain is in bold. Asterisk represents 100 ultrafast bootstraps. Each taxon has a CRISPR spacer number and a list of CRISPR genes annotated within the respective genome.
Diversity 14 00502 g001
Figure 2. The Densitree visualization of the 1513 maximum likelihood phylogenetic trees. The thickened line represents the root canal—consensus tree with the highest support. The HGT frequency and direction estimated by T-Rex for each species are visualized by arrows and numbers.
Figure 2. The Densitree visualization of the 1513 maximum likelihood phylogenetic trees. The thickened line represents the root canal—consensus tree with the highest support. The HGT frequency and direction estimated by T-Rex for each species are visualized by arrows and numbers.
Diversity 14 00502 g002
Table 1. The table summarizes basic CRISPR/Cas system features for each strain.
Table 1. The table summarizes basic CRISPR/Cas system features for each strain.
StrainCRISPR ArraysCRISPR SpacersCRISPR Genes
Synechoccus sp. Rupite Bulgaria639016
Synechocccus sp. JA-3-3Ab78814
Synechocccus sp. JA-2-3B′a(2-13)71242
Synechocccus sp. 60AY4M25336
Synechocccus sp. 63AY4M14216
Synechocccus sp. 63AY4M28455
Synechocccus sp. 65AY6A59566
Synechocccus sp. 65AY6Li117213
Synechocccus sp. 65AY6407346
Synechocccus sp. PCC7336828612
Table 2. The matrix of common identical spacers among the investigated strains.
Table 2. The matrix of common identical spacers among the investigated strains.
Rupite BulgariaJA-3-3AbJA-2-3B′a(2-13)60AY4M263AY4M163AY4M265AY6A565AY6Li
Rupite Bulgaria
JA-3-3Ab0
JA-2-3B′a(2-13)01
60AY4M2000
63AY4M100010
63AY4M2000813
65AY6A500014813
65AY6Li0000010
65AY64001000000
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Jahodářová, E.; Poulíčková, A.; Dvořák, P. The CRISPR/Cas Machinery Evolution and Gene Flow in the Hot Spring Cyanobacterium Thermostichus. Diversity 2022, 14, 502. https://doi.org/10.3390/d14070502

AMA Style

Jahodářová E, Poulíčková A, Dvořák P. The CRISPR/Cas Machinery Evolution and Gene Flow in the Hot Spring Cyanobacterium Thermostichus. Diversity. 2022; 14(7):502. https://doi.org/10.3390/d14070502

Chicago/Turabian Style

Jahodářová, Eva, Aloisie Poulíčková, and Petr Dvořák. 2022. "The CRISPR/Cas Machinery Evolution and Gene Flow in the Hot Spring Cyanobacterium Thermostichus" Diversity 14, no. 7: 502. https://doi.org/10.3390/d14070502

APA Style

Jahodářová, E., Poulíčková, A., & Dvořák, P. (2022). The CRISPR/Cas Machinery Evolution and Gene Flow in the Hot Spring Cyanobacterium Thermostichus. Diversity, 14(7), 502. https://doi.org/10.3390/d14070502

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop