Next Article in Journal
Mitochondrial Dysfunction in Aging, HIV, and Long COVID: Mechanisms and Therapeutic Opportunities
Previous Article in Journal
Human Papillomavirus: An Old New History
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Bioinformatic Identification of CRISPR–Cas Systems in Leptospira Genus: An Update on Their Distribution Across 77 Species

by
Ronald Guillermo Peláez Sánchez
1,*,
Juanita González Restrepo
1,
Santiago Pineda
1,
Alexandra Milena Cuartas-López
1,
Juliana María Martínez Garro
2,
Marco Torres-Castro
3,
Rodrigo Urrego
4,
Luis Ernesto López-Rojas
5,
Jorge Emilio Salazar Florez
6 and
Fernando P. Monroy
7
1
Life and Health Sciences Research Group, Graduate School, CES University, Medellin 050021, Colombia
2
CES Biology, Science and Biotechnology School, CES University, Medellin 050021, Colombia
3
Dr. Hideyo Noguchi Regional Research Center, Laboratory of Zoonoses and Other Vector-Borne Diseases, Autonomous University of Yucatan, Mérida 97000, Mexico
4
INCA-CES Group, School of Veterinary Medicine and Zootechnic, Universidad CES, Medellín 050021, Colombia
5
Colombian Institute of Tropical Medicine (ICMT), Sabaneta 055450, Colombia
6
GEINCRO Research Group, School of Health Sciences, San Martin University, Sabaneta 055457, Colombia
7
Department of Biological Sciences, Northern Arizona University, Flagstaff, AZ 86011, USA
*
Author to whom correspondence should be addressed.
Pathogens 2025, 14(10), 1044; https://doi.org/10.3390/pathogens14101044
Submission received: 4 September 2025 / Revised: 23 September 2025 / Accepted: 8 October 2025 / Published: 16 October 2025

Abstract

Leptospirosis is a globally distributed zoonotic disease caused by pathogenic bacteria of the Leptospira genus. Genome editing in Leptospira has been difficult to perform. Currently, the functionality of the CRISPR-Cas system has been demonstrated in species such as Leptospira interrogans. However, the different CRISPR-Cas systems present in most of the 77 species are unknown. Therefore, the objective of this study was to identify these arrays across the genomes of all described Leptospira species using bioinformatics tools. Methods: a bioinformatics workflow was followed: genomes were downloaded from the NCBI database; Cas protein detection was carried out using the CRISPR-CasFinder and RAST web servers; functional analyses of Cas proteins were performed with InterProScan, ProtParam, Swiss Model, Alphafold3, Swiss PDB Viewer, and Pymol; conservation pattern detection was conducted using MEGA12, and Seqlogos; spacer identification was carried out with the Actinobacteriophages database and BLAST version 1.4.0; and bacteriophage detection was performed using PHASTER, and PHASTEST. Results: Cas proteins were detected in 36 out of the 77 species of the Leptospira species, including Cas1 to Cas9 and Cas12. These proteins were classified into Class 1 and Class 2 systems, corresponding to types I, II, and V. Direct repeats and spacers were detected in 19 species, with the direct repeats exhibiting two conserved nucleotide motifs. Analysis of spacer sequences revealed 323 distinct bacteriophages. Additionally, three intact bacteriophages were detected in the genomes of four Leptospira species. Notably, two saprophytic species have complete CRISPR-Cas systems. Conclusions: The presence of Cas proteins, direct repeats, and spacer sequences with homology to bacteriophage genomes provides evidence for a functional CRISPR-Cas system in at least 19 species.

1. Introduction

Leptospirosis is a globally distributed zoonosis disease caused by pathogenic bacteria of the Leptospira genus [1]. The disease affects wild and domestic animals, and the bacteria can survive in moist soils and environmental water sources for long periods of time. Moreover, humans are considered accidental hosts of the disease [2]. Leptospirosis is considered a neglected disease that occurs primarily in tropical regions of developing countries [3]. Outbreaks of the disease have been associated with occupations involving animal handling, flooding, environmental disasters, extreme climate change, water sports, prolonged exposure to environmental water sources, and wet soils [4]. Additionally, in endemic regions, severe forms such as Weil’s disease and severe pulmonary hemorrhage syndrome have emerged as the leading cause of fatal cases [5]. Epidemiological studies have estimated that 1.03 million cases and 58,900 deaths occur worldwide each year due to leptospirosis [6]. Currently, the Leptospira genus comprises 77 genomic species, which are distributed into pathogenic subgroups (P1 and P2) and saprophytic subgroups (S1 and S2) [7,8,9,10,11,12].
Genome editing in Leptospira has been difficult to perform because the bacteria grow slowly in culture media (liquid and solid) and have high nutritional requirements [13]. Gene editing techniques such as random transposon mutagenesis [14,15,16,17], suicide plasmids [18], allelic exchange [19], and shuttle vectors [20] have been used in the Leptospira genus. However, these techniques are inefficient, laborious, expensive, and difficult to implement [13]. The lack of efficient gene-editing tools in Leptospira has hindered research on hypothetical proteins, pathogenicity mechanisms, virulence factors, and key proteins for drug and vaccine development. Therefore, generating knockout mutant strains remains challenging.
However, the CRISPR-Cas system (Clustered Regularly Interspaced Palindromic Repeats—associated with Cas proteins) has emerged as a novel tool that is highly specific, efficient, easy to design and implement, broadly applicable to bacterial species, and cost-effective [21,22,23]. Additionally, multiple classes, types, subtype, variants, and molecular targets of CRISPR-Cas systems have been described in other bacterial species [24,25,26]. The CRISPR-Cas system is an adaptive immune mechanism in prokaryotes that prevents phage and plasmid infections by storing immunological memory in DNA fragments (spacers). These spacers are integrated into the host chromosome and regularly separated by direct repeats, forming CRISPR arrays adjacent to the leader sequence. These arrays are typically located near genes encoding Cas proteins [27,28,29]. Such proteins are responsible for detecting, cleaving and storing foreign genetic material from bacteriophages or plasmids, which can later serve as molecular guides complementary to the genomic sequences of these invaders during future infections [30,31,32]. The CRISPR-Cas system functions through three main steps: adaptation, expression, and interference. During adaptation (also called insertion or acquisition), foreign genetic material from bacteriophages or plasmids is recognized, cleaved into small fragments, and incorporated in the CRISPR array near the leader sequence. Expression consists of the transcription of the CRISPR locus into precursor RNAs (pre-crRNA), which are subsequently processed into mature CRISPR RNAs (crRNAs). Finally, in the interference step, mature crRNAs associate with multiprotein effector complexes or single effector proteins, which recognize identical or similar sequences in the genomes of invading viruses or plasmids and cleave them to prevent infection [33,34]. This system has been emulated, modified, and adapted as a genome-editing tool in both prokaryotic and eukaryotic cells [35,36]. Therefore, investigating the presence and diversity of these endogenous systems in the genomes of the Leptospira genus is of particular interest.
In recent years, significant progress has been made in applying CRISPR-Cas-based gene silencing to the genus Leptospira, including the detection of subtypes IB, and IC in pathogenic species [37,38,39]. Additionally, it has been shown that these subtypes are transcriptionally active [40], form interference complexes and different protospacer adjacent motifs (PAMs) have been evaluated to optimize cutting sites [41], and process immature RNAs (Pre-crRNA) [42]. Furthermore, Cas1, Cas2, Cas4, Cas5, and Cas6 proteins have been biologically and functionally characterized [37,38,39,40,42,43,44]. The first attempts at gene silencing were unsuccessful, because the Cas9 protein includes double-strand breaks in DNA that are lethal to Leptospira cells [13,45,46]. Therefore, techniques such as CRISPR interference (CRISPRi) [13], CRISPR-Cas9/non-homologous end-joining components (NHEJ) [45], and CRISPR-prime editing [46] have been developed, and successfully used to generate knock-out mutants of multiple proteins in both pathogenic and saprophytic Leptospira species. Interestingly, the analysis of the spacer sequences detected in the genomes of Leptospira interrogans corresponds to mobile genetic elements, indicating their functionality and importance in the adaptive immune defense of the bacteria [47]. In parallel, applications based on the components of the CRISPR-Cas system have been used as diagnostic tools, including a CRISPR dFnCas9-based quantitative lateral flow immunoassay and CRISPR/Cas12a platform combined with isothermal amplification [48]. Additionally, the nucleotide variability of CRISPR-Cas systems in the Leptospira genus has been used as a taxonomic tool for the identification and differentiation of species and serovars in 18 Leptospira species [47].
However, little information is available regarding the presence, diversity, and functionality of the CRISPR-Cas machinery, including CAS proteins, spacers, direct repeats, leader sequences, crRNA, and protospacer adjacent motifs (PAMs), across all species of the Leptospira genus. Therefore, the objective of this research was to identify the different CRISPR-Cas systems in the 77 species of the Leptospira genus. This knowledge may contribute to optimizing gene-editing approaches and to elucidate the biological function of the many proteins that remain hypothetical in Leptospira.

2. Materials and Methods

2.1. Downloading Reference Genomes

A reference genome for each of the 77 Leptospira species was downloaded from the NCBI-Genome database (https://www.ncbi.nlm.nih.gov/Taxonomy/Browser/wwwtax.cgi?id=171, accessed on 15 July 2025). Genomes were grouped according to established subgroups (P1, P2, S1, and S2). For each species, the sequences of the two chromosomes (three in some saprophytic species) were concatenated into a continuous linear sequence using the CAT command in the Linux operating system (Ubuntu 24.04 LTS). Genomes were then organized by species subgroups, and the genomic characteristics (species names, clade, accession number, genome size (Mb), GC content (%), number of genes, proteins, and non-coding sequence) were annotated with Prokka v1.14.5 (https://github.com/tseemann/prokka, accessed on 15 July 2025), Pseudopipe (https://github.com/mosilab/pseudopipe, accessed on 15 July 2025), Pseudofinder (https://github.com/filip-husnik/pseudofinder, accessed on 15 July 2025), and RetroScan (https://github.com/Vicky123wzy/RetroScan, accessed on 15 July 2025) [49]. Finally, the genomic characteristics of the different species were compared.

2.2. Cas Proteins Detection

The concatenated sequences of each of the 77 species of the Leptospira genus were analyzed using the web server RAST (Rapid Annotations using Subsystem Technology) (https://rast.nmpdr.org/rast.cgi, accessed on 15 July 2025) to identify genes encoding Cas proteins, spacers and direct repeats of the CRISPR-Cas system. Each genome was analyzed independently, and the results were consolidated in a database using the Excel program (Microsoft®—Microsoft 365). The results were analyzed and grouped into three categories: Cas protein, spacer, or direct repetitions. RAST assigns gene functions and metabolic pathways by comparing annotated genomes; its results are based on subsystems and protein families that are manually curated within the SEED integration, providing the basis for metabolic reconstructions and gene function assignments [50,51,52].
In parallel, the same 77 Leptospira species were analyzed using the CRISPRCasFinder web server (https://crisprcas.i2bc.paris-saclay.fr, accessed on 15 July 2025) to detect genes related to the CRISPR-Cas system. As with the RAST analysis, each sequence was examined independently, and the results were consolidated using the Excel program (Microsoft®—Microsoft 365). The results were analyzed and grouped into three categories: Cas protein, spacer, or direct repetitions. CRISPRCasFinder program enables the detection of CRISPR arrays and Cas genes in user-supplied sequence data. This is an updated version of CRISPRFinder with increased specificity and improved accuracy in detecting CRISPR-associated sequences. MacSyFinder is used to identify Cas genes, CRISPR-Cas type, and subtype [53].
Finally, the datasets generated by RAST and CRISPRCasFinder were merged into a single database. Duplicate entries were carefully checked and removed, resulting in a non-redundant dataset for downstream comparative analyses.

2.3. Analysis of the Biological Function of Cas Proteins

Sequences encoded by CRISPR-Cas-associated genes were identified and downloaded using the BLAST algorithm version 1.4.0 (https://blast.ncbi.nlm.nih.gov/Blast.cgi, accessed on 15 July 2025) from the NCBI database [54].
Functional annotation was performed with InterProScan bioinformatics program (https://www.ebi.ac.uk/interpro/search/sequence/, accessed on 15 July 2025), to identify protein domains, biological processes, molecular function, and cellular components [55]. Additional biological information for each protein was obtained from the Panther GO Terms (https://pantherdb.org/, accessed on 15 July 2025) and GeneCards databases (https://www.genecards.org/, accessed on 15 July 2025). Biological parameters of the proteins were calculated using the ProtParam web server (https://web.expasy.org/protparam/, accessed on 15 July 2025).
The three-dimensional structures of proteins were modeled with SwissModel web server (https://swissmodel.expasy.org, accessed on 15 July 2025) [56], and AlphaFold3 Server (https://alphafoldserver.com/about, accessed on 15 July 2025) [57]. Structural visualization and analysis were carried out with Swiss-pdbViewer version 4.1 (http://spdbv.unil.ch, accessed on 15 July 2025) [58] and PyMOL version 2.6 (https://www.pymol.org/, accessed on 15 July 2025) [59].

2.4. Classification of Cas Proteins into Functional Stages

The CRISPR-Cas proteins detected were manually classified according to the functional stage in which they participate: adaptation (integration of spacers), expression (processing of pre-crRNA), or interference (participation in the effector complex or target cleavage), based on the currently known biological functions of each protein in the CRISPR-Cas system.

2.5. Conservation Patterns of Direct Repeats in the Leptospira Genus

A search for direct repeats was conducted across the 77 species of the Leptospira species. The following direct repeats were found according to the subgroups: P1, 332 sequences; P2, 48 sequences; S1, 26 sequences; and S2, 18 sequences. A database was compiled, and multiple sequence alignment was performed per clade using MEGA version 12 (https://www.megasoftware.net/, accessed on 15 July 2025) [60]. The alignments were subsequently visualized with SeqLogos web server (http://imed.med.ucm.es/Tools/seqlogo.html, accessed on 15 July 2025) to determine nucleotide frequencies in the spacer sequences [61].

2.6. Bioinformatic Identification of the Immunological Memory (Spacer)

The actinobacteriophage database at phagesDB.org (https://phagesdb.org/, accessed on 15 July 2025) was used to identify the immunological memory of Leptospira species against different bacteriophages [62]. For this analysis, spacer sequences detected in the 77 genomes using RAST (Rapid Annotations using Subsystem Technology) and CRISPRCasFinder web servers were used.

2.7. Identification of Unique Effector Proteins

The genomes of the 77 Leptospira species were scanned using the BLAST algorithm version 1.4.0 (https://blast.ncbi.nlm.nih.gov/Blast.cgi, accessed on 15 July 2025) of the NCBI database to detect the Cas9, Cas12, and Cas13 proteins, which are unique effector proteins in CRISPR-Cas systems [54].

2.8. Bioinformatic Detection of Intact Bacteriophages in Genomes

The 77 genomes of the Leptospira species were scanned using the PHASTER program (Phage Search Tool Enhanced Release; https://phaster.ca/, accessed on 15 July 2025) and the PHASTEST web server (Phage Search Tool with Enhanced Sequence Translation; https://phastest.ca/, accessed on 15 July 2025). PHASTER is an upgraded version of the PHAST web server that enables rapid identification and annotation of prophage sequences in bacterial genomes and plasmids. PHASTEST further extends this functionality by providing enhanced sequence translation and visualization capabilities for prophage detection [63,64,65].

3. Results

3.1. Downloading Reference Genomes

In this research, the 77 reference genomes for the Leptospira genus were analyzed and classified into the following subgroups of P1 (21 species), P2 (22 species), S1 (29 species) and S2 (5 species). Based on assembly status, 14 genomes were assembled at the chromosome level, 8 at the scaffold level, and 55 at the contig level. Genome sizes ranged from 3.7 to 4.9 with Leptospira ainazelensis having the largest genome (4.9 Mb) and Leptospira fletcheri and Leptospira fluminis the smallest (3.7 Mb). The genomic GC content ranged from 35% present in Leptospira interrogans to 48% in Leptospira ellisii.
Gene counts ranged from 3409 in Leptospira fletcheri and 4789 in Lepstopira adleri. Similarly, protein counts varied between 3326 and 4626, respectively, in the same species. The number of pseudogenes ranged from 39 in Leptospira ryugenii and Leptospira ellinghausenii to 352 in Leptospira mayottensis (Table 1).

3.2. Cas Proteins Detection

The main objective of this research was to analyze the reference genomes, detect the proteins belonging to the CRISPR-Cas system and classify them according to their class, type, subtype, genetic variants, and molecular target. The presence of these proteins was detected bioinformatically in 36 out of the 77 species of the Leptospira genus. The number of species containing Cas proteins in the respective subgroups were as follow: 15 in P1; 11 in P2; 7 in S1; and 3 in S2. From the 36 species that have genes encoding Cas proteins, 19 species had multiple Cas proteins, and 17 species only had the Cas3 protein, with only 3 of them having the Cas3 and Cas3a variants. In terms of class distribution, 32 species were classified as Class 1, and 4 species as Class 2 (L. gorisiae, L. fletcheri, L. inadai, and L. ilyithenensis). Notably, L. gorisiae and L. inadai exhibited proteins from both Classes 1 and 2. Across the genus, proteins from CRISPR-Cas types I, II, III, and V were detected. At the subtype level, proteins belonging to subtypes IA, IB, IE, IC, IIB, and U were detected identified. Variants detected included Cas3a, Cas5a, Cas5a2, Cas5b, Cas5c, Cas7b, Cas7c, Cas8c, and Cas8a1a3.
With respect to molecular targets, 35 species encoded proteins that cleave DNA molecules, while only one species (L. haakeii) encoded a protein capable of RNA cleavage (csm2_TypeIIIA). Importantly, this work reports for the first time the presence of CRISPR-Cas system in two saprophytic species (L. ilyithenensis and L. ryugenii), both taxonomically assigned to the S2 subgroup (Table 2).

3.3. Analysis of the Biological Function of Cas Proteins

Currently, 13 proteins are described as components of the CRISPR-Cas system (Cas1–Cas13). Analysis of the 77 Leptospira genomes revealed the presence of 10 out of 13 Cas proteins: Cas1, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9, and Cas12. In contrast, Cas10, Cas 11, and Cas13 proteins were not detected in any of the genomes analyzed. Furthermore, Cas1 was found in 18 species, Cas2 in 19, Cas3 in 35, Cas4 in 10, Cas 5 in 17, Ca6 in 14, Cas7 in 17, Cas8 in 5, Cas9 in 1, and Cas12 in 3. Cas3 was the most frequent, present in 35 out of the 36 species encoding Cas proteins, while the least frequent were the unique effector proteins Cas 9 and Cas12.
To study the biological function, biological processes, three-dimensional structure, and identify the functional domains of the CAS proteins, the following proteins and species were used as models: Cas1 (Leptospira interrogans) is a DNA nuclease and participates in the adaptation stage of the CRISPR-Cas system. It helps in the process of spacer integration into the CRISPR matrix. It is associated with biological processes including defense against bacteriophages, plasmids, as well as the maintenance of repetitive elements within the CRISPR array. The protein consists of 254 amino acids, with an isoelectric point of 9.51, a molecular weight of 29,817.57 Da, a predicted bacterial half-life exceeding 10 h, and an instability index of 43.37, classifying it as unstable. Structurally, it is composed of 8 β-sheets, 18 α-helices, and contains a Cas1_I-II-II domain that may facilitate the linkage of DNA fragments to the CRISPR array (Figure 1).
Cas2 (Leptospira interrogans) functions as an RNA nuclease involved in the adaptation stage of the CRISPR-Cas system, contributing to spacer integration into the CRISPR array. It is associated with biological processes such as defense against bacteriophages and plasmids, as well as the maintenance of repetitive elements. The protein consists of 90 amino acids, with an isoelectric point of 8.49, a molecular weight of 10,317.93 Da, a predicted bacterial half-life exceeding 10 h, and an instability index of 31.57, classifying it as stable. Structurally, Cas2 contains 10 β-sheets and 4 α-helices, and includes a SSF:TTP0101/SSO1404-like domain. Cas2 proteins have been described as either endoribonuclease (acting on ssRNA) or endodeoxyribonuclease (acting on dsDNA), depending on the system in which they occur (Figure 1).
Cas3 (Leptospira interrogans) is a multifunctional protein with nuclease and helicase activity, participating in the interference stage of the CRISPR-Cas system (target cleavage). It is associated with biological processes such as defense against bacteriophages and plasmids, ATP binding, hydrolase activity, and DNA binding. The protein consists of 377 amino acids, with an isoelectric point of 8.59, a molecular weight of 43,082.83 Da, and a bioinformatically predicted half-life in bacteria of more than 10 h. Its instability index is 37.43, classifying it as stable. Structurally, Cas3 contains 6 β-sheets and 21 α-helices, and exhibits motifs characteristic of superfamily 2 helicases, including a DEAD/DEAH box region and a conserved C-terminal domain. The Cas3-type HD domain confers nuclease activity against ssDNA and ssRNA (Figure 1).
Cas4 (Leptospira interrogans) is a DNA nuclease that participates in the adaptation stage of the CRISPR-Cas system (spacer integration). It is linked to biological processes such as defense against bacteriophages and plasmids. The protein consists of 142 amino acids, with an isoelectric point of 8.95, and a molecular weight of 16,606.38 Da, a predicted bacterial half-life exceeding 10 h, and an instability index of 50.59, classifying it as unstable. Structurally, it contains 6 β-sheets and 6 α-helices. The endonuclease activity of Cas4 proteins contributes to crRNA generation by cleaving sequences complementary to foreign genetic elements, and Cas4 has been characterized as a 5′→3′ single-stranded DNA exonuclease (Figure 1).
Cas5 (Leptospira interrogans) is a protein that participates in the interference stage of the CRISPR-Cas system (effector complex). It is associated with biological processes including defense against bacteriophages and plasmids, maintenance of CRISPR repeat elements, and RNA binding. The protein consists of 234 amino acids, with an isoelectric point of 7.72, a molecular weight of 27,128.07 Da, a predicted bacterial half-life exceeding 10 h, and an instability index of 46.99, classifying it as unstable. Structurally, it contains 10 β-sheets and 3 α-helices. Cas5 contributes to the processing or stabilization of pre-crRNA into individual crRNA units, and together with Cas6 is required for optimal crRNA maturation and stability (Figure 1).
Cas6 (Leptospira santarosai) is a protein that participates in the expression stage of the CRISPR-Cas system (pre-crRNA processing). It is associated with processes such as defense against bacteriophages and plasmids. The protein consists of 203 amino acids, with an isoelectric point of 9.81, a molecular weight of 22,721.75 Da, a predicted bacterial half-life exceeding 10 h, and an instability index of 25.34, classifying it as stable. Structurally, it contains 16 β-sheets and 5 α-helices. Members of this protein family are found across multiple CRISPR-Cas subtypes. Cas6 proteins share the ability to recognize and cleave a single phosphodiester bond with short-repeated sequence of the pre-crRNA transcript (Figure 1).
Cas7 (Leptospira interrogans) is a protein that participates in the interference stage of the CRISPR-Cas system (effector complex). It is associated with biological processes such as defense against bacteriophages and plasmids, RNA recognition, and crRNA binding. The protein consists of 279 amino acids, with an isoelectric point of 6.68, a molecular weight of 31,146.15 Da, a predicted bacterial half-life exceeding 10 h, and an instability index of 38.92, classifying it as stable. Structurally, it contains 18 β-sheets and 6 α-helices. Cas7–11 variants can cleave single-stranded RNA (ssRNA) and self-process pre-crRNA (guide RNA). In other systems, Cas7 (DevR) has also been implicated in fruiting body development, sporulation and aggregation (Figure 1).
Cas8 (Leptospira santarosai) is a protein participates in the interference stage of the CRISPR-Cas system (effector complex). It is associated with biological processes such as defense against bacteriophages, plasmids, functions as the large subunit of the Cascade complex. The protein consists of 532 amino acids, with an isoelectric point of 9.51, a molecular weight of 61,404.33 Da, a predicted bacterial half-life exceeding 10 h, and an instability index of 31.72, classifying it as stable. Structurally, it contains 23 β-sheets and 19 α-helices. In Myxococcus xanthus, the Cas8a1 homolog (also known as DevT) stimulates the synthesis of a signal transduction protein required for fruiting body morphogenesis, a process in which rod-shaped cells differentiate into spherical spores under starvation conditions (Figure 1).
Cas9 (Leptospira fletcheri) is a DNA nuclease that participates in the interference stage of the CRISPR-Cas system (target cleavage). It is associated with biological processes such as defense against bacteriophages and plasmids. The protein consists of 1469 amino acids, with an isoelectric point of 9.43, a molecular weight of 17,0367.32 Da, a predicted bacterial half-life exceeding 10 h, and an instability index of 43.69, classifying it as unstable. Structurally, it contains 42 β-sheets and 51 α-helices. Cas9 is inactive in the absence of the two guide RNAs (gRNAs). It recognizes a short sequence motif known as the protospacer adjacent motif (PAM), which enables discrimination between self of non-self DNA, since CRISPR loci lack PAM sequences. PAM recognition is essential for catalytic activity, and cleavage of target DNA occurs only when Cas9 is bound to its gRNAs (Figure 1).
Cas12 (Leptospira inadai) is a DNA nuclease and participates in the interference stage of the CRISPR-Cas system (target cleavage). It is associated with biological processes such as defense against bacteriophages and plasmids. The protein consists of 1250 amino acids, with an isoelectric point of 9.26, a molecular weight of 148,356.40 Da, a bioinformatically predicted bacterial half-life of more than 2 min, and an instability index of 30.29, classifying it as stable. Structurally, it contains 33 β-sheets and 50 α-helices. The CRISPR-associated protein Cas12a (Cpf1) possesses two distinct nuclease activities: an endoribonuclease activity that processes its own guide RNAs and an RNA-guided DNase activity that cleaves target DNA. Cas12a, an RNA-guided endonuclease of the type V CRISPR-Cas system adopts a bilobed architecture composed of an α-helical recognition (REC) lobe and a nuclease (NUC) lobe. The crRNA-target DNA heteroduplex binds within the positively charged central channel between the two lobes (Figure 1).

3.4. Classification of Cas Proteins into Functional Stages

The CRISPR-Cas system degrades foreign genetic material from viruses through three functional stages: adaptation (adapter integration), expression (pre-crRNA processing), and interference (effector complex formation and target cleavage). Among the 36 species with genes encoding Cas proteins, 19 encoded multiple Cas proteins, whereas 17 encoded only Cas3 protein. The latter group likely represents species with a partial or incomplete CRISPR-Cas system.Therefore, the purpose of this analysis was to establish the distribution of Cas proteins across the three functional stages of the CRISPR-Cas system in the 19 Leptospira species that encode multiple proteins, and to identify species with a complete system. Fifteen species carried Class 1, Type I CRISPR-Cas system, one species (L. fletcheri) had a Class 2, Type II system, and one species (L. ilyithenensis) had a Class 2, Type V system. Two species, L. gorisiae and L. inadai, encoded a combination of Class 1, Type I and Class 2, Type V systems.
Notably, L. kirschneri, L. interrogans, and L. noguchii lacked Cas6, which is normally required for the expression process (pre-crRNA processing), although Cas5 may compensate for this function. The absence of Cas1 in L. borgpetersenii, and Cas8 in L. adleri, L. alstonii, and L. inadai, was also unexpected. Additionally, the presence of csm2, a protein of the class 1—Type IIIA system in L. haakeii, is striking (Table 3).

3.5. Conservation Patterns of the Direct Repeats in the Leptospira Genus

A search for direct repeats was performed across the 77 species, identifying 434 spacer sequences distributed among the four species subgroups. Spacer length ranged from 23 to 39 nucleotides. Alignment of these sequences and analysis of nucleotide frequency reveled subgroup-specific conservation patterns: in P1, a central conserved motif (GTTTGAACTCCCACAAGTT), in P2 a central motif (CTGATCCCCACACACGTGGGGATTAA), in S1 a central motif (CCAGATCTGCAAGTGGATCTGC), and in S2, conservation was observed across the entire sequence (CTCACCACGCATATGGGGTTCAACCAT). Furthermore, when the 434 sequences from all four subgroups were compared collectively, no conservation pattern was detected (Figure 2).

3.6. Bioinformatic Identification of the Immunological Memory (Spacer Sequences)

This analysis is performed on the 19 species with the presence of spacer sequences, to verify the functionality of the different CRISPR-Cas systems detected. The presence of spacer sequences within the CRISPR arrays (Leader Sequence—Direct Repeats—Spacer Sequences) is clear evidence of the functionality of the different systems, because the bacteria incorporate spacer sequences as a kind of immunological memory against bacteriophages or plasmids. Among the 19 Leptospira species with Cas proteins, all presented spacer sequences and direct repeats. Among the 19 species with a complete and functional CRISPR-Cas system (presence of Cas proteins, spacer sequences, and direct repeats), a range of (12–58) spacers per species was detected. Leptospira noguchii with 58 spacer sequences was the species with the widest repertoire of immunological memory against bacteriophages, while Leptospira kirschneri with 12 spacer sequences was the species with the lowest immunological memory. A total of 617 spacer sequences were detected among the 19 Leptospira species, but only 323 were identified as belonging to different bacteriophages in the database (The actinobacteriophage database). Forty-two spacer sequences could not be identified, representing bacteriophages not reported in the database. These spacer sequences that represent the different bacteriophages have been found in the genome of 12 bacterial genera (Gordonia, Arthrobacter, Streptomyces, Rhodococcus, Propionibacterium, Microbacterium, Mycobacterium, Curtobacterium, Brevibacterium, Corynebacterium, and Tsukamurella). It should be noted that these 12 bacterial genera are represented by multiple spacer sequences that were found in various Leptospira species (Table S1).

3.7. Identification of Unique Effector Proteins

Given the difficulty of establishing gene-editing systems with CRISPR-Cas complexes composed of multiple effector proteins, a search was conducted for single-protein effectors such as Cas9, Cas12, and Cas13. One species (L. fletcheri) encoded Cas9, a protein Class 2, Type II, while three species (L. ilyithenensis, L. gorisiae, and L. inadai) encoded Cas12, a protein Class 2, Type V. No species were found with Cas13. Cas9 and Cas12 were structurally modeled, and functional domain analyses were performed to better understand the structure and function of these endogenous proteins, which could potentially be used as gene-editing tools in Leptospira (Figure 1). When comparing the Cas9 and Cas12a proteins of the Leptospira genus with homologs from model organisms commonly used in CRISPR–Cas studies (Streptococcus pyogenes Cas9 and Francisella novicida Cas12a), we found that both proteins show strong similarity to those identified in other Leptospira species. Moreover, they exhibit high sequence identity, genetic relatedness, and coverage with members of phylogenetically related genera such as Turnerella and Leptonema. Notably, the Leptospira fletcheri Cas9 protein differs from S. pyogenes Cas9 in amino acid sequence, functional domain composition, and predicted tertiary structure. In contrast, the Leptospira inadai Cas12a protein displays nearly identical sequence, domain architecture, and structural configuration to that of F. novicida, a well-established model species for CRISPR–Cas research (Supplementary Material S2).

3.8. Bioinformatic Detection of Intact Bacteriophages in Genomes

Bioinformatic detection of bacteriophages was performed across the genomes of 77 Leptospira species. Three intact bacteriophages were identified in four Leptospira species (L. mayottensis, L. weilii, L. bandrabouensis, and L. ellinghausenii), while two questionable or nearly complete bacteriophages were detected in L. weilii and L. fluminis. In addition, 58 species contained incomplete bacteriophages, and 13 showed no evidence of bacteriophage sequences (Figure 3).

4. Discussion

The Leptospira genus has made great progress in recent years, with 77 species de-scribed and 1157 genomes sequenced (https://www.ncbi.nlm.nih.gov/datasets/genome/?taxon=171, accessed on 5 August 2025). Each species has a reference genome and multiple genomes from different strains. These genomes are a valuable source of genetic information for computational biology studies, where different bacterial biological processes can be explored. The major problem with the Leptospira genomes, and many prokaryotes, is that a large portion of the proteins encoded in their genomes are hypothetical (predicted by an ORF, but without experimental validation), thus rendering their functions unknown. For example, in the species L. interrogans serovar Lai, only 11.5% of the proteins have been characterized [66,67]. According to the functional annotation of the proteins in the 77 genomes of the Leptospira genus, we found a range of hypothetical proteins between (41.4–57.9%), being the species with the most hypothetical proteins L. ainlahdjerensis (57.9%) and the species with the least hypothetical proteins L. Harrisiae (41.4%). This result reflects a large gap in the understanding of biological functions of proteins in the Leptospira genus and the urgent need for a gene editing tool (Figure 4).
Moreover, the gene editing process in the Leptospira genus has been difficult to standardize, which has hampered the functional analysis of thousands of proteins and their assignment to the bacteria’s biological processes [13]. However, the CRISPR-Cas methodology is emerging as a promising tool for gene silencing in Leptospira, yielding knockout mutants of multiple proteins [13,45,46]. Therefore, it is important to explore endogenous CRISPR-Cas systems in the 77 currently described species to uncover their diversity, characterize their molecular components (Cas proteins, direct repeats, and spacers), assess their immunological memory, and identify intact bacteriophages that have evaded the immune system. Regarding genome availability, a reference genome was obtained for each of the 77 Leptospira species, providing a comprehensive overview of Cas protein distribution in the genus (Table 1). Similar work has identified CRISPR-Cas systems in 18 Leptospira species, with the objective of assessing the taxonomic utility of CRISPR-Cas arrays for distinguishing species and serovars of the genus [47].
A total number of 166 Cas proteins were detected in 36 from the 77 (46.75%) Leptospira species, corresponding to Cas1 to Cas9, and Cas12. The proteins were classified into class 1 and class 2 systems, and types I, II, and V. Additionally, proteins belonging to the subtypes IA, IB, IC, IE, IIB, and IIIA, and variants Cas3a, Cas5a, Cas5a2, Cas5b, Cas5c, Cas7b, Cas7c, Cas8a1a3, and Cas8c were also detected. All proteins target DNA for cleavage, except for the csm2_TypeIIIA protein in L. haakeii, which also targets RNA molecules. The Csm complex comprises five Cas proteins (Csm1-Csm5) and a crRNA, which degrades invading DNA and RNA [68].
Currently, there are 43 pathogenic species divided into P1 (21 species) and P2 (22 species) subgroups, from which 27 species encoded Cas proteins; therefore, concluding that not all pathogenic species have CRISPR-Cas systems. Moreover, there are 34 saprophytic species grouped into S1 (29 species) and S2 (5) subgroups, with only two species (L. ilyithenensis, and L. ryugenii) having complete CRISPR-Cas systems and the Cas3a, type I protein present in ten species. Based on these findings, we can conclude that some saprophytic species encode CRISPR-Cas systems, contrary to what was reported in a previous study, that did not include these species [47]. Having found such systems potentially allows for experimental trials without the risk of infection in research laboratories. It is striking to find the cas3a, typeI protein as the sole representative of the CRISPR-Cas system in 17 Leptospira species (9 pathogens and 8 saprophytes). This suggests that these species have lost their CRISPR–Cas systems, and this vestigial protein has been conserved in their genomes to perform different functions in other biological processes; beyond defense against viruses (CRISPR immunity), the Cas3 protein may be involved in regulating biofilm formation and virulence in bacteria [69], as well as influencing genes related to quorum sensing (a form of bacterial communication where cells release and detect signal molecules called autoinducers, allowing them to sense the population density of their community and coordinate gene expression to alter collective behavior) [70], secretion systems [71], and flagella formation [69]. These non-canonical functions of Cas3 are being investigated and may have implications for gene editing and other cellular biological processes.
According to the results obtained, the Leptospira genus has a wide diversity of Cas proteins in 36 species that could be used as genetic editing tools.
Thus far, 13 types of Cas proteins have been described. Analysis of Leptospira genomes, revealed 10 of these proteins 13 (76.92%), indicating that the genus possesses a broad repertoire of Cas components. Eight belong to multi-protein effector systems (Cas1-Cas8), while two correspond to single-protein effectors (Cas9 and Cas12a). Notably, this study reports for the first time the presence of endogenous Cas9 and Cas12a in the Leptospira genus.
Previous studies have reported gene silencing in Leptospira species using Cas9, but the proteins employed were heterologous, genetically modified nucleases designed to introduce a single cut in the target DNA [46].
A functional bioinformatics analysis for the ten types of Cas proteins detected (Cas1-Cas9, and Cas12a) was performed to characterize their physicochemical properties, biological functions, associated biological processes, three-dimensional structures, and functional domains. This analysis enabled us to assess their roles within the CRISPR-Cas system and to assign each protein to one of the three functional stages: adaptation, expression, or interference. As a result of the analysis, it was found that the functionality of Cas proteins in Leptospira genus corresponds to other bacterial species (Figure 1).
Another important step in characterizing the CRISPR-Cas system is the classification of proteins into functional stages according to their biological roles. The first stage, adaptation, involves recognition of foreign DNA and RNA, cleavage of the viral genome, and integration of genome fragments into the bacterial chromosome as spacer sequences. This stage typically includes Cas1, Cas2, and Cas4, although Cas4 may be absent in most systems and entirely lacking in types III, V, and VI [22]. Among the 19 Leptospira species encoding Cas proteins, ten contained Cas1, Cas2, and Cas4; 8 contained Cas1 and Cas2; and 1 contained only Cas2. According to our results, the adaptation stage is complete in all 19 species. Although L. borgpetersenii lacks Cas2, its system appears to be functional, as spacer sequences were identified originating from bacteriophages.
The functional stage of expression consists of the transcription of the CRISPR microarray locus into immature RNA precursors (pre-crRNA) and their subsequent processing into mature RNA (cr-RNA) [22]. In this state, Cas6 or Cas5 are key components in Class 1, type I system, whereas RNAse III fulfills this role in Class 2, type II system. Additionally, Class 2, types V and VI systems, the unique effector proteins Cas12 and Cas13 also participate in pre-crRNA processing [22]. According to our results, all 19 Leptospira species analyzed possess the proteins necessary for a fully functional expression stage. (Table 3).
The interference stage consists of binding mature RNAs (crRNA) to multiprotein effector complexes or single effector proteins, which recognize an identical or similar sequence in the genome of invading viruses or plasmids and cleave them to inactivate them [33,34]. In Class 1 type I systems, Cas5, Cas7, and Cas8 are important to form the effector complex, along with Cas3 protein, which is responsible for targeting and cleaving DNA, whereas in Class 2 type II, V, and VI systems, Cas9, Cas12, and Cas13 are essential to form the effector complex. According to our results, all species have their respective complete effector complexes (Table 3).
Once we verified that the adaptation, expression, and interference stages were complete, we proceeded to assign the classes, types, and subtypes according to the taxonomic classification of CRISPR-Cas systems proposed by Makarova et al. in 2020 [36]. In our study, we found seventeen species classified as Class 1 type I; one species as Class 2 type II; and three species as class 2 type V. The following CRISPR-Cas subtypes were identified: IB (L. santarosai, and L. broomii), IC (L. kirschneri, L. interrogans, and L. noguchii), IE (L. alexanderi, L. borgpetersenii, L. mayottensis, L. stimsonii, L. weilii, L. gorisiae, L. fainei, L. koniambonensis, and L. ryugenii), IIB (L. fletcheri), VA (L. gorisiae, L. inadai, and L. ilyithenensis), IF2 (L. adleri, L. alstonii, and L. inadai). It is worth highlighting that the species L. inadai (Class 1 type IF2, and Class 2 type V) and L. gorisiae (Class 1, type IE, and Class 2 type V) each contain two distinct CRISPR-Cas systems (Table 3). To date, only subtypes IB, and IC have been described in the Leptospira genus [37,38,39]. This study therefore provides the first report of subtypes IE, IIB, VA, and IF2 in Leptospira. These findings significantly contribute to our understanding of the diversity and functionality of CRISPR–Cas systems in the Leptospira genus. Bacterial species can harbor and activate multiple types of CRISPR–Cas systems, allowing them to defend against a wider range of viral and genetic threats. The presence of multiple systems reflects an evolutionary war with phages, which in turn have developed inhibitors such as anti-CRISPR proteins to evade detection [72]. In addition, an additional system can act as a backup in case the primary CRISPR–Cas system fails, ensuring the continuity of bacterial defense. The presence of multiple CRISPR–Cas systems in a single bacterial species is a known phenomenon: some Streptococcus species can harbor up to three different types of CRISPR–Cas systems [73]. Clostridium and Bacillus genera have multiple types of CRISPR–Cas systems in different strains, such as types IB, II-B, II-C, and III-B in Clostridium [74], and types IB and IC in Bacillus [75].
Another fundamental component of the CRISPR-Cas system is the direct repeats (DR), which serve to separate the spacer sequences (segments of DNA from viruses). These nucleotide sequences are identical in size and sequence. Direct repeats can be similar between related species, but also very different between distant species. The average spacer size is 32 nucleotides, but they can vary between 21 and 47 base pairs [22]. Analysis of the 77 genomes uncovered 434 unique direct repeats, with sizes ranging between 23 and 55 base pairs. Conservation patterns were observed within the species subgroups (P1, P2, S1, and S2), but no conserved pattern was found among the 77 species. This is consistent with the literature, where genetically closely related species retain similar direct repeats, while genetically distant species use completely different direct repeats in their CRISPR-Cas arrays [22] (Figure 2).
Spacers are nucleotide sequences of fixed length but high variability, as they originate from diverse bacteriophages and plasmids. Sizes range from 20 to 72 base pairs [22]. A total of 617 spacer sequences were identified, representing the bacterial immunological memory against bacteriophage infection. Of these, 404 sequences matched entries in the Actinobacteriophage database and corresponded to 323 unique becteriophages. The remaining 81 sequences could not be identified, reflecting the presence of many bacteriophages yet to be reported in the database. The identified bacteriophages were associated with 11 different bacterial genera (Table S1). These findings confirm the presence and functionality of CRISPR-Cas systems in 19 Leptospira species. Structurally, genes encoding Cas proteins, direct repeats, and spacer sequences were detected. Functionally, spacers corresponded to bacteriophages sequences, indicating that Leptospira species had been exposed to these bacteriophages, avoided infection, and generated an immunological memory against them. However, the bioinformatics analysis of CRISPR–Cas systems in the Leptospira genus is a preliminary result that demonstrates the existence of the components, but it is necessary to experimentally validate these results to ensure that the systems work correctly. For example, this could involve performing experimental evaluations of the degradation of a bacteriophage or blocking a gene in a Leptospira species that has the complete CRISPR–Cas system. These findings have significant implications for potential applications in bacteriophage therapy, where the CRISPR–Cas system could be harnessed to develop targeted treatments for bacterial infections.
An interesting result was the partial or total loss of the CRISPR–Cas system in 58 species of the Leptospira genus. This raises the following question: why are some Leptospira species losing this defense system? Regarding this phenomenon, it is known that some bacterial species lack CRISPR–Cas systems due to evolutionary inactivation. CRISPR–Cas systems can be selectively inactivated in certain bacteria, as observed in the Bacillus cereus group [76], allowing them to adapt to specific environments. Another cause of CRISPR–Cas system loss is selective pressure induced by antibiotic exposure. Intensive antibiotic use can lead to the loss of the CRISPR–Cas system, as it can act as a barrier to the acquisition of antibiotic resistance genes [77]. Additionally, total or partial loss of essential components of the CRISPR–Cas system can render the system inoperable. For these reasons, we assume that some species of the Leptospira genus have lost their CRISPR–Cas defense system.
CRISPR-Cas systems can cleave sequences from bacteriophages or plasmids using multi-protein complexes (Class 1-types I, III, and IV) or single effector proteins (Class 2-types II, V, and VI). Single effector systems are particularly attractive for gene editing due to their simplicity, which facilitates application across multiple bacterial species [22]. According to our results, three species encode single-protein effector: L. fletcheri with a Class 2, type II system (Cas9 as cleavage protein), and L. inadai and L. ilyithenensis with Class 2, type V systems (Cas12a, as cleavage protein). This finding is significant for the Leptospira genus, as it represents the first report of endogenous Cas9 and Cas12a effectors. These proteins could serve as tools to optimize future gene-editing approaches while avoiding the toxicity associated with exogenous proteins from other bacterial species.
Finally, CRISPR-Cas systems are not infallible. Therefore, it is important to identify bacteriophages that have successfully evaded the CRISPR-Cas defense system, leaving their intact sequences embedded into the bacterial genome. Detecting such prophages is crucial not only for understanding the evolutionary dynamics between bacteria and their viruses but also for developing novel biotechnological applications. Bacteriophages have gained attention as alternative tools for gene editing [78,79,80] and as promising candidates for antibacterial therapy against multidrug-resistant infections in humans and animals [81,82,83,84,85].
Based on our analysis, we detected the following bacteriophages infecting four Leptospira species: L. mayottensis (PHAGE_Pseudo_phi3_NC_030940), L. weilii (PHAGE_Paenib_Tripp_NC_028930), L. bandrabouensis and L. ellinghausenii (PHAGE_Leptos_LE1_NC_048892). The only phages that have been isolated, purified, and phenotypically characterized in the Leptospira genus are LE1, LE3, and LE4. These bacteriophages were described infecting the saprophytic species Leptospira biflexa in the 1990s [86], and the LE1 genome was subsequently sequenced [87], and in 2018, sequencing and proteomic analyses of LE3 and LE4 were reported [88]. Despite these advances, research on Leptospira bacteriophages remains limited.
In this study, two additional bacteriophages infecting Leptospira were identified (PHAGE_Pseudo_phi3_NC_030940, and PHAGE_Paenib_Tripp_NC_028930). These findings expand current knowledge of phage diversity in this genus and may prove valuable for future applications in Leptospira genome engineering and the development of next-generation bacteriophage-based antibacterial therapies.

5. Conclusions

The presence of Cas proteins, direct repeats, and spacer sequences homologous to bacteriophage genomes indicates a functional CRISPR-Cas system in 19 Leptospira species. Notably, the detection of endogenous Cas9 and Cas12a proteins, both unique effectors, highlight their potential as precise tools for gene editing in this genus. The identification of saprophytic species with complete and functional CRISPR-Cas systems suggests a recent acquisition of this system in subgroup S2 and provides non-pathogenic models that may be valuable for experimental studies under laboratory conditions. Additionally, the three intact bacteriophages detected in four Leptospira species represent promising candidates both for genetic manipulation of the bacteria, due to their ability to evade the CRISPR-Cas system, and for the development of phage-based therapeutics aimed at selectively lysing Leptospira during human and animal infections.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/pathogens14101044/s1, Table S1: The table summarizes the number of direct repeats, spacer sequences, and bacteriophages identified (based on gene fragments, score, e-value, identity [%]), in the 19 Leptospira species with a complete CRISPR-Cas system (Cas proteins, direct repeats, and spacer sequences). It also includes the original hosts of the spacer sequences und unidentified spacers. Color coding indicates subgroup: green for pathogenic P1, blue for pathogenic P2, yellow for saprophytic S1, and red for saprophytic S2.; Supplementary Material S1: The file shows the detection of orthologous proteins, analysis of three-dimensional structures, and functional domain analysis between species Leptospira fletcheri (Cas9)/Streptococcus pyogenes(Cas9), and Leptospira inadai (Cas12a)/Francisella novicida (Cas12a).

Author Contributions

Conceptualization, R.G.P.S., J.E.S.F., S.P., A.M.C.-L., J.M.M.G., R.U., L.E.L.-R. and F.P.M.; Methodology, R.G.P.S., S.P. and F.P.M.; Software, R.G.P.S., J.G.R., S.P. and F.P.M.; Validation, R.G.P.S.; Formal analysis, R.G.P.S. and J.G.R.; Investigation, R.G.P.S., J.G.R., M.T.-C. and J.E.S.F.; Resources, R.G.P.S., M.T.-C., J.E.S.F. and S.P.; Data curation, R.G.P.S. and S.P.; Writing—original draft, R.G.P.S., M.T.-C., J.E.S.F., S.P., A.M.C.-L., J.M.M.G., R.U., L.E.L.-R. and F.P.M.; Writing—review and editing, R.G.P.S., M.T.-C., J.E.S.F., S.P. and F.P.M.; Supervision, R.G.P.S. and J.E.S.F.; Project administration, R.G.P.S. and J.E.S.F.; Funding acquisition, R.G.P.S., J.E.S.F. and F.P.M. All authors have read and agreed to the published version of the manuscript.

Funding

This project was funded by the Science, Technology and Innovation Direction of the CES University, Autonomous University of Yucatan, San Martin University, Colombian Institute of Tropical Medicine (ICMT), and Northern Arizona University.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained within the article (Appendix A).

Acknowledgments

The authors thank CES University, San Martin University, Autonomous University of Yucatan, Colombian Institute of Tropical Medicine (ICMT), and Northern Arizona University for the financial support given to the research.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

The reference genomic sequences used in this research are available in the NCBI-Genome database under the following accession numbers:
  • GCA_002811985.1, GCA_016918785.1, GCA_016919175.1, GCA_000243815.3, GCA_000347175.1, GCA_002811925.1, GCA_003516145.1, GCA_002811955.2, GCA_004770155.1, GCA_000243695.3, GCA_002073495.2, GCA_003722295.1, GCA_000306675.3, GCA_000306255.2, GCA_022267325.1, GCA_000313175.2, GCA_003545875.1, GCA_001729245.1, GCA_006874765.1, GCA_003545925.1, GCA_004770105.1, GCA_000243715.3, GCA_004770895.1, GCA_000306235.2, GCA_004769195.1, GCA_004771275.1, GCA_002812225.1, GCA_002811475.1, GCA_000243675.3, GCA_003112675.1, GCA_004769555.1, GCA_004770615.1, GCA_000526875.1, GCA_002812205.1, GCA_002811875.1, GCA_002811765.1, GCA_004769615.1, GCA_004769405.1, GCA_004770055.1, GCA_002150035.1, GCA_004770635.1, GCA_016918735.1, GCA_004770555.1, GCA_000017685.1, GCA_004770145.1, GCA_004770625.1, GCA_004769295.1, GCA_016919165.1, GCA_004770265.1, GCA_003114815.1, GCA_002811945.1, GCA_004769775.1, GCA_004769235.1, GCA_004769665.1, GCA_002812085.1, GCA_004368965.1, GCA_004770045.1, GCA_004770475.1, GCA_004770765.1, GCA_004769575.1, GCA_000332495.2, GCA_004770365.1, GCA_000332515.2, GCA_004769275.1, GCA_004770995.1, GCA_004771005.1, GCA_003114835.3, GCA_004770745.1, GCA_003114855.1, GCA_026151345.1, GCA_026151335.1, GCA_026151395.1

References

  1. Adler, B.; De La Peña Moctezuma, A. Leptospira and leptospirosis. Vet. Microbiol. 2010, 140, 287–296. [Google Scholar] [CrossRef]
  2. Haake, D.A. Spirochaetal lipoproteins and pathogenesis. Microbiol. Read. Engl. 2000, 146 Pt 7, 1491–1504. [Google Scholar] [CrossRef] [PubMed]
  3. Levett, P.N. Leptospirosis. Clin. Microbiol. Rev. 2001, 14, 296–326. [Google Scholar] [CrossRef] [PubMed]
  4. Vijayachari, P.; Sugunan, A.P.; Shriram, A.N. Leptospirosis: An emerging global public health problem. J. Biosci. 2008, 33, 557–569. [Google Scholar] [CrossRef]
  5. McBride, A.J.A.; Athanazio, D.A.; Reis, M.G.; Ko, A.I. Leptospirosis. Curr. Opin. Infect. Dis. 2005, 18, 376–386. [Google Scholar] [CrossRef]
  6. Costa, F.; Hagan, J.E.; Calcagno, J.; Kane, M.; Torgerson, P.; Martinez-Silveira, M.S.; Stein, C.; Abela-Ridder, B.; I Ko, A. Global Morbidity and Mortality of Leptospirosis: A Systematic Review. PLoS Negl. Trop. Dis. 2015, 9, e0003898. [Google Scholar] [CrossRef]
  7. Vincent, A.T.; Schiettekatte, O.; Goarant, C.; Neela, V.K.; Bernet, E.; Thibeaux, R.; Ismail, N.; Mohd Khalid, M.K.; Amran, F.; Masuzawa, T.; et al. Revisiting the taxonomy and evolution of pathogenicity of the genus Leptospira through the prism of genomics. PLoS Negl. Trop. Dis. 2019, 13, e0007270. [Google Scholar] [CrossRef]
  8. Cerqueira, G.M.; Picardeau, M. A century of Leptospira strain typing. Infect. Genet. Evol. J. Mol. Epidemiol. Evol. Genet. Infect. Dis. 2009, 9, 760–768. [Google Scholar] [CrossRef]
  9. Fernandes, L.G.V.; Stone, N.E.; Roe, C.C.; Goris, M.G.A.; van der Linden, H.; Sahl, J.W.; Wagner, D.M.; Nally, J.E. Leptospira sanjuanensis sp. nov., a pathogenic species of the genus Leptospira isolated from soil in Puerto Rico. Int. J. Syst. Evol. Microbiol. 2022, 72, 005560. [Google Scholar] [CrossRef]
  10. Korba, A.A.; Lounici, H.; Kainiu, M.; Vincent, A.T.; Mariet, J.F.; Veyrier, F.J.; Goarant, C.; Picardeau, M. Leptospira ainlahdjerensis sp. nov., Leptospira ainazelensis sp. nov., Leptospira abararensis sp. nov. and Leptospira chreensis sp. nov., four new species isolated from water sources in Algeria. Int. J. Syst. Evol. Microbiol. 2021, 71, 005148. [Google Scholar] [CrossRef] [PubMed]
  11. Hamond, C.; Tibbs-Cortes, B.; Fernandes, L.G.V.; LeCount, K.; Putz, E.J.; Anderson, T.; Camp, P.; Stuber, T.; Hicks, J.; van der Linden, H.; et al. Leptospira gorisiae sp. nov, L. cinconiae sp. nov, L. mgodei sp. nov, L. milleri sp. nov and L. Iowaensis sp. nov: Five new species isolated from water sources in the Midwestern United States. Int. J. Syst. Evol. Microbiol. 2025, 75, 006595. [Google Scholar] [CrossRef]
  12. Dos Santos Ribeiro, P.; Carvalho, N.B.; Aburjaile, F.; Sousa, T.; Veríssimo, G.; Gomes, T.; Neves, F.; Blanco, L.; Lima, J.A.; de Oliveira, D.; et al. Environmental Biofilms from an Urban Community in Salvador, Brazil, Shelter Previously Uncharacterized Saprophytic Leptospira. Microb. Ecol. 2023, 86, 2488–2501. [Google Scholar] [CrossRef]
  13. Fernandes, L.G.V.; Hornsby, R.L.; Nascimento, A.L.T.O.; Nally, J.E. Genetic manipulation of pathogenic Leptospira: CRISPR interference (CRISPRi)-mediated gene silencing and rapid mutant recovery at 37 °C. Sci. Rep. 2021, 11, 1768. [Google Scholar] [CrossRef]
  14. Pappas, C.J.; Xu, H.; Motaleb, M.A. Creating a Library of Random Transposon Mutants in Leptospira. In Leptospira spp.: Methods and Protocols; Koizumi, N., Picardeau, M., Eds.; Springer: New York, NY, USA, 2020; pp. 77–96. [Google Scholar] [CrossRef]
  15. Bourhy, P.; Louvel, H.; Saint Girons, I.; Picardeau, M. Random Insertional Mutagenesis of Leptospira interrogans, the Agent of Leptospirosis, Using a mariner Transposon. J. Bacteriol. 2005, 187, 3255–3258. [Google Scholar] [CrossRef] [PubMed]
  16. Murray, G.L.; Morel, V.; Cerqueira, G.M.; Croda, J.; Srikram, A.; Henry, R.; Ko, A.I.; Dellagostin, O.A.; Bulach, D.M.; Sermswan, R.W.; et al. Genome-wide transposon mutagenesis in pathogenic Leptospira species. Infect. Immun. 2009, 77, 810–816. [Google Scholar] [CrossRef] [PubMed]
  17. Lourdault, K.; Matsunaga, J.; Evangelista, K.V.; Haake, D.A. High-throughput Parallel Sequencing to Measure Fitness of Leptospira interrogans Transposon Insertion Mutants During Golden Syrian Hamster Infection. J. Vis. Exp. JoVE 2017, 56442. [Google Scholar]
  18. Liao, S.; Sun, A.; Ojcius, D.M.; Wu, S.; Zhao, J.; Yan, J. Inactivation of the fliY gene encoding a flagellar motor switch protein attenuates mobility and virulence of Leptospira interrogansstrain Lai. BMC Microbiol. 2009, 9, 253. [Google Scholar] [CrossRef]
  19. Croda, J.; Figueira, C.P.; Wunder, E.A.; Santos, C.S.; Reis, M.G.; Ko, A.I.; Picardeau, M. Targeted mutagenesis in pathogenic Leptospira species: Disruption of the LigB gene does not affect virulence in animal models of leptospirosis. Infect. Immun. 2008, 76, 5826–5833. [Google Scholar] [CrossRef]
  20. Pappas, C.J.; Benaroudj, N.; Picardeau, M. A Replicative Plasmid Vector Allows Efficient Complementation of Pathogenic Leptospira Strains. Appl. Environ. Environ. Microbiol. 2015, 81, 3176–3181. [Google Scholar] [CrossRef]
  21. Pacesa, M.; Pelea, O.; Jinek, M. Past, present, and future of CRISPR genome editing technologies. Cell 2024, 187, 1076–1100. [Google Scholar] [CrossRef]
  22. Nidhi, S.; Anand, U.; Oleksak, P.; Tripathi, P.; Lal, J.A.; Thomas, G.; Kuca, K.; Tripathi, V. Novel CRISPR–Cas Systems: An Updated Review of the Current Achievements, Applications, and Future Research Perspectives. Int. J. Mol. Sci. 2021, 22, 3327. [Google Scholar] [CrossRef] [PubMed]
  23. Chehelgerdi, M.; Chehelgerdi, M.; Khorramian-Ghahfarokhi, M.; Shafieizadeh, M.; Mahmoudi, E.; Eskandari, F.; Rashidi, M.; Arshi, A.; Mokhtari-Farsani, A. Correction: Comprehensive review of CRISPR—Based gene editing: Mechanisms, challenges, and applications in cancer therapy. Mol. Cancer 2024, 23, 43. [Google Scholar] [CrossRef]
  24. Hillary, V.E.; Ceasar, S.A. A Review on the Mechanism and Applications of CRISPR/Cas9/Cas12/Cas13/Cas14 Proteins Utilized for Genome Engineering. Mol. Biotechnol. 2023, 65, 311–325. [Google Scholar] [CrossRef]
  25. Sternberg, S.H.; Richter, H.; Charpentier, E.; Qimron, U. Adaptation in CRISPR-Cas Systems. Mol. Cell 2016, 61, 797–808. [Google Scholar] [CrossRef]
  26. Hille, F.; Charpentier, E. CRISPR-Cas: Biology, mechanisms and relevance. Philos. Trans. R. Soc. B Biol. Sci. 2016, 371, 20150496. [Google Scholar] [CrossRef]
  27. Jiang, F.; Doudna, J.A. The structural biology of CRISPR-Cas systems. Curr. Opin. Struct. Biol. 2015, 30, 100–111. [Google Scholar] [CrossRef]
  28. Koonin, E.V.; Makarova, K.S. Origins and evolution of CRISPR-Cas systems. Philos. Trans. R. Soc. B Biol. Sci. 2019, 374, 20180087. [Google Scholar] [CrossRef]
  29. Alkhnbashi, O.S.; Meier, T.; Mitrofanov, A.; Backofen, R.; Voß, B. CRISPR-Cas bioinformatics. Methods 2020, 172, 3–11. [Google Scholar] [CrossRef]
  30. Koonin, E.V.; Makarova, K.S.; Zhang, F. Diversity, classification and evolution of CRISPR-Cas systems. Curr. Opin. Microbiol. 2017, 37, 67–78. [Google Scholar] [CrossRef] [PubMed]
  31. Li, C.; Chu, W.; Gill, R.A.; Sang, S.; Shi, Y.; Hu, X.; Yang, Y.; Zaman, Q.U.; Zhang, B. Computational Tools and Resources for CRISPR/Cas Genome Editing. Genom. Proteom. Bioinform. 2023, 21, 108–126. [Google Scholar] [CrossRef] [PubMed]
  32. Mojica, F.J.M.; Montoliu, L. On the Origin of CRISPR-Cas Technology: From Prokaryotes to Mammals. Trends Microbiol. 2016, 24, 811–820. [Google Scholar] [CrossRef]
  33. Strich, J.R.; Chertow, D.S. CRISPR-Cas Biology and Its Application to Infectious Diseases. J. Clin. Microbiol. 2019, 57, e01307-18. [Google Scholar] [CrossRef]
  34. Bhatia, S.; Pooja Yadav, S.K. CRISPR-Cas for genome editing: Classification, mechanism, designing and applications. Int. J. Biol. Macromol. 2023, 238, 124054. [Google Scholar] [CrossRef]
  35. Mougiakos, I.; Bosma, E.F.; De Vos, W.M.; Van Kranenburg, R.; Van Der Oost, J. Next Generation Prokaryotic Engineering: The CRISPR-Cas Toolkit. Trends Biotechnol. 2016, 34, 575–587. [Google Scholar] [CrossRef]
  36. Makarova, K.S.; Wolf, Y.I.; Iranzo, J.; Shmakov, S.A.; Alkhnbashi, O.S.; Brouns, S.J.J.; Charpentier, E.; Cheng, D.; Haft, D.H.; Horvath, P.; et al. Evolutionary classification of CRISPR–Cas systems: A burst of class 2 and derived variants. Nat. Rev. Microbiol. 2020, 18, 67–83. [Google Scholar] [CrossRef] [PubMed]
  37. Dixit, B.; Ghosh, K.K.; Fernandes, G.; Kumar, P.; Gogoi, P.; Kumar, M. Dual nuclease activity of a Cas2 protein in CRISPR—Cas subtype I–B of Leptospira interrogans. FEBS Lett. 2016, 590, 1002–1016. [Google Scholar] [CrossRef]
  38. Anand, V.; Prabhakaran, H.S.; Gogoi, P.; Kanaujia, S.P.; Kumar, M. Structural and functional characterization of Cas2 of CRISPR-Cas subtype I-C lacking the CRISPR component. Front. Mol. Biosci. 2022, 9, 988569. [Google Scholar] [CrossRef] [PubMed]
  39. Anand, V.; Prabhakaran, H.S.; Prakash, A.; Hussain, M.S.; Kumar, M. Differential processing of CRISPR RNA by LinCas5c and LinCas6 of Leptospira. Biochim. Biophys. Acta BBA—Gen. Subj. 2023, 1867, 130469. [Google Scholar] [CrossRef] [PubMed]
  40. Prakash, A.; Kumar, M. Transcriptional analysis of CRISPR I-B arrays of Leptospira interrogans serovar Lai and its processing by Cas6. Front. Microbiol. 2022, 13, 960559. [Google Scholar] [CrossRef]
  41. Hussain, M.S.; Anand, V.; Kumar, M. Functional PAM sequence for DNA interference by CRISPR-Cas I-B system of Leptospira interrogans and the role of LinCas11b encoded within lincas8b. Int. J. Biol. Macromol. 2023, 237, 124086. [Google Scholar] [CrossRef]
  42. Prakash, A.; Kumar, M. Characterizing the transcripts of Leptospira CRISPR I-B array and its processing with endoribonuclease LinCas6. Int. J. Biol. Macromol. 2021, 182, 785–795. [Google Scholar] [CrossRef]
  43. Dixit, B.; Prakash, A.; Kumar, P.; Gogoi, P.; Kumar, M. The core Cas1 protein of CRISPR-Cas I-B in Leptospira shows metal-tunable nuclease activity. Curr. Res. Microb. Sci. 2021, 2, 100059. [Google Scholar] [CrossRef]
  44. Dixit, B.; Anand, V.; Hussain, M.d.S.; Kumar, M. The CRISPR-associated Cas4 protein from Leptospira interrogans demonstrate versatile nuclease activity. Curr. Res. Microb. Sci. 2021, 2, 100040. [Google Scholar] [CrossRef]
  45. Fernandes, L.G.V.; Nascimento, A.L.T.O. A Novel Breakthrough in Leptospira spp. Mutagenesis: Knockout by Combination of CRISPR/Cas9 and Non-homologous End-Joining Systems. Front. Microbiol. 2022, 13, 915382. [Google Scholar] [CrossRef]
  46. Fernandes, L.G.V.; Hamond, C.; Tibbs-Cortes, B.W.; Putz, E.J.; Olsen, S.C.; Palmer, M.V.; Nally, J.E. CRISPR-prime editing, a versatile genetic tool to create specific mutations with a single nucleotide resolution in Leptospira. mBio 2024, 15, e01516-24. [Google Scholar] [CrossRef]
  47. Xiao, G.; Yi, Y.; Che, R.; Zhang, Q.; Imran, M.; Khan, A.; Yan, J.; Lin, X. Characterization of CRISPR—Cas systems in Leptospira reveals potential application of CRISPR in genotyping of Leptospira interrogans. Apmis 2019, 127, 202–216. [Google Scholar] [CrossRef] [PubMed]
  48. Natarajan, S.; Joseph, J.; Vinayagamurthy, B.; Estrela, P. A Lateral Flow Assay for the Detection of Leptospira lipL32 Gene Using CRISPR Technology. Sensors 2023, 23, 6544. [Google Scholar] [CrossRef]
  49. Seemann, T. Prokka: Rapid prokaryotic genome annotation. Bioinformatics 2014, 30, 2068–2069. [Google Scholar] [CrossRef] [PubMed]
  50. Aziz, R.K.; Bartels, D.; Best, A.A.; DeJongh, M.; Disz, T.; Edwards, R.A.; Formsma, K.; Gerdes, S.; Glass, E.M.; Kubal, M.; et al. The RAST Server: Rapid Annotations using Subsystems Technology. BMC Genom. 2008, 9, 75. [Google Scholar] [CrossRef] [PubMed]
  51. Brettin, T.; Davis, J.J.; Disz, T.; Edwards, R.A.; Gerdes, S.; Olsen, G.J.; Olson, R.; Overbeek, R.; Parrello, B.; Pusch, G.D.; et al. RASTtk: A modular and extensible implementation of the RAST algorithm for building custom annotation pipelines and annotating batches of genomes. Sci. Rep. 2015, 5, 8365. [Google Scholar] [CrossRef]
  52. Overbeek, R.; Olson, R.; Pusch, G.D.; Olsen, G.J.; Davis, J.J.; Disz, T.; Olson, R.; Overbeek, R.; Parrello, B.; Pusch, G.D.; et al. The SEED and the Rapid Annotation of microbial genomes using Subsystems Technology (RAST). Nucleic Acids Res. 2014, 42, D206–D214. [Google Scholar] [CrossRef]
  53. Couvin, D.; Bernheim, A.; Toffano-Nioche, C.; Touchon, M.; Michalik, J.; Néron, B.; Rocha, E.P.C.; Vergnaud, G.; Gautheret, D.; Pourcel, C. CRISPRCasFinder, an update of CRISRFinder, includes a portable version, enhanced performance and integrates search for Cas proteins. Nucleic Acids Res. 2018, 46, W246–W251. [Google Scholar] [CrossRef]
  54. Altschul, S.F.; Gish, W.; Miller, W.; Myers, E.W.; Lipman, D.J. Basic local alignment search tool. J. Mol. Biol. 1990, 215, 403–410. [Google Scholar] [CrossRef]
  55. Zdobnov, E.M.; Apweiler, R. InterProScan—An integration platform for the signature-recognition methods in InterPro. Bioinformatics 2001, 17, 847–848. [Google Scholar] [CrossRef]
  56. Waterhouse, A.; Bertoni, M.; Bienert, S.; Studer, G.; Tauriello, G.; Gumienny, R.; Heer, F.T.; De Beer, T.A.P.; Rempfer, C.; Bordoli, L.; et al. SWISS-MODEL: Homology modelling of protein structures and complexes. Nucleic Acids Res. 2018, 46, W296–W303. [Google Scholar] [CrossRef]
  57. Abramson, J.; Adler, J.; Dunger, J.; Evans, R.; Green, T.; Pritzel, A.; Ronneberger, O.; Willmore, L.; Ballard, A.J.; Bambrick, J.; et al. Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature 2024, 630, 493–500. [Google Scholar] [CrossRef] [PubMed]
  58. Johansson, M.U.; Zoete, V.; Michielin, O.; Guex, N. Defining and searching for structural motifs using DeepView/Swiss-PdbViewer. BMC Bioinform. 2012, 13, 173. [Google Scholar] [CrossRef] [PubMed]
  59. DeLano, W.L.; Scientific, D.; Carlos, S. PyMOL: An Open-Source Molecular Graphics Tool. CCP4 Newsl. Protein Crystallogr. 2002, 40, 82–92. [Google Scholar]
  60. Kumar, S.; Stecher, G.; Suleski, M.; Sanderford, M.; Sharma, S.; Tamura, K. MEGA12: Molecular Evolutionary Genetic Analysis Version 12 for Adaptive and Green Computing. Mol. Biol. Evol. 2024, 41, msae263. [Google Scholar] [CrossRef]
  61. Garcia-Boronat, M.; Diez-Rivero, C.M.; Reinherz, E.L.; Reche, P.A. PVS: A web server for protein sequence variability analysis tuned to facilitate conserved epitope discovery. Nucleic Acids Res. 2008, 36, W35–W41. [Google Scholar] [CrossRef]
  62. Russell, D.A.; Hatfull, G.F. PhagesDB: The actinobacteriophage database. Bioinformatics 2017, 33, 784–786. [Google Scholar] [CrossRef]
  63. Wishart, D.S.; Han, S.; Saha, S.; Oler, E.; Peters, H.; Grant, J.R.; Stothard, P.; Gautam, V. PHASTEST: Faster than PHASTER, better than PHAST. Nucleic Acids Res. 2023, 51, W443–W450. [Google Scholar] [CrossRef] [PubMed]
  64. Arndt, D.; Grant, J.R.; Marcu, A.; Sajed, T.; Pon, A.; Liang, Y.; Wishart, D.S. PHASTER: A better, faster version of the PHAST phage search tool. Nucleic Acids Res. 2016, 44, W16–W21. [Google Scholar] [CrossRef] [PubMed]
  65. Zhou, Y.; Liang, Y.; Lynch, K.H.; Dennis, J.J.; Wishart, D.S. PHAST: A Fast Phage Search Tool. Nucleic Acids Res. 2011, 39, W347–W352. [Google Scholar] [CrossRef]
  66. Samaha Th, S. In-Silico Characterisation of a Hypothetical Protein (LA_1016) of Leptospira Interrogans Serovar Lai Strain 56601. Austin J. Proteom. Bioinform. Genom. 2017, 4, 1022. [Google Scholar]
  67. Ijaq, J.; Chandrasekharan, M.; Poddar, R.; Bethi, N.; Sundararajan, V.S. Annotation and curation of uncharacterized proteins- challenges. Front. Genet. 2015, 6, 119. [Google Scholar] [CrossRef]
  68. Takeshita, D.; Sato, M.; Inanaga, H.; Numata, T. Crystal Structures of Csm2 and Csm3 in the Type III-A CRISPR–Cas Effector Complex. J. Mol. Biol. 2019, 431, 748–763. [Google Scholar] [CrossRef]
  69. Guo, T.; Yang, J.; Zhou, N.; Sun, X.; Huan, C.; Lin, T.; Bao, G.; Hu, J.; Li, G. Cas3 of type I-Fa CRISPR-Cas system upregulates bacterial biofilm formation and virulence in Acinetobacter baumannii. Commun. Biol. 2025, 8, 750. [Google Scholar] [CrossRef] [PubMed]
  70. Cui, L.; Wang, X.; Huang, D.; Zhao, Y.; Feng, J.; Lu, Q.; Pu, Q.; Wang, Y.; Cheng, G.; Wu, M.; et al. CRISPR-cas3 of Salmonella Upregulates Bacterial Biofilm Formation and Virulence to Host Cells by Targeting Quorum-Sensing Systems. Pathogens 2020, 9, 53. [Google Scholar] [CrossRef]
  71. He, L.; St John James, M.; Radovcic, M.; Ivancic-Bace, I.; Bolt, E.L. Cas3 Protein—A Review of a Multi-Tasking Machine. Genes 2020, 11, 208. [Google Scholar] [CrossRef]
  72. Zhang, F.; Song, G.; Tian, Y. Anti-CRISPRs: The natural inhibitors for CRISPR-Cas systems. Anim. Models Exp. Med. 2019, 2, 69–75. [Google Scholar] [CrossRef] [PubMed]
  73. Carte, J.; Christopher, R.T.; Smith, J.T.; Olson, S.; Barrangou, R.; Moineau, S.; Glover, C.V., 3rd; Graveley, B.R.; Terns, R.M.; Terns, M.P. The three major types of CRISPR-Cas systems function independently in CRISPR RNA biogenesis in Streptococcus thermophilus. Mol. Microbiol. 2014, 93, 98–112. [Google Scholar] [CrossRef] [PubMed]
  74. McAllister, K.N.; Sorg, J.A. CRISPR Genome Editing Systems in the Genus Clostridium: A Timely Advancement. J. Bacteriol. 2019, 201, e00219-19. [Google Scholar] [CrossRef]
  75. Sheykholeslami, N.; Mirzaei, H.; Nami, Y.; Khandaghi, J.; Javadi, A. Ecological and evolutionary dynamics of CRISPR-Cas systems in Clostridium botulinum: Insights from genome mining and comparative analysis. Infect. Genet. Evol. 2024, 123, 105638. [Google Scholar] [CrossRef]
  76. Zheng, Z.; Zhang, Y.; Liu, Z.; Dong, Z.; Xie, C.; Bravo, A.; Soberón, M.; Mahillon, J.; Sun, M.; Peng, D. The CRISPR-Cas systems were selectively inactivated during evolution of Bacillus cereus group for adaptation to diverse environments. ISME J. 2020, 14, 1479–1493. [Google Scholar] [CrossRef]
  77. Arias, C.A.; Murray, B.E. The rise of the Enterococcus: Beyond vancomycin resistance. Nature Reviews. Microbiology 2012, 10, 266–278. [Google Scholar] [CrossRef]
  78. Campbell, A. The future of bacteriophage biology. Nat. Rev. Genet. 2003, 4, 471–477. [Google Scholar] [CrossRef]
  79. Hussain, W.; Yang, X.; Ullah, M.; Wang, H.; Aziz, A.; Xu, F.; Asif, M.; Ullah, M.W.; Wang, S. Genetic engineering of bacteriophages: Key concepts, strategies, and applications. Biotechnol. Adv. 2023, 64, 108116. [Google Scholar] [CrossRef]
  80. Jia, H.J.; Jia, P.P.; Yin, S.; Bu, L.K.; Yang, G.; Pei, D.S. Engineering bacteriophages for enhanced host range and efficacy: Insights from bacteriophage-bacteria interactions. Front. Microbiol. 2023, 14, 1172635. [Google Scholar] [CrossRef]
  81. Chen, Y.; Batra, H.; Dong, J.; Chen, C.; Rao, V.B.; Tao, P. Genetic Engineering of Bacteriophages Against Infectious Diseases. Front. Microbiol. 2019, 10, 954. [Google Scholar] [CrossRef] [PubMed]
  82. Segundo-Arizmendi, N.; Arellano-Maciel, D.; Rivera-Ramírez, A.; Piña-González, A.M.; López-Leal, G.; Hernández-Baltazar, E. Bacteriophages: A Challenge for Antimicrobial Therapy. Microorganisms 2025, 13, 100. [Google Scholar] [CrossRef]
  83. Girma, A. Bacteriophages as an alternative strategy for the treatment of drug resistant bacterial infections: Current approaches and future perspectives. Cell Surf. 2025, 14, 100149. [Google Scholar] [CrossRef]
  84. Subramanian, A. Emerging roles of bacteriophage-based therapeutics in combating antibiotic resistance. Front. Microbiol. 2024, 15, 1384164. [Google Scholar] [CrossRef]
  85. Summers, W.C. Bacteriophage Therapy. Annu. Rev. Microbiol. 2001, 55, 437–451. [Google Scholar] [CrossRef]
  86. Girons, I.S.; Margarita, D.; Amouriaux, P.; Baranton, G. First isolation of bacteriophages for a spirochaete: Potential genetic tools for Leptospira. Res. Microbiol. 1990, 141, 1131–1138. [Google Scholar] [CrossRef] [PubMed]
  87. Bourhy, P.; Frangeul, L.; Couvé, E.; Glaser, P.; Saint Girons, I.; Picardeau, M. Complete Nucleotide Sequence of the LE1 Prophage from the Spirochete Leptospira biflexa and Characterization of Its Replication and Partition Functions. J. Bacteriol. 2005, 187, 3931–3940. [Google Scholar] [CrossRef] [PubMed]
  88. Schiettekatte, O.; Vincent, A.T.; Malosse, C.; Lechat, P.; Chamot-Rooke, J.; Veyrier, F.J.; Picardeau, M.; Bourhy, P. Characterization of LE3 and LE4, the only lytic phages known to infect the spirochete Leptospira. Sci. Rep. 2018, 8, 11781. [Google Scholar] [CrossRef] [PubMed]
Figure 1. The figure shows the ten Cas proteins described to date and indicates which were detected in the genomes of the 77 species of the Leptospira species. Their biological functions and processes were annotated using InterPro. The three-dimensional structures of Cas1, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9, and Cas12 were modeled with Swiss-Model and AlphaFold3. Functional domains were also identified, and their corresponding biological roles were assigned.
Figure 1. The figure shows the ten Cas proteins described to date and indicates which were detected in the genomes of the 77 species of the Leptospira species. Their biological functions and processes were annotated using InterPro. The three-dimensional structures of Cas1, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9, and Cas12 were modeled with Swiss-Model and AlphaFold3. Functional domains were also identified, and their corresponding biological roles were assigned.
Pathogens 14 01044 g001aPathogens 14 01044 g001bPathogens 14 01044 g001c
Figure 2. Conservation patterns of direct repeats in the Leptospira genus. The figure shows the alignment of spacer sequences (nucleotides) from 30 Leptospira species (those encoding Cas proteins, direct repeats, and spacers) and the resulting conservation patterns analysis. Colors in the alignment represent nucleotides: adenine (black), thymine, guanine, and cytosine (green). The size of the letters in the sequence logos is proportional to the nucleotide frequency in the alignment (measured in bits).
Figure 2. Conservation patterns of direct repeats in the Leptospira genus. The figure shows the alignment of spacer sequences (nucleotides) from 30 Leptospira species (those encoding Cas proteins, direct repeats, and spacers) and the resulting conservation patterns analysis. Colors in the alignment represent nucleotides: adenine (black), thymine, guanine, and cytosine (green). The size of the letters in the sequence logos is proportional to the nucleotide frequency in the alignment (measured in bits).
Pathogens 14 01044 g002
Figure 3. Detection of bacteriophages integrated into the genome of four species of the genus Leptospira (L. mayottensis, L. weilii, L. bandrabouensis, and L. ellinghausenii), indicating successful evasion of the CRISPR-Cas system. The figure displays the genomic regions where bacteriophage proteins were detected (highlighted in green), including species, genomic location, region length, completeness, score, number of phage proteins identified, insertion site, phage identity, and GC content. Color coding denotes completeness: dark green, intact bacteriophages (complete genome detected); light green, nearly complete bacteriophages (almost the entire genome was detected); red, incomplete bacteriophages (small genomic region detected).
Figure 3. Detection of bacteriophages integrated into the genome of four species of the genus Leptospira (L. mayottensis, L. weilii, L. bandrabouensis, and L. ellinghausenii), indicating successful evasion of the CRISPR-Cas system. The figure displays the genomic regions where bacteriophage proteins were detected (highlighted in green), including species, genomic location, region length, completeness, score, number of phage proteins identified, insertion site, phage identity, and GC content. Color coding denotes completeness: dark green, intact bacteriophages (complete genome detected); light green, nearly complete bacteriophages (almost the entire genome was detected); red, incomplete bacteriophages (small genomic region detected).
Pathogens 14 01044 g003aPathogens 14 01044 g003b
Figure 4. The figure shows the genomic annotation of the Leptospira species with the highest and lowest number of hypothetical proteins, determined using the RAST (Rapid Annotation using subsystem Technology) web server. Displayed information includes the species name, subgroup, the annotated or identified proteins, hypothetical proteins, total number of proteins, percentage of hypothetical proteins, and the rank based on hypothetical protein abundance.
Figure 4. The figure shows the genomic annotation of the Leptospira species with the highest and lowest number of hypothetical proteins, determined using the RAST (Rapid Annotation using subsystem Technology) web server. Displayed information includes the species name, subgroup, the annotated or identified proteins, hypothetical proteins, total number of proteins, percentage of hypothetical proteins, and the rank based on hypothetical protein abundance.
Pathogens 14 01044 g004
Table 1. The table summarizes the 77 Leptospira species used in the study, their taxonomic distribution within subgroups (P1, P2, S1, and S2), NCBI Taxonomy accession numbers, genome size (Mb), CG content (%), number of genes, proteins, and non-coding sequences. Color coding indicates subgroup: green for pathogenic P1, blue for pathogenic P2, yellow for saprophytic S1, and red for saprophytic S2.
Table 1. The table summarizes the 77 Leptospira species used in the study, their taxonomic distribution within subgroups (P1, P2, S1, and S2), NCBI Taxonomy accession numbers, genome size (Mb), CG content (%), number of genes, proteins, and non-coding sequences. Color coding indicates subgroup: green for pathogenic P1, blue for pathogenic P2, yellow for saprophytic S1, and red for saprophytic S2.
SpeciesSubgroupAccess NumberSequencing LevelGenome Size (Mb)GC (%)GenesProteinsNon-Coding
L. adleriP1GCA_002811985.1Scaffold4.843.547894626163
L. ainazelensisP1GCA_016918785.1Contig4.942.544174310107
L. ainlahdjerensisP1GCA_016919175.1Contig4.842.54409431099
L. alexanderiP1GCA_000243815.3Contig4.2404582454141
L. alstoniiP1GCA_000347175.1Contig4.442.54423438043
L. barantoniiP1GCA_002811925.1Contig4.44441354033102
L. borgpeterseniiP1GCA_003516145.1Chromosome44037923463329
L. ellisiiP1GCA_002811955.2Contig4.34839983896102
L. gomenensisP1GCA_004770155.1Contig4.34639313802129
L. gorisiaeP1GCA_040833975.1Chromosome4.541.541043939165
L. interrogansP1GCA_002073495.2Chromosome4.63540493772277
L. kirschneriP1GCA_000243695.3Contig4.4364029398643
L. kmetyiP1GCA_003722295.1Chromosome4.44541604035125
L. mayottensisP1GCA_000306675.3Chromosome4.239.539443592352
L. noguchiiP1GCA_000306255.2Contig4.735.54565452045
L. sanjuanensisP1GCA_022267325.1Contig4.54541524052100
L. santarosaiP1GCA_000313175.2Chromosome44241914080111
L. stimsoniiP1GCA_003545875.1Contig4.742.547474599148
L. tipperaryensisP1GCA_001729245.1Chromosome4.642.54342425587
L. weiliiP1GCA_006874765.1Chromosome4.44143123965347
L. yasudaeP1GCA_003545925.1Contig4.445.542864162124
L. andrefontaineaeP2GCA_004770105.1Contig4.3403969389475
L. broomiiP2GCA_000243715.3Contig4.4434249420544
L. dzoumogneensisP2GCA_004770895.1Contig4.1413814372490
L. faineiP2GCA_000306235.2Contig4.343.54157411344
L. fletcheriP2GCA_004769195.1Contig3.747.53409332683
L. fluminisP2GCA_004771275.1Contig3.747.53427334285
L. haakeiiP2GCA_002812225.1Scaffold4.24039203814106
L. hartskeerliiP2GCA_002811475.1Scaffold4.140.53787370879
L. inadaiP2GCA_000243675.3Contig4.544.54314426450
L. johnsoniiP2GCA_003112675.1Contig4.141.53754371341
L. koniambonensisP2GCA_004769555.1Contig4.3394000392971
L. langatensisP2GCA_004770615.1Contig4.1453751367180
L. licerasiaeP2GCA_000526875.1Contig4.2413899383465
L. neocaledonicaP2GCA_002812205.1Scaffold4.2403978388989
L. perolatiiP2GCA_002811875.1Contig442.53712363181
L. saintgironsiaeP2GCA_002811765.1Contig4.1393830373694
L. sarikeiensisP2GCA_004769615.1Contig4.440.54026392898
L. selangorensisP2GCA_004769405.1Contig4.2403894381480
L. semungkisensisP2GCA_004770055.1Contig3.9433626356264
L. venezuelensisP2GCA_002150035.1Contig4.3394030397357
L. wolffiiP2GCA_004770635.1Contig4.2463851377180
L. cinconiaeP2GCA_040833995.1Chromosome4.1423801374160
L. abararensisS1GCA_016918735.1Contig4.2393968390068
L. bandrabouensisS1GCA_004770555.1Contig4.237.53928385771
L. biflexaS1GCA_000017685.1Chromosome4393775372649
L. bourretiiS1GCA_004770145.1Contig4.2383923384083
L. bouyouniensisS1GCA_004770625.1Contig4.1373833374687
L. brenneriS1GCA_004769295.1Contig3.938.53662359369
L. chreensisS1GCA_016919165.1Contig4.5404176408690
L. congkakensisS1GCA_004770265.1Contig4383712364765
L. ellinghauseniiS1GCA_003114815.1Contig4.237.53960392139
L. harrisiaeS1GCA_002811945.1Scaffold3.9383726365175
L. jelokensisS1GCA_004769775.1Contig4.1393833375578
L. kanakyensisS1GCA_004769235.1Contig4.138.53884381074
L. kemamanensisS1GCA_004769665.1Contig3.8393533343697
L. levettiiS1GCA_002812085.1Scaffold3.937.53655357976
L. meyeriS1GCA_004368965.1Contig4.2384028393098
L. montravelensisS1GCA_004770045.1Contig437.53760368674
L. mtsangambouensisS1GCA_004770475.1Contig4.1383836376472
L. noumeaensisS1GCA_004770765.1Contig4.138.53832375874
L. perdikensisS1GCA_004769575.1Contig438.53740368060
L. terpstraeS1GCA_000332495.2Contig4.1383932388943
L. vanthieliiS1GCA_004770365.1Contig4.1393839375386
L. wolbachiiS1GCA_000332515.2Contig4.1393956391244
L. yanagawaeS1GCA_004769275.1Contig438.53704363173
L. mgodeiS1GCA_040833985.1Chromosome4393807375255
L. milleriS1GCA_040833955.1Chromosome3.938.53628355573
L. iowaensisS1GCA_040833965.1Chromosome4.1373867381255
L. paudalimensisS1GCA_026151345.1Contig4.137.53769371158
L. soteropolitanensisS1GCA_026151335.1Contig4.137.53934386371
L. limiS1GCA_026151395.1Contig3.937.53679361960
L. idoniiS2GCA_004770995.1Contig4.1413797372473
L. ilyithenensisS2GCA_004771005.1Contig4.240.539503849101
L. kobayashiiS2GCA_003114835.3Chromosome4.340.53945390243
L. ognonensisS2GCA_004770745.1Scaffold439.53733366073
L. ryugeniiS2GCA_003114855.1Scaffold4403698365939
Table 2. The table summarizes the 77 Leptospira species analyzed, including their taxonomic subgroup distribution, CRISPR-Cas proteins detected, and their classification into classes, types, subgroups, and variants. The molecular targets cleaved by effector Cas proteins are also indicated. Color coding indicates subgroup: green for pathogenic P1, blue for pathogenic P2, yellow for saprophytic S1, and red for saprophytic S2.
Table 2. The table summarizes the 77 Leptospira species analyzed, including their taxonomic subgroup distribution, CRISPR-Cas proteins detected, and their classification into classes, types, subgroups, and variants. The molecular targets cleaved by effector Cas proteins are also indicated. Color coding indicates subgroup: green for pathogenic P1, blue for pathogenic P2, yellow for saprophytic S1, and red for saprophytic S2.
SpeciesCladeProteinsClassTypeSubtypeVariantNative Target
L. adleriP1cas1_TypeIA, cas4_TypeI-II, cas3_TypeI, cas3a_TypeI, cas2_TypeI-II-III, cas5a2_TypeIA, cas6_TypeIA, cas7b_TypeIBClass 1I, II, IIIIA, IBcas3a, cas5a2, cas7bDNA
L. ainazelensisP1Not detected-----
L. ainlahdjerensisP1cas3_TypeIClass 1I--DNA
L. alexanderiP1cse1_TypeIE, cse2_TypeIE, cas1_TypeIE, cas2_TypeIE, cas3_TypeI, cas4_TypeI-II, cas5_TypeIE, cas6_TypeIE, cas7_TypeIEClass 1I, IIIE-DNA
L. alstoniiP1cas1_TypeIA, cas2_TypeI-II-III, cas3_TypeI, cas5a2_TypeIA, cas6_TypeIE, cas7_TypeI, cas4_TypeI-II, cas5_TypeIEClass 1I, II, IIIIA, IECas5aDNA
L. barantoniiP1cas3_TypeIClass 1I--DNA
L. borgpeterseniiP1cse2_TypeIE, cse1_TypeIE, cas1_TypeIE, cas2_TypeIE, cas3_TypeI, cas5_TypeI, cas6_TypeIE, cas7_TypeIEClass 1IIE-DNA
L. ellisiiP1cas3a_TypeIClass 1I--DNA
L. gomenensisP1Not detected-----
L. kirschneriP1cas1_TypeIC, cas2_TypeI-II-III, cas3_TypeI, cas4_TypeI-II, cas5_TypeIA, cas5c_TypeIC, cas7c_TypeIC, cas8c_TypeICClass 1I, II, IIIIC, IACas5c, cas7c, cas8cDNA
L. interrogansP1cas1_TypeIC, cas2_TypeI-II-III, cas3_TypeI, cas3a_TypeI, cas4_TypeI-II, cas5c_TypeIC, cas7c_TypeIC, cas8c_TypeICClass 1I, II, IIIICCas3aDNA
L. kmetyiP1Not detected-----
L. mayottensisP1cse1_TypeIE, cse2_TypeIE, cas1_TypeIE, cas2_TypeIE, cas3_TypeI, cas5_TypeIE, cas6_TypeIE, cas7_TypeIEClass 1IIE-DNA
L. noguchiiP1cas2_TypeI-II-III, cas3_TypeI, cas3a_TypeI, cas4_TypeI-II, cas5c_TypeIC, cas7c_TypeIC, cas8c_TypeICClass 1I, II, IIIICCas3a, cas5c, cas7c,cas8cDNA
L. sanjuanensisP1Not detected-----
L. santarosaiP1cse1_TypeIE, cas1_TypeIE, cas1_TypeIA, cas2_TypeI-II-III, cas2_TypeIE, cas3_TypeI, cas5_TypeIE, cas5a2_TypeIA, cas6_TypeIE, cas6_TypeIA, cas7_TypeIE, cas7_TypeI, cas8a1a3_TypeIAClass 1I, II, IIIIE, IACas5a2, cas8a1a3DNA
L. stimsoniiP1cse1_TypeIE, cse2_TypeIE, cas1_TypeIE, cas2_TypeIE, cas3_TypeI, cas5_TypeIE, cas6_TypeIE, cas7_TypeIEClass 1IIE-DNA
L. tipperaryensisP1Not detected-----
L. weiliiP1cse1_TypeIE, cse2_TypeIE, cas1_TypeIE, cas2_TypeIE, cas3_TypeI, cas5_TypeIE, cas6_TypeIE, cas7_TypeIEClass 1IIE-DNA
L. yasudaeP1Not detected-----
L. gorisiaeP1cse1_TypeIE, cpf1_TypeU (Cas12a), cas1_TypeIE, cas1_TypeU, cas2_TypeIE, cas2_Type I-II-III, cas3_TypeI, cas4_TypeU, cas5_TypeIE, cas6_TypeIE, cas7_TypeIEClass 1
Class 2
II, VIE-DNA
L. andrefontaineaeP2cas3_TypeIClass 1I--DNA
L. broomiiP2cas1_TypeIA, cas2_TypeI-II-III, cas3_TypeI, cas5a2_TypeIA, cas6_TypeIA, cas7_TypeI, cas8a1a3_TypeIAClass 1I, II, IIIIACas5a2, cas8a1a3DNA
L. dzoumogneensisP2cas3_TypeI, cas3a_TypeIClass 1I-Cas3aDNA
L. faineiP2cse1_TypeIE, cse2_TypeIE, cas1_TypeIE, cas2_TypeIE, cas3_TypeI, cas5_TypeIE, cas6_TypeIE, cas7_TypeIEClass 1IIE-DNA
L. fletcheriP2cas1_TypeI-II-III, cas2_TypeI-II-III, cas4_TypeI-II, cas9_TypeIIBClass 2IIIIB-DNA
L. fluminisP2Not detected-----
L. haakeiiP2csm2_TypeIIIA, cas3_TypeIClass 1I, IIIIIIA-DNA and RNA
L. hartskeerliiP2cas3_TypeIClass 1I--DNA
L. inadaiP2cas1_TypeIB, cas1_TypeU, cpf1_TypeU (Cas12a), cas1_TypeI-II-III, cas2_TypeI-II-III, cas3_TypeI, cas4_TypeI-II, cas4_TypeU, cas4_TypeI-II, cas5b_TypeIB, cas6_TypeI-III, cas7b_TypeIBClass 1
Class 2
I, II, II, VIBCas5b, cas7bDNA
L. johnsoniiP2Not detected-----
L. koniambonensisP2cse1_TypeIE, cse2_TypeIE, cas1_TypeIE, cas2_TypeIE, cas3_TypeI, cas5_TypeIE, cas6_TypeIE, cas7_TypeIEClass 1IIE-DNA
L. langatensisP2Not detected-----
L. licerasiaeP2cas3_TypeIClass 1I--DNA
L. neocaledonicaP2Not detected-----
L. perolatiiP2Not detected-----
L. saintgironsiaeP2cas3_TypeIClass 1I--DNA
L. sarikeiensisP2Not detected-----
L. selangorensisP2Not detected-----
L. semungkisensisP2Not detected-----
L. venezuelensisP2Not detected-----
L. wolffiiP2Not detected-----
L. cinconiaeP2Not detected-----
L. abararensisS1cas3_TypeI, cas3a_TypeIClass 1I-Cas3aDNA
L. bandrabouensisS1cas3_TypeIClass 1I--DNA
L. biflexaS1Not detected-----
L. bourretiiS1Not detected-----
L. bouyouniensisS1cas3_TypeIClass 1I--DNA
L. brenneriS1Not detected-----
L. chreensisS1cas3a_TypeIClass 1I--DNA
L. congkakensisS1cas3a_TypeIClass 1I--DNA
L. ellinghauseniiS1Not detected-----
L. harrisiaeS1Not detected-----
L. jelokensisS1Not detected-----
L. kanakyensisS1Not detected-----
L. kemamanensisS1cas3_TypeIClass 1I--DNA
L. levettiiS1Not detected-----
L. meyeriS1Not detected-----
L. montravelensisS1Not detected-----
L. mtsangambouensisS1Not detected-----
L. noumeaensisS1Not detected-----
L. perdikensisS1Not detected-----
L. terpstraeS1Not detected-----
L. vanthieliiS1cas3a_TypeIClass 1I--DNA
L. wolbachiiS1Not detected-----
L. yanagawaeS1Not detected-----
L. mgodeiS1Not detected-----
L. milleriS1Not detected-----
L. iowaensisS1Not detected-----
L. paudalimensisS1Not detected-----
L. soteropolitanensisS1Not detected-----
L. limiS1Not detected-----
L. idoniiS2cas3_TypeI, cas3a_TypeIClass 1I-cas3aDNA
L. ilyithenensisS2cas1_TypeU, cas2_TypeI-II-III, cas3_TypeI, cas4_TypeU, cpf1_TypeU(Cas12a)Class 2I, II, III, V--DNA
L. kobayashiiS2Not detected-----
L. ognonensisS2Not detected-----
L. ryugeniiS2cse1_TypeIE, cse2_TypeIE, cas1_TypeIE, cas2_TypeIE, cas3_TypeI, cas5_TypeIE, cas6_TypeIE, cas7_TypeIEClass 1IIE-DNA
Table 3. Distribution of Cas proteins across the three functional stages of the CRISPR-Cas system in the 19 Leptospira species encoding multiple proteins: Adaptation (adapter integration), Expression (pre-crRNA processing), and Interference (effector complex formation or target cleavage). Color coding indicates subgroup: green for pathogenic P1, blue for pathogenic P2, and red for saprophytic S2.
Table 3. Distribution of Cas proteins across the three functional stages of the CRISPR-Cas system in the 19 Leptospira species encoding multiple proteins: Adaptation (adapter integration), Expression (pre-crRNA processing), and Interference (effector complex formation or target cleavage). Color coding indicates subgroup: green for pathogenic P1, blue for pathogenic P2, and red for saprophytic S2.
SpeciesAdaptation
Spacer Integration
Expression
Pre-crRNA Processing
InterferenceTaxonomic
Classification
Effector ComplexTarget Cleavage
L. adleriCas1_TypeIA, Cas2_TypeI-II-III, Cas4_TypeI-IICas6_TypeIACas5a2_TypeIA, Cas7b_TypeIBCas3_TypeI, Cas3a_TypeIClass 1 type IF2
L. alexanderiCas1_TypeIE, Cas2_TypeIE, Cas4_TypeI-IICas6_TypeIECas5_TypeIE, Cas7_TypeIE, Cse1_TypeIE, Cse2_TypeIECas3_TypeIClass 1, type IE
L. alstoniiCas1_TypeIA, Cas2_TypeI-II-III, Cas4_TypeI-IICas6_TypeIECas5a2_TypeIA, Cas5_TypeIE, Cas7_TypeICas3_TypeIClass 1 type IF2
L. borgpeterseniiCas2_TypeIECas6_TypeIECas5_TypeI, Cas7_TypeIE, Cse1_TypeIECas3_TypeIClass 1, type IE
L. gorisiaeCas1_TypeU, Cas1_TypeIE, Cas2_Type_I-II-III, Cas2_TypeIE, Cas4_TypeUCas6_TypeIECas5_TypeIE, Cas7_TypeIE, Cse1_TypeIECas12a
Cas3_TypeI
Class 1, type IE
Class 2 type V
L. interrogansCas1_TypeIC, Cas2_TypeI-II-III, Cas4_TypeI-II Cas5c_TypeIC, Cas7c_TypeIC, Cas8c_TypeICCas3_TypeI, Cas3a_TypeIClass 1 type IC
L. kirschneriCas1_TypeIC, Cas2_TypeI-II-III, Cas4_TypeI-II Cas5_TypeIA, Cas5c_TypeIC, Cas7c_TypeIC, Cas8c_TypeICCas3_TypeIClass 1 type IC
L. mayottensisCas1_TypeIE, Cas2_TypeIECas6_TypeIECas5_TypeIE, Cas7_TypeIE, Cse1_TypeIE, Cse2_TypeIECas3_TypeIClass 1, type IE
L. noguchiiCas2_TypeI-II-III, Cas4_TypeI-II Cas5c_TypeIC, Cas7c_TypeIC, Cas8c_TypeICCas3_TypeI, Cas3a_TypeIClass 1 type IC
L. santarosaiCas1_TypeIA, Cas1_TypeIE, Cas2_TypeI-II-III, Cas2_TypeIECas6_TypeIA, Cas6_TypeIECas5_TypeIE, Cas5a2_TypeIA, Cas7_TypeIE, Cas7_TypeI, Cas8a1a3_TypeIA, Cse1_TypeIECas3_TypeIClass 1 type B
L. stimsoniiCas1_TypeIE, Cas2_TypeIECas6_TypeIECas5_TypeIE, Cas7_TypeIE, Cse1_TypeIE, Cse2_TypeIECas3_TypeIClass 1, type IE
L. weiliiCas1_TypeIE, Cas2_TypeIECas6_TypeIECas5_TypeIE, Cas7_TypeIE, Cse1_TypeIE, Cse2_TypeIECas3_TypeIClass 1, type IE
L. broomiiCas1_TypeIA, Cas2_TypeI-II-IIICas6_TypeIACas5a2_TypeIA, Cas7_TypeI, Cas8a1a3_TypeIACas3_TypeIClass 1 type B
L. faineiCas1_TypeIE, cas2_TypeIECas6_TypeIECas5_TypeIE, Cas7_TypeIE, Cse1_TypeIE, Cse2_TypeIECas3_TypeIClass 1, type IE
L. fletcheriCas1_TypeI-II-III, Cas2_TypeI-II-III, Cas4_TypeI-IIRNAse III Cas9_TypeIIBClass 2 type IIB
L. inadaiCas1_TypeIB, Cas1_TypeU, Cas1_TypeI-II-III, Cas2_TypeI-II-III, Cas4_TypeI-II, Cas44_TypeU, Cas4_TypeI-IICas6_TypeI-IIICas5b_TypeIB, Cas7b_TypeIBCas3_TypeI, Cas12aClass 1 type IF2
Class 2 type V
L. koniambonensisCas1_TypeIE, Cas2_TypeIECas6_TypeIECas5_TypeIE, Cas7_TypeIE, Cse1_TypeIE, Cse2_TypeIECas3_TypeIClass 1, type IE
L. ilyithenensisCas1_TypeU, Cas2_TypeI-II-III, Cas4_TypeU Cas3_TypeI, Cas12aClass 2 type V
L. ryugeniiCas1_TypeIE, Cas2_TypeIECas6_TypeIECas5_TypeIE, Cas7_TypeIE, Cse1_TypeIE, Cse2_TypeIECas3_TypeIClass 1, type IE
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Peláez Sánchez, R.G.; González Restrepo, J.; Pineda, S.; Cuartas-López, A.M.; Martínez Garro, J.M.; Torres-Castro, M.; Urrego, R.; López-Rojas, L.E.; Salazar Florez, J.E.; Monroy, F.P. Bioinformatic Identification of CRISPR–Cas Systems in Leptospira Genus: An Update on Their Distribution Across 77 Species. Pathogens 2025, 14, 1044. https://doi.org/10.3390/pathogens14101044

AMA Style

Peláez Sánchez RG, González Restrepo J, Pineda S, Cuartas-López AM, Martínez Garro JM, Torres-Castro M, Urrego R, López-Rojas LE, Salazar Florez JE, Monroy FP. Bioinformatic Identification of CRISPR–Cas Systems in Leptospira Genus: An Update on Their Distribution Across 77 Species. Pathogens. 2025; 14(10):1044. https://doi.org/10.3390/pathogens14101044

Chicago/Turabian Style

Peláez Sánchez, Ronald Guillermo, Juanita González Restrepo, Santiago Pineda, Alexandra Milena Cuartas-López, Juliana María Martínez Garro, Marco Torres-Castro, Rodrigo Urrego, Luis Ernesto López-Rojas, Jorge Emilio Salazar Florez, and Fernando P. Monroy. 2025. "Bioinformatic Identification of CRISPR–Cas Systems in Leptospira Genus: An Update on Their Distribution Across 77 Species" Pathogens 14, no. 10: 1044. https://doi.org/10.3390/pathogens14101044

APA Style

Peláez Sánchez, R. G., González Restrepo, J., Pineda, S., Cuartas-López, A. M., Martínez Garro, J. M., Torres-Castro, M., Urrego, R., López-Rojas, L. E., Salazar Florez, J. E., & Monroy, F. P. (2025). Bioinformatic Identification of CRISPR–Cas Systems in Leptospira Genus: An Update on Their Distribution Across 77 Species. Pathogens, 14(10), 1044. https://doi.org/10.3390/pathogens14101044

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop