Next Article in Journal
Prevalence and Risk Factors of Borrelia burgdorferi Sensu Lato IgG Antibodies Among Blood Donors in Western Romania
Previous Article in Journal
Plesiomonas shigelloides Bacteremia: A Scoping Review of Epidemiology, Clinical Characteristics, Outcomes, and Implications of Antimicrobial Stewardship
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Resequencing and De Novo Assembly of Leishmania (Viannia) guyanensis from Amazon Region: Genome Assessment, Phylogenetic Insights and Therapeutic Targets

by
Lucas George Assunção Costa
1,
Edivaldo Costa Sousa Junior
1,
Camila Cristina Cardoso
1,
Millena Arnaud Franco da Igreja
1,
Franklyn Samudio Acosta
2,3,
Fabiano Reis da Silva
1 and
Lourdes Maria Garcez
1,4,*
1
Laboratório de Epidemiologia das Leishmanioses, Seção de Parasitologia, Instituto Evandro Chagas, Rodovia BR 316, km 07, s/n, Ananindeua 67.030-000, PA, Brazil
2
Instituto Conmemorativo Gorgas de Estudios de la Salud, Avenida Justo Arosemena, Panama City P.O. Box 0816-02593, Panama
3
Facultad de Ciencias Naturales Exactas y Tecnología, Universidad de Panamá, Panama City P.O. Box 0824-00073, Panama
4
Departamento de Patologia, Centro de Ciências Biológicas e da Saúde, Universidade do Estado do Pará, Travessa Perebebui, 2623, Belém 66.087-662, PA, Brazil
*
Author to whom correspondence should be addressed.
Pathogens 2026, 15(1), 124; https://doi.org/10.3390/pathogens15010124
Submission received: 18 December 2025 / Revised: 14 January 2026 / Accepted: 16 January 2026 / Published: 22 January 2026
(This article belongs to the Section Parasitic Pathogens)

Abstract

Leishmania guyanensis is one of 15 American human-pathogenic species, frequently linked to therapeutic failure due to its marked genetic plasticity and adaptability under drug pressure. To broaden the genomic understanding of this species, its biological traits, and potential therapeutic alternatives, we sequenced the L. guyanensis strain MHOM/BR/75/M4147. Raw reads underwent quality-filtering and assembly. Taxonomic classification utilized BLASTn and Kraken2, confirming that 99.95% of contigs matched Leishmania. The assembled genome size was 31 Mb, with an N50 of 4743 bp and 40.85× coverage. Variant calling subsequently identified 36,665 SNPs, 8210 indels, and chromosomal aneuploidies. Genomic annotation identified 3119 proteins with known molecular functions in L. guyanensis, alongside 6371 orthologous genes shared with L. major and L. panamensis. The search for pharmacological relevance yielded ten candidate genes, including one calpain and nine GSK3 family members. Phylogenetic reconstruction using the polA1 gene consistently grouped L. guyanensis, demonstrating strong discriminatory capacity, with L. martiniquensis emerging as the most divergent species. Overall, these findings expand the available genomic framework for L. guyanensis and support advances in species-specific diagnostic approaches.

Graphical Abstract

1. Introduction

The Leishmania genus (Kinetoplastida; Trypanosomatidae) includes species responsible for a spectrum of tropical zooanthroponotic diseases known as leishmaniasis. The common clinical manifestations of the disease are cutaneous leishmaniasis (CL), mucocutaneous leishmaniasis (ML), and visceral leishmaniasis (VL). The protozoans of this genus are transmitted to mammal reservoirs, including humans, by the bite of several phlebotomous species (Psychodidadae: Phlebotominae), which are broadly spread in forest areas [1,2,3].
The diverse species of the Leishmania genus cluster in four subgenera: Leishmania (Viannia), Leishmania (Leishmania), Leishmania (Sauroleishmania), and Leishmania (Mundinia). The first subgenus includes Leishmania pathogens broadly distributed in the Americas. Nearly fifty Leishmania species are currently known; twenty are human pathogens, including 15 endemics in the Americas [4,5]. The Leishmania (Viannia) subgenus is characterized by its development in the hindgut of the sandfly vector (peripylarian development), a distinct feature from the Leishmania (Leishmania) subgenus (suprapylarian development) [6]. While Leishmania (Viannia) species, such as L. braziliensis and L. guyanensis, are the primary agents of cutaneous and mucocutaneous leishmaniasis in the Neotropics, they remain less familiar to some international clinical settings compared to the Leishmania (Leishmania) subgenus, which has a broader global distribution including both the Old and New World. Consequently, it is mandatory to carry out studies on the comparative genomics of the species housed in this subgenus that may reveal differences useful to explain how biological traits of Leishmania (Viannia) parasites impact medically relevant characteristics such as drug resistance, clinical outcome, virulence, and pathogenicity.
Formerly, CL and ML complications were commonly associated with L. (V.) braziliensis, the most disseminated species in the Americas. However, it is currently known that L. (V.) guyanensis causes not only single skin lesions (or multiple lesions at a determined frequency) but also nasopharyngeal mucosa destruction in more severe cases [7,8]. This species occurs in several Latin American countries such as Panama, Brazil, Bolivia, Colombia, French Guiana, Suriname, Venezuela, Ecuador, and Argentina [9,10,11]. Leishmania (Viannia) guyanensis transmission is strongly related to human displacement through heavy vegetation where the presence of mammal species such as Choleopus didactylus, Tamandua tetradactyla, and Didelphis marsupialis, natural reservoirs of this parasite, occurs. In Brazil, the L. (V.) guyanensis distribution is especially broad in the Amazon region, where drug failure rates are high, and a different treatment choice is mandatory due to the predominance of this Leishmania species [12,13].
Under pressure, L. (V.) guyanensis develops adaptive mechanisms capable of inducing drug uptake/efflux modifications in the first-line therapy used to treat CL, which causes treatment failure [14,15]. These and other biological characteristics, including virulence, pathogenicity, proliferative capacity, and immune modulation, are directly related to pathogens’ genomes [16].
Genomic plasticity is a common feature of the Leishmania genus, including aneuploidy and chromosomal fission/fusion events. Moreover, the kinetoplastid genome organization shows genes grouped in polycistronic transcription units and an evident scarcity of introns in their genes. This fact evidences a basal genomic structure without most of the more complex modular genetic elements found in other eukaryotes [17,18]. The L. (V.) guyanensis genome has an average size of 32 Mb, containing nearly 8000 protein-coding genes and 35 chromosomes due to the fusion of chromosomes 20 and 34 [16]. When thoroughly described, this genomic data represents a source of accurate and robust information necessary for understanding parasite adaptability and survival strategies that might contribute to developing efficient therapeutic interventions.
In the middle of several initiatives aimed at sequencing genomes of parasites from the Leishmania genus [16,19,20], this study describes the genome of a L. (V.) guyanensis strain from the Amazon, a Leishmania species poorly investigated [21].
This study aimed to perform the resequencing and de novo assembly of the Leishmania (Viannia) guyanensis strain MHOM/BR/75/M4147, followed by comprehensive genomic annotation to characterize its functional architecture and identify potential therapeutic targets. Despite the epidemiological importance of L. guyanensis in the Americas, high-quality reference genomes for this species remain scarce, limiting comparative genomics and the identification of species-specific markers. To address this gap, we also evaluated the polA1 gene as a phylogenetic marker to assess its ability to discriminate species and complexes within the Viannia subgenus. Together, these analyses provide a robust genomic baseline to support improved diagnostics, targeted therapies, and future molecular and epidemiological studies of cutaneous and mucocutaneous leishmaniasis.

2. Materials and Methods

2.1. Ethics

This study does not involve human subjects or tissue samples but only Leishmania promastigotes and computational analyses; therefore, ethical approval is not required.

2.2. DNA Extraction and Sequencing

The Leishmania guyanensis strain was provided by a Leishmania Collection (CLIOC) that is part of the Network of Biological Resource Centers for Conformity Assessment of Biological Material, Instituto Oswaldo Cruz, Rio de Janeiro, Brazil. The genomic DNA of the L. (V.) guyanensis strain (MHOM/BR/75/M4147) was isolated using the Wizard® Genomic DNA Purification kit (Promega, Madison, WI, USA) following the manufacturer’s instructions, rehydrating the whole DNA in a final volume of 50 µL. The genomic library of this parasite was then prepared using the Nextera XT DNA library preparation kit (Illumina, Inc., San Diego, CA, USA) following the manufacturer’s protocols to subsequently be sequenced by the NextSeq 500 (Illumina, Inc., San Diego, CA, USA) platform using the NextSeq 500/550 High Output Kit v2.5 (300 cycles) and a paired-end sequencing approach.

2.3. Reads Quality Assessment

To get rid of adapters and low-quality reads (Phred < 20) after the sequencing process, we used the Fastp software v.0.20.1 [22] and the software FastQC v.0.12.1 [23] to depict read-quality data.

2.4. Genome Assembly

We used a de novo assembly approach employing no reference genome to assemble all reads in contigs using the MEGAHIT v.1.2.9 assembly software. This assembler utilizes a succinct Bruijn Graph (SdBG) and an iterative multi k-mer strategy, which efficiently handles large datasets and improves the recovery of low-coverage and repetitive regions [24]. To create scaffolds from the assembled contigs, we used the software SSPACE v.3.0 [25].

2.5. Taxonomical Classification of Reads and Contigs

Assembled contigs resolve repetitive regions of the complex Leishmania genome, minimizing ambiguities that raw reads cannot solve alone. This provides the long consensus sequences necessary for accurate gene annotation and species differentiation. To perform the taxonomical classification of reads and contigs obtained, we first infer the sequence homology of these sequences by a local alignment with sequences from a local Leishmania DNA database (Supplementary Material S1) using the BLASTn algorithm v.2.5.0 [26]. After sequence homology determination, we inferred the taxonomical status of reads and contigs by performing a classification based on k-mer and adding taxonomical tags to each sequence using Kraken2 software v.2.1.2 [27] with the PlusPFP-16 database, which contains RefSeq sequences for archaea, bacteria, viruses, humans, protozoa, fungi, plants, and the UniVec library for screening synthetic vector contaminants. For the visualization and analysis of the contig classification data, the Pavian v.1.0. application was employed [28]. This classification allowed us to extract all contigs belonging to Leishmania to perform the genomic annotation of L. (V.) guyanensis (MHOM/BR/75/M4147) subsequently.

2.6. Genome Annotation

The genomic annotation of this parasite was performed using the AUGUSTUS software v.3.5 [29], using as a reference the genomic structure of Leishmania (Sauroleishmania) tarentolae. We constructed a local genomic database of 69 Leishmania genomes (Supplementary Material S1) retrieved from GenBank to compare genomic annotations and analyze and correct open reading frames (ORFs) positions and structures. In this way, we provided more accurate evidence of gene function during the gene annotation of this parasite. To depict the genome and manually edit ORFs obtained during the annotation process, we used the bioinformatics suite Geneious v.8.1.4 [30]. The manual edition was finished when we analyzed the base quality and integrity of each ORF.

2.7. Gene Orthologs Assessment

The genomic data of L. (V.) guyanensis (MHOM/BR/75/M4147) was used to perform a Pangenome analysis aimed at identifying ortholog genes comparing this genome to the genome assembly of Leishmania (V.) shawi (GCA_962240455), Leishmania (V.) panamensis (GFC_000755165.1) and Leishmania (L.) major (GCF_000002725.2). To do so, we used the software BLASTp v2.5.0 [31] and the statistical software R v.4.1.2. using the Venn library [32].

2.8. Drug Target Identification

A structure-guided homology-based strategy was employed to screen for candidate drug targets associated with leishmaniasis. For this purpose, a curated local reference dataset was assembled comprising amino acid sequences derived from experimentally resolved three-dimensional protein structures with established relevance in antiparasitic drug research. These structures, previously explored in ongoing studies conducted in our laboratory (unpublished data), were obtained from the RCSB Protein Data Bank and included representatives of key molecular targets such as calpains (PDB IDs: 1TLO, 1MDW, 1ZCN), the dual-specificity kinase DYRK1A (3ANQ), glycogen synthase kinase 3 (GSK3; 7S6V), and an ATP-binding cassette (ABC) transporter (7OJ8). Predicted protein sequences from the analyzed genomes were queried against this reference dataset using BLASTp from the BLAST+ package (version 2.15.0) [33], with the aim of detecting conserved functional domains indicative of pharmacological relevance. To minimize spurious matches and enhance biological interpretability, BLASTp outputs were subjected to a stringent post-processing step. Only alignments that satisfied the following criteria were retained: (i) amino acid identity ≥ 35%, (ii) alignment length ≥ 60 residues, (iii) e-value ≤ 1 × 10−3 and (iv) bitscore ≥ 30. These thresholds are widely accepted in the field for detecting functionally relevant homologies with a moderate degree of conservation. These cutoffs are commonly adopted in protein homology analyses to balance sensitivity and specificity when identifying moderately conserved functional relationships [34]. Following filtering, retained matches were prioritized using a composite ranking scheme that integrated multiple alignment attributes, namely percentage identity, alignment coverage, statistical confidence expressed as the logarithm of the e-value, and bitscore. The top 200 highest-ranked hits were selected and subsequently merged into a unified dataset, enabling a global and comparative assessment of candidate drug targets across the genome.

2.9. Mapping to Reference Genome, Genome Coverage, and Variant Call

The mapping to the reference genome of L. (V.) guyanensis was performed using Bowtie2 [35], allowing an assessment of chromosomal coverage. This read mapping also allowed us to determine chromosome ploidy (chromosome copy number for each genome) by dividing each chromosome coverage by half of each 35 chromosome’s average coverage and assuming a diploid organism. Single-nucleotide polymorphisms (SNPs) and insertion/deletion (indels) events were identified using BCFtools v1.2 [36] to detect nucleotide differences between the resequencing strain of L. (V.) guyanensis (MHOM/BR/75/M4147) and the reference genome of this species (GenBank CP103914-CP103949).

2.10. Phylogenetic Inference

The DNA polymerase alpha catalytic subunit (polA1) was selected as the best molecular target to infer Leishmania phylogeny after evaluating its phylogenetic signal with the TREE-PUZZEL v.5.3 algorithm [37] using the likelihood-mapping method (Supplementary Material S2). To assess the phylogenetic signal, we used 69 Leishmania genomes (Supplementary Material S1). The polA1 gene from these genomes was identified and extracted using the software Geneious v.8.1.4 to perform a phylogenetic analysis using the maximum likelihood method (ML) included in the IQTREE2 software v.2.0.7 [38] after selecting the best nucleotide substitution model by the algorithm of the software. The software parameters were set up to find the most likely tree topology, testing each clade’s robustness by bootstrap analysis. The choice of the best phylogenetic tree considered the genetic identity threshold among Leishmania species inferred by a sequence identity matrix constructed by the software. All software parameters were set to standard conditions unless another configuration was required.

3. Results

3.1. Quality Control and Genome Assembly

The genome sequencing of the L. (V.) guyanensis (MHOM/BR/75/M4147) produced 9,847,950 reads with an average size of 151 bp. After filtering low-quality reads and excluding adapters and artifacts from the sequencing process, we obtained 8,839,174 high-quality reads, a reduction of 10%. The de novo assembly produced 14,097 contigs with an N50 value of 4743 bp, an average contig size of 2239 bp, and an average genome coverage of 40.85×. The number of scaffolds constructed was identical to the number of contigs (14,097). The genome size of the L. (V.) guyanensis was 31,565,639 bp (31 Mb) with a G + C content of 57.44%. Consequently, we obtained a new complete genome assembly (whole genome shotgun) for the Leishmania isolate analyzed herein. Table 1 gathers all the genome assembly results obtained in this study and those produced in earlier Leishmania genomic studies. The genome assembly for L. (V.) guyanensis (MHOM/BR/75/M4147) was deposited in the GenBank database under the accession number GCA_051201275.1. The data is publicly available at https://www.ncbi.nlm.nih.gov/datasets/genome/GCA_051201275.1/ (accessed on 12 January 2026).

3.2. Taxonomical Assessment

Most of the contigs produced in this study (99.95%, 14,090 contigs) were found to be related to the Leishmania genus (Supplementary Material S3). All selected contigs showed an average nucleotide identity value of 96.29% with Leishmania spp. The contigs tagged as belonging to Leishmania spp. were then used in the genomic annotation process.

3.3. Variants Call and Chromosome Coverage

We searched for variants and identified 36,665 SNPs distributed through the 35 L. (V.) guyanensis chromosomes and 8210 indels (Supplementary Material S4). Reads mapping to the L. (V.) guyanensis reference genome (GenBank CP103914-CP103949) identified chromosomes with coverage values between 41× and 111× (Supplementary Material S4). Additionally, there was great heterogeneity in the chromosome ploidy (values between 1.2 and 3.4), highlighting chromosomes 2 and 8 that presented a condition close to trisomy. The ploidy variations throughout L. (V.) guyanensis chromosomes are depicted in Figure 1.

3.4. Genomic Annotation

The gene annotation of the L. (V.) guyanensis (MHOM/BR/75/M4147) revealed a lack of introns and allowed us to identify 7698 genes, 93.42% (7192) of which were associated with known molecular functions. After analyzing all the annotated data, we identified 3119 different proteins related to the metabolic pathways of the parasite. The proteins with more associated genes were calpain-like cysteine peptidase (155), protein kinase domain-containing protein (114), and transmembrane protein (75). Figure 2 depicts the main 20 relevant proteins identified during the annotation process.

3.5. Gene Orthologs Analysis

The gene annotation of L. (V.) guyanensis (MHOM/BR/75/M4147) allowed us to study the Pangenome, ortholog genes, and core genome that altogether represent the entire group of annotated genes extant in a particular taxonomic group [5]. Therefore, we identified 6371 genes shared by L. (V.) guyanensis, L. (V.) panamensis, L. (V.) shawi, and L. (L.) major. Fourteen of these genes were exclusive to L. (V.) panamensis, 13 of L. (V.) guyanensis, 23 of L. (V.) shawi, and 299 of L. (L.) major. The Pangenome of these four Leishmania species encompasses 7791 genes. Regarding the Leishmania species core genome, each species showed a characteristic genetic expansion (expansion in ortholog copy numbers). For example, we found 22 expanded genes in L. (V.) guyanensis MHOM/BR/75/M4147 (Supplementary Material S5). The Pangenome analysis result is depicted in Figure 3.

3.6. Identification of Therapeutic Targets

Protein homology analysis in Leishmania (Viannia) guyanensis led to the identification of ten candidate genes with potential pharmacological relevance, as summarized in Table 2, with a marked predominance of proteins belonging to the GSK3 family (9/10 genes), in addition to a single representative of the calpain family. The gene g7485.t1_1, annotated as GSK3, exhibited the highest quality score (0.968), accompanied by very high amino acid identity (92.96%), an extensive alignment length (355 residues), and an exceptionally high bitscore (701), indicating strong structural and functional conservation relative to the reference target. Collectively, these parameters support this gene as the most robust pharmacological candidate identified in the analyzed genome.
The remaining GSK3-annotated genes displayed intermediate quality scores, ranging from 0.37 to 0.64, corresponding to alignments with moderate amino acid identity (35–53%), alignment lengths between 68 and 135 residues, and highly significant e-values (≤10−10). The single calpain-annotated gene (g7477.t1_1) showed a quality score comparable to those of the intermediate GSK3 candidates (0.57), despite a shorter alignment length, suggesting the presence of partially conserved functional domains (detailed results are provided in Table 2). The distribution of quality scores, shown in Figure 4, reveals a clear hierarchical organization of candidate targets, with a strong predominance of GSK3-related proteins, reinforcing this family as the principal set of potential drug targets in L. (V.) guyanensis.

3.7. Phylogenetic Tree Reconstruction

As a result of the phylogenetic analysis, the four Leishmania subgenera were grouped into independent monophyletic clusters supported by bootstrap values higher than 90%. Furthermore, all Leishmania complexes within each subgenus clustered into specific subclades supported by bootstrap values around 90% or higher. Regarding the sequence identity analyses, the species from the subgenera Leishmania (Leishmania), Leishmania (Sauroleishmania), Leishmania (Viannia), and Leishmania (Mundinia) showed sequence identity values of 90%, 99.8%, 96.8% and 84.7%, respectively (Figure 5).

4. Discussion

4.1. Genome Assembly Integrity and Taxonomic Resolution

The genome assembly of L. (V.) guyanensis (MHOM/BR/75/M4147) performed herein strengthens the extant genomic data of this Leishmania species, providing excellent values of completeness and coherence when compared with the current L. (V.) guyanensis reference genome. An N50 value of 6029 bp supported the good quality of the genome assembly as the parameter suggested that most of the contigs were long enough and well-assembled. The clear evidence that the scaffold number obtained was identical to the contigs number also supports the completeness of the assembled contigs, which are long enough and need no additional connections as links or bridges to create scaffolds. This fact reflects the good quality of the genome assembly obtained by the de novo method, a significant advancement considering that other Leishmania species of epidemiological importance still lack high-quality genome assembly metrics (Table 1). Certainly, the quality filtering and taxonomical identification of the reads were critical for obtaining the correct genome structure [40], since 9% of the reads were classified as sequencing artifacts.
The reference genome of L. (V.) guyanensis (GenBank CP103914–CP103949) presented an average coverage value of 27.0×, while the coverage value obtained in this study for the same Leishmania strain was 40.85×. This higher coverage might have helped to acquire more accurate results in other genome-based analyses, such as SNP identification and variant calls.
As the L. (V.) guyanensis genome assembly performed in this study, the assembly reported earlier for the L. (V.) guyanensis LgCL085 genome [39] presented good assembling parameters. However, a different bioinformatic workflow, including assembly corrections, gap filling, and junction joining, was necessary to complete the genome assembly. Both L. (V.) guyanensis (LgCL085) and L. (V.) panamensis (PSC-1) genomes available in the GenBank were characterized via de novo assembly, representing the only strains from the L. (V.) guyanensis complex with complete genomes available in genome databases. The genomic characterization of L. (V.) guyanensis (MHOM/BR/75/M4147) performed in this study complements the genomic data of this species, providing indicative parameters of quality and completeness lacking in most genomic studies of Leishmania. Generally, the genome assembly of L. (V.) guyanensis (MHOM/BR/75/M4147) obtained herein showed similar genomic characteristics as the ones previously reported for L. (V.) braziliensis [41], Leishmania (Leishmania) amazonensis [42], and L. (V.) panamensis [16], which present genome sizes of nearly 30 Mb, average G + C content of 57%, and a value of predicted gene quantity close to 8000. This result indicates convergence among methods used to assemble Leishmania genomes, highlighting the importance of resequencing previously characterized Leishmania species, especially those with intrinsic epidemiological and clinical relevance.

4.2. Chromosomal Aneuploidy and Genomic Plasticity

Aneuploidy, defined by chromosomal copy number variation (CNV), is a widely recognized trait in Leishmania, directly associated with the parasite’s adaptive plasticity and resulting in cellular mosaicism [43,44]. Ploidy analysis demonstrated that the resulting karyotypic profile is highly dependent on the reference genome employed for read mapping. When utilizing the homologous L. (V.) guyanensis (M4147) reference (characterized by a continuous and gapless assembly), chromosomes 2 and 8 exhibited values approaching trisomy, suggesting lineage-specific amplification events. In contrast, mapping against the L. (V.) braziliensis (M2904) reference revealed a distinct pattern, with chromosomes 8 and 31 showing supernumerary status.
This discrepancy is particularly relevant for chromosome 31, a recognized evolutionarily conserved copy number variation (CNV) locus recurrently associated with aneuploidy, which has been linked to phenotypic adaptation, including stress responses and gene expression modulation, conferring a potential selective advantage [45,46]. The observed distinctions between ploidy profiles likely reflect the interplay between strain-intrinsic characteristics (possibly influenced by conservation history) and methodological factors associated with quality metrics, architecture, and structural differences between official, despite their phylogenetic proximity, reference genomes. Thus, while the frequent detection of supernumerary chromosome 31 reinforces its constitutively polysomic nature, aneuploidy must be considered a dynamic and genome context-dependent phenomenon in Leishmania.
A more significant deviation from the disomic state was observed during the read mapping in chromosomes 2 and 8. This fact also supports that the genomic reference choice significantly affects karyotype even among species belonging to the same subgenus, highlighting the importance of using a specific reference genome from the same species to perform more accurate analyses. Although L. (V.) guyanensis (MHOM/BR/75/M4147) was mapped against its own reference genome, 36,665 SNPs were identified. These variations likely reflect micro-evolutionary changes during independent laboratory cultivation. Notably, as specific information regarding the in vitro passage history of the reference genome was unavailable, this factor, combined with the use of different sequencing technologies, may account for the observed genetic differences. Such variability often arises during parasite culture in response to distinct microenvironments, leading to adaptive genetic shifts that favor survival under varying environmental stimuli [47].

4.3. Genomic Annotation and Comparative Orthology

The study carried out herein complements the genetic architecture previously described for L. (V.) guyanensis (MHOM/BR/75/M4147) (reference), identifying genes (7726) and describing functional proteins. In this sense, the gene ontology analyses of the L. (V.) guyanensis (LgCL085) revealed the presence of 8230 protein-coding genes. Most of these proteins were associated with ortholog genes reported in the Viannia subgenus whose copy numbers vary from one species to another [39]. Due to the polycistronic nature of this genus, each protein is not strictly associated with one gene, and there is also no coding DNA and ORFs coding for more than one protein. This fact was verified after finding a high level of genes associated with one functional protein. The initial description of 3128 protein-coding genes in L. (V.) guyanensis might aid in disclosing the translational composition of this species, a fact of paramount importance for understanding its cellular and metabolic processes [41].
The gene annotation was of value to identify essential elements, including start and stop codons, exon-intron junctions, protein-coding regions, and promoters and regulatory regions. As a result, we observed a lack of introns in most of the L. (V.) guyanensis protein-coding genes, evidence previously reported in genomic studies of Leishmania earlier [48], and that could be linked to inherent polycistronic characteristics of the Leishmania genus. This polycistronic region includes a DNA region processed during transcription to produce a long polycistronic RNA including non-related genes organized into a genomic architecture resembling the prokaryotes’ gene organization [49,50].
The workflow described herein identifies proteins and ortholog genes in microorganisms in general and has also been used earlier to assess Leishmania spp. Genomes [39]. Using this strategy, we identified ortholog genes shared by L. (V.) guyanensis, L. (V.) panamensis, and L. (L.) major and a reduced group of genes (48) restricted to L. (V.) guyanensis. Among these three species, L. (L.) major showed the highest number of genes unrelated to the other species (1631), probably because it belongs to a different Leishmania subgenus. Conversely, we found fewer exclusive genes between L. (V.) panamensis and L. (V.) guyanensis. Generally, gene ortholog studies have indicated low quantities of species-specific genes in Leishmania [16,19,36]. Additionally, we observed a trend indicating that closely related Leishmania species from the same complex share more ortholog genes in the Pangenome than those Leishmania parasites from different subgenus. This might be explained by the great divergency bridge between Leishmania spp. from different subgenus due to the parasites’ adaptation to conditions found in each transmission cycle. This biological adjustment of parasites from different Leishmania subgenera during the evolutionary process might explain the functional diversity found in the Leishmania genus nowadays. Indeed, the functional diversity produced in this way might account for the differential group of orthologs belonging to the different taxonomic units within the Leishmania spp.

4.4. GSK3 and Calpain as Conserved Therapeutic Targets

The marked predominance of GSK3 family members among the highest-ranked candidate genes in Leishmania (Viannia) guyanensis highlights the central role of this kinase as a highly conserved and biologically relevant pharmacological target in this taxon [51]. The identification of nine GSK3-annotated genes among the ten top-scoring candidates reflects both the robustness of the structure-guided homology-based strategy employed in this study and the functional importance of this protein family in trypanosomatids [52]. GSK3 has been consistently implicated in the regulation of essential cellular processes, including cell-cycle control, intracellular signaling, and parasite adaptation to host-derived stresses, all of which are critical for parasite survival and pathogenicity [53].
Within this context, the gene g7485.t1_1, which exhibited the highest quality score, emerges as a particularly strong candidate for pharmacological exploration. The combination of high amino acid identity, extensive alignment coverage, and exceptional statistical significance suggests the presence of a complete and structurally conserved catalytic domain. Such features are characteristic of essential enzymes and are commonly associated with high-value therapeutic targets. The pronounced conservation of GSK3 observed in L. (V.) guyanensis is consistent with previous studies demonstrating its functional indispensability in Leishmania species, while also indicating the existence of structural, regulatory, and kinetic differences relative to mammalian homologs that may be exploited to achieve molecular selectivity [54,55].
In addition to GSK3, the identification of a calpain family member, albeit with an intermediate quality score, supports the potential relevance of this class of proteases as complementary drug targets [56]. Parasite calpains have been associated with protein turnover, differentiation, and host–parasite interactions, and the detection of partially conserved domains suggests that these proteins may remain amenable to pharmacological modulation [57]. The shorter alignment length observed for the calpain candidate in L. (V.) guyanensis likely reflects species-specific structural divergence or partial domain conservation, a pattern frequently reported for this protein family in trypanosomatids [58].
Overall, the prioritization profile obtained in this study—based on objective sequence homology criteria, statistical significance, and composite quality score ranking—shows strong concordance with the existing body of literature on validated and emerging drug targets in trypanosomatids. These findings reinforce GSK3 as the principal conserved and exploitable therapeutic target in L. (V.) guyanensis and underscore the effectiveness of structure-guided homology-based screening as a robust framework for rational drug target discovery in Leishmania genomes, with potential applicability across multiple species of the genus [54,55,59].

4.5. Taxonomic Validation Through polA1 Phylogeny

The phylogenetic analysis was important for validating the genome assembly data of L. (V.) guyanensis (MHOM/BR/75/M4147) because it might explain any discrepancies found during the assembly process [60]. The analysis using TREE-PUZZLE v.5.3 showed a high phylogenetic signal of the polA1 gene, favoring the construction of a well-supported phylogenetic tree. This molecular target has been used to identify Leishmania species [61,62], showing to be a potential candidate to differentiate cryptic species such as L. (V.) guyanensis and L. (V.) panamensis, Leishmania species associated with tegumentary leishmaniasis in the Americas. This fact is important because, in the last 24 years, the validation of Leishmania species has been a matter of discussion. The main taxonomical problem lies in the initial Leishmania taxonomical proposal based mainly on the biological and eco-epidemiological characteristics of the current species [63,64,65].
The molecular marker polA1 was previously employed to phylogenetically distinguish and group Leishmania (Mundinia) martiniquensis in cases where Multilocus Enzyme Electrophoresis (MEE) analysis was unable to differentiate this species [61]. In this study, the phylogenetic analysis using the polA1 gene clustered in different clades of cryptic species, some Leishmania species from the same complex, such as Leishmania (Leishmania) donovani and L. (L.) infantum; L. (V.) guyanensis and L. (V.) panamensis; L. (V.) braziliensis and Leishmania (Viannia) peruviana. All sequences from L. (V.) guyanensis and L. (V.) panamensis species clustered into different monophyletic clades, unlike studies based on an hsp70-based approach that presented problems differentiating these cryptic species [66]. Phylogeny based on the hsp70 gene in Leishmania has been of help with the subgenera assignment and raised some important questions on the validity of some species belonging to the L. (V.) guyanensis complex [67]. The phylogenetic analysis performed herein showed the potential of the polA1 gene for the Leishmania complex validation when compared with the Leishmania hsp70 gene region. The main pitfall reported in hsp70-based protocols is their problems discerning some species from the Viannia subgenus since the geographical distribution of the Leishmania species in the Americas has favored a high ancestral genetic flux [68].
Even using the Multilocus Sequence Typing approach (MLST), the phylogenetic differentiation between L. (V.) guyanensis and L. (V.) panamensis from the L. (V.) guyanensis complex is still problematic [39]. The advances in the etiological characterization of the Leishmania genus have suggested modifications in the taxonomic nomenclature of species [69,70] and the validation of complexes within subgenera [71]. Therefore, DNA-sequence-based strategies are recommended instead of traditional isoenzyme-based methods. In line with this, we adopted a molecular approach based on the polA1 gene. The phylogenetic species concept is based on creating a monophyletic group with characteristics different from other groups and following a model of parental ancestry [71]. Consequently, the tree topology obtained herein using the polA1 gene from Leishmania species suggests the L. (V.) guyanensis species complex validation. Furthermore, the phylogenetic tree indicated nucleotide identity grades of at least 90% among species from the same subgenus, except the Leishmania (Mundinia) subgenus (84.7%). This parameter indicates the number of identical sites among clade components. Observing the tree topology, it is evident that L. (M.) martiniquensis showed the highest phylogenetic distance when compared visually with the rest of the species from the Mundinia subgenus, therefore, influencing the global nucleotide identity value assessed for this group. Indeed, when removing the L. (V.) martiniquensis from the analysis, the Mundinia clade clustered species with a nucleotide identity value of 95%, an increasing rate of more than 10%. This result suggests that L. (V.) martiniquensis might be the most divergent species of the whole Leishmania genus.

5. Conclusions

The de novo assembly of L. (V.) guyanensis (MHOM/BR/75/M4147) provides a high-quality genomic framework that clarifies chromosomal aneuploidy and genomic plasticity within the Viannia subgenus. Comparative orthology highlights extensive evolutionary conservation and lineage-specific genetic expansions, establishing a baseline for understanding distinct biological traits. These data effectively prioritized candidate drug targets—notably from the GSK3 and calpain families—reinforcing the importance of conserved signaling and proteolytic pathways for therapeutic intervention. Furthermore, the polA1 marker proved essential for species-level validation and taxonomic resolution, while revealing high divergence within the Leishmania (Mundinia) subgenus. By integrating advanced genomic and phylogenetic workflows, this study establishes a robust foundation to accelerate the development of precise diagnostic tools and targeted treatments for leishmaniasis in the Americas.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/pathogens15010124/s1.

Author Contributions

Conceptualization, L.M.G., E.C.S.J. and L.G.A.C. Project administration, L.M.G.; Formal analysis, E.C.S.J., C.C.C. and L.G.A.C.; Software, E.C.S.J.; Data curation, E.C.S.J.; Methodology, E.C.S.J., F.R.d.S., C.C.C., M.A.F.d.I. and L.G.A.C. Investigation, E.C.S.J., F.S.A., F.R.d.S., C.C.C., M.A.F.d.I. and L.G.A.C.; Supervision, L.M.G., E.C.S.J. and F.S.A.; Writing—original draft, L.G.A.C.; Writing—review and editing, F.S.A., L.M.G. and L.G.A.C.; Visualization—F.S.A., L.M.G., L.G.A.C. and E.C.S.J. All authors have read and agreed to the published version of the manuscript.

Funding

Financier of studies and projects (FINEP), Brazil, agreement number 01.22.0495.00; Coordination for the Improvement of Higher Education Personnel (CAPES), Brazil, within the scope of Notice no. 13/2020—Postgraduate Development Program—Process: 88887.919409/2023-00.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Sequence data that support the findings of this study have been deposited in the GenBank database under the accession number GCA_051201275.1. The data is publicly available at https://www.ncbi.nlm.nih.gov/datasets/genome/GCA_051201275.1/ (accessed on 12 January 2026). All additional data supporting the findings of this study are provided within the article and its Supplementary Materials.

Acknowledgments

The authors acknowledge the following organizations in Brazil and Panama for supporting researchers and students involved in this study: Programa de Pós-Graduação Bionorte—Rede de Biodiversidade e Biotecnologia da Amazônia, Universidade Federal do Pará, Brasil; Programa Institucional de Bolsas de Iniciação Científica (Conselho Nacional de Desenvolvimento Científico e Tecnológico—CNPq), Instituto Evandro Chagas, Brasil; Programa de Pós-graduação em Biologia Parasitária na Amazônia, Universidade do Estado do Pará, Brasil. Instituto Conmemorativo Gorgas de Estudios de la Salud, Panama and Sistema Nacional de Investigación, Panama.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

CLcutaneous leishmaniasis
MLmucocutaneous leishmaniasis
MEEmultilocus enzyme electrophoresis
VLvisceral leishmaniasis
ORFopen reading frames
MLmaximum likelihood
3Dthree-dimensional
SNPsingle nucleotide polymorphisms

References

  1. Alencar, R.B.; Justiniano, S.C.B.; Scarpassa, V.M. Morphological Description of the Immature Stages of Nyssomyia umbratilis (Ward & Frahia) (Diptera: Psychodidae: Phlebotominae), the Main Vector of Leishmania guyanensis Floch (Kinetoplastida: Trypanosomatidae) in the Brazilian Amazon Region. Neotrop. Entomol. 2018, 47, 668–680. [Google Scholar] [CrossRef] [PubMed]
  2. Galati, E.A.B.; Rodrigues, B.L. A Review of Historical Phlebotominae Taxonomy (Diptera: Psychodidae). Neotrop. Entomol. 2023, 52, 539–559. [Google Scholar] [CrossRef] [PubMed]
  3. Akhoundi, M.; Kuhls, K.; Cannet, A.; Votýpka, J.; Marty, P.; Delaunay, P.; Sereno, D. A Historical Overview of the Classification, Evolution, and Dispersion of Leishmania Parasites and Sandflies. PLoS Negl. Trop. Dis. 2016, 10, e0004349. [Google Scholar] [CrossRef] [PubMed]
  4. Espinosa, O.A.; Serrano, M.G.; Camargo, E.P.; Texeira, M.M.G.; Shaw, J. An appraisal of the taxonomy and nomenclature of trypanosomatids presently classified as Leishmania and Endotrypanum. Parasitology 2018, 145, 430–442. [Google Scholar] [CrossRef]
  5. Medina, J.; Cruz-Saavedra, L.; Patiño, L.H.; Muñoz, M.; Ramírez, J.D. Comparative analysis of the transcriptional responses of five Leishmania species to trivalent antimony. Parasit. Vectors 2021, 14, 419. [Google Scholar] [CrossRef]
  6. Cupolillo, E.; Grimaldi, G., Jr.; Momen, H. A general classification of New World Leishmania using numerical zymotaxonomy. Am. J. Trop. Med. Hyg. 1994, 50, 296–311. [Google Scholar] [CrossRef]
  7. Carvalho, A.M.; Barral, A.; Costa, J.M.; Bittencourt, A.; Marsden, P. Clinical and immunopathological aspects of disseminated cutaneous leishmaniasis. Acta Trop. 1994, 56, 315–325. [Google Scholar] [CrossRef]
  8. Grimaldi, G., Jr.; McMahan-Pratt, D. Leishmaniasis and its etiologic agents in the New World: An overview. Prog. Clin. Parasitol. 1991, 2, 73–118. [Google Scholar]
  9. Miranda, A.D.C.; González, K.A.; Samudio, F.; Pineda, V.J.; Calzada, J.E.; Capitan-Barrios, Z.; Jiménez, A.; Castillo, J.; Mendoza, Y.; Suárez, J.A.; et al. Molecular Identification of Parasites Causing Cutaneous Leishmaniasis in Panama. Am. J. Trop. Med. Hyg. 2021, 104, 1326–1334. [Google Scholar] [CrossRef]
  10. Rodríguez-Barraquer, I.; GónGora, R.; Prager, M.; Pacheco, R.; Montero, L.M.; Navas, A.; Ferro, C.; Miranda, M.C.; Saravia, N.G. Etiologic agent of an epidemic of cutaneous leishmaniasis in Tolima, Colombia. Am. J. Trop. Med. Hyg. 2008, 78, 276–282. [Google Scholar] [CrossRef]
  11. Rotureau, B.; Ravel, C.; Nacher, M.; Couppié, P.; Curtet, I.; Dedet, J.-P.; Carme, B. Molecular epidemiology of Leishmania (Viannia) guyanensis in French Guiana. J. Clin. Microbiol. 2006, 44, 468–473. [Google Scholar] [CrossRef] [PubMed]
  12. Couto, D.V.; Filho, G.H.; Medeiros, M.Z.; Vicari, C.F.S.; Barbosa, A.B.; Takita, L.C. American tegumentary leishmaniasis—A case of therapeutic challenge. An. Bras. Dermatol. 2014, 89, 974–976. [Google Scholar] [CrossRef] [PubMed]
  13. Neves, L.O.; Talhari, A.C.; Gadelha, E.P.N.; Júnior, R.M.d.S.; Guerra, J.A.d.O.; Ferreira, L.C.d.L.; Talhari, S. A randomized clinical trial comparing meglumine antimoniate, pentamidine and amphotericin B for the treatment of cutaneous leishmaniasis by Leishmania guyanensis. An. Bras. Dermatol. 2011, 86, 1092–1101. [Google Scholar] [CrossRef] [PubMed]
  14. Monte-Neto, R.; Laffitte, M.-C.N.; Leprohon, P.; Reis, P.; Frézard, F.; Ouellette, M. Intrachromosomal amplification, locus deletion and point mutation in the aquaglyceroporin AQP1 gene in antimony resistant Leishmania (Viannia) guyanensis. PLoS Negl. Trop. Dis. 2015, 9, e0003476. [Google Scholar] [CrossRef]
  15. Almeida, A.N.F.; Nascimento, L.d.C.S.D.; Sousa, E.S.M.d.M.; de Oliveira, A.J.D.; de Sena, M.G.; de Resende, B.M.; Chaves, R.C.G.; Garcez, L.M. Surveillance of cutaneous leishmaniasis in clinical samples: Distribution of Leishmania guyanensis in the state of Amapá, Brazil, 2018. Epidemiol. Serv. Saude 2020, 17, e2018504. [Google Scholar] [CrossRef]
  16. Llanes, A.; Restrepo, C.M.; Del Vecchio, G.; Anguizola, F.J.; Lleonart, R. The genome of Leishmania panamensis: Insights into genomics of the L. (Viannia) subgenus. Sci. Rep. 2015, 5, 8550. [Google Scholar] [CrossRef]
  17. McDonagh, P.D.; Myler, P.J.; Stuart, K. The unusual gene organization of Leishmania major chromosome 1 may reflect novel transcription processes. Nucleic Acids Res. 2000, 28, 2800–2803. [Google Scholar] [CrossRef]
  18. Myler, P.J.; Beverley, S.M.; Cruz, A.K.; Dobson, D.E.; Ivens, A.C.; McDonagh, P.D.; Madhubala, R.; Martinez-Calvillo, S.; Ruiz, J.C.; Saxena, A.; et al. The Leishmania genome project: New insights into gene organization and function. Med. Microbiol. Immunol. 2001, 190, 9–12. [Google Scholar] [CrossRef]
  19. Ivens, A.C.; Peacock, C.S.; Worthey, E.A.; Murphy, L.; Aggarwal, G.; Berriman, M.; Sisk, E.; Rajandream, M.-A.; Adlem, E.; Aert, R.; et al. The genome of the kinetoplastid parasite, Leishmania major. Science 2005, 15, 436–442. [Google Scholar] [CrossRef]
  20. Valdivia, H.O.; Reis-Cunha, J.L.; Rodrigues-Luiz, G.F.; Baptista, R.P.; Baldeviano, G.C.; Gerbasi, R.V.; Dobson, D.E.; Pratlong, F.; Bastien, P.; Lescano, A.G.; et al. Comparative genomic analysis of Leishmania (Viannia) peruviana and Leishmania (Viannia) braziliensis. BMC Genom. 2015, 16, 715. [Google Scholar] [CrossRef]
  21. Borges, A.F.; Gomes, R.S.; Ribeiro-Dias, F. Leishmania (Viannia) guyanensis in tegumentary leishmaniasis. Pathog. Dis. 2018, 76, fty025. [Google Scholar] [CrossRef]
  22. Chen, S.; Zhou, Y.; Gu, J. Fastp: An ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 2018, 34, i884–i890. [Google Scholar] [CrossRef] [PubMed]
  23. Andrews, S. FastQC: A Quality Control Tool for High Throughput Sequence Data. 2010. Available online: http://www.bioinformatics.babraham.ac.uk/projects/fastqc (accessed on 15 January 2026).
  24. Li, D.; Liu, C.M.; Luo, R.; Sadakane, K.; Lam, T.W. MEGAHIT: An ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. Bioinformatics 2015, 31, 1674–1676. [Google Scholar] [CrossRef] [PubMed]
  25. Boetzer, M.; Henkel, C.V.; Jansen, H.J.; Butler, D.; Pirovano, W. Scaffolding pre-assembled contigs using SSPACE. Bioinformatics 2010, 27, 578–579. [Google Scholar] [CrossRef] [PubMed]
  26. Altschul, S.F.; Gish, W.; Miller, W.; Myers, E.W.; Lipman, D.J. Basic local alignment search tool. J. Mol. Biol. 1990, 215, 403–410. [Google Scholar] [CrossRef]
  27. Wood, D.E.; Lu, J.; Langmead, B. Improved metagenomic analysis with Kraken 2. Genome Biol. 2019, 20, 257. [Google Scholar] [CrossRef]
  28. Breitwieser, F.P.; Salzberg, S.L. PAVIAN: Interactive analysis of metagenomics data for microbiome studies and pathogen identification. Bioinformatics 2020, 36, 1303–1304. [Google Scholar] [CrossRef]
  29. Stanke, M.; Steinkamp, R.; Morgenstern, B.; Waack, S. AUGUSTUS: A web server for gene prediction in eukaryotes. Nucleic Acids Res. 2004, 32, W309–W312. [Google Scholar] [CrossRef]
  30. Kearse, M.; Moir, R.; Wilson, A.; Stones-Havas, S.; Cheung, M.; Sturrock, S.; Buxton, S.; Cooper, A.; Markowitz, S.; Duran, C.; et al. Geneious Basic: An integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 2012, 28, 1647–1649. [Google Scholar] [CrossRef]
  31. Lavigne, R.; Seto, D.; Mahadevan, P.; Ackerman, H.W.; Kropinski, A.M. Unifying classical and molecular taxonomic classification: Analysis of the Podoviridae using BLASTP-based tools. Res. Microbiol. 2008, 159, 406–414. [Google Scholar] [CrossRef]
  32. Dusa, A. Venn: A Close to Zero Dependency Package to Draw and Display Venn Diagrams up to 7 Sets, and Any Boolean Union of Set Interceptions. 2024. Available online: https://cran.r-project.org/web/packages/venn/index.html (accessed on 15 January 2026).
  33. Camacho, C.; Coulouris, G.; Avagyan, V.; Ma, N.; Papadopoulos, J.; Bealer, K.; Madden, T.L. BLAST+: Architecture and applications. BMC Bioinform. 2009, 10, 421. [Google Scholar] [CrossRef] [PubMed]
  34. Pearson, W.R. An Introduction to Sequence Similarity (“Homology”) Searching. Curr. Protoc. Bioinform. 2013, 42, 3.1.1–3.1.8. [Google Scholar] [CrossRef] [PubMed]
  35. Langmead, B.; Salzberg, S.L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 2012, 9, 357–359. [Google Scholar] [CrossRef] [PubMed]
  36. Bonfield, J.K.; Marshall, J.; Danecek, P.; Li, H.; Ohan, V.; Whitwham, A.; Keane, T.; Davies, R.M. HTSlib: C library for reading/writing high-throughput sequencing data. Gigascience 2021, 10, giab007. [Google Scholar] [CrossRef]
  37. Schmidt, H.A.; Strimmer, K.; Vingron, M.; von Haeseler, A. TREE-PUZZLE: Maximum likelihood phylogenetic analysis using quartets and parallel computing. Bioinformatics 2002, 18, 502–504. [Google Scholar] [CrossRef]
  38. Minh, B.Q.; Schmidt, H.A.; Chernomor, O.; Schrempf, D.; Woodhams, M.D.; von Haeseler, A.; Lanfear, R. IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Mol. Biol. Evol. 2020, 37, 1530–1534. [Google Scholar] [CrossRef]
  39. Coughlan, S.; Taylor, A.S.; Feane, E.; Sanders, M.; Schonian, G.; Cotton, J.A.; Downing, T. Leishmania naiffi and Leishmania guyanensis reference genomes highlight genome structure and gene evolution in the Viannia subgenus. R. Soc. Open Sci. 2018, 5, 172212. [Google Scholar] [CrossRef]
  40. Długosz, M.; Deorowicz, S. Illumina reads correction: Evaluation and improvements. Sci. Rep. 2024, 14, 2232. [Google Scholar] [CrossRef]
  41. Peacock, C.S.; Seeger, K.; Harris, D.; Murphy, L.; Ruiz, J.C.; Quail, M.A.; Peters, N.; Adlem, E.; Tivey, A.; Aslett, M.; et al. Comparative genomic analysis of three Leishmania species that cause diverse human diseases. Nat. Genet. 2007, 9, 839–847. [Google Scholar] [CrossRef]
  42. Real, F.; Vidal, R.O.; Carazzolle, M.F.; Mondego, J.M.C.; Costa, G.G.L.; Herai, R.H.; Würtele, M.; de Carvalho, L.M.; e Ferreira, R.C.; Mortara, R.A.; et al. The genome sequence of Leishmania (Leishmania) amazonensis: Functional annotation and extended analysis of gene models. DNA Res. 2013, 20, 567–581. [Google Scholar] [CrossRef]
  43. Dumetz, F.; Imamura, H.; Sanders, M.; Seblova, V.; Myskova, J.; Pescher, P.; Vanaerschot, M.; Meehan, C.J.; Cuypers, B.; De Muylder, G.; et al. Modulation of aneuploidy in Leishmania donovani during adaptation to different in vitro and in vivo environments and its impact on gene expression. mBio 2017, 8, e00599-17. [Google Scholar] [CrossRef]
  44. Mannaert, A.; Downing, T.; Imamura, H.; Dujardin, J.C. Adaptive mechanisms in pathogens: Universal aneuploidy in Leishmania. Trends Parasitol. 2012, 28, 370–376. [Google Scholar] [CrossRef]
  45. Ubeda, J.-M.; Raymond, F.; Berriman, M.; Varkey, R.; Brunton, A.E.; Schertzer, M.; Leprohon, P.; Papadopoulou, B.; Barry, J.D.; Ouellette, M. Genome-wide stochastic adaptive DNA amplification at direct and inverted DNA repeats in the parasite Leishmania. PLoS Biol. 2014, 12, e1001868. [Google Scholar] [CrossRef]
  46. Leprohon, P.; Legaré, D.; Raymond, F.; Madore, É.; Hardiman, G.; Corbeil, J.; Ouellette, M. Gene expression modulation is associated with gene amplification, supernumerary chromosomes and chromosome loss in antimony-resistant Leishmania infantum. Nucleic Acids Res. 2009, 37, 1387–1399. [Google Scholar] [CrossRef] [PubMed]
  47. Black, J.A.; Reis-Cunha, J.L.; Cruz, A.K.; Tosi, L.R.O. Life in plastic, it’s fantastic! How Leishmania exploit genome instability to shape gene expression. Front. Cell. Infect. Microbiol. 2023, 13, 1102462. [Google Scholar] [CrossRef] [PubMed]
  48. Uliana, S.R.B.; Ruiz, J.C.; Cruz, A.K. Leishmania Genomics: Where do we stand? In Bioinformatics in Tropical Disease Research: A Practical and Case-Study Approach; Gruber, A., Durham, A.M., Huynh, C., del Portillo, H.A., Eds.; National Center for Biotechnology Information (US): Bethesda, MD, USA, 2008. Available online: https://www.ncbi.nlm.nih.gov/books (accessed on 15 January 2026).
  49. Agabian, N. Trans splicing of nuclear pre-mRNAs. Cell 1990, 61, 1157–1160. [Google Scholar] [CrossRef] [PubMed]
  50. Myler, P.J.; Audleman, L.; Devos, T.; Hixson, G.; Kiser, P.; Lemley, C.; Magness, C.; Rickel, E.; Sisk, E.; Sunkin, S.; et al. Leishmania major Friedlin chromosome 1 has an unusual distribution of protein-coding genes. Proc. Natl. Acad. Sci. USA 1999, 96, 2902–2906. [Google Scholar] [CrossRef]
  51. Kipandula, W.; Young, S.A.; MacNeill, S.A.; Smith, T.K. Screening of the MMV and GSK open access chemical boxes using a viability assay developed against the kinetoplastid Crithidia fasciculata. Mol. Biochem. Parasitol. 2018, 222, 61–69. [Google Scholar] [CrossRef]
  52. Ojo, K.K.; Gillespie, J.R.; Riechers, A.J.; Napuli, A.J.; Verlinde, C.L.; Buckner, F.S.; Gelb, M.H.; Domostoj, M.M.; Wells, S.J.; Scheer, A.; et al. Glycogen synthase kinase 3 is a potential drug target for African trypanosomiasis therapy. Antimicrob. Agents Chemother. 2008, 52, 3710–3717. [Google Scholar] [CrossRef]
  53. Martínez de Iturrate, P.; Sebastián-Pérez, V.; Nácher-Vázquez, M.; Tremper, C.S.; Smirlis, D.; Martín, J.; Martínez, A.; Campillo, N.E.; Rivas, L.; Gil, C. Towards discovery of new leishmanicidal scaffolds able to inhibit Leishmania GSK-3. J. Enzyme Inhib. Med. Chem. 2020, 35, 199–210. [Google Scholar] [CrossRef]
  54. Ojo, K.K.; Arakaki, T.L.; Napuli, A.J.; Inampudi, K.K.; Keyloun, K.R.; Zhang, L.; Hol, W.G.J.; Verlinde, C.L.M.J.; Merritt, E.A.; Van Voorhis, W.C. Structure determination of glycogen synthase kinase-3 from Leishmania major and comparative inhibitor structure-activity relationships with Trypanosoma brucei GSK-3. Mol. Biochem. Parasitol. 2011, 176, 98–108. [Google Scholar] [CrossRef]
  55. Efstathiou, A.; Smirlis, D. Leishmania protein kinases: Important regulators of the parasite life cycle and molecular targets for treating leishmaniasis. Microorganisms 2021, 9, 691. [Google Scholar] [CrossRef] [PubMed]
  56. Marinho, F.A.; Sangenito, L.S.; Oliveira, S.S.; Arruda, L.B.; D’Ávila-Levy, C.M.; Santos, A.L.; Branquinha, M.H. The potent cell-permeable calpain inhibitor MDL28170 affects the interaction of Leishmania amazonensis with macrophages and shows anti-amastigote activity. Parasitol. Int. 2017, 66, 579–583. [Google Scholar] [CrossRef] [PubMed]
  57. Ono, Y.; Saido, T.; Sorimachi, H. Calpain research for drug discovery: Challenges and potential. Nat. Rev. Drug Discov. 2016, 15, 854–876. [Google Scholar] [CrossRef] [PubMed]
  58. Branquinha, M.H.; Marinho, F.A.; Sangenito, L.S.; Oliveira, S.S.; Gonçalves, K.C.; Ennes-Vidal, V.; D’Ávila-Levy, C.M.; Santos, A.L. Calpains: Potential targets for alternative chemotherapeutic intervention against human pathogenic trypanosomatids. Curr. Med. Chem. 2013, 20, 3174–3185. [Google Scholar] [CrossRef]
  59. Diaz, A.P.; Meneses Canal, C.A.; Valdés, Á.J.; Giraldo Delgado, J.E.; Varela-M, R.E. GSK-3 Kinase a Putative Therapeutic Target in Trypanosomatid Parasites. Braz. J. Infect. Dis. 2024, 28, 103736. [Google Scholar] [CrossRef]
  60. Zwickl, D.J.; Hillis, D.M. Increased taxon sampling greatly reduces phylogenetic error. Syst. Biol. 2002, 51, 588–598. [Google Scholar] [CrossRef]
  61. Noyes, H.; Pratlong, F.; Chance, M.; Ellis, J.; Lanotte, G.; Dedet, J.-P. A previously unclassified trypanosomatid responsible for human cutaneous lesions in Martinique (French West Indies) is the most divergent member of the genus Leishmania ss. Parasitology 2002, 124, 17–24. [Google Scholar] [CrossRef]
  62. Tsokana, C.N.; Sokos, C.; Giannakopoulos, A.; Mamuris, Z.; Birtsas, P.; Papaspyropoulos, K.; Valiakos, G.; Spyrou, V.; Lefkaditis, M.; Chatzopoulos, D.C.; et al. First evidence of Leishmania infection in European brown hare (Lepus europaeus) in Greece: GIS analysis and phylogenetic position within the Leishmania spp. Parasitol. Res. 2016, 115, 313–321. [Google Scholar] [CrossRef]
  63. Shaw, J.J.; Lainson, R.; Ryan, L.; Braga, R.; McMahon-Pratt, D.; David, J. The identification of Leishmania braziliensis braziliensis in wild-caught neotropical sandflies using monoclonal antibodies. Trans. R. Soc. Trop. Med. Hyg. 1987, 81, 69–72. [Google Scholar] [CrossRef]
  64. Bray, R.S. Leishmania . Annu. Rev. Microbiol. 1974, 28, 189–217. [Google Scholar] [CrossRef] [PubMed]
  65. Lumsden, W.H. Letter: Biochemical taxonomy of Leishmania. Trans. R. Soc. Trop. Med. Hyg. 1974, 68, 74–75. [Google Scholar] [CrossRef]
  66. Fraga, J.; Montalvo, A.M.; De Doncker, S.; Dujardin, J.C.; Van der Auwera, G. Phylogeny of Leishmania species based on the heat-shock protein 70 gene. Infect. Genet. Evol. 2010, 10, 238–245. [Google Scholar] [CrossRef]
  67. Hoyos, J.; Rosales-Chilama, M.; León, C.; González, C.; Gómez, M.A. Sequencing of hsp70 for discernment of species from the Leishmania (Viannia) guyanensis complex from endemic areas in Colombia. Parasit. Vectors 2022, 15, 406. [Google Scholar] [CrossRef]
  68. Akhoundi, M.; Downing, T.; Votýpka, J.; Kuhls, K.; Lukeš, J.; Cannet, A.; Ravel, C.; Marty, P.; Delaunay, P.; Kasbari, M.; et al. Leishmania infections: Molecular targets and diagnosis. Mol. Aspects Med. 2017, 57, 1–29. [Google Scholar] [CrossRef]
  69. Maurício, I.L.; Stothard, J.R.; Miles, M.A. The strange case of Leishmania chagasi. Parasitol. Today 2000, 16, 188–189. [Google Scholar] [CrossRef]
  70. Lukes, J.; Mauricio, I.L.; Schönian, G.; Dujardin, J.-C.; Soteriadou, K.; Dedet, J.-P.; Kuhls, K.; Tintaya, K.W.Q.; Jirků, M.; Chocholová, E.; et al. Evolutionary and geographical history of the Leishmania donovani complex with a revision of current taxonomy. Proc. Natl. Acad. Sci. USA 2007, 104, 9375–9380. [Google Scholar] [CrossRef]
  71. Cupolillo, E.; Boité, M.C.; Porrozzi, R. Considerações sobre taxonomia do gênero Leishmania. In Leishmanioses do Continente Americano, 1st ed.; Conceição-Silva, F., ALVES, C.R., Eds.; Editora Fiocruz: Rio de Janeiro, Brazil, 2014; pp. 39–51. [Google Scholar]
Figure 1. Analysis of chromosomal ploidy in Leishmania guyanensis (MHOM/BR/75/M4147) strain, demonstrating aneuploidy. Chromosomal dosage increase is observed, with emphasis on chromosomes 2 and 8, which exhibit conditions approaching trisomy.
Figure 1. Analysis of chromosomal ploidy in Leishmania guyanensis (MHOM/BR/75/M4147) strain, demonstrating aneuploidy. Chromosomal dosage increase is observed, with emphasis on chromosomes 2 and 8, which exhibit conditions approaching trisomy.
Pathogens 15 00124 g001
Figure 2. Principal functional proteins identified in the genomic annotation of Leishmania guyanensis. The list presents the twenty proteins associated with the highest number of corresponding genes, highlighting functionally relevant components within the species’ genome.
Figure 2. Principal functional proteins identified in the genomic annotation of Leishmania guyanensis. The list presents the twenty proteins associated with the highest number of corresponding genes, highlighting functionally relevant components within the species’ genome.
Pathogens 15 00124 g002
Figure 3. Venn diagram comparing shared genes among Leishmania major, L. panamensis, L. guyanensis, and L. shawi. The area overlap indicates the number of genes common to the species, highlighting both species-specific (exclusive) genes and those shared across each group.
Figure 3. Venn diagram comparing shared genes among Leishmania major, L. panamensis, L. guyanensis, and L. shawi. The area overlap indicates the number of genes common to the species, highlighting both species-specific (exclusive) genes and those shared across each group.
Pathogens 15 00124 g003
Figure 4. Graphical representation of the normalized quality scores of candidate drug target genes identified in the L. (V.) guyanensis genome. Genes belonging to the GSK3 family exhibit predominantly higher scores, with gene g7485.t1_1 standing out as the most conserved and structurally robust candidate, whereas the remaining genes show intermediate scores associated with partial alignments or moderate sequence identity.
Figure 4. Graphical representation of the normalized quality scores of candidate drug target genes identified in the L. (V.) guyanensis genome. Genes belonging to the GSK3 family exhibit predominantly higher scores, with gene g7485.t1_1 standing out as the most conserved and structurally robust candidate, whereas the remaining genes show intermediate scores associated with partial alignments or moderate sequence identity.
Pathogens 15 00124 g004
Figure 5. Phylogenetic tree based on the polA1 gene sequence, illustrating the phylogenetic groupings within the Leishmania genus. The tree organizes the species into hierarchical levels, including subgenera, complexes, and species. * MHOM/BR/1975/M4147 L. guyanensis refers to the strain sequenced in the present study.
Figure 5. Phylogenetic tree based on the polA1 gene sequence, illustrating the phylogenetic groupings within the Leishmania genus. The tree organizes the species into hierarchical levels, including subgenera, complexes, and species. * MHOM/BR/1975/M4147 L. guyanensis refers to the strain sequenced in the present study.
Pathogens 15 00124 g005
Table 1. Comparative Analysis of Leishmania spp. Genomic Assembly Metrics. The table summarizes key assembly quality parameters, including number of contigs, N50 value, number of scaffolds, presence of gaps, and read coverage, among others. These metrics are used to assess the integrity and completeness of the assemblies, informing the optimal strategy for downstream genomic studies in the Leishmania genus.
Table 1. Comparative Analysis of Leishmania spp. Genomic Assembly Metrics. The table summarizes key assembly quality parameters, including number of contigs, N50 value, number of scaffolds, presence of gaps, and read coverage, among others. These metrics are used to assess the integrity and completeness of the assemblies, informing the optimal strategy for downstream genomic studies in the Leishmania genus.
Assembled Genomes of Leishmania spp.
SpeciesL. guyanensis (M4147)L. guyanensis (M4147)L. guyanensis (LgCL085)L. naiffi
(LnCL233)
L. panamensis (PSC-1)L. braziliensis
(M2904)
L. infantum
(JPCM5)
L. major
Friedlin Strain
Contigs14,0973510,30814,68236
N50
contigs (Kb *)
47431.19.65.7
Scaffolds14,097352,86.53108
N50
Scaffold (Kb)
47431.195.424.3674
Nº of Gaps0015573853553876440
Average
Coverage *
40.85×27×56×36×30×
Genome
Size (Mb *)
31.5632.531.0130.3430.6931.2431.9232.8
G + C (%) *57.445857.5657.7259.5859.7
Protein
Coding Genes
71928.2381047748817581998272
Total Nº Genes76988376826279338.378.268341
ReferenceCurrent studyGenbank[39][39][16][16][16])[19]
* Kb: kilobases; Mb: megabases; G + C (%): Guanine-Cytosine content percentage; N50: sequence length such that 50% of the total assembly is contained in sequences of that length or longer; Average Coverage: mean number of times each nucleotide is sequenced; (—): data not available or not applicable for the specific assembly version. Genomes obtained from the current study are compared with representative strains from GenBank and literature references.
Table 2. Candidate genes were prioritized using BLASTp homology analysis against reference drug targets (GSK3 and calpain). The table summarizes standard alignment metrics (pident, alignment length, e-value, and bitscore) and a combined quality score utilized for ranking. This score integrates percentage identity, alignment coverage, statistical significance, and bitscore.
Table 2. Candidate genes were prioritized using BLASTp homology analysis against reference drug targets (GSK3 and calpain). The table summarizes standard alignment metrics (pident, alignment length, e-value, and bitscore) and a combined quality score utilized for ranking. This score integrates percentage identity, alignment coverage, statistical significance, and bitscore.
GeneTargetPidentLengthE-ValueBitscoreTarget FamilyQuality_Score
g7485.t1_1GSK392,9583550701GSK30.968310746478873
g2584.t1_1GSK341,176682.92 × 10−1050.1GSK30.63774951944162
g7339.t1_1GSK352,5931351.57 × 10−41144GSK30.588149574074074
g7477.t1_1Calpain38,889723.43 × 10−1052CALPAIN0.569850062043589
g2667.t1_1GSK335,1351119.01 × 10−1562GSK30.568017545045045
g3420.t1_1GSK335,2941021.22 × 10−1359.7GSK30.564705470588235
g6383.t1_1GSK346,753774.53 × 10−1464.3GSK30.556492642857143
g6725.t1_1GSK336,905842.53 × 10−1256.2GSK30.517262738095238
g491.t1_1GSK3371005.58 × 10−1357.4GSK30.4695
g7453.t1_1GSK335,6521153.13 × 10−1154.7GSK30.368260260869565
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Costa, L.G.A.; Sousa Junior, E.C.; Cardoso, C.C.; Igreja, M.A.F.d.; Acosta, F.S.; Silva, F.R.d.; Garcez, L.M. Resequencing and De Novo Assembly of Leishmania (Viannia) guyanensis from Amazon Region: Genome Assessment, Phylogenetic Insights and Therapeutic Targets. Pathogens 2026, 15, 124. https://doi.org/10.3390/pathogens15010124

AMA Style

Costa LGA, Sousa Junior EC, Cardoso CC, Igreja MAFd, Acosta FS, Silva FRd, Garcez LM. Resequencing and De Novo Assembly of Leishmania (Viannia) guyanensis from Amazon Region: Genome Assessment, Phylogenetic Insights and Therapeutic Targets. Pathogens. 2026; 15(1):124. https://doi.org/10.3390/pathogens15010124

Chicago/Turabian Style

Costa, Lucas George Assunção, Edivaldo Costa Sousa Junior, Camila Cristina Cardoso, Millena Arnaud Franco da Igreja, Franklyn Samudio Acosta, Fabiano Reis da Silva, and Lourdes Maria Garcez. 2026. "Resequencing and De Novo Assembly of Leishmania (Viannia) guyanensis from Amazon Region: Genome Assessment, Phylogenetic Insights and Therapeutic Targets" Pathogens 15, no. 1: 124. https://doi.org/10.3390/pathogens15010124

APA Style

Costa, L. G. A., Sousa Junior, E. C., Cardoso, C. C., Igreja, M. A. F. d., Acosta, F. S., Silva, F. R. d., & Garcez, L. M. (2026). Resequencing and De Novo Assembly of Leishmania (Viannia) guyanensis from Amazon Region: Genome Assessment, Phylogenetic Insights and Therapeutic Targets. Pathogens, 15(1), 124. https://doi.org/10.3390/pathogens15010124

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop