Next Article in Journal
The Autophagic Activator GHF-201 Can Alleviate Pathology in a Mouse Model and in Patient Fibroblasts of Type III Glycogenosis
Previous Article in Journal
Biophysical Analysis of a Minimalistic Kidney Model Expressing SGLT1 Reveals Crosstalk between Luminal and Lateral Membranes and a Plausible Mechanism of Isosmotic Transport
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Complete Plastid Genome Sequences of Four Salsoleae s.l. Species: Comparative and Phylogenetic Analyses

by
Shyryn Almerekova
1,2,
Moldir Yermagambetova
1,
Bektemir Osmonali
3,
Polina Vesselova
3,
Yerlan Turuspekov
1,2 and
Saule Abugalieva
1,2,*
1
Molecular Genetics Laboratory, Institute of Plant Biology and Biotechnology, Almaty 050040, Kazakhstan
2
Faculty of Biology and Biotechnology, Al-Farabi Kazakh National University, Almaty 050040, Kazakhstan
3
Laboratory of the Flora of Higher Plants, Institute of Botany and Phytointroduction, Almaty 050040, Kazakhstan
*
Author to whom correspondence should be addressed.
Biomolecules 2024, 14(8), 890; https://doi.org/10.3390/biom14080890
Submission received: 11 June 2024 / Revised: 19 July 2024 / Accepted: 22 July 2024 / Published: 24 July 2024
(This article belongs to the Section Molecular Genetics)

Abstract

:
The taxonomic classification of the genera Salsola L., Pyankovia Akhani and Roalson, and Xylosalsola Tzvelev within Chenopodiaceae Vent. (Amaranthaceae s.l.) remains controversial, with the precise number of species within these genera still unresolved. This study presents a comparative analysis of the complete plastid genomes of S. foliosa, S. tragus, P. affinis, and X. richteri species collected in Kazakhstan. The assembled plastid genomes varied in length, ranging from 151,177 bp to 152,969 bp for X. richteri and S. tragus. These genomes contained 133 genes, of which 114 were unique, including 80 protein-coding, 30 tRNA, and 4 rRNA genes. Thirteen regions, including ndhC-ndhD, rps16-psbK, petD, rpoC2, ndhA, petB, clpP, atpF, ycf3, accD, ndhF-ndhG, matK, and rpl20-rpl22, exhibited relatively high levels of nucleotide variation. A total of 987 SSRs were detected across the four analyzed plastid genomes, primarily located in the intergenic spacer regions. Additionally, 254 repeats were identified, including 92 tandem repeats, 88 forward repeats, 100 palindromic repeats, and only one reverse repeat. A phylogenetic analysis revealed clear clustering into four clusters corresponding to the Salsoleae and Caroxyloneae tribe clades. These nucleotide sequences obtained in this study represent a valuable resource for future phylogenetic analyses within the Salsoleae s.l. tribe.

1. Introduction

The Chenopodiaceae Vent. (Amaranthaceae s.l.) family, also known as the goosefoot family, encompasses diverse plant species distributed worldwide, particularly in arid and semi-arid regions. Chenopodiaceae is one of the largest families, comprising approximately 110 genera and over 1700 species [1]. The representatives of this family play significant ecological roles as pioneer colonizers of disturbed habitats and providers of forage for wildlife and livestock [2]. One of the largest and most economically important genera within the family is Salsola L. [3]. The precise number of Salsola species worldwide is uncertain, with estimates ranging from 130 to 150, depending on the different taxonomic revisions [3,4,5,6,7]. These species are widely distributed across desert and semi-desert regions spanning Central Asia, the Middle East, Africa, and Europe [8,9]. The taxonomy of the tribe Salsoleae s.l., which includes the genus Salsola, remains controversial, and the exact number of genera within the tribe is still under debate [10]. Akhani et al. [10] revised the classification of Salsoleae s.l. based on nuclear ribosomal internal transcribed spacer (ITS) and chloroplast psbB-psbH markers. Their findings revealed that three genera are newly described (Pyankovia, Kaviria, and Turania), while four previously recognized genera (Caroxylon, Climacoptera, Kali, and Xylosalsola) have been resurrected [10]. Another extensive phylogenetic study focused on Salsoleae s.l. taxa from northwestern China, and employed nucleotide sequences from ITS, psbB–psbH, and rbcL [11]. The authors [11] reported that the tribe Salsoleae s.l. is monophyletic and consists of three distinct monophyletic subunits: Caroxyloneae, the Kali clade, and s.str. The genus Xylosalsola Tzvelev was formerly considered a section within Salsola before being recognized as an independent genus [10]. According to POWO (Plants of the World Online) [12], Xylosalsola is represented by four accepted species, with its native range extending from South European Russia to Mongolia and Pakistan. The newly recognized genus Pyankovia Akhani and Roalson is naturally found across territories from Krym to Mongolia and Afghanistan [12], comprising three distinct species. The species within this genus were previously classified as belonging to Climacoptera Botsch. [10]. The species from these genera play a dominant role in forming various plant communities in dry climates [13,14]. They demonstrate adaptability to different environmental conditions, such as saline soils and harsh climates, providing forage for livestock in such regions [15,16]. Furthermore, the plants of Salsola tragus L. and Xylosalsola rechteri (Moq.) Akhani and Roalson are widely used in folk medicine [17,18]. Despite their ecological, economic, and medicinal significance, ongoing research and debate persist regarding the taxonomic classification and phylogenetic relationships within these genera belonging to the tribe Salsoleae s.l.
In addition to the numerous studies on the taxonomy of Salsoleae s.l. [19,20,21,22], population genetics analyses have been conducted by employing diverse DNA markers. The genetic diversity of Salsola species populations has been investigated using inter-simple sequence repeat, or ISSR [23,24]; amplified length polymorphism, or AFLP [25]; and simple sequence repeat, or SSR [26,27], markers. In prior investigations [26,27], SSR markers derived from Beta were utilized for cross-genera amplification in Salsola species due to the absence of species-specific SSRs designed for Salsola. Furthermore, SSRs specific to Xylosalsola and Pyankovia species are currently unavailable.
Chloroplasts play a crucial role in photosynthesis in plants and possess their own genome [28]. Structurally, the genome of chloroplasts typically maintains a highly conserved quadripartite organization in angiosperms, characterized by a pair of inverted repeats (IRs) flanking a large single-copy (LSC) region and a small single-copy (SSC) region [29]. Recent advances in sequencing technologies have facilitated the rapid and cost-effective acquisition of nucleotide sequences of plastomes [30]. Plastid genomes’ polymorphic regions within nucleotide sequences can be utilized to generate DNA-barcoding markers across different taxonomic levels [31]. The SSRs extracted from plastid genomes may serve as valuable molecular markers for population and conservation genetics [32,33]. The highly conserved nature of chloroplast SSRs allows for cross-species amplification in the same family [34,35,36]. Because of its small genome size and conserved structure, the plastid genome is extensively utilized in phylogenetic analyses across various plant species [28,37].
Research on the plastid genomes of Salsola, Xylosalsola, and Pyankovia is limited, with only the plastid genome of Salsola abrotanoides having been characterized [38]. In this study, we report a comparative analysis of the complete plastid genome of four species, Salsola foliosa (L.) Schrad. ex Schult., Salsola tragus L., Pyankovia affinis (C.A.Mey. ex Schrenk) Mosyakin and Roalson, and Xylosalsola richteri (Moq.) Akhani and Roalson, collected in Kazakhstan. The obtained results may serve as a valuable resource for future studies on population genetics and phylogenetic analyses of species within the tribe Salsoleae s.l.

2. Materials and Methods

2.1. Plant Materials and DNA Extraction

Fresh leaves were harvested from mature plants of the Salsola foliosa, Salsola tragus, Pyankovia affinis, and Xylosalsola richteri species (Figure 1). Each species’ fresh leaves were promptly dehydrated using silica gel to facilitate subsequent DNA extraction. The collection location and GPS coordinates are detailed in Table 1. The specimens of S. foliosa, S. tragus, P. affinis, and X. richteri were deposited in the herbarium (AA) of the Institute of Botany and Phytointroduction. Genomic DNA was isolated from the dried leaves by utilizing the cetyltrimethylammonium bromide (CTAB) protocol [39]. The genomic DNA quality was assessed using 1% agarose gel electrophoresis and quantified using the NanoDrop™ One spectrophotometer (Thermofischer, Waltham, MA, USA).

2.2. Library Construction and De Novo Sequencing

Genomic DNA that met quality control standards was utilized for library preparation by employing the TruSeq Nano DNA Kit (Illumina Inc., San Diego, CA, USA). Subsequently, genome sequencing was conducted on the Illumina NovaSeq 6000 platform by employing a 151 bp paired-end read (Macrogen Inc., Seoul, Republic of Korea).

2.3. Genome Assembly and Annotation

The initial quality control checks on the raw sequence data were performed using FastQC (http://www.bioinformatics.babraham.ac.uk/projects/fastqc, accessed on 29 January 2024). The Trimmomatic 0.36 software [40] was employed to eliminate adapter sequences from the raw reads, followed by the assembly of the clean reads using NOVOPlasty 4.3.3 [41]. The assembled plastid genomes were annotated using GeSeq (https://chlorobox.mpimp-golm.mpg.de/geseq.html, accessed on 22 February 2024) [42] and based on comparisons with the plastid genomes of Salsola affinis (ON080842), Salsola abrotanoides (MW123092) [38], and Caroxylon passerinum (MW192441) [43]. A graphical map of the annotated circular plastid genome was generated using the OrganellarGenomeDRAW tool 1.3.1 (OGDRAW) [44]. The annotated plastome sequences of the analyzed species were submitted to GenBank.

2.4. Simple Sequence Repeats (SSRs) and Long Repetitive Sequence Analysis

SSRs were detected using the web-based tool MISA (https://webblast.ipk-gatersleben.de/misa/, accessed on 8 May 2024) [45], with the minimum repeat number set to 8, 4, 4, 3, 3, and 3 for mononucleotides, dinucleotides, trinucleotides, tetranucleotides, pentanucleotides, and hexanucleotides, respectively. Forward (F), reverse (R), and palindromic (P) repeats were identified using REPuter (https://bibiserv.cebitec.uni-bielefeld.de/reputer/, accessed on 8 May 2024) [46] with minimal repeats exceeding 30 bp and Hamming distances of less than 3 bp. Additionally, tandem repeats (T) were identified using the Tandem repeats finder (https://tandem.bu.edu/trf/trf.html, accessed on 10 May 2024) [47] tool with the default settings.

2.5. Nucleotide Diversity and Ka/Ks Ratio Analysis

Sequences of protein-coding genes were aligned using Geneious Prime® 2024.0.2 (https://www.geneious.com, accessed on 10 May 2024) to assess the nucleotide diversity (Pi) of the protein-coding genes across the 13 analyzed samples. Subsequently, a sliding window analysis was conducted to determine the nucleotide variability (Pi) values using DnaSP 6 [48], employing a window length of 600 bp and a step size of 200 bp. We separately isolated and aligned the 80 protein-coding genes of 13 samples to evaluate the synonymous (Ks) and nonsynonymous (Ka) substitution rates. The Ka/Ks ratios for each gene were then analyzed using DnaSP 6 [48].

2.6. Phylogenetic Analyses

A phylogenetic analysis was carried out using the nucleotide sequences of a complete plastid genome sourced from 13 samples of 12 species aligned using Geneious Prime® 2024.0.2 (https://www.geneious.com, accessed on 13 May 2024). The thirteen samples of Salsoleae s.l. were selected as ingroups, while two species, Suaeda glauca and Atriplex prostrata, were used as outgroups. A maximum likelihood (ML) phylogenetic tree was reconstructed using IQ-TREE 2.2.2.6 [49], and the best substitution model, TVM+F+I+R5, was based on the Bayesian information criterion (BIC). A Bayesian inference (BI) analysis was conducted using MrBayes 3.2.7 [50] with the parameters set as follows: ngen = 3,000,000, samplefreq = 200, and burninfrac = 0.25. The trees generated were visualized using FigTree (http://tree.bio.ed.ac.uk/software/figtree/, accessed on 15 May 2024).

3. Results

3.1. The Features of the S. foliosa, S. tragus, P. affinis, and X. richteri Plastid Genomes

A total of 24,156,836, 19,894,552, 29,541,702, and 32,239,736 filtered reads were obtained for the plastid genomes of the S. foliosa, S. tragus, P. affinis, and X. richteri species, respectively. The final yield of filtered data exceeded 16 GB for all four species. The quality values Q20 were 97.85%, 96.98%, 98.41%, and 97.58% for S. foliosa, S. tragus, P. affinis, and X. richteri, respectively. Correspondingly, the Q30 values for the same species were 92.77%, 90.74%, 94.2%, and 92.11%. The total lengths of the assembled S. foliosa, S. tragus, P. affinis, and X. richteri plastid genomes were 151,577 bp, 152,969 bp, 151,239 bp, and 151,177 bp, respectively (Figure 2).
The plastid genome sequences of the four species analyzed in this study have been submitted to the GenBank database at the NCBI (National Center for Biotechnology Information) with the following accession numbers: PP754487-PP754490. The examined plastid genome displayed a typical quadripartite structure consisting of paired inverted repeats (46,954–50,400 bp) separated by a large single-copy region (83,324–84,136 bp) and a small single-copy region (18,576–20,087). The inverted repeat regions (42.58–43.17%), large single-copy region (34.10–34.60%), and small single-copy region (28.75–29.76%) exhibited nearly identical GC content values across all four analyzed species. All four examined plastid genomes exhibited a comparable gene content and arrangement. Together, they encompassed 114 unique genes, including 80 protein-coding genes, 30 tRNA genes, and 4 rRNA genes. Moreover, 19 duplicated genes were observed within the inverted region (Table 2).
Among the 114 unique genes identified, 17 contained introns. Specifically, there were eleven protein-coding genes (rps12, rps16, rpl16, rpoC1, atpF, ndhA, ndhB, petB, petD, clpP, and ycf3), and six tRNA genes (trnA-UGC, trnG-GCC, trnI-GAU, trnK-UUU, trnL-UAA, and trnV-UAC), as detailed in Table 3. The clpP and ycf3 genes each had two introns, while the remaining listed genes featured a single intron. Duplicated copies were detected for eight protein-coding genes (rps7, rps12, rpl2, rpl23, ndhB, ycf1, ycf2, and ycf15), seven tRNA genes (trnA-UGC, trnI-CAU, trnI-GAU, trnL-CAA, trnN-GUU, trnR-ACG, and trnV-GAC), and four rRNA genes (rrn4.5, rrn5, rrn16, and rrn23).

3.2. Repeat Sequences Analysis

A total of 239, 274, 215, and 259 SSRs were detected in the plastid genome of S. foliosa, S. tragus, P. affinis, and X. richteri, respectively. Comparing the frequency of SSRs in the analyzed nucleotide sequences of the plastid genomes revealed that mononucleotides (77.20%) were more prevalent across all the genomes, followed by dinucleotides (16.72%), tetranucleotides (3.34%), and trinucleotides (2.33%). Pentanucleotide repeats (0.20%) were exclusively found in P. affinis and X. richteri, while hexanucleotide repeats (0.20%) were present only in P. affinis and S. tragus. The majority of the mononucleotide repeats consisted of A/T, accounting for 97.64% or 744 repeats, with only 2.36% or 18 composed of C/G. Among the dinucleotide repeats, AT/AT sequences constituted 58.79% or 97 repeats, while AG/CT and AC/GT repeats constituted only 36.36% and 4.85%, respectively (Table 4).
Out of the total 987 SSRs identified, the majority were situated in the intergenic spacer regions (60.98%), followed by the protein-coding gene regions (33.33%) (Supplementary File S1). Additionally, most of the detected SSRs were found in the LSC regions rather than in the SSC or IR regions across the four plastid genomes (Figure 3).
Moreover, the sequences of the four plastid genomes revealed the presence of tandem, forward, reverse, and palindromic repeats. Among these, palindromic repeats were found to be the most prevalent type across the analyzed plastid genomes, while reverse repeats were the least common. In total, 254 repeats were identified, comprising 92 tandem repeats, 88 forward repeats, 100 palindromic repeats, and only 1 reverse repeat (Figure 4A). Repeats ranging from 30 to 39 bp were the most prevalent in all four plastid genomes, with counts of 18, 28, 17, and 26 for S. foliosa, S. tragus, P. affinis, and X. richteri, respectively. Repeats with a length of 89 base pairs or more were exclusively detected in the plastid genomes of S. foliosa, S. tragus, and X. richteri (Figure 4B).

3.3. Sliding Window Analysis

To identify the regions with a high variability among the thirteen samples of Salsoleae s.l., we compared the Pi values within coding regions using DnaSP 6. The average nucleotide diversity (Pi) across these species was calculated as 0.02426. We observed nucleotide diversity values ranging from 0 to 0.06000 within 600 base pairs through a sliding window analysis. Thirteen regions exhibited significant variability, characterized by Pi values exceeding 0.03. These regions included ndhC-ndhD, rps16-psbK, petD, rpoC2, ndhA, petB, clpP, atpF, ycf3, accD, ndhF-ndhG, matK, and rpl20-rpl22 (Figure 5). The rpl20-rpl22 region displayed the highest nucleotide variation (0.06000), as shown in Table 5.

3.4. Substitution Rate of Protein-Coding Genes of Salsoleae s.l. Species

Synonymous (Ks) and nonsynonymous (Ka) substitutions, along with their ratio (Ka/Ks), were calculated across the 80 protein-coding genes presented in the plastid genomes of the 13 samples of Salsoleae s.l. (Supplementary File S2). The average Ka/Ks ratio for the 80 protein-coding genes was 0.76. The analysis revealed a zero Ka/Ks ratio for seven genes. However, 55 genes exhibited a Ka/Ks ratio of less than 1, indicating purifying selection. The Ka/Ks ratios of 18 genes exceeded 1, suggesting positive selection (Figure 6).

3.5. Phylogenetic Analysis

The phylogenetic relationships among Salsoleae s.l. species were analyzed based on the nucleotide sequences of complete plastid genome sequences. A multiple sequence alignment was performed on the nucleotide sequences from thirteen Salsoleae s.l. samples and two outgroup species. Maximum likelihood (ML) and Bayesian inference (BI) methods were employed to construct the phylogenetic trees. The resulting trees based on the nucleotide sequences of the complete plastid genome showed a similar topology and clustered the Salsoleae s.l. species into several clusters, which was supported by strong bootstrap and posterior probability values. The analyzed species were divided into two main clusters, corresponding to the Salsoleae and Caroxyloneae clades, and four subclusters. The Salsoleae cluster comprised five species from the genera Oreosalsola Akhani, Soda (Dumort.) Fourr., Salsola, and Xylosalsola, while the Caroxyloneae cluster included seven species (or eight samples) from the genera Caroxylon Thunb. and Pyankovia. The species S. foliosa was clustered with S. abrotanoides in a separate subcluster. S. tragus and X. richteri, along with the GenBank sample S. collina, formed a second subcluster in the Salsoleae clade, indicating a close relationship among these species. The species from Caroxylon grouped together and made a third subcluster. P. affinis, the next species analyzed in this study, grouped with S. affinis and P. bachiata obtained from GenBank in the Caroxyloneae clade. Each clade was represented by two subclusters (Figure 7).

4. Discussion

In this study, we sequenced, assembled, and annotated the plastid genomes of Salsola foliosa, Salsola tragus, Pyankovia affinis, and Xylosalsola richteri species from Kazakhstan. These species’ complete plastid genome sizes ranged from 151,177 bp to 152,969 bp, which was consistent with the genome sizes of other Amaranthaceae species [51,52,53]. The total GC content varied slightly, ranging from 36.26% in P. affinis to 36.60% in S. tragus and X. richteri. A similar GC content has been observed in other angiosperm plastid genomes [54,55]. The plastid genomes of S. foliosa, S. tragus, P. affinis, and X. richteri are highly conserved, comprising four parts and featuring an identical gene content and gene order. The gene loss and genome rearrangement documented in various angiosperm species [56,57] were not observed in the plastomes of S. foliosa, S. tragus, P. affinis, and X. richteri. A total of 114 unique genes were annotated, including 80 protein-coding genes, 30 tRNA genes, and 4 rRNA genes (Table 3).
The thirteen regions displaying relatively high variability, specifically ndhC-ndhD, rps16-psbK, petD, rpoC2, ndhA, petB, clpP, atpF, ycf3, accD, ndhF-ndhG, matK, and rpl20-rpl22, were identified by Pi values exceeding 0.03 (Table 5). These regions exhibited a substantially higher nucleotide diversity when compared to psbB–psbH and rbcL, which were previously employed in phylogenetic analyses for the revised classification of Salsoleae s.l. [10,11]. In this study, the matK gene, recognized as a core barcode by the CBOL [58], was identified as a region demonstrating a relatively high variability, with a Pi value of 0.05478. The thirteen regions identified to exhibit a relatively high variability could serve as potential DNA barcoding markers and valuable resources in assessing the taxonomy of Salsoleae s.l.
The Ka/Ks ratio provides a valuable measure of selection pressure on individual protein-coding genes. A Ka/Ks ratio below 1 signifies negative or purifying selection, while a ratio of 1 indicates neutral selection, and a Ka/Ks ratio exceeding 1 indicates positive selection [59]. In this study, the Ka/Ks analysis suggested that the majority (55) of the genes were under negative selection, while 18 were identified as being under positive selection (Figure 6). These findings are consistent with reports indicating that the prevalence of protein-coding genes undergoing negative selection is common in angiosperms [55,60,61]. Notably, the ndhB gene exhibited the highest Ka/Ks ratio (4.9), suggesting significant positive selection in the analyzed Salsoleae s.l. species, a conclusion supported by earlier studies [60,61].
SSRs extracted from plastid genomes are variable molecular markers known for their high levels of polymorphism and stability, as documented in the literature [62,63]. Plastid genome SSRs are widely utilized in population genetics and evolutionary studies due to their effectiveness in revealing levels of genetic diversity and population structures [64,65]. In this study, a total of 987 SSRs were identified across the four analyzed plastid genomes (Table 4). Mononucleotide repeats were the most abundant, constituting 77.20% of the total SSRs, followed by dinucleotide, tetranucleotide, trinucleotide, pentanucleotide, and hexanucleotide repeats. Previous studies have reported similar results [66,67]. Most mononucleotide and dinucleotide repeats consisted of A/T (97.64%) and AT/AT (58.79%). The obtained results aligned with prior research indicating that plastid genome SSRs are primarily composed of polyadenine (poly A) or polythymine (poly T) repeats, with infrequent occurrences of tandem guanine (G) or cytosine (C) repeats [68,69]. The higher abundance of tetranucleotide repeats compared to trinucleotide, pentanucleotide, and hexanucleotide repeats is consistent with findings in other species [70]. Pentanucleotide and hexanucleotide repeats were rare across the plastid genomes of S. foliosa, S. tragus, P. affinis, and X. richteri, mirroring the results observed in Caroxylon species from Amaranthaceae [71]. These newly acquired plastid genome SSRs have the potential to bridge the gap caused by the lack of species-specific SSRs designed for the Salsola, Xylosalsola, and Pyankovia taxa.
Complete plastid genome nucleotide sequences offer invaluable data for elucidating the phylogenetic relationships among plants and are widely utilized for this purpose [72,73]. Plastid genome nucleotide sequences have been utilized to assess the phylogenetic relationships among Caroxylon species and other members of the tribe Salsoleae s.l., for which plastid genome data are accessible in GenBank [71]. Previously, extensive phylogenetic studies of Salsoleae s.l. have been conducted, employing both nuclear genes and several plastid genome genes to gain a comprehensive understanding of the evolutionary relationships within this tribe [10,11]. Despite these efforts, the taxonomy of genera such as Salsola, Pyankovia, and Xylosalsola still presents challenges and remains unclear. This study employed nucleotide sequences derived from complete plastid genomes to elucidate the phylogenetic relationships among the analyzed species. In total, thirteen Salsoleae s.l. plastid genomes, including four genomes reported in this study, were analyzed in addition to two outgroups (Figure 7). The dendrogram based on full plastid genomes formed two clear clusters and four subclusters. According to Akhani et al. [10], the first two clusters represented the Salsoleae tribe clade, and the other two represented the Caroxyloneae tribe clade. Within the Caroxyloneae tribe clade, the dendrogram clearly grouped five species from the genus Caroxylon (Figure 7). Previously, this group of species from this genus was assessed using full plastid sequences [71], confirming their taxonomic position within the Caroxyloneae clade. The other cluster of the clade consisted of three species of Pyankovia, including P. affinis. The distinct separation of P. affinis from three other species described in this study was congruent with a morphological description of these taxa (Figure 1). The morphology of P. affinis is drastically different from that of the other three species, as it has fleshy leaves with obtuse and semi-rolled forms (Figure 1G).
In the Salsoleae tribe clade, the species S. abrotanoides, which was accepted as Oreosalsola abrotanoides (Bunge) Akhani [74], and S. foliosa, which was recognized as Soda foliosa (L.) Akhani by Rudov et al. [75], were clearly distinguished from the subcluster consisting of X. richteri, S. tragus, and S. collina (Figure 7). The taxonomic position of S. foliosa within this clade and the separation from S. tragus and X. richteri also reflects a number of distinct morphological features of this species, such as bare and branched stems and leaves that are club-shaped and arched upwards (Figure 1E). The clustering of the three latter species of the clade agrees well with Wen and co-authors [11], as the topology of the two dendrograms for these species is similar. Still, the results provided by Akhani and co-authors [10] suggested that X. richteri belongs to the genus Xylosalsola. Therefore, the generated ML/BI dendrogram indicated that S. collina and S. tragus belong to the genus Salsola in the Kali clade and confirmed that X. richteri is distantly separated from these two species as a representative of the genus Xylosalsola. The separation of S. tragus and X. richteri is also easily visible morphologically. The bracts of S. tragus are pointed with a spine at the apex (Figure 1F), unlike in X. richteri, where the short-pointed leaves are filamentous or linear (Figure 1H). While our results provide some insights, additional plastid genome data from Salsola and Xylosalsola species are necessary to clarify the relationships between these genera thoroughly.
In general, the generated complete plastid sequences for these four species of Salsoleae s.l. may have important biological significance in further taxonomic, evolutionary, and conservation studies in these taxa. For instance, here, we identified more than 200 candidate SSR markers for each of the studied species, which can potentially be efficiently used for the extraction of information on most polymorphic SSRs as informative tools in genetic studies of these species, in the evaluation of the genetic diversity within and between populations, and in the identification of most diverse populations within a species for the development of robust strategies in germplasm conservation projects. Although our current study examined only a limited number of species, ongoing advancements in sequencing technologies are poised to significantly expand the pool of sequenced plastid genomes. Consequently, the data obtained in this study hold promise as a valuable resource for future phylogenetic analyses within the tribe of Salsoleae s.l.

5. Conclusions

The assembled plastid genomes of S. foliosa, S. tragus, P. affinis, and X. richteri had lengths of 151,577 bp, 152,969 bp, 151,239 bp, and 151,177 bp, respectively. Thirteen regions, including ndhC-ndhD, rps16-psbK, petD, rpoC2, ndhA, petB, clpP, atpF, ycf3, accD, ndhF-ndhG, matK, and rpl20-rpl22, exhibited a relatively high nucleotide diversity. These genes could serve as potential DNA-barcoding markers for Salsoleae tribe species. Among 55 genes, the Ka/Ks ratio was <1, indicating purifying selection, while for 18 genes, the ratio was >1, suggesting positive selection; the ndhB gene had the highest Ka/Ks ratio (4.9). The plastid genomes of S. foliosa, S. tragus, P. affinis, and X. richteri contained 239, 274, 215, and 259 SSRs, respectively, with the majority located in intergenic spacer regions (60.98%). The maximum likelihood and Bayesian inference phylogenetic tree separated thirteen plastid genomes into four distinct clusters corresponding to six genera in the Salsoleae and Caroxyloneae clades. It was suggested that the generated sequences of the four complete plastid genomes comprise a valuable resource for future phylogenetic analyses within Salsoleae s.l.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/biom14080890/s1, Supplementary File S1: List of simple sequence repeats identified in the plastid genomes of S. foliosa, S. tragus, P. affinis, and X. richteri. Supplementary File S2: The Ka/Ks analysis of 80 protein-coding genes from Salsoleae s.l. plastid genomes.

Author Contributions

Conceptualization, S.A. (Shyryn Almerekova), Y.T. and S.A. (Saule Abugalieva); software, S.A. (Shyryn Almerekova); validation, S.A. (Shyryn Almerekova), M.Y. and B.O.; formal analysis, S.A. (Shyryn Almerekova) and Y.T.; investigation, S.A. (Shyryn Almerekova), M.Y., B.O. and P.V.; resources, S.A. (Shyryn Almerekova), M.Y., B.O. and P.V.; data curation, S.A. (Shyryn Almerekova) and Y.T.; writing—original draft preparation, S.A. (Shyryn Almerekova), Y.T. and S.A. (Saule Abugalieva); writing—review and editing, S.A. (Shyryn Almerekova) and S.A. (Saule Abugalieva); supervision, Y.T.; project administration, S.A. (Shyryn Almerekova); funding acquisition, S.A. (Shyryn Almerekova). All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Science Committee of the Ministry of Science and Higher Education of the Republic of Kazakhstan (grant No. AP14869593).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The plastid genome sequences of the four species analyzed in this study have been submitted to the GenBank database at the NCBI (National Center for Biotechnology Information) with the following accession numbers: PP754487-PP754490.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Kadereit, G.; Borsch, T.; Weising, K.; Freitag, H. Phylogeny of Amaranthaceae and Chenopodiaceae and the evolution of C4 photosynthesis. Int. J. Plant Sci. 2003, 164, 959–986. [Google Scholar] [CrossRef]
  2. Latałowa, M.; Kupryjanowicz, M.; Nalepka, D. Chenopodiaceae–Goosefoot family. In Late Glacial and Holocene History of Vegetation in Poland Based on Isopollen Maps; Szafer Institute of Botany, Polish Academy of Sciences: Kraków, Poland, 2004; pp. 273–281. [Google Scholar]
  3. Mucina, L. Caroxylon (Chenopodiaceae s. str.) in continental southern Africa and Madagascar: A preliminary nomenclatural synopsis and biogeographical considerations. Phytotaxa 2017, 312, 151–178. [Google Scholar] [CrossRef]
  4. Bochantsev, V.P. The genus Salsola L., a brief history of its development and settlement. Bot. J. 1969, 54, 989–1001. (In Russian) [Google Scholar]
  5. Kühn, U.; Bittrich, V.; Carolin, R.; Freitag, H.; Hedge, I.C.; Uotila, P.; Wilson, P.G. Chenopodiaceae. In Flowering Plants Dicotyledons; Springer: Berlin/Heidelberg, Germany, 1993; pp. 253–281. [Google Scholar]
  6. Freitag, H. Salsola L. (Chenopodiaceae). In Flora Iranica; Rechinger, K.N., Ed.; Akadesche Druck-u. Verlagsanstalt: Graz, Austria, 1997; Volume 173, pp. 154–255. [Google Scholar]
  7. Willis, J.C.; Airy Shaw, H.G. A Dictionary of the Flowering Plants and Ferns, 8th ed.; Cambridge University Press: Cambridge, UK, 1973; p. 209. [Google Scholar]
  8. Abtahi, M.; Zandi Esfahan, E. Effects of phenological stage on forage quality of halophyte species Salsola arbuscula Pall. in the central desert of Iran. Appl. Ecol. Environ. Res. 2017, 15, 3. [Google Scholar] [CrossRef]
  9. Murshid, S.S.; Atoum, D.; Abou-Hussein, D.R.; Abdallah, H.M.; Hareeri, R.H.; Almukadi, H.; Edrada-Ebel, R. Genus Salsola: Chemistry, biological activities and future prospective—A review. Plants 2022, 11, 714. [Google Scholar] [CrossRef] [PubMed]
  10. Akhani, H.; Edwards, G.; Roalson, E.H. Diversification of the old world Salsoleae sl (Chenopodiaceae): Molecular phylogenetic analysis of nuclear and chloroplast data sets and a revised classification. Int. J. Plant Sci. 2007, 168, 931–956. [Google Scholar] [CrossRef]
  11. Wen, Z.B.; Zhang, M.L.; Zhu, G.L.; Sanderson, S.C. Phylogeny of Salsoleae sl (Chenopodiaceae) based on DNA sequence data from ITS, psbB–psbH, and rbcL, with emphasis on taxa of northwestern China. Plant Syst. Evol. 2010, 288, 25–42. [Google Scholar] [CrossRef]
  12. POWO. Plants of the World Online. Facilitated by the Royal Botanic Gardens, Kew. Published on the Internet. Available online: http://www.plantsoftheworldonline.org/ (accessed on 11 May 2024).
  13. Wei, Y.; Dong, M.; Huang, Z.; Tan, D. Factors influencing seed germination of Salsola affinis (Chenopodiaceae), a dominant annual halophyte inhabiting the deserts of Xinjiang, China. Flora Morphol. Distrib. Funct. Ecol. Plants 2008, 203, 134–140. [Google Scholar] [CrossRef]
  14. Pirasteh-Anosheh, H.; Mirhosseini, A.; Akram, N.A.; Hasanuzzaman, M. Forage potential of Salsola species in arid-saline rangelands. Turk. J. Bot. 2021, 45, 203–215. [Google Scholar] [CrossRef]
  15. Dianati Tilaki, G.; Haidarian Aghakhani, M.; Filehkesh, E.; Naghipour Borj, A.A. Investigation on the effects of phenological stages on forage quality and soluble carbohydrates in Salsola arbuscula and Salsola richteri species in saline rangelands of sabzevar. Iran. J. Range Desert Res. 2012, 18, 652–661. [Google Scholar]
  16. Altay, V.; Ozturk, M. The genera Salsola and Suaeda (Amaranthaceae) and their value as fodder. In Handbook of Halophytes: From Molecules to Ecosystems towards Biosaline Agriculture; Springer: Berlin/Heidelberg, Germany, 2020; pp. 1–12. [Google Scholar]
  17. Loizzo, M.R.; Tundis, R.; Statti, G.A.; Passalacqua, N.G.; Peruzzi, L.; Menichini, F. In vitro angiotensin converting enzyme inhibiting activity of Salsola oppositifolia Desf., Salsola soda L. and Salsola tragus L. Nat. Prod. Res. 2007, 21, 846–851. [Google Scholar] [CrossRef] [PubMed]
  18. ElNaggar, M.H.; Eldehna, W.M.; Abourehab, M.A.; Abdel Bar, F.M. The old world Salsola as a source of valuable secondary metabolites endowed with diverse pharmacological activities: A review. J. Enzym. Inhib. Med. Chem. 2022, 37, 2036–2062. [Google Scholar] [CrossRef] [PubMed]
  19. Pyankov, V.I.; Artyusheva, E.G.; Edwards, G.E.; Black, C.C., Jr.; Soltis, P.S. Phylogenetic analysis of tribe Salsoleae (Chenopodiaceae) based on ribosomal ITS sequences: Implications for the evolution of photosynthesis types. Am. J. Bot. 2001, 88, 1189–1198. [Google Scholar] [CrossRef] [PubMed]
  20. Borger, C.P.; Yan, G.; Scott, J.K.; Walsh, M.J.; Powles, S.B. Salsola tragus or S. australis (Chenopodiaceae) in Australia—Untangling taxonomic confusion through molecular and cytological analyses. Aust. J. Bot. 2008, 56, 600–608. [Google Scholar] [CrossRef]
  21. Saeed, A.S.; Saeidi, H.; Baghestani, M.N.; Mirhosseini, A. The taxonomy of the genus Salsola (Chenopodiaceae) in Yazd province. J. Taxon. Biosist. 2016, 8, 51–60. [Google Scholar]
  22. Mosyakin, S.L. Taxonomic and nomenclatural notes on Pontic-Mediterranean coastal and some Australasian taxa of Salsola (Chenopodiaceae). Ukr. Bot. J. 2017, 74, 521–531. [Google Scholar] [CrossRef]
  23. Gao, T.P.; Gao, H.N.; Zhang, Y.; Xu, Y.; An, L.Z. Genetic diversity of Salsola passerina populations in northwestern China based on inter-simple sequence repeat (ISSR). J. Lanzhou Univ. Nat. Sci. 2009, 45, 66–74. [Google Scholar]
  24. Alotaibi, M.O.; Abd-Elgawad, M.E. ISSR and SCoT for evaluation of hereditary differences of 29 wild plants in Al Jubail Saudi Arabian. Saudi J. Biol. Sci. 2022, 29, 3223–3231. [Google Scholar] [CrossRef]
  25. Abdel-Hamid, A.M. Characterization of four Salsola species and their genetic relationship by AFLP. Pak. J. Bot. 2016, 48, 1183–1187. [Google Scholar]
  26. McGray, H.G.; Ayres, D.R.; Sloop, C.M.; Lee, A.K. Beta SSR loci cross-amplify in five Salsola taxa. Mol. Ecol. Resour. 2008, 8, 608–611. [Google Scholar] [CrossRef]
  27. Almerekova, S.; Favarisova, N.; Turuspekov, Y.; Abugalieva, S. Cross-Genera Transferability of Microsatellite Markers and Phylogenetic Assessment of Three Salsola Species from Western Kazakhstan. Proc. Latv. Acad. Sci. Sect. B Nat. Exact Appl. Sci. 2020, 74, 325–334. [Google Scholar] [CrossRef]
  28. Daniell, H.; Lin, C.S.; Yu, M.; Chang, W.J. Chloroplast genomes: Diversity, evolution, and applications in genetic engineering. Genome Biol. 2016, 17, 134. [Google Scholar] [CrossRef] [PubMed]
  29. Jansen, R.K.; Raubeson, L.A.; Boore, J.L.; DePamphilis, C.W.; Chumley, T.W.; Haberle, R.C.; Wyman, S.K.; Alverson, A.J.; Peery, R.; Herman, S.J.; et al. Methods for obtaining and analyzing whole chloroplast genome sequences. Methods Enzymol. 2005, 395, 348–384. [Google Scholar]
  30. Yu, X.Q.; Drew, B.T.; Yang, J.B.; Gao, L.M.; Li, D.Z. Comparative chloroplast genomes of eleven Schima (Theaceae) species: Insights into DNA barcoding and phylogeny. PLoS ONE 2017, 12, e0178026. [Google Scholar] [CrossRef]
  31. Ahmed, I.; Matthews, P.J.; Biggs, P.J.; Naeem, M.; McLenachan, P.A.; Lockhart, P.J. Identification of chloroplast genome loci suitable for high-resolution phylogeographic studies of Colocasia esculenta (L.) Schott (Araceae) and closely related taxa. Mol. Ecol. Resour. 2013, 13, 929–937. [Google Scholar] [CrossRef]
  32. Bilgen, B.B.; Kaya, N. Genetic diversity among Pinus sylvestris L. populations and its implications for genetic conservation: Comparison of nuclear and chloroplast microsatellite markers. Fresenius Environ. Bull. 2017, 26, 6873–6881. [Google Scholar]
  33. Jayaswall, K.; Sharma, H.; Jayaswal, D.; Sagar, R.; Bhandawat, A.; Kumar, A.; Singh, M. Development of chloroplast derived SSR markers for genus Allium and their characterization in the allies for genetic improvement of Alliums. S. Afr. J. Bot. 2023, 162, 304–313. [Google Scholar] [CrossRef]
  34. Ginwal, H.S.; Chauhan, P.; Barthwal, S.; Sharma, A.; Sharma, R. Short Note: Cross-Species Amplification and Characterization of Chloroplast Microsatellite Markers in Roxb. Silvae Genet. 2011, 60, 65–69. [Google Scholar] [CrossRef]
  35. Żukowska, W.B.; Wójkiewicz, B.; Litkowiec, M.; Wachowiak, W. Cross-amplification and multiplexing of cpSSRs and nSSRs in two closely related pine species (Pinus sylvestris L. and P. mugo Turra). Dendrobiology 2017, 77, 59–64. [Google Scholar] [CrossRef]
  36. Sharma, H.; Hyvönen, J.; Poczai, P. Development of chloroplast microsatellite markers for giant ragweed (Ambrosia trifida). Appl. Plant Sci. 2020, 8, 11313. [Google Scholar] [CrossRef]
  37. Terakami, S.; Matsumura, Y.; Kurita, K.; Kanamori, H.; Katayose, Y.; Yamamoto, T.; Katayama, H. Complete sequence of the chloroplast genome from pear (Pyrus pyrifolia): Genome structure and comparative analysis. Tree Genet. Genomes 2012, 8, 841–854. [Google Scholar] [CrossRef]
  38. Li, X.; Zhang, Q.; Duo, J.; Yang, Y.; Ju, X.; Duan, R.; Xiong, H. The complete chloroplast genome of Salsola abrotanoides (Chenopodiaceae), a desert halophyte shrub in China. Mitochondrial DNA Part B 2021, 6, 1152–1153. [Google Scholar] [CrossRef]
  39. Doyle, J.J.; Doyle, J.L. A Rapid DNA Isolation Procedure for Small Quantities of Fresh Leaf Tissue. Phytochem. Bull. 1987, 19, 11–15. [Google Scholar]
  40. Bolger, A.M.; Lohse, M.; Usadel, B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics 2014, 30, 2114–2120. [Google Scholar] [CrossRef]
  41. Dierckxsens, N.; Mardulyn, P.; Smits, G. NOVOPlasty: De novo assembly of organelle genomes from whole genome data. Nucleic Acids Res. 2017, 45, e18. [Google Scholar] [CrossRef]
  42. Tillich, M.; Lehwark, P.; Pellizzer, T.; Ulbricht-Jones, E.S.; Fischer, A.; Bock, R.; Greiner, S. GeSeq–versatile and accurate annotation of organelle genomes. Nucleic Acids Res. 2017, 45, W6–W11. [Google Scholar] [CrossRef] [PubMed]
  43. Xie, W.; Zhang, C.; Wang, Y.; Zhang, Y. The complete chloroplast genome of Caroxylon passerinum (Chenopodiaceae), an annual desert plant. Mitochondrial DNA Part B 2022, 7, 426–427. [Google Scholar] [CrossRef] [PubMed]
  44. Lohse, M.; Drechsel, O.; Bock, R. OrganellarGenomeDRAW (OGDRAW): A tool for the easy generation of high-quality custom graphical maps of plastid and mitochondrial genomes. Curr. Genet. 2007, 52, 267–274. [Google Scholar] [CrossRef] [PubMed]
  45. Beier, S.; Thiel, T.; Munch, T.; Scholz, U.; Mascher, M. MISA-web: A web server for microsatellite prediction. Bioinformatics 2017, 33, 2583–2585. [Google Scholar] [CrossRef]
  46. Kurtz, S.; Choudhuri, J.V.; Ohlebusch, E.; Schleiermacher, C.; Stoye, J.; Giegerich, R. Reputer: The manifold applications of repeat analysis on a genomic scale. Nucleic Acids Res. 2001, 29, 4633–4642. [Google Scholar] [CrossRef]
  47. Benson, G. Tandem repeats finder: A program to analyze DNA sequences. Nucleic Acids Res. 1999, 27, 573–580. [Google Scholar] [CrossRef] [PubMed]
  48. Rozas, J.; Ferrer-Mata, A.; Sánchez-DelBarrio, J.C.; Guirao-Rico, S.; Librado, P.; Ramos-Onsins, S.E.; Sánchez-Gracia, A. DnaSP 6: DNA sequence polymorphism analysis of large data sets. Mol. Biol. Evol. 2017, 34, 3299–3302. [Google Scholar] [CrossRef] [PubMed]
  49. Nguyen, L.T.; Schmidt, H.A.; Von Haeseler, A.; Minh, B.Q. IQ-TREE: A fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 2015, 32, 268–274. [Google Scholar] [CrossRef] [PubMed]
  50. Ronquist, F.; Teslenko, M.; van der Mark, P.; Ayres, D.L.; Darling, A.; Höhna, S.; Larget, B.; Liu, L.; Suchard, M.A.; Huelsenbeck, J.P. MrBayes 3.2: Efficient Bayesian Phylogenetic Inference and Model Choice Across a Large Model Space. Syst. Biol. 2012, 61, 539–542. [Google Scholar] [CrossRef] [PubMed]
  51. Park, J.; Min, J.; Kim, Y.; Chung, Y. The comparative analyses of six complete chloroplast genomes of morphologically diverse Chenopodium album L. (Amaranthaceae) collected in Korea. Int. J. Genom. 2021, 2021, 6643444. [Google Scholar] [CrossRef] [PubMed]
  52. Wariss, H.M.; Qu, X.J. The complete chloroplast genome of Chenopodium acuminatum Willd. (Amaranthaceae). Mitochondrial DNA Part B 2021, 6, 174–175. [Google Scholar] [CrossRef]
  53. Wei, Z.; Chen, F.; Ding, H.; Liu, W.; Yang, B.; Geng, J.; Guo, S. Comparative Analysis of Six Chloroplast Genomes in Chenopodium and Its Related Genera (Amaranthaceae): New Insights into Phylogenetic Relationships and the Development of Species-Specific Molecular Markers. Genes 2023, 14, 2183. [Google Scholar] [CrossRef] [PubMed]
  54. Zhang, Z.; Tao, M.; Shan, X.; Pan, Y.; Sun, C.; Song, L.; Dai, Z. Characterization of the complete chloroplast genome of Brassica oleracea var. italica and phylogenetic relationships in Brassicaceae. PLoS ONE 2022, 17, e0263310. [Google Scholar]
  55. Feng, L.; Zhao, G.; An, M.; Wang, C.; Yin, Y. Complete chloroplast genome sequences of the ornamental plant Prunus cistena and comparative and phylogenetic analyses with its closely related species. BMC Genom. 2023, 24, 739. [Google Scholar] [CrossRef]
  56. He, L.; Qian, J.; Li, X.; Sun, Z.; Xu, X.; Chen, S. Complete chloroplast genome of medicinal plant Lonicera japonica: Genome rearrangement, intron gain and loss, and implications for phylogenetic studies. Molecules 2017, 22, 249. [Google Scholar] [CrossRef]
  57. Claude, S.J.; Park, S.; Park, S. Gene loss, genome rearrangement, and accelerated substitution rates in plastid genome of Hypericum ascyron (Hypericaceae). BMC Plant Biol. 2022, 22, 135. [Google Scholar] [CrossRef] [PubMed]
  58. CBOL Plant Working Group. A DNA barcode for land plants. Proc. Natl. Acad. Sci. USA 2009, 106, 12794–12797. [Google Scholar] [CrossRef] [PubMed]
  59. Yang, Z.; Nielsen, R. Estimating synonymous and nonsynonymous substitution rates under realistic evolutionary models. Mol. Biol. Evol. 2000, 17, 32–43. [Google Scholar] [CrossRef]
  60. Fu, P.C.; Zhang, Y.Z.; Geng, H.M.; Chen, S.L. The complete chloroplast genome sequence of Gentiana lawrencei var. farreri (Gentianaceae) and comparative analysis with its congeneric species. PeerJ 2016, 4, e2540. [Google Scholar]
  61. Gao, B.; Yuan, L.; Tang, T.; Hou, J.; Pan, K.; Wei, N. The complete chloroplast genome sequence of Alpinia oxyphylla Miq. and comparison analysis within the Zingiberaceae family. PLoS ONE 2019, 14, e0218817. [Google Scholar] [CrossRef]
  62. Ebert, D.; Peakall, R. Chloroplast simple sequence repeats (cpSSRs): Technical resources and recommendations for expanding cpSSR discovery and applications to a wide array of plant species. Mol. Ecol. Resour. 2009, 9, 673–690. [Google Scholar] [CrossRef]
  63. Makrickiene, E.; Danusevičius, D.; Brazaitis, G.; Manton, M. Morphological and genetic differentiation of wolf trees in Scots pine stands based on chloroplast microsatellite markers. Eur. J. For. Res. 2019, 138, 527–537. [Google Scholar] [CrossRef]
  64. Lācis, G.; Kārkliņa, K.; Bartulsons, T.; Stalažs, A.; Jundzis, M.; Baļķe, I.; Strautiņa, S. Genetic structure of a Ribes genetic resource collection: Inter-and intra-specific diversity revealed by chloroplast DNA simple sequence repeats (cpSSRs). Sci. Hortic. 2022, 304, 111285. [Google Scholar] [CrossRef]
  65. Guo, Q.; Xue, X.; Wang, D.; Zhang, L.; Liu, W.; Wang, E.; Hou, X. Genetic diversity and population genetic structure of Paeonia suffruticosa by chloroplast DNA simple sequence repeats (cpSSRs). Hortic. Plant J. 2024, in press. [Google Scholar] [CrossRef]
  66. Mehmetoğlu, E.; Kaymaz, Y.; Ateş, D.; Kahraman, A.; Tanyolaç, M.B. The complete chloroplast genome of Cicer reticulatum and comparative analysis against relative Cicer species. Sci. Rep. 2023, 13, 17871. [Google Scholar] [CrossRef]
  67. Jin, G.; Li, W.; Song, F.; Yang, L.; Wen, Z.; Feng, Y. Comparative analysis of complete Artemisia subgenus Seriphidium (Asteraceae: Anthemideae) chloroplast genomes: Insights into structural divergence and phylogenetic relationships. BMC Plant Biol. 2023, 23, 258. [Google Scholar] [CrossRef] [PubMed]
  68. Rono, P.C.; Dong, X.; Kirika, P.M.; Hu, G.W.; Wang, Q.F. Initial complete chloroplast genomes of Alchemilla (Rosaceae): Comparative analysis and phylogenetic relationships. Front. Genet. 2020, 11, 560368. [Google Scholar] [CrossRef] [PubMed]
  69. Yermagambetova, M.; Abugalieva, S.; Turuspekov, Y.; Almerekova, S. Illumina sequencing data of the complete chloroplast genome of rare species Juniperus seravschanica (Cupressaceae) from Kazakhstan. Data Brief 2023, 46, 108866. [Google Scholar] [CrossRef] [PubMed]
  70. Almerekova, S.; Yermagambetova, M.; Jumanov, S.; Abugalieva, S.; Turuspekov, Y. Comparative analysis of chloroplast genomes of seven Juniperus species from Kazakhstan. PLoS ONE 2024, 19, e0295550. [Google Scholar] [CrossRef] [PubMed]
  71. Almerekova, S.; Yermagambetova, M.; Osmonali, B.; Vesselova, P.; Abugalieva, S.; Turuspekov, Y. Characterization of the Plastid Genomes of Four Caroxylon Thunb. Species from Kazakhstan. Plants 2024, 13, 1332. [Google Scholar] [CrossRef] [PubMed]
  72. Chen, S.; Safiul Azam, F.M.; Akter, M.L.; Ao, L.; Zou, Y.; Qian, Y. The first complete chloroplast genome of Thalictrum fargesii: Insights into phylogeny and species identification. Front. Plant Sci. 2024, 15, 1356912. [Google Scholar] [CrossRef] [PubMed]
  73. Li, L.; Wu, Q.; Zhai, J.; Wu, K.; Fang, L.; Li, M.; Li, S. Comparative chloroplast genomics of 24 species shed light on the genome evolution and phylogeny of subtribe Coelogyninae (Orchidaceae). BMC Plant Biol. 2024, 24, 31. [Google Scholar] [CrossRef] [PubMed]
  74. Akhani, H.; Khoshravesh, R.; Malekmohammadi, M. Taxonomic novelties from Irano-Turanian region and NE Iran: Oreosalsola, a new segregate from Salsola s.l., two new species in Anabasis and Salvia, and two new combinations in Caroxylon and Seseli. Phytotaxa 2016, 249, 159–180. [Google Scholar] [CrossRef]
  75. Rudov, A.; Mashkour, M.; Djamali, M.; Akhani, H. A review of C4 plants in southwest Asia: An ecological, geographical and taxonomical analysis of a region with high diversity of C4 eudicots. Front. Plant Sci. 2020, 11, 546518. [Google Scholar] [CrossRef]
Figure 1. Pictures of Salsola foliosa, Salsola tragus, Pyankovia affinis, and Xylosalsola richteri in natural conditions ((AD), respectively) and close-up representations of S. foliosa, S. tragus, P. affinis, and X. richter ((EH), respectively).
Figure 1. Pictures of Salsola foliosa, Salsola tragus, Pyankovia affinis, and Xylosalsola richteri in natural conditions ((AD), respectively) and close-up representations of S. foliosa, S. tragus, P. affinis, and X. richter ((EH), respectively).
Biomolecules 14 00890 g001
Figure 2. A circular map illustrating the plastid genomes of the species S. foliosa, S. tragus, P. affinis, and X. richteri collected in Kazakhstan, with annotated genes color-coded based on their respective functions. Inside the circle, the darker grey shades represent the GC content, while the lighter grey shades represent the AT content. The boundaries of the plastid genome is delineated into LSC, SSC, IRA, and IRB regions.
Figure 2. A circular map illustrating the plastid genomes of the species S. foliosa, S. tragus, P. affinis, and X. richteri collected in Kazakhstan, with annotated genes color-coded based on their respective functions. Inside the circle, the darker grey shades represent the GC content, while the lighter grey shades represent the AT content. The boundaries of the plastid genome is delineated into LSC, SSC, IRA, and IRB regions.
Biomolecules 14 00890 g002
Figure 3. The distribution of identified SSRs across the large single-copy (LSC), small single-copy (SSC), and inverted repeat (IR) regions of the plastid genomes of S. foliosa, S. tragus, P. affinis, and X. richteri.
Figure 3. The distribution of identified SSRs across the large single-copy (LSC), small single-copy (SSC), and inverted repeat (IR) regions of the plastid genomes of S. foliosa, S. tragus, P. affinis, and X. richteri.
Biomolecules 14 00890 g003
Figure 4. Repeated sequences within the plastid genomes of S. foliosa, S. tragus, P. affinis, and X. richteri. (A) The overall count of tandem, forward, reverse, and palindromic repeat types. (B) The distribution of long repeats was categorized by their respective lengths.
Figure 4. Repeated sequences within the plastid genomes of S. foliosa, S. tragus, P. affinis, and X. richteri. (A) The overall count of tandem, forward, reverse, and palindromic repeat types. (B) The distribution of long repeats was categorized by their respective lengths.
Biomolecules 14 00890 g004
Figure 5. Nucleotide diversity (Pi) values among the protein-coding genes of the 13 species of Salsoleae s.l. based on the sliding window analysis (window length = 600 bp and step size = 200 bp). The vertical axis represents the nucleotide diversity for each window, and the horizontal axis indicates the midpoint position.
Figure 5. Nucleotide diversity (Pi) values among the protein-coding genes of the 13 species of Salsoleae s.l. based on the sliding window analysis (window length = 600 bp and step size = 200 bp). The vertical axis represents the nucleotide diversity for each window, and the horizontal axis indicates the midpoint position.
Biomolecules 14 00890 g005
Figure 6. The Ka/Ks ratios of 80 protein-coding genes from 13 samples of Salsoleae s.l. plastid genomes.
Figure 6. The Ka/Ks ratios of 80 protein-coding genes from 13 samples of Salsoleae s.l. plastid genomes.
Biomolecules 14 00890 g006
Figure 7. Phylogenetic tree inferred from the nucleotide sequences of the complete plastid genome of 13 samples of Salsoleae s.l. and outgroups, using maximum likelihood (ML) and Bayesian inference (BI) methods. The species collected in this study are highlighted in blue. The numbers in the branch nodes represent ML bootstrap values/BI posterior probabilities values.
Figure 7. Phylogenetic tree inferred from the nucleotide sequences of the complete plastid genome of 13 samples of Salsoleae s.l. and outgroups, using maximum likelihood (ML) and Bayesian inference (BI) methods. The species collected in this study are highlighted in blue. The numbers in the branch nodes represent ML bootstrap values/BI posterior probabilities values.
Biomolecules 14 00890 g007
Table 1. The information on the collected places of S. foliosa, S. tragus, P. affinis, and X. richteri plant materials.
Table 1. The information on the collected places of S. foliosa, S. tragus, P. affinis, and X. richteri plant materials.
Species S. foliosaS. tragusP. affinisX. richteri
Collected placeWest Kazakhstan region, Borly districtKyzylorda region, Zhalagash districtZhetysu region, Panfilov district Kyzylorda region, Zhanakorgan district
GPS coordinates51.2945.0844.1744.33
53.3664.7879.5366.21
82 m a.s.l.110 m a.s.l.890 m a.s.l.150 m a.s.l.
Table 2. Plastid genome features of S. foliosa, S. tragus, P. affinis, and X. richteri.
Table 2. Plastid genome features of S. foliosa, S. tragus, P. affinis, and X. richteri.
S. foliosaS. tragusP. affinisX. richteri
GenBank numbersPP754489PP754490PP754487PP754488
Genome size (bp)151,577152,969151,239151,177
LSC (bp)83,87183,99383,32484,136
SSC (bp)18,96218,57618,73920,087
IR (bp)48,74450,40049,17646,954
Number of total genes133133133133
Protein-coding genes80808080
tRNAs30303030
rRNAs4444
Total GC content (%)36.5536.6036.2636.60
LSC GC content (%)34.4534.6034.1034.57
SSC GC content (%)29.5929.3828.7529.76
IR GC content (%)42.8842.5842.7743.17
Table 3. Annotated genes within the S. foliosa, S. tragus, P. affinis, and X. richteri plastid genomes.
Table 3. Annotated genes within the S. foliosa, S. tragus, P. affinis, and X. richteri plastid genomes.
CategoryGroup of GenesName of Genes
Self-replicationRibosomal RNArrn4.5 (2), rrn5 (2), rrn16 (2), rrn23 (2)
Transfer RNAtrnA-UGC * (2), trnC-GCA, trnD-GUC, trnE-UUC, trnF-GAA, trnfM-CAU, trnG-GCC *, trnG-UCC, trnH-GUG, trnI-CAU (2), trnI-GAU * (2), trnK-UUU *, trnL-CAA (2), trnL-UAA *, trnL-UAG, trnM-CAU, trnN-GUU (2), trnP-UGG, trnQ-UUG, trnR-ACG (2), trnR-UCU, trnS-GCU, trnS-GGA, trnS-UGA, trnT-GGU, trnT-UGU, trnV-GAC (2), trnV-UAC *, trnW-CCA, trnY-GUA
Small subunit of ribosomerps2, rps3, rps4, rps7 (2), rps8, rps11, rps12 * (2), rps14, rps15, rps16 *, rps18, rps19
Large subunit of ribosomerpl2 (2), rpl14, rpl16 *, rpl20, rpl22, rpl23 (2), rpl32, rpl33, rpl36
RNA polymeraserpoA, rpoB, rpoC1 *, rpoC2
Translation initiation factorinfA
PhotosynthesisATP synthaseatpA, atpB, atpE, atpF *, atpH, atpI
NADH dehydrogenasendhA *, ndhB * (2), ndhC, ndhD, ndhE, ndhF, ndhG, ndhH, ndhI, ndhJ, ndhK
Subunits of cytochromepetA, petB *, petD *, petG, petL, petN
Photosystem IpsaA, psaB, psaC, psaI, psaJ
Photosystem IIpsbA, psbB, psbC, psbD, psbE, psbF, psbH, psbI, psbJ, psbK, psbL, psbM, psbN, psbT, psbZ
RubiscorbcL
Other genesMaturasematK
ProteaseclpP **
Envelope membrane proteincemA
Subunit of acetyl-CoA-carboxylaseaccD
C-type cytochrome synthesis geneccsA
Genes of unknown functionHypothetical chloroplast
reading frames
ycf1 (2), ycf2 (2), ycf3 **, ycf4, ycf15 (2)
* One-intron-containing genes; ** two-intron-containing genes; (2) duplicated genes.
Table 4. SSRs detected in the plastid genomes of S. foliosa, S. tragus, P. affinis, and X. richteri.
Table 4. SSRs detected in the plastid genomes of S. foliosa, S. tragus, P. affinis, and X. richteri.
TypeRepeatsS. foliosaS. tragusP. affinisX. richteriTotal%
MonoA/T18020716019774477.20
C/G365418
DiAC/GT2141816.72
AG/CT1614141660
AT/AT2824182797
TriAAG/CTT112152.33
AAT/ATT1122318
TetraAAAC/GTTT110133.34
AAAG/CTTT11215
AAAT/ATTT10113
AAGG/CCTT01012
AATG/ATTC01012
AATT/AATT323210
ACCT/AGGT22228
PentaAAATG/ATTTC001120.20
HexaAAAATT/AATTTT010010.20
AATCCG/ATTCGG00101
Total239274215259987100
Table 5. Highly variable regions in the protein-coding genes of the 13 Salsoleae s.l. species.
Table 5. Highly variable regions in the protein-coding genes of the 13 Salsoleae s.l. species.
Variable RegionLengthVariable SitesParsimony-Informative SitesNucleotide Diversity
ndhC-ndhD60075440.03947
rps16-psbK60982470.03778
petD68890440.03905
rpoC260686450.04046
ndhA86483450.04086
petB74498430.04238
clpP72396460.04262
atpF77987480.04278
ycf366996500.04564
accD729110530.05114
ndhF-ndhG606106630.05288
matK606106650.05478
rpl20-rpl22606133580.06000
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Almerekova, S.; Yermagambetova, M.; Osmonali, B.; Vesselova, P.; Turuspekov, Y.; Abugalieva, S. Complete Plastid Genome Sequences of Four Salsoleae s.l. Species: Comparative and Phylogenetic Analyses. Biomolecules 2024, 14, 890. https://doi.org/10.3390/biom14080890

AMA Style

Almerekova S, Yermagambetova M, Osmonali B, Vesselova P, Turuspekov Y, Abugalieva S. Complete Plastid Genome Sequences of Four Salsoleae s.l. Species: Comparative and Phylogenetic Analyses. Biomolecules. 2024; 14(8):890. https://doi.org/10.3390/biom14080890

Chicago/Turabian Style

Almerekova, Shyryn, Moldir Yermagambetova, Bektemir Osmonali, Polina Vesselova, Yerlan Turuspekov, and Saule Abugalieva. 2024. "Complete Plastid Genome Sequences of Four Salsoleae s.l. Species: Comparative and Phylogenetic Analyses" Biomolecules 14, no. 8: 890. https://doi.org/10.3390/biom14080890

APA Style

Almerekova, S., Yermagambetova, M., Osmonali, B., Vesselova, P., Turuspekov, Y., & Abugalieva, S. (2024). Complete Plastid Genome Sequences of Four Salsoleae s.l. Species: Comparative and Phylogenetic Analyses. Biomolecules, 14(8), 890. https://doi.org/10.3390/biom14080890

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop