Next Article in Journal
Influence of Plant Developmental Phase and Irrigation Level on Cultivable Microbiome of Maize Root
Previous Article in Journal
Nutritional Quality Fingerprinting of Wild and Farmed Cyprinus carpio: A UHPLC-MS/MS-Based Traceability Strategy
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Comprehensive Analysis of Mitochondrial Genomic Characteristics and Phylogenetic Analysis of the Plant Genus Ipomoea (Convolvulaceae) Species

1
College of Agriculture, Henan University of Science and Technology, Luoyang 471023, China
2
College of Animal Science and Technology, Henan University of Science and Technology, Luoyang 471000, China
3
Sweet Potato Industry Institute, Henan University of Science and Technology, Luoyang 471023, China
*
Authors to whom correspondence should be addressed.
Biology 2025, 14(12), 1696; https://doi.org/10.3390/biology14121696 (registering DOI)
Submission received: 15 November 2025 / Revised: 26 November 2025 / Accepted: 27 November 2025 / Published: 28 November 2025
(This article belongs to the Section Conservation Biology and Biodiversity)

Simple Summary

This study explored the genetic material inside the energy-producing parts of cells, known as mitochondrial genomes, in eight different species of the Ipomoea plant. Although these plants are valuable for food and the environment, little was known about their mitochondrial genomes. Our goal was to understand how these genomes are structured and how they have evolved. We found that the size and gene content of these genomes vary widely among species. Some genes are changing quickly, which may help the plants adapt to their environments, while others remain very stable. By comparing all the genomes, we were able to map out the family relationships between the species. This research provides a foundation for better understanding how these plants evolve and can help guide future efforts to improve crop varieties, which is important for ensuring food security and sustainable agriculture.

Abstract

The genus Ipomoea holds significant economic and ecological value, yet comparative genomic studies on its mitochondrial genomes remain limited. To clarify the genetic evolutionary characteristics of Ipomoea at the mitogenome level, this study conducted a systematic comparative analysis of eight Ipomoea mitogenomes, focusing on sequence structure, codon usage bias, repetitive sequences, genetic variation, and phylogenetic relationships. The results showed that the mitogenome sizes of the eight species ranged from 106.281 to 296.652 kb, with GC content of 43.89–44.82%, ORF numbers of 298–326, and RNA editing sites of 29–223. Repetitive sequence analysis revealed universal mononucleotide SSRs (P1), lineage-specific dispersed repeats (DNA/MIR in four species), and abundant short tandem repeats (0–50 bp). Ka/Ks ratio analysis showed 17 protein-coding genes (e.g., atp4, nad6) under positive selection (linked to adaptive evolution) and 23 genes (e.g., nad3) under purifying selection, ensuring functional conservation. Maximum likelihood phylogeny (with Cuscuta australis as outgroup) identified three primary clades: I. quamoclitI. pes-caprae, I. trilobaI. batatas, and I. bifloraI. aquatica, with I. trifida and I. nil as basal lineages. This study fills gaps in Ipomoea mitogenome research and provides a theoretical foundation for understanding its genomic evolution and guiding germplasm improvement.

1. Introduction

The genus Ipomoea, belonging to the order Solanales and the family Convolvulaceae, is the largest genus within its family, comprising approximately 500 species. Most of these species are distributed across subtropical and tropical regions and are predominantly annual or perennial herbaceous vines. Commonly known examples include morning glory (I. nil), sweet potato (I. batatas), water spinach (I. aquatica), and cypress vine (I. quamoclit), while a small number are shrubs or small trees [1,2]. In China, about 20 species of Ipomoea are found throughout the country, with the majority occurring in southern and southwestern regions [3]. Among these, I. batatas and I. aquatica. serve as important food and vegetable crops, while some species also possess ornamental value [4]. Additionally, Ipomoea species constitute major components of vegetation across various habitats, underscoring their significant economic and ecological importance.
Within plant cells, mitochondria serve as the primary energy-producing organelles, supplying the necessary energy for diverse cellular activities. Since the completion of the Arabidopsis thaliana mitochondrial genome sequencing in 1997, the NCBI-Genome database has included over 560 plant mitochondrial genomes to date [5,6]. Unlike animal mitochondrial genomes, plant mitochondrial genomes exhibit considerable variation in size, sequence composition, and the arrangement of functional genes [7,8]. The smallest plant mitochondrial genome known is that of Brassica napus, measuring only 221 kb, while the largest, reported to date, is that of Silene conoidea, with a genome size of 11.3 Mb [9]. Despite the substantial divergence in genome size among plant mitochondria, most protein-coding genes are highly conserved. These genes primarily consist of 24 core conserved genes and 17 variable genes, which can be categorized based on the types of proteins they encode into Complex I (nad), Complex II (sdh), Complex III (cob), Complex IV (cox), Complex V (atp), Cytochrome c biosynthesis genes (ccm), and transfer RNA (tRNA) genes, among others [10,11]. In higher plant mitochondrial genomes, genes other than those encoding Complex II, ribosomal proteins, and tRNAs are relatively conserved [12]. Unlike chloroplast genomes, plant mitochondrial genomes do not utilize unique genetic codes, plant mitochondrial genomes share a universal genetic code across species. In addition to being directly inherited from ancestral mitochondria, plant mitochondrial tRNAs also originate from the transfer of sequences from their own chloroplast genomes [13,14]. Due to the lack of a complete set of tRNA genes sufficient to recognize all codons, nearly all higher plants lack specific tRNA genes capable of recognizing Gly (glycine), Val (valine), Leu (leucine), Ala (alanine), Thr (threonine), and Arg (arginine) [15,16]. The coding genes in higher plant mitochondrial genomes also harbor abundant genetic variation information, making them ideal molecular markers for studying species origin, evolution, and population genetic diversity [17]. Mitochondrial genomes not only exhibit rapid evolutionary rates and high recombination rates but also possess advantages such as relatively small genome sizes and ease of sequencing, making them valuable tools for comparative genetics and phylogenetic studies across different plant species [18,19,20].
To date, studies employing comparative genomic approaches to analyze the genomic characteristics and phylogenetic relationships of mitochondrial genomes within this genus remain scarce. To gain deeper insights into the genetic evolutionary characteristics among Ipomoea species at the mitochondrial genome level, this study conducts a comparative analysis based on the nine available mitochondrial genomes of Ipomoea species in the NCBI-Genome database. The analysis focuses on sequence structure characteristics, codon usage bias, and phylogenetic relationships within the genus. The findings aim to provide a theoretical foundation for a comprehensive understanding of the genomic structural features and phylogenetic diversity among Ipomoea species while also laying the groundwork for future omics studies, biological functional analyses, and genomic evolutionary research in this genus.

2. Materials and Methods

2.1. Sample Source

Experimental samples were collected from the I. batatas trial field in Fandian Town, Luoyang City, Henan Province. Fresh and tender leaves in the vigorous growth stage were harvested, rinsed to remove soil and sand, treated with sterile water, and subsequently stored at -80 °C in an ultra-low temperature freezer for future use.

2.2. Sequencing

Total genomic DNA was extracted from the tissue using the modified CTAB method [21]. Second-generation sequencing (SGS) and third-generation sequencing (TGS) were performed utilizing the BGISEQ-500 platform and the BGI PromethION nanopore sequencer, respectively. The SGS workflow encompassed sample quality control, library construction, library quality assessment, and sequencing. The TGS procedure involved the following steps: large-fragment DNA was size-selected using the Blue Pippin automated nucleic acid recovery system; libraries were constructed; approximately 1 μg of DNA from each Blue Pippin-selected library was subjected to damage repair; the repaired DNA was purified and recovered using magnetic beads; its concentration was quantified and the recovery rate calculated using a Qubit fluorometer (ThermoFisher Scientific, Waltham, MA, USA); DNA was ligated to sequencing adapters provided in the kit, ensuring each DNA fragment contained identifiable unique adapter sequences; the ligated sample DNA was purified again using magnetic beads; its concentration was measured and the recovery rate calculated; finally, the libraries were loaded onto the sequencer for sequencing.

2.3. Mitochondrial Genome Assembly and Annotation

Initially, a 20 G dataset was extracted from the raw PacBio sequencing data using the ‘-s’ and ‘-p’ parameters of the seqkit software (https://github.com/shenwei356/seqkit accessed on 15 May 2025), followed by de novo assembly using Flye (v2.9.3-b1797) [22]. Subsequently, the assembled scaffold sequences were aligned using BLAST (https://blast.ncbi.nlm.nih.gov/Blast.cgi, accessed on 12 June 2025) to identify the draft mitochondrial genome of I. batatas. To enhance assembly quality, Samtools (v1.13) was employed to calculate coverage depth, and scaffolds with coverage greater than 20× were selected. Finally, a local database of the assembled sequences was constructed using the ‘makeblastdb’ command in BLAST, and conserved contig sequences were identified via BLAST to finalize the assembly. The mitochondrial genome of I. batatas. was annotated using the GeSeq website (https://chlorobox.mpimp-golm.mpg.de/geseq.html, accessed on 12 June 2025), with both the “Protein search identity” and “rRNA, tRNA, DNA search identity” parameters set to 85. Genes with suboptimal annotation results were manually corrected to ensure accuracy and completeness.

2.4. Analysis of Mitochondrial Genome Structural Features

The sequenced mitochondrial genome of I. batatas. along with mitochondrial genome data from seven other known Ipomoea species (that is, I. batatas (OM808941.1), I. biflora (MZ240723.1), I. nil (AP017303.1), I. pes-caprae (MZ240736.1), I. quamoclit (MZ240732.1), I. trifida (LC773296.1), I. triloba (BK059242.1)) were imported sequentially into Vector NTI 8.0 software. The imported sequence files were manipulated in different windows to statistically analyze structural characteristics, including mitochondrial genome size, (G + C) content, and the number of ORFs for each species. Editing sites in protein-coding genes were predicted using the online tool PREPACT (http://www.prepact.de/prepact-main.php, accessed on 18 June 2025), with the parameter threshold set to 0.8. Simple sequence repeats (SSRs), dispersed repeats, and tandem repeats were identified and analyzed using the tools MISA-web (https://webblast.ipk-gatersleben.de/misa/, accessed on 18 June 2025; parameters: 1-10, 2-6, 3-5, 4-5, 5-5, 6-5, and C-100), RepeatMasker (https://www.repeatmasker.org/; accessed on 18 June 2025, parameters: default), and Tandem Repeats Finder (https://tandem.bu.edu/trf/home; accessed on 18 June 2025; parameters: 2-7-7, 50, 500, 2-f-d-m), respectively.

2.5. Codon Usage Bias Analysis of Mitochondrial Protein-Coding Genes

The RSCU (Relative Synonymous Codon Usage) values were statistically estimated using CodonW software to evaluate the usage bias of each synonymous codon. The RSCU value is defined as the ratio of the observed frequency of a codon to its expected frequency within its synonymous codon family [23]. An RSCU value equal to 1 indicates no usage bias for that synonymous codon, meaning its usage frequency is close to the expected frequency. A codon with usage bias exhibits a higher (or lower) observed frequency than expected, resulting in an RSCU value greater than (or less than) 1.

2.6. Analysis of Genetic Variation in the Mitochondrial Genome

Multiple sequence alignment of conserved protein-coding gene sequences from the mitochondrial genomes of eight Ipomoea species was performed using Clustal X software. Genetic variation information, including the non-synonymous substitution rate (Ka) and synonymous substitution rate (Ks) for these conserved protein-coding genes across different species’ mitochondrial genomes, was calculated using KaKs_Calculator 3.0 software. Finally, the Ka/Ks ratio for each conserved protein-coding gene was summarized.

2.7. Phylogenetic Analysis

The multiple sequence alignment results of the conserved protein-coding gene sequences from the mitochondrial genomes of the eight Ipomoea species were imported into MEGA11 software to construct a phylogenetic tree.

3. Results

3.1. Characteristics of Mitochondrial Genomes in Ipomoea Species

The mitochondrial genome sizes of the eight Ipomoea species were observed to range from 106.281 to 296.652 kb, with the largest mitochondrial genome being identified in I. aquatica and the smallest in I. biflora (Table 1). The guanine-plus-cytosine (G + C) content of the mitochondrial genomes across these eight Ipomoea species was found to vary between 43.89% and 44.82%. The highest G + C content was exhibited by I. pes-caprae, while the lowest was recorded in I. triloba. A shared G + C content of 44.25% was detected in both I. quamoclit and I. batatas. An open reading frame (ORF) is defined as a DNA sequence that begins with a start codon and ends with a stop codon (excluding the stop codon itself), possessing the potential to encode a protein. The number of ORFs identified in the mitochondrial genomes of the eight Ipomoea species varied from 298 to 326. The highest number of ORFs was contained in I. quamoclit, whereas I. triloba was found to contain 298 ORFs. A total of 324 ORFs were identified in each of I. biflora and I. nil. Analysis of RNA editing sites, based on the protein-coding gene (PCG) sequences from these eight mitochondrial genomes, revealed that more than 200 RNA editing sites were present in I. quamoclit, I. biflora, I. batatas, and I. trifida. Among these, the highest number of RNA editing sites (223) was possessed by I. trifida. Between 150 and 200 RNA editing sites were identified in I. aquatica, I. nil, and I. triloba. In contrast, the number of RNA editing sites in I. pes-caprae is relatively small, with only 29.

3.2. Characteristics of Protein-Coding Genes in the Mitochondrial Genomes of the Genus Ipomoea

Variations in size and (G + C) content were observed among the mitochondrial genomes of plants within the genus Ipomoea. Differences in the products of protein-coding genes were also identified across these mitochondrial genomes, which were found to encode a total of 17 to 40 proteins (Figure 1). The protein-coding genes identified in the eight Ipomoea mitochondrial genomes were categorized as follows: Complex I genes, specifically NADH dehydrogenase genes (nad1nad7, nad4L, and nad9); a Complex II gene, the membrane anchor subunit 4 (sdh4); a Complex III gene, the cytochrome reductase gene (cob); Complex IV genes, namely cytochrome oxidase genes (cox1cox3); Complex V genes, the ATP synthase genes (atp1, atp4, atp6, atp8, and atp9); Cytochrome C biogenesis genes (ccmB, ccmC, ccmFC1, ccmFC2, and ccmFN); ribosomal protein genes (e.g., rpl, rps); the SecY-dependent transporter gene (mttB); and the intron maturase gene (matR). Among these, the mitochondrial genome of I. trifida was found to contain the highest diversity of protein-coding genes, with 40 types identified. Conversely, the mitochondrial genome of I. pes-caprae contained the fewest, with only 17 types. Variability in protein-coding genes was also noted among the different mitochondrial genomes. Twelve genes, including rpl5, rpl10, rpl16, and rps14, exhibited relatively high variability, with the rps12-14 region being particularly variable. The genes rps12 and rps13 were present in the mitochondrial genomes of all eight Ipomoea species, whereas rps14 was absent in I. pes-caprae and I. quamoclit but present in the other six species. Furthermore, multiple-copy phenomena for specific protein-coding genes were observed within the mitochondrial genomes of different Ipomoea species. For instance, single copies of atp9 and nad3 were identified in the mitochondrial genomes of I. batatas and I. trifida. In contrast, no copies of these genes were detected in the mitochondrial genomes of I. biflora, I. nil, I. pes-caprae, I. quamoclit, and I. triloba.
The amino acid preferences of the protein-coding genes were found to be similar across the mitochondrial genomes of the different Ipomoea species. All protein-coding genes in the mitochondria of every species were observed to encode 20 amino acids, including isoleucine (Ile), methionine (Met), and proline (Pro). However, the Relative Synonymous Codon Usage (RSCU) for the same protein-coding gene differed among the eight Ipomoea mitochondrial genomes. Among the various codons used for the 20 amino acids, GUU (Valine) and GCU (Alanine) were identified as the codons with relatively high usage frequencies. With the exception of I. nil, for which GUU was found to have the highest RSCU value, GCU was identified as the codon with the highest RSCU in each of the remaining Ipomoea species. In contrast, codons with relatively low RSCU values included UAG (Stop), UAC (Tyrosine), CAC (Histidine), CAG (Glutamine), GGC (Glycine), GAC (Aspartic acid), and CGC (Arginine). The RSCU values for these codons ranged from 0.36 to 0.63.

3.3. Analysis of Mitochondrial Genome tRNAs in Ipomoea Species

Among the mitochondrial genomes of the eight Ipomoea species, I. quamoclit was identified as containing the highest diversity of tRNAs, with 27 distinct types; conversely, I. pes-caprae possessed the fewest, with only 11 types. The mitochondrial genomes of I. nil and I. trifida were found to contain an identical number of tRNA types, each comprising 20. With the exception of I. pes-caprae, the mitochondrial genomes of the other seven Ipomoea species were observed to contain trnC, trnF, trnG, trnH, trnK, trnM, trnN, trnS, trnW, and trnY, which were considered relatively conserved tRNA types (Figure 2). In contrast, trnI and trnP exhibited considerable variability among the Ipomoea mitochondrial genomes, with trnP demonstrating the highest degree of variability. Multiple copies of the same tRNA type were also detected across different Ipomoea mitochondrial genomes. Except in I. nil and I. trifida, trnI was present as a single copy in the mitochondrial genomes of the other six Ipomoea species. The trnS gene was identified with three copies in the mitochondrial genomes of I. aquatica, I. biflora, I. quamoclit, and I. triloba, while two copies were observed in I. nil and I. trifida, and only one copy was found in I. pes-caprae. Regarding trnM, the highest copy number was recorded in the mitochondrial genome of I. quamoclit, with four copies. Two copies were detected in I. biflora, and one copy each was identified in I. aquatica and I. triloba. No copies were observed in the remaining species (Figure 2).

3.4. Analysis of Repetitive Sequences in the Mitochondrial Genomes of the Ipomoea Species

The types and quantities of simple sequence repeats (SSRs), interspersed repeats, and tandem repeats present in the mitochondrial genomes of the eight Ipomoea species are illustrated in Figure 3. Three types of SSRs—P1 (mononucleotide), P2 (dinucleotide), and P3 (trinucleotide)—were identified. Among these, P1 was detected in all eight mitochondrial genomes, whereas P3 was exclusively found in the mitochondrial genome of I. nil. Variations in the number of SSRs were observed among the mitochondrial genomes of different species. Although P1 was the most abundant type of SSR in all eight species, the maximum number of P2 was recorded in I. quamoclit (7), followed by I. aquatica (5). In contrast, only one P2 was identified in I. triloba, and no P2 were detected in I. pes-caprae. The remaining four Ipomoea species each contained three P2. The total number of interspersed repeats ranged from 41 in I. triloba to 353 in I. trifida. DNA/MIR elements were exclusively identified in the mitochondrial genomes of I. biflora, I. nil, I. quamoclit, and I. trifida. Similarly, L3/CR1 elements were only detected in I. trifida, I. pes-caprae, and I. biflora. Tandem repeat analysis revealed three length categories: 0–50 bp, 51–100 bp, and 101–200 bp. Tandem repeats within the 0–50 bp were present in all eight species and exhibited the highest relative abundance. The 51–100 bp category was most frequent in I. quamoclit (6), followed by I. trifida (3), I. aquatica (1), and two repeats in each of the remaining five species. In the 101–200 bp category, one repeat was found in I. quamoclit and I. nil, whereas none were detected in I. pes-caprae. The other five species each contained two repeats in this size range (Figure 4).

3.5. Genetic Variation Information in Mitochondrial Genomes of Ipomoea Species

To determine the conservation of protein-coding genes across different Ipomoea species, the ratio of nonsynonymous substitution rate (Ka) to synonymous substitution rate (Ks) was used to analyze the evolutionary rates of protein-coding genes in the mitochondrial genomes of Ipomoea species (Table 2). It was found that atp4, atp8, atp9, ccmFc, ccmFc1, ccmFc2, ccmFn, mttB, nad1, nad2, nad6, NADH, rpl5, rpl10, rps4, rps14, and sdh4 were genes with relatively rapid evolutionary rates, as their Ka/Ks ratios all exceeded 1. This indicated that these genes were subjected to varying degrees of positive selection. In contrast, the remaining 23 protein-coding genes exhibited Ka/Ks ratios below 1 (ranging from 0.7135 to 0.9878), with nad3 identified as the most slowly evolving gene. These findings suggested that these genes were influenced by varying degrees of negative selection (purifying selection). Harmful mutations were eliminated through selection, and protein sequences were conserved, thereby reflecting the evolutionary conservatism of Ipomoea species.

3.6. Phylogenetic Analysis

A maximum likelihood (ML) tree was constructed using the protein-coding gene (PCG) sequences from eight mitochondrial genomes of Ipomoea species, with Cuscuta australis (MN400576.1) of the same family but a different genus designated as the outgroup. The optimal substitution model was determined by ModelFinder to be GY-HKY. Based on 1000 bootstrap replicates, the ML phylogenetic tree revealed that I. quamoclit and I. pes-caprae, I. triloba and I. batatas, as well as I. biflora and I. aquatica, were initially clustered into distinct clades. The phylogenetic relationships between I. quamoclit and I. pes-caprae, and between I. triloba and I. batatas, were shown to be closer than that between I. biflora and I. aquatica. Subsequently, these six Ipomoea species formed a larger clade, which then clustered with I. trifida, and finally grouped with I. nil (Figure 5).

4. Discussion

The plant mitochondrial genome has been established as a significant subject for research in plant phylogenetics and genetic diversity, due to its complex structural characteristics, dynamic evolutionary processes, and central role in energy metabolism and cellular regulation [7,24]. When compared to animal mitochondrial genomes, those of plants were demonstrated to exhibit greater diversity and complexity in terms of size, structure, repetitive sequence composition, and gene arrangement order [9,25]. In the present study, a systematic comparative analysis of the mitochondrial genomes was conducted for the first time across eight species within the genus Ipomoea (including I. batatas). Characteristics of the genus regarding mitochondrial genome structure, codon usage patterns, repetitive sequence distribution, selective pressures, and phylogenetic relationships were revealed, thereby providing a crucial foundation for understanding the evolutionary history and functional genomics of Ipomoea species.
The mitochondrial genome sizes of the eight Ipomoea species were found to range from 106.281 kb (I. biflora) to 296.652 kb (I. aquatica). The mitochondrial genome size of I. batatas was determined to be 257.880 kb, which was considered intermediate within this range. This size variation was largely consistent with the previously reported range for higher plant mitochondrial genomes (200 kb–2.34 Mb) [8], reflecting the high plasticity observed in plant mitochondrial genome size. The GC content of the I. batatas mitochondrial genome was calculated as 44.25%, which was similar to that of other Ipomoea species (43.89–44.82%), indicating a relative conservation in the base composition of mitochondrial genomes within this genus. However, significant interspecific variations were observed in the number of ORFs (298–326) and RNA editing sites (29–223). I. trifida was identified with the highest number of RNA editing sites (223), whereas only 29 sites were detected in I. pes-caprae, suggesting potential significant divergence in mitochondrial gene expression regulatory mechanisms among species [26]. Notably, an identical number of ORFs was not possessed by all Ipomoea mitochondrial genomes, and the functions of the majority of ORFs remain uncharacterized, a phenomenon similar to the uneven ORF distribution reported in Poaceae mitochondrial genomes [24]. Furthermore, a considerable number of RNA editing sites (215) were identified in the I. batatas mitochondrial genome, implying a potentially important role in post-transcriptional regulation, particularly in adaptation to various environmental stresses [15,27,28].
In higher plants, the types of protein-coding genes encoded by the mitochondrial genome are relatively conserved, but their number, location, and arrangement order often vary among different species, and even among different varieties of the same species [29]. The mitochondrial genomes of the Ipomoea species were found to encode 17 to 40 types of proteins. The highest number of protein types (40) was encoded by I. trifida, while the lowest (17) was encoded by I. pes-caprae. I. batatas was found to share most core protein-coding genes with the other species, including genes associated with complexes I–V, cytochrome c synthase genes (the ccm family), and ribosomal protein genes (rpl, rps). However, the absence of certain genes, such as rps14, was noted in I. pes-caprae and I. quamoclit, whereas this gene was present in the other six species, indicating a potential specific loss during the evolution of the genus. Similar phenomena have been reported in Poaceae, where rps14 also exhibits high variability in some species [24]. Regarding gene copy number, single copies of atp9 and nad3 were identified in I. batatas and I. trifida, but these genes were completely absent in other species, suggesting independent gene duplication or loss events in certain lineages of Ipomoea. Such variations in gene copy number might be attributed to mitochondrial genome recombination or horizontal gene transfer [13,30,31], and could also be associated with species’ adaptive evolution. Analysis of codon usage bias revealed a general preference for A/U-ending codons, such as GUU (Val) and GCU (Ala), in the protein-coding genes of I. batatas and other Ipomoea species. GUU was the most preferred codon in I. nil, whereas GCU was the most preferred in the other species. Codons with low usage frequency included UAG (stop codon), UAC (Tyr), and CAC (His), with RSCU values ranging from 0.36 to 0.63. This codon usage bias was consistent with the A/T preference observed in other plants [24,32,33], possibly related to the high AT content of mitochondrial genomes, and reflecting optimization for translational efficiency and accuracy during evolution [23].
A high degree of diversity was observed in tRNA genes among the Ipomoea species. The greatest variety of tRNAs (27 types) was found in I. quamoclit, while the least (11 types) was found in I. pes-caprae. The I. batatas mitochondrial genome contained 20 types of tRNA, identical to I. nil and I. trifida. Ten tRNA types, including trnC, trnF, and trnG, were identified as the most conserved, being present in all species except I. pes-caprae, indicating their core functional role in the mitochondrial translation system of Ipomoea. In contrast, the distribution of trnI and trnP was highly variable; trnP, in particular, was present only in a few species, suggesting potential functional loss or replacement in specific lineages. Concerning copy number variation, trnS was present in three copies in I. aquatica and I. biflora, two copies in I. nil and I. trifida, and only one copy in I. pes-caprae. The highest copy number of trnM (4 copies) was observed in I. quamoclit, with fewer copies or absence in other species. These differences in tRNA copy number might influence mitochondrial translation efficiency, thereby affecting metabolic adaptation [34,35]. Repetitive sequences are considered major drivers of recombination and structural variation in plant mitochondrial genomes [8,36]. Three types of simple sequence repeats (SSRs) were identified in the Ipomoea species. P1 (mononucleotide repeats) were present in all species, while P3 (trinucleotide repeats) were detected only in I. nil. The total number of dispersed repeats was lowest in I. triloba (41) and highest in I. trifida (353). Certain types, such as DNA/MIRs and L3/CR1, were found only in specific species, indicating their potential involvement in lineage-specific genome reorganization events. Analysis of tandem repeats revealed that repeats in the 0–50 bp range were present in all species and were the most numerous. Repeats in the 51–100 bp and 101–200 bp ranges were most abundant in I. quamoclit. These long-fragment repeats might promote genome structure diversification through homologous recombination, consequently affecting gene arrangement and function [37]. Compared to Poaceae, the types of repetitive sequences in Ipomoea were relatively simplified, yet their role in genome structure evolution should not be overlooked. The evolutionary rates of mitochondrial protein-coding genes in Ipomoea species were assessed using the Ka/Ks ratio. The results indicated that Ka/Ks values were greater than 1 for 17 genes (atp4, atp8, atp9, ccmFc, mttB, nad1, nad2, nad6, rpl5, rpl10, rps4, rps14, sdh4), suggesting that these genes were under positive selection and may have played important roles in the adaptation of Ipomoea species to different ecological environments. Notably, the Ka/Ks values for atp9 and nad6 were 1.3230 and 1.3329, respectively, indicating they may have undergone adaptive evolution related to energy metabolism. The Ka/Ks values for the remaining 23 genes were all less than 1, with the lowest value (0.7135) observed for nad3, indicating this gene was under strong purifying selection, likely reflecting high functional conservation within the NADH dehydrogenase complex. This finding is consistent with the conclusion that most protein-coding genes in Poaceae are under purifying selection [24], underscoring the stability of core mitochondrial functional genes during evolution.
A maximum likelihood phylogenetic tree was constructed based on conserved mitochondrial protein-coding genes from the eight Ipomoea species. The tree revealed three distinct subclades: I. quamoclit and I. pes-caprae formed one subclade; I. triloba and I. batatas formed a second; and I. biflora and I. aquatica formed a third. The first two subclades were more closely related to each other. I. trifida and I. nil were positioned at the basal nodes of the tree, suggesting they may represent earlier diverging lineages within the genus. The phylogenetic relationships established in this study provide evidence at the mitochondrial genome level for the origin and dispersal of Ipomoea species and lay a foundation for future species identification and phylogenetic analysis using mitochondrial markers. Integrating analyses with chloroplast and nuclear genomic data in the future will help to further elucidate the evolutionary history of Ipomoea. In summary, this study represents the first systematic comparative analysis of the mitochondrial genomes of eight Ipomoea species, including the important crop I. batatas. Characteristics related to genome structure, gene content, repetitive sequences, codon usage, and selective pressure were revealed. The findings not only enrich the mitochondrial genome database for Convolvulaceae plants but also provide a theoretical basis for the evaluation of I. batatas germplasm resources, cultivar breeding, and evolutionary biology research.

5. Conclusions

Based on a comprehensive comparative analysis of the mitochondrial genomes of eight Ipomoea species, this study reveals significant interspecific variation in genome size, structure, gene content, and repetitive sequences. Key findings include a wide range of mitochondrial genome sizes (106–297 kb), divergent numbers of protein-coding genes (17–40) and tRNA types (11–27), and distinct patterns of codon usage bias and RNA editing. Evolutionary rate analysis (Ka/Ks) identified 17 protein-coding genes under positive selection, suggesting their role in adaptive evolution. Phylogenetic reconstruction provided robust insights into the evolutionary relationships within the genus. These results substantially enhance our understanding of mitochondrial genome evolution in Ipomoea and offer a valuable genomic foundation for future phylogenetic, functional, and breeding studies in this economically important genus.

Author Contributions

M.X.: Formal analysis; Software; Roles/Writing—original draft; C.Z.: Visualization; W.H. and Y.L.: Writing—review and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Informed Consent Statement

Not applicable.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Conflicts of Interest

The authors declare that they have no competing interests.

References

  1. Yan, M.; Li, M.; Wang, Y.; Wang, X.; Moeinzadeh, M.; Quispe-Huamanquispe, D.; Fan, W.; Fang, Y.; Wang, Y.; Nie, H. Haplotype-based phylogenetic analysis and population genomics uncover the origin and domestication of sweetpotato. Mol. Plant 2024, 17, 277–296. [Google Scholar] [CrossRef]
  2. Austin, D. The Ipomoea batatas complex-I. taxonomy. Bull. Torrey Bot. Club 1978, 105, 114–129. [Google Scholar] [CrossRef]
  3. Shen, S.; Xu, G.; Li, D.; Clements, D.; Jin, G.; Liu, S.; Yang, Y.; Chen, A.; Zhang, F.; Kato-Noguchi, H. Allelopathic potential of sweet potato (Ipomoea batatas) germplasm resources of Yunnan Province in southwest China. Acta Ecol. Sin. 2018, 38, 444–449. [Google Scholar] [CrossRef]
  4. Zheng, W.; Clifford, M. Profiling the chlorogenic acids of sweet potato (Ipomoea batatas) from China. Food Chem. 2008, 106, 147–152. [Google Scholar] [CrossRef]
  5. Unseld, M.; Marienfeld, J.; Brandt, P.; Brennicke, A. The mitochondrial genome of Arabidopsis thaliana contains 57 genes in 366,924 nucleotides. Nat. Genet. 1997, 15, 57–61. [Google Scholar] [CrossRef] [PubMed]
  6. Sheng, W.; Kuang, Q.; Zhang, Q.; Chai, X.; Zhang, Y. The characteristic analysis of 560 plant mitochondrial genome registered in NCBI data. Mol. Plant Breed. 2023, 6, 1–9. [Google Scholar]
  7. Gualberto, J.; Mileshina, D.; Wallet, C.; Niazi, A.; Weber-Lotfi, F.; Dietrich, A. The plant mitochondrial genome: Dynamics and maintenance. Biochimie 2014, 100, 107–120. [Google Scholar] [CrossRef]
  8. Gualberto, J.; Newton, K. Plant mitochondrial genomes: Dynamics and mechanisms of mutation. Annu. Rev. Plant Biol. 2017, 68, 225–252. [Google Scholar] [CrossRef]
  9. Alverson, A.J.; Wei, X.; Rice, D.W.; Stern, D.B.; Barry, K.; Palmer, J.D. Insights into the evolution of mitochondrial genome size from complete sequences of Citrullus lanatus and Cucurbita pepo (Cucurbitaceae). Mol. Biol. Evol. 2010, 27, 1436–1448. [Google Scholar] [CrossRef]
  10. Xu, W. The Mitochondrial Genome of Cistanche genus in China. Ph.D. Thesis, Peking Union Medical College, Beijing, China, 2020. [Google Scholar]
  11. Moller, I.M.; Rasmusson, A.G.; Van Aken, O. Plant mitochondria—Past, present and future. Plant J. 2021, 108, 912–959. [Google Scholar] [CrossRef]
  12. Robles, P.; Quesada, V. Organelle genetics in plants 2.0. Int. J. Mol. Sci. 2023, 24, 12128. [Google Scholar] [CrossRef]
  13. Xiong, A.; Peng, R.; Zhuang, J.; Gao, F.; Zhu, B.; Fu, X.; Xue, Y.; Jin, X.; Tian, Y.; Zhao, W.; et al. Gene duplication and transfer events in plant mitochondria genome. Biochem. Biophys. Res. Commun. 2008, 376, 1–4. [Google Scholar] [CrossRef] [PubMed]
  14. Szandar, K.; Krawczyk, K.; Myszczynski, K.; Slipiko, M.; Sawicki, J.; Szczecinska, M. Breaking the limits-multichromosomal structure of an early eudicot Pulsatilla patens mitogenome reveals extensive RNA-editing, longest repeats and chloroplast derived regions among sequenced land plant mitogenomes. BMC Plant Biol. 2022, 22, 109. [Google Scholar] [CrossRef] [PubMed]
  15. Morley, S.; Nielsen, B. Plant mitochondrial DNA. Front. Biosci.-Landmark 2017, 22, 1023–1032. [Google Scholar] [CrossRef]
  16. Mower, J. Variation in protein gene and intron content among land plant mitogenomes. Mitochondrion 2020, 53, 203–213. [Google Scholar] [CrossRef]
  17. Mower, J.P.; Sloan, D.B.; Alverson, A.J. Plant Mitochondrial Genome Diversity: The Genomics Revolution; Springer: Vienna, Austria, 2012; pp. 123–144. [Google Scholar]
  18. Zhou, M.; Li, X. Analysis of synonymous codon usage patterns in different plant mitochondrial genomes. Mol. Biol. Rep. 2009, 36, 2039–2046. [Google Scholar] [CrossRef]
  19. Yang, J.; Ling, C.C.; Zhang, H.M.; Hussain, Q.; Lyu, S.H.; Zheng, G.H.; Liu, Y.S. A comparative genomics approach for analysis of complete mitogenomes of five Actinidiaceae plants. Genes 2022, 13, 1827. [Google Scholar] [CrossRef]
  20. Zervas, A.; Petersen, G.; Seberg, O. Mitochondrial genome evolution in parasitic plants. BMC Evol. Biol. 2019, 19, 87. [Google Scholar] [CrossRef]
  21. Schenk, J.; Becklund, L.; Carey, S.; Fabre, P. What is the “modified” CTAB protocol? Characterizing modifications to the CTAB DNA extraction protocol. Appl. Plant Sci. 2023, 11, e11517. [Google Scholar] [CrossRef]
  22. Kolmogorov, M.; Yuan, J.; Lin, Y.; Pevzner, P.A. Assembly of long, error-prone reads using repeat graphs. Nat. Biotechnol. 2019, 37, 540–546. [Google Scholar] [CrossRef]
  23. Mou, Y.; Wu, R.; Wang, Y. Patterns and influencing factors of synonymous codon usage in porcine circovirus. J. Anhui Agric. Sci. 2013, 41, 149–151. [Google Scholar] [CrossRef]
  24. Yang, S.; Liu, J.; Jiang, S.; Guo, B.; Guo, J.; Cao, L.; Zhang, W. Mitochondrial genomic characteristics and phylogenetic analysis of 21 species of poaceae. Genom. Appl. Biol. 2024, 43, 1533–1547. [Google Scholar]
  25. Wu, Z.Q.; Liao, X.Z.; Zhang, X.N.; Tembrock, L.R.; Broz, A. Genomic architectural variation of plant mitochondria—A review of multichromosomal structuring. J. Syst. Evol. 2022, 60, 160–168. [Google Scholar] [CrossRef]
  26. Small, I.; Melonek, J.; Bohne, A.; Nickelsen, J.; Schmitz-Linneweber, C. Plant organellar RNA maturation. Plant Cell 2023, 35, 1727–1751. [Google Scholar] [CrossRef]
  27. Sun, F.; Wang, C.; Lin, Y. Sweet potato gene editing and breeding technologies: Research progress and future prospects. Adv. Resour. Res. 2025, 5, 852–871. [Google Scholar] [CrossRef]
  28. Doré, G.; Barloy, D.; Barloy-Hubler, F. De novo hybrid assembly unveils multi-chromosomal mitochondrial genomes in Ludwigia species, highlighting genomic recombination, gene transfer, and RNA editing events. Int. J. Mol. Sci. 2024, 25, 7283. [Google Scholar] [CrossRef]
  29. Chen, B.; Li, X.; Tian, Y.; Xing, D.; Su, X.; Guan, C. Comparison and phylogenetic analysis of mitochondrial genome among Lycium barbarum and other Solanaceae plants. Genom. Appl. Biol. 2024, 43, 45–62. [Google Scholar] [CrossRef]
  30. Mower, J.P.; Stefanović, S.; Hao, W.; Gummow, J.S.; Jain, K.; Ahmed, D.; Palmer, J.D. Horizontal acquisition of multiple mitochondrial genes from a parasitic plant followed by gene conversion with host mitochondrial genes. BMC Biol. 2010, 8, 150. [Google Scholar] [CrossRef]
  31. Hastings, P.J.; Lupski, J.R.; Rosenberg, S.M.; Ira, G. Mechanisms of change in gene copy number. Nat. Rev. Genet. 2009, 10, 551–564. [Google Scholar] [CrossRef]
  32. Wang, B.; Yuan, J.; Liu, J.; Liang, J.; Chen, J.Q. Codon usage bias and determining forces in green plant mitochondrial genomes. J. Integr. Plant Biol. 2011, 54, 324–334. [Google Scholar] [CrossRef]
  33. Zhang, K.; Wang, Y.; Zhang, Y.; Shan, X. Codon usage characterization and phylogenetic analysis of the mitochondrial genome in Hemerocallis citrina. BMC Genom. Data 2024, 25, 6. [Google Scholar] [CrossRef]
  34. Du, M.Z.; Wei, W.; Qin, L.; Liu, S.; Zhang, A.Y.; Zhang, Y.; Zhou, H.; Guo, F.B. Co-adaption of tRNA gene copy number and amino acid usage influences translation rates in three life domains. DNA Res. 2017, 24, 623–633. [Google Scholar] [CrossRef] [PubMed]
  35. Victor, M.P.; Acharya, D.; Chakraborty, S.; Ghosh, T.C. The combined influence of codon composition and tRNA copy number regulates translational efficiency by influencing synonymous nucleotide substitution. Gene 2020, 745, 144640. [Google Scholar] [CrossRef]
  36. Backert, S.; Nielsen, B.L.; Börner, T. The mystery of the rings: Structure and replication of mitochondrial genomes from higher plants. Trends Plant Sci. 1997, 2, 477–483. [Google Scholar] [CrossRef]
  37. Nardi, F.; Carapelli, A.; Frati, F. Repeated regions in mitochondrial genomes: Distribution, origin and evolutionary significance. Mitochondrion 2012, 12, 483–491. [Google Scholar] [CrossRef]
Figure 1. The composition of mitochondrial protein-coding genes of the eight Ipomoea mitochondrion.
Figure 1. The composition of mitochondrial protein-coding genes of the eight Ipomoea mitochondrion.
Biology 14 01696 g001
Figure 2. The information of tRNAs in the eight Ipomoea mitochondrion.
Figure 2. The information of tRNAs in the eight Ipomoea mitochondrion.
Biology 14 01696 g002
Figure 3. Composition of SSR types in the eight Ipomoea mitochondrion.
Figure 3. Composition of SSR types in the eight Ipomoea mitochondrion.
Biology 14 01696 g003
Figure 4. Composition of the number of repeat sequence in the eight Ipomoea mitochondrion.
Figure 4. Composition of the number of repeat sequence in the eight Ipomoea mitochondrion.
Biology 14 01696 g004
Figure 5. The phylogenetic tree of eight Ipomoea mitochondrion based on the protein-coding genes.
Figure 5. The phylogenetic tree of eight Ipomoea mitochondrion based on the protein-coding genes.
Biology 14 01696 g005
Table 1. Basic characteristics of the eight Ipomoea mitochondrial genomes.
Table 1. Basic characteristics of the eight Ipomoea mitochondrial genomes.
SpeciesMitogenome Size/kbG + C Content/%Number of ORFNumber of RNA Editing Site
Ipomoea aquatica296.65244.13315162
Ipomoea quamoclit293.93944.25326206
Ipomoea biflora278.12144.16324207
Ipomoea batatas257.88044.25306215
Ipomoea nil265.76844.45324182
Ipomoea trifida264.69844.15310223
Ipomoea triloba160.30243.89298165
Ipomoea pes-caprae106.28144.8230529
Table 2. Evolutionary rate analysis of PCGs in the eight Ipomoea mitochondrion.
Table 2. Evolutionary rate analysis of PCGs in the eight Ipomoea mitochondrion.
GenesKaKsKa/Ks
atp10.97531.07880.9041
atp41.03030.90261.1414
atp60.96641.12450.8594
atp81.00800.96941.0398
atp91.07370.81161.3230
ccmB0.97371.08370.8985
ccmC0.97551.06800.9134
ccmFc1.00480.98281.0223
ccmFc11.02730.93301.1011
ccmFc21.00550.97941.0266
ccmFn1.01680.94481.0762
Cob0.99031.03350.9582
cox10.96161.11560.8619
cox20.96021.13750.8441
cox30.95541.13140.8444
matR0.98431.04880.9385
mttB1.04480.86941.2018
nad11.00680.98041.0268
nad21.00920.97211.0381
nad30.91601.28340.7135
nad40.98871.04010.9506
nad4l0.95801.14350.8378
nad50.98801.04270.9475
nad61.06720.80071.3329
nad70.93401.27970.7299
nad90.97571.09660.8897
rpl51.01870.94211.0813
rpl101.03350.88751.1644
rpl160.92171.28300.7184
rps10.98131.05520.9301
rps30.99731.00960.9878
rps41.001360.99481.0066
rps70.95921.14530.8374
rps100.96771.12240.8622
rps120.99161.03280.9601
rps130.99691.01470.9825
rps141.02880.89431.1504
rps190.94491.26180.7489
sdh41.01790.94511.0770
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Xiao, M.; Zhang, C.; Hou, W.; Li, Y. Comprehensive Analysis of Mitochondrial Genomic Characteristics and Phylogenetic Analysis of the Plant Genus Ipomoea (Convolvulaceae) Species. Biology 2025, 14, 1696. https://doi.org/10.3390/biology14121696

AMA Style

Xiao M, Zhang C, Hou W, Li Y. Comprehensive Analysis of Mitochondrial Genomic Characteristics and Phylogenetic Analysis of the Plant Genus Ipomoea (Convolvulaceae) Species. Biology. 2025; 14(12):1696. https://doi.org/10.3390/biology14121696

Chicago/Turabian Style

Xiao, Mengya, Cheng Zhang, Wenbang Hou, and Youjun Li. 2025. "Comprehensive Analysis of Mitochondrial Genomic Characteristics and Phylogenetic Analysis of the Plant Genus Ipomoea (Convolvulaceae) Species" Biology 14, no. 12: 1696. https://doi.org/10.3390/biology14121696

APA Style

Xiao, M., Zhang, C., Hou, W., & Li, Y. (2025). Comprehensive Analysis of Mitochondrial Genomic Characteristics and Phylogenetic Analysis of the Plant Genus Ipomoea (Convolvulaceae) Species. Biology, 14(12), 1696. https://doi.org/10.3390/biology14121696

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop