Next Article in Journal
Divergent Metabolic Changes in Rhizomes of Lowland and Upland Switchgrass (Panicum virgatum) from Early Season through Dormancy Onset
Next Article in Special Issue
Dynamic Translational Landscape Revealed by Genome-Wide Ribosome Profiling under Drought and Heat Stress in Potato
Previous Article in Journal
Low-Cost Hyperspectral Imaging to Detect Drought Stress in High-Throughput Phenotyping
Previous Article in Special Issue
Analysis of Potato Physiological and Molecular Adaptation in Response to Different Water and Nitrogen Combined Regimes
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genome-Wide Comparative Analysis of the R2R3-MYB Gene Family in Six Ipomoea Species and the Identification of Anthocyanin-Related Members in Sweet Potatoes

1
College of Agronomy and Biotechnology, Yunnan Agricultural University, Kunming 650201, China
2
Yunnan Engineering Research Center of Tuber and Root Crop Bio-Breeding and Healthy Seed Propagation, Yunnan Agricultural University, Kunming 650201, China
*
Author to whom correspondence should be addressed.
Plants 2023, 12(8), 1731; https://doi.org/10.3390/plants12081731
Submission received: 16 March 2023 / Revised: 17 April 2023 / Accepted: 19 April 2023 / Published: 21 April 2023

Abstract

:
Sweet potatoes (Ipomoea batatas) are one of the important tuberous root crops cultivated worldwide, and thier storage roots are rich in antioxidants, such as anthocyanins. R2R3-MYB is a large gene family involved in various biological processes, including anthocyanin biosynthesis. However, few reports about the R2R3-MYB gene family of sweet potatoes have been released to date. In the present study, a total of 695 typical R2R3-MYB genes were identified in six Ipomoea species, including 131 R2R3-MYB genes in sweet potatoes. A maximum likelihood phylogenetic analysis divided these genes into 36 clades, referring to the classification of 126 R2R3-MYB proteins of Arabidopsis. Clade C25(S12) has no members in six Ipomoea species, whereas four clades (i.e., clade C21, C26, C30, and C36), including 102 members, had no members in Arabidopsis, and they were identified as Ipomoea-specific clades. The identified R2R3-MYB genes were unevenly distributed on all chromosomes in six Ipomoea species genomes, and the collinearity analysis among hexaploid I. batatas and another five diploid Ipomoea species suggested that the sweet potato genome might have undergone a larger chromosome rearrangement during the evolution process. Further analyses of gene duplication events showed that whole-genome duplication, transposed duplication, and dispersed duplication events were the primary forces driving the R2R3-MYB gene family expansion of Ipomoea plants, and these duplicated genes experienced strong purifying selection because of their Ka/Ks ratio, which is less than 1. Additionally, the genomic sequence length of 131 IbR2R3-MYBs varied from 923 bp to ~12.9 kb with a mean of ~2.6 kb, and most of them had more than three exons. The Motif 1, 2, 3, and 4 formed typical R2 and R3 domains and were identified in all IbR2R3-MYB proteins. Finally, based on multiple RNA-seq datasets, two IbR2R3-MYB genes (IbMYB1/g17138.t1 and IbMYB113/g17108.t1) were relatively highly expressed in pigmented leaves and tuberous root flesh and skin, respectively; thus, they were identified to regulate tissue-specific anthocyanin accumulation in sweet potato. This study provides a basis for the evolution and function of the R2R3-MYB gene family in sweet potatoes and five other Ipomoea species.

1. Introduction

Sweet potato (Ipomoea batatas (L.) Lam., 2n = 6x = 90), of the genus Ipomoea from the Convolvulaceae family, is an important tuberous root crop cultivated globally [1,2]. Currently, China is the largest sweet potato cultivation and production region; China accounts for approximately 54% of the world’s production and more than 29% of the global area [3]. Sweet potato storage roots are rich in carbohydrates, dietary fibers, minerals, vitamins, and various antioxidants, such as carotenoids and anthocyanins, which are beneficial to human health [4]. Anthocyanins, belonging to flavonoids metabolites, are natural water-soluble pigments accumulated in vacuoles that are responsible for the red, purple, and blue coloration of plant tissues [5]. Anthocyanins are distributed among flowers, leaves, stems, and storage roots of sweet potatoes, such as the storage roots of purple-fleshed sweet potatoes, which are rich in anthocyanins [6,7,8]. Sweet potato tissue coloration involves complex biochemical changes due to genetic and environmental factors, and anthocyanin accumulation in different tissues exhibits no correlation. Hence, understanding the genetic factors controlling anthocyanin biosynthesis and accumulation is valuable for sweet potato breeding and germplasm resource utilization.
V-myb avian myeloblastosis viral oncogene homolog (MYB) proteins are the second largest transcription factor (TF) families in higher plants [9,10]. MYB TFs are composed of a highly conserved DNA-binding domain repeat (R) at the N-terminal and a relatively variable regulatory region at the C-terminal. Each MYB repeat (R) harbors approximately 52 amino acid residues, among which 3 regularly spaced tryptophan (W) residues form 3 α-helices [11]. Typically, according to the MYB repeats sequence similarity and repeat number, plant MYB proteins contain one to four imperfect MYB repeat(s) and are, therefore, classified as 1R-MYBs (R3-MYBs and MYB-related), 2R-MYBs (R2R3-MYBs), 3R-MYBs (R1R2R3-MYBs), and 4R-MYBs (R1R2R2R1/2-MYBs) [9,12,13]. The R2R3-MYB gene family members have expanded substantially during plant evolution and constitute the predominant family [10], which contributes to R2R3-MYB becoming an important research focus.
Plant R2R3-MYB genes perform crucial roles in various biological processes, including plant growth and development, cell differentiation, specialized secondary metabolism, and biotic and abiotic stress responses [14,15]. In particular, many R2R3-MYB genes have been thoroughly investigated and verified to regulate tissue-specific anthocyanin accumulation in various plants [16,17,18]. For example, the first R3R3-MYB plant gene, COLORED1 (C1), which was isolated from maize, encodes a regulatory protein to control anthocyanin pigment production in the aleurone layer of the endosperm [19,20]. In Arabidopsis, the overexpression of AtMYB75/PAP1 or AtMYB90/PAP2 produces enhanced anthocyanin pigmentation throughout the entire plant [21]. In addition, the overexpression of AtMYB113 or AtMYB114 results in substantial increases in pigment production in vegetative tissues [22]. In tomatoes, the dominant Anthocyanin fruit (Aft) locus encodes the R2R3-MYB TF named SlAN2-like, which acts as a functional anthocyanin activator that is responsible for light-dependent anthocyanin accumulation in the fruit skin of the purple tomato variety ‘Indigo Rose’ [23]. In apples, three R2R3-MYB TFs, namely, MdMYB1, MdMYBA, and MdMYB10, are the key regulators in apple fruit anthocyanin accumulation and coloration [24,25,26]. In carrots, two R2R3-MYB genes, namely, DcMYB7 and DcMYB113, perform important roles in regulating anthocyanin biosynthesis, of which, DcMYB7 has a high expression level in carrot root and petiole [27], but DcMYB113 is specifically expressed in carrot root [28]. In addition, some R2R3-MYB repressors were also reported, such as PhMYB27 in petunias [29], FaMYB1 in strawberries [30], and StMYB44 in potatoes [31]. These studies summarized that R2R3-MYB TFs are the essential regulators for anthocyanin biosynthesis in plants and that S6 (subgroup 6) R2R3-MYB TFs are conserved as anthocyanin activators in many plants referencing the Arabidopsis R2R3-MYB gene family [9].
The anthocyanin biosynthetic pathway is highly conserved and has been well-studied in many plants [22,32]. Several key enzyme-coding genes, including IbCHS, IbCHI, IbF3’H, IbDFR, IbANS, Ib3GGT, and IbGSTF4, have been linked to anthocyanin biosynthesis and accumulation within sweet potatoes [33,34,35,36,37,38]. An R2R3-MYB gene, IbMYB1-2, which is specifically highly expressed in purple-fleshed sweet potato storage root flesh, is considered the main factor for activating the anthocyanin biosynthesis and accumulation in tuberous roots [6,39,40,41]. IbMYB44, the R2R3-MYB repressor, competitively inhibits the IbMYB340-IbbHLH2-IbNAC56a/b complex’s formation to affect anthocyanin biosynthesis in purple-fleshed sweet potatoes [42]. Moreover, it was reported that multiple MYB TFs (i.e., activators IbMYB1/2/3, repressors IbMYB27, IbMYBx, and IbMYB4a/b/c) co-regulate the accumulation of anthocyanins in sweet potato leaves [43]. Despite preliminary exploration, the underlying transcriptional regulatory mechanism of R2R3-MYBs in the anthocyanin metabolism pathway has not been systematically investigated in sweet potatoes.
In recent decades, the increasing availability of plant genome sequencing data has made comprehensively characterizing the genome-wide R2R3-MYB gene family in the plant kingdom successful. Currently, this gene family has been identified and characterized in more than 75 plant species [13,14], including Arabidopsis (Arabidopsis thaliana) [9], rice (Oryza sativa) [44], potato (Solanum tuberosum) [45], tomato (Solanum lycopersicum) [46], wolfberry (Lycium barbarum) [47], and pea (Pisum sativum) [48]. Nevertheless, little is known about the information of the genome-wide R2R3-MYB gene family in sweet potatoes. Moreover, there are no reports about the comparative analysis of R2R3-MYB genes in the genus Ipomoea, which includes approximately 500 species and possesses the largest number of species in the Convolvulaceae family [49]. In recent years, the chromosome-level genomes of six Ipomoea species, including the hexaploid species Ipomoea batatas, and five other diploid species, including Ipomoea trifida, Ipomoea triloba, Ipomoea nil, Ipomoea purpurea, and Ipomoea aquatica, have been sequenced, assembled, annotated, and released [50,51,52,53,54]. The genome information on these wild diploid relatives has been used in genome-enabled breeding and the research of sweet potatoes. Therefore, these genomes sequence data are informative for a comparative analysis of the R2R3-MYB TF gene family among the Ipomoea species.
In this study, a genome-wide comparative analysis of the R2R3-MYB gene family was performed. First, a total of 695 R2R3-MYB genes were identified in 6 Ipomoea species. Next, the protein physicochemical properties, phylogenetic relationships, chromosome localization, syntenic links, gene duplication events, gene structures, and conserved motifs of identified R2R3-MYB genes were analyzed. The expression patterns of anthocyanin-related IbR2R3-MYBs in sweet potatoes with different colored tissues were also investigated based on public RNA-seq data. These findings will provide valuable information for further studies of the evolution and function of R2R3-MYB gene families in Ipomoea species.

2. Results

2.1. Identification and Characterization of R2R3-MYB Family Members in Six Ipomoea Species

Three strategies (i.e., BLASTP search, HMM search, and SMART database validation) were used to fully identify R2R3-MYB TFs in six Ipomoea species. A total of 695 typical R2R3-MYB TFs were identified, including 131 in sweet potato, 133 in I. trifida, 129 in I. triloba, 127 in I. nil, 124 in I. aquatica, and 51 in I. purpurea (Table 1). Next, to further characterize all R2R3-MYB proteins features, the physiochemical properties, including protein length, molecular weight (MW), isoelectric point (pI), instability index, aliphatic index, and grand average of hydropathicity (GRAVY), were analyzed (Figure 1) and the subcellular localization of proteins was also predicted (Table S1). These results showed that amino acid numbers in the 6 Ipomoea R2R3-MYB proteins ranged from 119 to 608, with an average of 318 amino acids (Figure 1a). The MW varied from 13.71 to 66.40 kDa, and itb04g32410.t2 had the longest length of 608 amino acids and the highest molecular weight of 66.40 kDa, while INIL06g37520.t1 had the shortest length of 119 amino acids and the lowest molecular weight of 13.71 kDa (Figure 1b). The pI ranged between 4.59 (g29760.t1) and 10.98 (INIL00g10723.t1), among which, 50.8% (353/695) of proteins are acidic proteins with a pI that is lower than 7, and the rest are basic proteins with a pI that is larger than 7 (Figure 1c). The instability index of 3.0% (221/695) of the proteins was less than 40, while the rest exhibited more than 40, suggesting that they are unstable proteins (Figure 1d). The aliphatic index was distributed from 44.45 (g48266.t1) to 100.16 (Ipurp_gene18717) (Figure 1e). The GRAVY of all proteins was less than −0.1, indicating that all R2R3-MYB proteins had negative hydrophobicity (Figure 1f). Moreover, the prediction of subcellular localization showed that all R2R3-MYB proteins were localized in the nucleus (Table S1), suggesting their critical role in regulatory functions.

2.2. Phylogenetic Analysis and Classification of R2R3-MYB Genes in Six Ipomoea Species

To investigate the evolutionary relationships and gene functions of R2R3-MYB genes in 6 Ipomoea species, a rooted maximum likelihood (ML) phylogenetic tree consisting of 126 members from Arabidopsis and 695 members from 6 Ipomoea was constructed via MAFFT software for multiple sequence alignment and IQ-tree software was used for phylogenetic tree clustering. According to the topology of the ML phylogenetic tree and referring to the classification of AtR2R3-MYB family proteins [9,14], all R2R3-MYB proteins were subdivided into 36 clades (named C1-C36) (Figure 2 and Table S2), except for 4 AtR2R3-MYB proteins (i.e., AtMYB123, AtMYB39, AtMYB47, and AtMYB95). Clade C25 had six members only in Arabidopsis and not in Ipomoea species, and they were Arabidopsis-specific clades. While clades C21, C26, C30, and C36 had no members in Arabidopsis, they were identified as Ipomoea-specific clades. However, there were no sweet potato-specific clades. Clade C29 had the highest number of R2R3-MYB genes, containing 9, 11, 10, 9, and 12 genes in sweet potatoes, I. trifida, I. triloba, I. nil, and I. aquatica, respectively. Meanwhile, some clades (i.e., C8, C22, and C30) had only one member for each Ipomoea species (Table 2). These results showed that the number of R2R3-MYB proteins among different clades was highly variable, suggesting R2R3-MYB gene diversity in Ipomoea species. Meanwhile, the number of R2R3-MYB members of sweet potatoes in each clade was almost similar with other Ipomoea species but different with Arabidopsis, such as C3–C6, suggesting R2R3-MYB gene conservation within the same clade of Ipomoea species.
Research progress related to R2R3-MYB gene function has shown that the members grouped in the same clade might exhibit a similar biological function, and these R2R3-MYB members are involved as activators or repressors in transcriptional regulations [14]. The C24(S6), C19(S4), C20(S7), and C23(S44) clades are known to participate in phenylpropanoid-derived flavonoids biosynthesis and the metabolism pathway, including the regulation of the anthocyanin’s biosynthesis pathway. For example, C24 clade members containing AtMYB75 (PAP1), ATMYB90 (PAP2), AtMYB113, and AtMYB114, were proven to positively regulate anthocyanin biosynthesis [21,22], indicating that these twenty-six R2R3-MYB TFs (i.e., 4, 6, 5, 6, and 5 in sweet potatoes, I. trifida, I. triloba, I. nil, and I. aquatica, respectively) may control anthocyanin biosynthesis and metabolism in Ipomoea plants.

2.3. Chromosomal Distribution and Synteny Analysis of R2R3-MYBs in Six Ipomoea Species

Based on the corresponding chromosome-level genome sequence and annotation information, the 131, 128, 128, 123, 124, and 51 R2R3-MYB genes were mapped throughout the 15 chromosomes of I. batatas, I. trifida, I. triloba, I. nil, I. aquatica, and I. purpurea, respectively (Figure 3). The remaining genes (5 in I. trifida, 1 in I. triloba, 4 in I. nil, and 15 in I. aquatica) were located on unanchored scaffolds and have not been mapped (Table S3). Chromosome numbering was assigned and named “IbChr”, “ItfChr”, “ItbChr”, “InChr”, “IaChr”, and “IpChr”. The chromosomal location results showed that the R2R3-MYB genes were distributed on all chromosomes in six Ipomoea species’ genomes, but the distribution appeared to be uneven. For the I. batatas genome, IbChr12 had the largest number of IbR2R3-MYB genes (n = 24), while IbChr06 had the smallest number (n = 3). Among the other chromosomes, 14 IbR2R3-MYBs were detected in IbChr05, 12 in IbChr07, 11 in IbChr02, 10 in IbChr09, 9 in IbChr08, 8 in IbChr11, 7 in IbChr14, 6 in IbChr03, IbChr04, IbChr10 and IbChr15, 5 in IbChr13, and 4 in IbChr01. For I. trifida and I. triloba genomes, the distribution of R2R3-MYBs was highly similar, Itf/ItbChr07 had the largest number (n = 20/18), and Itf/ItbChr15 had the smallest number (n = 4/4). However, R2R3-MYB genes were randomly distributed in the I. nil, I. aquatica, and I. purpurea genomes. For example, the highest numbers of R2R3-MYBs were distributed inInChr08 (n = 15), IaChr01 and IaChr15 (n = 15), and IpChr07 (n = 9) in I. nil, I. aquatica, and I. purpurea, respectively. These results indicated that the distribution of R2R3-MYBs is different and disproportionate on chromosomes in sweet potatoes and five other Ipomoea species.
This study identified further intraspecific synteny blocks for each species. The results showed that there were 301 syntenic gene pairs among the 6 Ipomoea species. Of these, sweet potato, I. trifida, I. triloba, I. nil, I. aquatica, and I. purpurea had 46, 60, 61, 41, 68, and 25 gene pairs, respectively (Figure 3 and Table S4). One syntenic gene pair on IbChr05 (g17105.t1-g17138.t1) in the sweet potato genome, the itf05g14440.t1-itf12g04070.t1 pair in I. trifida, the itb05g15100.t1-itb12g04280.t1 pair in I. triloba, and the GWHPABKX000429-GWHPABKX011664 pair in I. aquatica belong to clade C24, which refers to anthocyanin biosynthesis. A comparison of interspecific collinearity among sweet potatoes with Arabidopsis and five other Ipomoea species was also analyzed to further explore the evolution of R2R3-MYB genes. There were 114, 175, 194, 171, 126, and 103 collinear blocks of R2R3-MYB genes containing 115, 242, 263, 245, 189, and 167 IbR2R3-MYB homologous gene pairs between I. batatas and A. thaliana, I. trifida, I. triloba, I. nil, I. aquatica, and I. purpurea, respectively (Figure 4 and Table S5). Interestingly, I. batatas vs. I. trifida and I. batatas vs. I. triloba gave a highly similar collinearity relationship (Figure 4b,c); and I. batatas vs. I. nil, and I. batatas vs. I. purpurea also exhibited a highly similar collinearity relationship (Figure 4d,f). However, the chromosomal correspondence between I. batatas and five other Ipomoea species and Arabidopsis was poor; this was mainly observed in different combinations of the chromosomes of one species with others. These results indicated that the I. batatas genome might have a larger chromosome level variation during the evolutionary process.

2.4. Gene Duplication Events and Ka/Ks Analysis of R2R3-MYB Genes in Six Ipomoea

To identify different modes of R2R3-MYB family gene duplication in six Ipomoea plants, five gene duplication events, including WGD, TD, PD, TRD, and DSD, were analyzed by running the DupGen_finder pipeline [55]. The results showed that a total of 737 R2R3-MYB gene duplication events were detected in these Ipomoea plant genomes (Figure 5 and Table S6), and the number of different gene duplication modes varied greatly. The WGD mode had the maximum number of duplication gene pairs with 284, followed by the TRD mode with 259 and the DSD mode with 121, indicating that the expansion of the R2R3-MYB gene family during evolution processes was mostly involved in WGD, TRD, and DSD events in Ipomoea species. In contrast, the TD mode, with 53, and the PD mode, with 20, were the minimum numbers of R2R3-MYB gene pairs. In addition, the number of WGDs and TRDs in each species was higher than TDs, PDs, and DSDs, except for I. purpurea. Furthermore, the distribution of five gene duplication events was largely similar in I. batatas, I. trifida, and I. triloba, and in I. nil and I. purpurea, which is consistent with the phylogenetic relationships of six Ipomoea species. These results indicated the importance of WGD and TRD modes for the R2R3-MYB gene family expansion in Ipomoea plants.
The non-synonymous substitution (Ka) to synonymous substitution (Ks) ratio (Ka/Ks) is an informative value of selection pressure during evolutionary history. Generally, Ka/Ks > 1 represents a positive selection, and Ka/Ks = 1 represents a neutral selection, while Ka/Ks < 1 represents a negative or purifying selection [48,55]. In this study, the Ka/Ks value of duplicated R2R3-MYB gene pairs was calculated and tested statistically, and all duplicated genes had a Ka/Ks ratio that was lower than 1 (Table S6), which indicated that these R2R3-MYB gene pairs experienced strong purifying evolutionary selection with limited functional divergence after duplication events. For the sweet potato R2R3-MYB gene family, the average Ka/Ks value of WGD, TD, PD, TRD, and DSD gene pairs were 024, 0.52, 0.46, 0.32, and 0.22, respectively. The TD gene pairs had the highest Ka/Ks value, suggesting that TD events had a higher evolution rate than other gene duplication events.

2.5. Phylogenetic Relationships, Gene Structures, and Conserved Motifs Analysis of 131 IbR2R3-MYB Genes

In this study, the phylogenetic tree, gene structure, and conserved motifs of 131 IbR2R3-MYB genes were combined to investigate sequence and structure characteristics. The results showed that the patterns of the gene exon-intron structures and protein amino acid motifs were strikingly similar among members within one clade but distinct between different clades (Figure 6). For example, the exon-intron structure analysis showed that the number of exons in IbR2R3-MYB genes varied from 1 to 12, with a mean of 3 (Figure 6b). Among them, 71 IbR2R3-MYB genes had 3 exons accounting for approximately 54%, 25 genes had 4 exons, 16 genes had 2 exons, 4 (g46362.t1, g49847.t1, g33626.t1, and g24684.t1) had 5 exons, 1 (g17983.t1) had 6 exons, 1 (g30901.t1) had 7 exons, and 2 (g41253.t1 and g47242.t1) of clade C1 had a maximum number of exons with 12, while 9 genes from clade C5 only had 1 exon. In addition, the lengths of most IbR2R3-MYB genomic sequences were less than 4.0 kb with a mean of ~2.6 kb, and g61376.t1 had the highest length of ~12.9 kb, while g25499.t1 had only the lowest length of 923 bp. In brief, these results suggested that IbR2R3-MYB genes have evolutionary diversity between different phylogenetic clades and have evolutionary conservation within the same clade in terms of gene structure and length.
In this study, 10 conserved motifs, among 131 IbR2R3-MYB proteins, were further identified and named Motif 1 to Motif 10 (Figure 6c and Figure S2). R2R3-MYB proteins clustering together within the same clade generally have similar motif compositions (Figure 6c). For example, almost all IbR2R3-MYB proteins contained Motif 1, 2, 3, and 4, which form MYB DNA-binding domains. Motif 3 and Motif 1 formed the R2 domain, and Motif 4 and Motif 2 formed the R3 domain, which is located in the N-terminal of proteins. R2 and R3 domains were also identified and visualized (Figure 6c). Moreover, the number and arrangement of motifs varied in different clades, which is probably required for clade-specific functions. For instance, Motif 6 coupled with Motif 7 in C32 and C35 may be related to development, and Motif 8 alone in C5 or Motif 10 combined with Motif 8 in C27 may be related to abiotic stress responses. However, Motif groups 6/7/8/9 in C36 may suggest versatile biological functions. These results indicated the conservation and diversification of IbR2R3-MYB’s gene function.

2.6. Identification of R2R3-MYBs Related to Anthocyanin Biosynthesis in Sweet Potato

To investigate the key anthocyanin-related IbR2R3-MYBs in sweet potatoes, multiple public RNA-seq datasets representing different color variants in sweet potato leaf and tuberous root flesh and skin were analyzed [8,43,56,57]. These datasets focused on searching all potential candidate genes of anthocyanin biosynthesis in different sweet potato tissues. An integrated and comparative analysis was performed to identify the core anthocyanin-related IbR2R3-MYBs and anthocyanin biosynthesis genes (ABGs). The differentially expressed IbR2R3-MYBs and ABGs were determined via a Venn diagram analysis (Figure S3 and Table S7), and their relative expression profiles heatmaps were generated using TBtools software [58]. A total of 12 IbR2R3-MYB DEGs were identified and expressed in different color tissues of sweet potatoes (Figure 7). In purple tuberous root flesh, 4 R2R3-MYBs, 2 bHLHs, and 23 ABGs were identified, compared with white and yellow flesh (Figure 7a). The g17138.t1 was identified as IbMYB1 by BLASTP analyses, which belongs to the anthocyanin-activated C24(S6) clade. The g50106.t1(C19), g8510.t1(C19), and g48266.t1(C10) were identified as IbMYB27, IbMYB4c, and IbMYB59, respectively. IbMYB1/g17138.t1, IbMYB27/g50206.t1, and IbMYB4c/g8510.t1 had similar expression patterns and showed a higher expression level in the purple flesh of ‘DZ88’ and ‘DZ54’, together with two bHLH2 genes (g9534.t1 and g9535.t1) and anthocyanin biosynthesis enzyme-coding genes, such as 4CL/g60727.t1, CHI/g3524.t1, F3H/g29398.t1, ANS/g58862.t1, and UFGT/g54580.t1. In red tuberous root skin, three IbR2R3-MYB genes, IbMYB113/g17108.t1, IbMYB62/g52884.t1, and IbMYB15/g25699.t1, had relatively high or stronger expression levels in the red skin of ‘Sushu8’ and ‘Zheshu81’, while two genes (i.e., IbMYB44-1/g34152.t1 and IbMYB59/g48266.t1) showed weak expression levels (Figure 7b, c). In purple leaf, IbMYB1/g17138.t1, IbMYB27/g50106.t1, and IbMYB14/g8292.t1 showed a higher expression level in the purple leaf of ‘D7*CIP1’ and ‘Purple leaf’, which is comparable to the green leaf of ‘Xiangshu99’ and ‘XCSNo.2’ (Figure 7d). Coincidentally, IbMYB1/g17138.t1, IbMYB27/g50106.t1, and IbMYB61/g35476.t1 were detected at a stronger expression level in the purple leaf of ‘1625’ while IbMYB59/g48266.t1, IbMYB308/g39254.t1, and IbMYB58/g61376.t1 showed low expression levels (Figure 7e).
Furthermore, the expression levels of the above IbR2R3-MYBs and ABGs were used to perform a Pearson’s correlation analysis (Figure 8). The results showed that IbMYB1/g17138.t1 and IbMYB27/g50206.t1 were positively and significantly correlated to almost all ABGs, and they were negatively and significantly correlated to two UFGTs (g42594.t1 and g42595.t1) in purple fleshes. Conversely, IbMYB59/g48266.t1 had a significant positive correlation with g42595.t1, but it had no significant correlation with other ABGs (Figure 8a). A significant positive correlation between IbMYB113/g17108.t1, IbMYB62/g52884.t1, and IbMYB15/g25699.t1 relative to 9 ABGs were observed in the red skin of ‘Zheshu81’, while IbMYB59/g48266.t1 and IbMYB44-1/g34152.t1 had significant negative correlations with 12 ABGs (Figure 8c). However, IbMYB113/g17108.t1 also had a significant positive correlation with three ABGs (g42593.t1, g30882.t1, and g63863.t1) in the red skin of ‘Sushu8’ (Figure 8b). Similarly, IbMYB1/g17138.t1 and IbMYB27/g50206.t1 were positively and significantly correlated to at least five ABGs in purple leaves (Figure 8d,e). These results suggested that the identified IbR2R3-MYBs perform a core role in regulating the anthocyanin biosynthesis of sweet potatoes.
IbMYB1/g17138.t1, which was shown to be a transcriptional activator for enhancing anthocyanin accumulation in purple-fleshed sweet potatoes in many studies [8,47,48,49], was identified as a fundamental R2R3-MYB TF responsible for the anthocyanin biosynthesis of tuberous root flesh and leaves in sweet potatoes in this study. IbMYB113/g17108.t1 had a relatively high expression level in red tuberous root skin but a very low or no expression in leaves and flesh, and it was positively and significantly correlated to most ABGs; thus, it was identified as a potential anthocyanin activator. IbMYB62/g52884.t1(C11) and IbMYB15/g25699.t1(C18) exhibited upregulated expression in pigmented tissues in sweet potatoes, suggesting their potential role in anthocyanin biosynthesis. However, the IbMYB59/g48266.t1 was downregulated expression in pigmented leaves and tuberous root flesh and skin, but it had no significant correlations with ABGs, suggesting that it might not be involved in the anthocyanin biosynthesis pathway. In addition, IbMYB27/g50206.t1(C19), IbMYB44-1/g34152.t1(C5), and IbMYB4c/g8510.t1(C19) had been identified as R2R3-MYB repressor regulating the anthocyanin biosynthesis [51]. These results summarized that multiple IbR2R3-MYBs co-regulate anthocyanin biosynthesis in different sweet potato tissues, of which, IbMYB1/g17138.t1 and IbMYB113/g17108.t1 were anthocyanin-related core regulators.

3. Discussion

Sweet potato is an important food crop due to its strong adaptability, high yield potential, and high nutritional value [1]. The highly heterozygous hexaploid genome and the lack of knowledge of this genome limit the genetic studies of sweet potatoes [2]. A haplotype-resolved genome of the I. batatas cultivar ‘Taizhong 6’ was assembled [50] and used for molecular breeding. Two wild diploid species, I.trifida and I. triloba, had been reported as the closest relative of the hexaploid sweet potato, and they possessed a high-quality genome assembly [51]. In addition, three other Ipomoea species, I. nil, I. purpurea, and I. aquatica, were also sequenced [52,53,54]. These genome data contribute to the genetic study of sweet potatoes and genome evolution analysis in the Ipomoea species. In a previous study, some important enzyme-encoding genes (such as XTH, NBS, CDPK, SWEET, and SPL) [59,60,61,62,63] and TFs (such as GRAS, WRKY, bZIP, and MADS-box) [64,65,66,67,68] gene families were investigated in tuberous root development and the stress responses of Ipomoea species. However, a comprehensive and comparative molecular, evolutionary, and functional analysis of R2R3-MYB genes, which are indispensable for several developmental pathways and stress responses, is lacking in the Ipomoea species, even though the R2R3-MYB gene family has been identified in I. aquatica [69] and I. nil [70]. In the current study, six Ipomoea plants’ genomes information was employed in the identification and characterization of R2R3-MYB TFs, including protein physicochemical properties, phylogenetic relationships, chromosome localization, syntenic links, and gene duplication events. On this basis, the expression patterns of anthocyanin-related IbR2R3-MYBs in different colored sweet potato tissues were analyzed. The genome-wide comparative analysis of the R2R3-MYB TFs will facilitate the further study of their function and molecular breeding of sweet potatoes.
In this study, a total of 695 R2R3-MYB genes from the genomes of 6 Ipomoea plants were identified, and the number of identified R2R3-MYBs varied from 124 in I. aquatica to 133 in I. trifida, excluding 51 in I. purpurea (Table 1). There was no correlation between the R2R3-MYB genes and species phylogeny or the genome size that has been reported by previous studies [71]. Therefore, the difference may be due to the genome assembly strategy and quality of six Ipomoea species [2,72], and further improvements in these genome sequences are urgently needed. Furthermore, the identified R2R3-MYB proteins were clustered into 36 clades via an ML phylogenetic analysis (Figure 3), and the clade C25(S12) was Arabidopsis-specific, which was consistent with previous findings in five Solanaceae species. A total of 4 clades, including 102 R2R3-MYB proteins, which have no homologous protein in Arabidopsis, were identified as Ipomoea-specific clades, suggesting their species-specific role in Ipomoea, and their biological functions need to be further studied. Chromosomal localizations and distributions showed that the R2R3-MYB genes were unevenly distributed on all chromosomes in six Ipomoea species’ genomes, and some genes were located on unanchored scaffolds. An interspecific collinearity analysis of R2R3-MYBs showed that a total of 1223 collinearity gene pairs were detected among hexaploid I. batatas and 5 other diploid Ipomoea species, but their collinearity relationships were not as good as expected. Given their species’ phylogenetic relationships (Figure 5a), one reasonable explanation is that the hexaploid I. batatas genome might have undergone a larger chromosome rearrangement during the evolutionary process [60].
Gene and genome duplication processes are central to the species’ evolution and gene family expansion [55]. The present study showed that TRD was predominant in I. batatas and I. nil, and WGD was predominant in I. trifida, I. triloba, and I. aquatica, while DSD occurred in I. purpurea, suggesting that WGD, TRD, and DSD events play key roles in the expansion of R2R3MYB gene families in the Ipomoea species. Moreover, the Ka/Ks analysis showed that all duplicated R2R3-MYB gene pairs had a Ka/Ks ratio that was less than 1, which indicated that these genes experienced strong purifying selections in the Ipomoea species’ evolution process. Therefore, these results provided useful information for future research to understand the evolution of the R2R3-MYB gene family.
R2R3-MYBs are the major determinant regulators of the MBW(MYB-bHLH-WD40) complex, which fine-tune the spatial and temporal localization of anthocyanins by regulating the expression level of flavonoid biosynthesis pathway genes [17,73]. In Arabidopsis thaliana, four subgroups of R2R3-MYBs related to flavonoid syntheses have been identified, including S4, S5, S6, and S7 [9]. The S6 members contain the conserved KPRPR[S/T]F motif as anthocyanin activators to regulate anthocyanins biosynthesis at the transcription level, while the S4 members harbor repressive motifs, such as LIsrGIDPxT/SHRxI/L, pdLNLD/ELxiG/S, EAR, SID, and TLLLFR motifs, in C-terminals and can directly repress the flavonoid’s biosynthetic genes [74]. In this study, the phylogenetic analysis identified 13 IbR2R3-MYB members belonging to the C19(S4), C20(S7), and C24(S6) clades, and the FPKM values of g49847.t1(C20) and g15127.t1(C24) in all transcriptome data were less than 1 (Figure S4 and Table S8), indicating their nonfunction. Furthermore, R2R3-MYBs and the anthocyanin-related enzyme-encoding DEGs of colorful and colorless tissues in sweet potatoes were analyzed. In tuberous root flesh, IbMYB1/g17138.t1 and IbbHLH2 (g9534.t1 and g9535.t1) form the transcriptional complex to activate downstream anthocyanin-related genes expression, such as 4CL/g60727.t1, CHI/g3524.t1, F3H/g29398.t1, ANS/g58862.t1, UFGT/g54580.t1, and GST/g29615.t1 (Figure 7a and Figure 8a), thus promoting the accumulation of anthocyanin. Meanwhile, IbMYB1/g17138.t1 also performs an essential role in the anthocyanin biosynthesis of sweet potato purple leaves (Figure 7d,e and Figure 8d,e). In previous comparative transcriptome analyses of the red skin of ‘Sushu8’ [56] and ‘Zheshu81’ [8], g17108.t1 had not been identified as anthocyanin-associated R2R3-MYB TF. A genome-wide comparative analysis of the R2R3-MYB gene family identified the IbMYB113/g17108.t1 as an R2R3-MYB in this study. IbMYB113/g17108.t1 may function in the activation of 4CL(g63863.t1 and g60727.t1), DFR-A/g17019.t1, DFR-C/g17021.t1, UFGT(g42593.t1 and g54584.t1), and AT/g30882.t1 (Figure 7b,c and Figure 8b,c), thus specifically regulating root skin coloration. Interestingly, multiple R2R3-MYB repressors may carry out competitive inhibition to prevent the excessive accumulation of anthocyanins [43], such as IbMYB27/g50106.t1, IbMYB4c/g8510.t1, and IbMYB44-1/g34152.t1, which suggests widespread negative feedback regulation mechanisms in the sweet potato anthocyanin metabolism. Based on these findings, it is speculated that the regulatory genes of anthocyanins in tuberous root flesh, skin, and leaves may be different and that IbMYB1/g17138.t1 and IbMYB113/g17108.t1 may regulate the spatiotemporal expression levels of anthocyanin-associated genes, thus resulting in tissue-specific anthocyanin accumulation in sweet potatoes. However, further molecular and genetic identification remains necessary in order to verify these genes’ true functions.

4. Materials and Methods

4.1. Genome-Wide Identification of R2R3-MYB Genes in Six Ipomoea Species

The whole genome sequences and genome annotation files of Ipomoea batatas, Ipomoea trifida, and Ipomoea triloba were downloaded from the Ipomoea Genome Hub (http://sweetpotao.com/, accessed on 1 September 2022) [50] and Sweetpotato Genomics Resource (http://sweetpotato.plantbiology.msu.edu/, accessed on 1 September 2022) [51]. However, the genome information of Ipomoea nil was downloaded from the following website: http://viewer.shigen.info/asagao/, accessed on 1 September 2022 [52]. Additionally, the genome databases of Ipomoea purpurea and Ipomoea aquatica were downloaded from the CoGe platform (https://genomevolution.org/coge/GenomeInfo.pl?gid=58735, accessed on 1 September 2022) [53] and National Genomics Data Center (https://ngdc.cncb.ac.cn/search/?dbId=gwh&q=PRJCA002216&page=1, accessed on 1 September 2022) [54], respectively. The longest coding sequence (CDS) and corresponding amino acid sequence of each gene were obtained by removing any alternatively spliced sequences by using the TBtools software [58].
To accurately identify all R2R3-MYB family members the six Ipomoea species, three different screening strategies were combined. First, the sequences of 126 AtR2R3-MYB proteins retrieved from TAIR (https://www.arabidopsis.org/, accessed on 1 September 2022) were used as queries for the BLASTP (E-value < 1 × 10−5) search by using TBtools software [58]. Subsequently, the MYB-like DNA-binding domain’s (PF00249) Hidden Markov Model (HMM) profile obtained from the Pfam database (http://pfam.xfam.org/, accessed on 3 September 2022) was used to search candidate proteins with conserved MYB domains by using the Advanced HMMER Search program in TBtools software [58]. Finally, all putative protein sequences further confirmed the presence of the R2R3 domain in three databases, including SMART (http://smart.embl-heidelberg.de/, accessed on 3 September 2022), Pfam (http://pfam.xfam.org/, accessed on 3 September 2022), and NCBI CD-search (https://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi, accessed on 3 September 2022). It should be noted that the protein sequences that did not have a complete R2R3 domain and those exhibiting redundant sequences were manually removed or corrected. The physicochemical properties of all identified R2R3-MYB members, including protein length, molecular weight (MW), isoelectric point (pI), instability index, aliphatic index, and grand average of hydropathicity (GRAVY) were analyzed via the ProtParam tool of the Expasy website (https://web.expasy.org/protparam/, accessed on 14 September 2022), and their subcellular localization was also predicted by using the WoLF PSORT website (https://wolfpsort.hgc.jp/, accessed on 14 September 2022).

4.2. Multiple Sequences Alignment and Phylogenetic Evolution Analysis of R2R3-MYBs

All R2R3-MYB protein full-length sequences from six Ipomoea species and Arabidopsis thaliana containing the AtCDC5 outgroup protein were used to perform phylogenetic evolution analyses. Multiple sequence alignments were carried out using the MAFFT v7.505 program [75] with default parameters, and a maximum-likelihood (ML) phylogenetic tree was constructed using the IQ-TREE v1.6.12 program [76] with auto parameters and 1000 bootstraps. The 131 IbR2R3-MYB proteins also were used to generate phylogenetic trees by using the same methods. The phylogenetic trees were subsequently visualized using the ChiPlot online tool (https://www.chiplot.online/index.html, accessed on 27 February 2023).

4.3. Chromosome Distribution and Syntenic Analysis of R2R3-MYBs in Six Ipomoea Species

The physical position and chromosomal distribution information of six Ipomoea species R2R3-MYB genes were provided in the corresponding genome annotation files (GFF3). The syntenic relationships of R2R3-MYB genes in six Ipomoea intraspecies and interspecies were analyzed using the One Step MCScanX program in the TBtools software with default parameters [58]. Furthermore, the Advanced Circos function in TBtools software was used to visualize the distribution information and syntenic results of six Ipomoea species R2R3-MYB genes across all chromosomes. It is worth noting that the genes distributed on unanchored scaffolds were excluded.

4.4. Gene Duplication Events and Ka/Ks Analysis of R2R3-MYBs in Six Ipomoea Species

OrthoFinder v2.5.4 [77] constructed the taxonomic tree of the six Ipomoea species. The gene duplication events of six Ipomoea species R2R3-MYB duplicated gene pairs derived from whole-genome duplication (WGD), tandem duplication (TD), proximal duplication (PD), transposed duplication (TRD), and dispersed duplication (DSD) were detected by running the DupGen_finder pipeline, in which Arabidopsis thaliana was selected as the outgroup in order to identify duplicated gene pairs. In addition, the nonsynonymous (Ka) and synonymous substitution rates (Ks) of syntenic gene pairs were calculated using Tbtools with the Nei–Gojobori (NG) method [58].

4.5. Gene Structure and Conserved Protein Motif Analysis of R2R3-MYBs in Sweet potato

The GFF3 annotation file of the I. batatas genome containing information about the gene structure was used to identify the exons and introns of 131 IbR2R3-MYB genes. The MEME online tools (https://meme-suite.org/meme/, accessed on 28 September 2022) were used to identify the conserved motifs of IbR2R3-MYB proteins, with a maximum number of motifs to 10 and other default parameters. The gene structure and conserved motifs were subsequently visualized by TBtools software [58].

4.6. Identification of Anthocyanin-Related R2R3-MYBs Via RNA-Seq Data in Sweet potato

Multiple public RNA-seq datasets were downloaded from the sequence read archive (SRA) of NCBI, and these datasets were employed to explore the expression patterns of anthocyanin-related IbR2R3-MYB genes in sweet potato different tissues. Of which, projects PRJNA562409 and PRJNA772025 comprised the transcriptome analysis of coloration mutation of root skin in sweet potato cultivar ‘Sushu8’ [56] and ‘Zheshu81’ [8], respectively. PRJNA721067 was a transcriptome analysis that included many sweet potato cultivars, and they present a typical color variant of purple leaves in the aerial part or purplish rings/spots in the storage root flesh [7]. In addition, our previous transcriptome study of four sweet potato cultivars with different pigmentation phenotypes in root flesh (PRJNA881010, PRJNA881014, PRJNA881013, and PRJNA881012) [57] was also used to identify the expression levels of IbR2R3-MYB genes.
The Hisat2 software [78] was used to align clean reads to the sweet potato genome. The fragments per kilobase million (FPKM) was measured to estimate IbR2R3-MYB gene expression levels by using the StringTie program [78]. Differentially expressed genes (DEGs) were identified by using the DESeq2 R package with an adjusted p value < 0.01 and absolute log2-fold change > 1 as the criteria [78]. First, the differentially expressed genes in each comparation and 131 identified IbR2R3-MYBs were selected to identify the candidate IbR2R3-MYBs and anthocyanin biosynthesis genes (ABGs) via a Venn diagram analysis. Next, the heatmaps of the expression levels of candidate IbR2R3-MYBs and anthocyanin biosynthesis genes (ABGs) were visualized in different colored sweet potato tissues using the TBtools software [58]. Furthermore, Pearson’s correlation analyses were also performed and visualized using the corrplot R package.

5. Conclusions

In summary, this study presented a comprehensive genome-wide analysis of the R2R3-MYB gene family in six Ipomoea plants. A total of 695 typical R2R3-MYB genes were identified, including 131 IbR2R3-MYB genes of the sweet potato. An ML phylogenetic analysis divided these genes into 36 clades, and 4 clades, including 102 members, were identified as Ipomoea-specific clades. WGD, TRD, and DSD events were the primary forces driving the R2R3-MYB gene family expansion of Ipomoea plants, and these duplicated genes experienced strong purifying selections. Furthermore, based on multiple RNA-seq data, two core IbR2R3-MYB genes (IbMYB1/g17138.t1 and IbMYB113/g17108.t1) were identified as responsible for regulating tissue-specific anthocyanin accumulation in sweet potato. These results can enhance our understanding of the R2R3-MYB gene family of sweet potato and provide novel R2R3-MYB gene information for anthocyanin metabolic engineering.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/plants12081731/s1. Figure S1. The sequence logo of R2 (a) and R3 (b) MYB domain; Figure S2. The sequence logos of conserved motifs (Motif 1-10) of the 131 IbR2R3-MYB proteins; Figure S3. The identification of anthocyanin-related IbR2R3-MYB DEGs in sweetpotato pigmented tissues via Venn diagram analysis; Figure S4. A heatmap of expression levels of 13 IbR2R3-MYB genes belong to C19(S4), C20(S7), and C24(S6) clades in sweetpotato pigmented tissues; Table S1. Physicochemical properties and subcellular localization of all identified R2R3-MYB proteins in six Ipomoea species; Table S2. Sequences and clades of all identified R2R3-MYB proteins in six Ipomoea species and Arabidopsis; Table S3. Gene feature of all identified R2R3-MYB genes in six Ipomoea species; Table S4. Syntenic R2R3-MYB gene pairs among each Ipomoea species; Table S5. Collinear R2R3-MYB gene pairs between sweetpotato and Arabidopsis and other five Ipomoea species; Table S6. R2R3-MYB duplicated gene pairs and Ka/Ks value of six Ipomoea species; Table S7. FPKM value of IbR2R3-MYB and ABG DEGs in purple tuberous root flesh, red skin, and purple leaf; Table S8. FPKM value of 13 IbR2R3-MYB genes belong to C19(S4), C20(S7), and C24(S6) clades.

Author Contributions

H.G. and M.L. conceived the idea and designed the research. M.L. performed the bioinformatics analysis and wrote the manuscript. Y.Z. and K.L. contributed to the visualized data. H.G. reviewed the manuscript of the paper and approved the final manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

The work was supported by the earmarked fund for CARS (No. CARS-09-P15), the Science and Technology Projects in Yunnan Province (202102AE090018), and the National Natural Science Foundation of China (32260543).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare that they have no conflict of interest.

References

  1. Liu, Q. Improvement for agronomically important traits by gene engineering in sweetpotato. Breed Sci. 2017, 67, 15–26. [Google Scholar] [CrossRef] [PubMed]
  2. Yan, M.; Nie, H.; Wang, Y.; Wang, X.; Jarret, R.; Zhao, J.; Wang, H.; Yang, J. Exploring and exploiting genetics and genomics for sweetpotato improvement: Status and perspectives. Plant Commun. 2022, 3, 100332. [Google Scholar] [CrossRef] [PubMed]
  3. FAOSTAT. Food and Agricultural Organization from the United Nations. Available online: http://www.fao.org/faostat/en/#data/QC (accessed on 14 April 2023).
  4. Mohanraj, R. Sweet Potato: Bioactive Compounds and Health Benefits. In Bioactive Molecules in Food; Mérillon, J.-M., Ramawat, K.G., Eds.; Reference Series in Phytochemistry; Springer International Publishing: Cham, Switerland, 2018; pp. 1–16. [Google Scholar]
  5. He, J.; Giusti, M.M. Anthocyanins: Natural colorants with health-promoting properties. Annu. Rev. Food Sci. Technol. 2010, 1, 163–187. [Google Scholar] [CrossRef] [PubMed]
  6. Mano, H.; Ogasawara, F.; Sato, K.; Higo, H.; Minobe, Y. Isolation of a regulatory gene of anthocyanin biosynthesis in tuberous roots of purple-fleshed sweet potato. Plant Physiol. 2007, 143, 1252–1268. [Google Scholar] [CrossRef]
  7. Zhang, D.; Tan, Y.; Dong, F.; Zhang, Y.; Huang, Y.; Zhou, Y.; Zhao, Z.; Yin, Q.; Xie, X.; Gao, X.; et al. The Expression of IbMYB1 Is Essential to Maintain the Purple Color of Leaf and Storage Root in Sweet Potato [Ipomoea batatas (L.) Lam]. Front. Plant Sci. 2021, 12, 688707. [Google Scholar] [CrossRef]
  8. Zhao, D.; Zhao, L.; Liu, Y.; Zhang, A.; Xiao, S.; Dai, X.; Yuan, R.; Zhou, Z.; Cao, Q. Metabolomic and Transcriptomic Analyses of the Flavonoid Biosynthetic Pathway for the Accumulation of Anthocyanins and Other Flavonoids in Sweetpotato Root Skin and Leaf Vein Base. J. Agric. Food Chem. 2022, 70, 2574–2588. [Google Scholar] [CrossRef]
  9. Dubos, C.; Stracke, R.; Grotewold, E.; Weisshaar, B.; Martin, C.; Lepiniec, L. MYB transcription factors in Arabidopsis. Trends. Plant Sci. 2010, 15, 573–581. [Google Scholar] [CrossRef]
  10. Jiang, C.K.; Rao, G.Y. Insights into the Diversification and Evolution of R2R3-MYB Transcription Factors in Plants. Plant Physiol. 2020, 183, 637–655. [Google Scholar] [CrossRef]
  11. Ogata, K.; Hojo, H.; Aimoto, S.; Nakai, T.; Nakamura, H.; Sarai, A.; Ishii, S.; Nishimura, Y. Solution structure of a DNA-binding unit of Myb: A helix-turn-helix-related motif with conserved tryptophans forming a hydrophobic core. Proc. Natl. Acad. Sci. USA 1992, 89, 6428–6432. [Google Scholar] [CrossRef]
  12. Katiyar, A.; Smita, S.; Lenka, S.K.; Rajwanshi, R.; Chinnusamy, V.; Bansal, K.C. Genome-wide classification and expression analysis of MYB transcription factor families in rice and Arabidopsis. BMC Genom. 2012, 13, 544. [Google Scholar] [CrossRef]
  13. Du, H.; Liang, Z.; Zhao, S.; Nan, M.G.; Tran, L.S.; Lu, K.; Huang, Y.B.; Li, J.N. The Evolutionary History of R2R3-MYB Proteins Across 50 Eukaryotes: New Insights Into Subfamily Classification and Expansion. Sci. Rep. 2015, 5, 11037. [Google Scholar] [CrossRef] [PubMed]
  14. Wu, Y.; Wen, J.; Xia, Y.; Zhang, L.; Du, H. Evolution and functional diversification of R2R3-MYB transcription factors in plants. Hortic. Res. 2022, 9, uhac058. [Google Scholar] [CrossRef] [PubMed]
  15. Pratyusha, D.S.; Sarada, D.V.L. MYB transcription factors-master regulators of phenylpropanoid biosynthesis and diverse developmental and stress responses. Plant Cell Rep. 2022, 41, 2245–2260. [Google Scholar] [CrossRef] [PubMed]
  16. Allan, A.C.; Hellens, R.P.; Laing, W.A. MYB transcription factors that colour our fruit. Trends. Plant Sci. 2008, 13, 99–102. [Google Scholar] [CrossRef]
  17. Yang, J.; Chen, Y.; Xiao, Z.; Shen, H.; Li, Y.; Wang, Y. Multilevel regulation of anthocyanin-promoting R2R3-MYB transcription factors in plants. Front. Plant Sci. 2022, 13, 1008829. [Google Scholar] [CrossRef] [PubMed]
  18. Liu, J.; Osbourn, A.; Ma, P. MYB Transcription Factors as Regulators of Phenylpropanoid Metabolism in Plants. Mol. Plant 2015, 8, 689–708. [Google Scholar] [CrossRef]
  19. Cone, K.C.; Burr, F.A.; Burr, B. Molecular analysis of the maize anthocyanin regulatory locus C1. Proc. Natl. Acad. Sci. USA 1986, 83, 9631–9635. [Google Scholar] [CrossRef]
  20. Paz-Ares, J.; Ghosal, D.; Wienand, U.; Peterson, P.A.; Saedler, H. The regulatory c1 locus of Zea mays encodes a protein with homology to myb proto-oncogene products and with structural similarities to transcriptional activators. EMBO J. 1987, 6, 3553–3558. [Google Scholar] [CrossRef]
  21. Borevitz, J.O.; Xia, Y.; Blount, J.; Dixon, R.A.; Lamb, C. Activation tagging identifies a conserved MYB regulator of phenylpropanoid biosynthesis. Plant Cell 2000, 12, 2383–2394. [Google Scholar] [CrossRef]
  22. Gonzalez, A.; Zhao, M.; Leavitt, J.M.; Lloyd, A.M. Regulation of the anthocyanin biosynthetic pathway by the TTG1/bHLH/Myb transcriptional complex in Arabidopsis seedlings. Plant J. 2008, 53, 814–827. [Google Scholar] [CrossRef]
  23. Sun, C.; Deng, L.; Du, M.; Zhao, J.; Chen, Q.; Huang, T.; Jiang, H.; Li, C.B.; Li, C. A Transcriptional Network Promotes Anthocyanin Biosynthesis in Tomato Flesh. Mol. Plant 2020, 13, 42–58. [Google Scholar] [CrossRef] [PubMed]
  24. Takos, A.M.; Jaffe, F.W.; Jacob, S.R.; Bogs, J.; Robinson, S.P.; Walker, A.R. Light-induced expression of a MYB gene regulates anthocyanin biosynthesis in red apples. Plant Physiol. 2006, 142, 1216–1232. [Google Scholar] [CrossRef] [PubMed]
  25. Ban, Y.; Honda, C.; Hatsuyama, Y.; Igarashi, M.; Bessho, H.; Moriguchi, T. Isolation and functional analysis of a MYB transcription factor gene that is a key regulator for the development of red coloration in apple skin. Plant Cell Physiol. 2007, 48, 958–970. [Google Scholar] [CrossRef]
  26. Espley, R.V.; Hellens, R.P.; Putterill, J.; Stevenson, D.E.; Kutty-Amma, S.; Allan, A.C. Red colouration in apple fruit is due to the activity of the MYB transcription factor, MdMYB10. Plant J. 2007, 49, 414–427. [Google Scholar] [CrossRef] [PubMed]
  27. Xu, Z.S.; Yang, Q.Q.; Feng, K.; Xiong, A.S. Changing Carrot Color: Insertions in DcMYB7 Alter the Regulation of Anthocyanin Biosynthesis and Modification. Plant Physiol. 2019, 181, 195–207. [Google Scholar] [CrossRef]
  28. Xu, Z.S.; Yang, Q.Q.; Feng, K.; Yu, X.; Xiong, A.S. DcMYB113, a root-specific R2R3-MYB, conditions anthocyanin biosynthesis and modification in carrot. Plant Biotechnol. J. 2020, 18, 1585–1597. [Google Scholar] [CrossRef]
  29. Albert, N.W.; Lewis, D.H.; Zhang, H.; Schwinn, K.E.; Jameson, P.E.; Davies, K.M. Members of an R2R3-MYB transcription factor family in Petunia are developmentally and environmentally regulated to control complex floral and vegetative pigmentation patterning. Plant J. 2011, 65, 771–784. [Google Scholar] [CrossRef]
  30. Aharoni, A.; De Vos, C.H.; Wein, M.; Sun, Z.; Greco, R.; Kroon, A.; Mol, J.N.; O’Connell, A.P. The strawberry FaMYB1 transcription factor suppresses anthocyanin and flavonol accumulation in transgenic tobacco. Plant J. 2001, 28, 319–332. [Google Scholar] [CrossRef]
  31. Liu, Y.; Lin-Wang, K.; Espley, R.V.; Wang, L.; Li, Y.; Liu, Z.; Zhou, P.; Zeng, L.; Zhang, X.; Zhang, J.; et al. StMYB44 negatively regulates anthocyanin biosynthesis at high temperatures in tuber flesh of potato. J. Exp. Bot. 2019, 70, 3809–3824. [Google Scholar] [CrossRef]
  32. Holton, T.A.; Cornish, E.C. Genetics and Biochemistry of Anthocyanin Biosynthesis. Plant Cell 1995, 7, 1071–1083. [Google Scholar] [CrossRef]
  33. Guo, J.; Zhou, W.; Lu, Z.; Li, H.; Li, H.; Gao, F. Isolation and Functional Analysis of Chalcone Isomerase Gene from Purple-Fleshed Sweet Potato. Plant Mol. Biol. Report. 2015, 33, 1451–1463. [Google Scholar] [CrossRef]
  34. Zhou, W.; Gong, Y.; Lu, X.; Huang, C.; Gao, F. Molecular cloning and characterization of a flavonoid 3’-hydroxylase gene from purple-fleshed sweet potato (Ipomoea batatas). Mol. Biol. Rep. 2012, 39, 295–302. [Google Scholar] [CrossRef]
  35. Wang, H.; Fan, W.; Li, H.; Yang, J.; Huang, J.; Zhang, P. Functional characterization of Dihydroflavonol-4-reductase in anthocyanin biosynthesis of purple sweet potato underlies the direct evidence of anthocyanins function against abiotic stresses. PLoS ONE 2013, 8, e78484. [Google Scholar] [CrossRef]
  36. Zhou, W.; Huang, C.; Gong, Y.; Feng, Q.; Gao, F. Molecular Cloning and Expression Analysis of an ANS Gene Encoding Anthocyanidin Synthase from Purple-Fleshed Sweet Potato [Ipomoea batatas (L.) Lam]. Plant Mol. Biol. Rep. 2009, 28, 112–121. [Google Scholar] [CrossRef]
  37. Wang, H.; Wang, C.; Fan, W.; Yang, J.; Appelhagen, I.; Wu, Y.; Zhang, P. A novel glycosyltransferase catalyses the transfer of glucose to glucosylated anthocyanins in purple sweet potato. J. Exp. Bot. 2018, 69, 5444–5459. [Google Scholar] [CrossRef]
  38. Kou, M.; Liu, Y.J.; Li, Z.Y.; Zhang, Y.G.; Tang, W.; Yan, H.; Wang, X.; Chen, X.G.; Su, Z.X.; Arisha, M.H.; et al. A novel glutathione S-transferase gene from sweetpotato, IbGSTF4, is involved in anthocyanin sequestration. Plant Physiol. Biochem. 2019, 135, 395–403. [Google Scholar] [CrossRef] [PubMed]
  39. Tanaka, M.; Takahata, Y.; Kurata, R.; Nakayama, H.; Yoshinaga, M. Structural and functional characterization of IbMYB1 genes in recent Japanese purple-fleshed sweetpotato cultivars. Mol. Breed. 2011, 29, 565–574. [Google Scholar] [CrossRef]
  40. Chu, H.; Jeong, J.C.; Kim, W.J.; Chung, D.M.; Jeon, H.K.; Ahn, Y.O.; Kim, S.H.; Lee, H.S.; Kwak, S.S.; Kim, C.Y. Expression of the sweetpotato R2R3-type IbMYB1a gene induces anthocyanin accumulation in Arabidopsis. Physiol. Plant 2013, 148, 189–199. [Google Scholar] [CrossRef]
  41. Zhang, L.; Yu, Y.; Shi, T.; Kou, M.; Sun, J.; Xu, T.; Li, Q.; Wu, S.; Cao, Q.; Hou, W.; et al. Genome-wide analysis of expression quantitative trait loci (eQTLs) reveals the regulatory architecture of gene expression variation in the storage roots of sweet potato. Hortic. Res. 2020, 7, 90. [Google Scholar] [CrossRef]
  42. Wei, Z.Z.; Hu, K.D.; Zhao, D.L.; Tang, J.; Huang, Z.Q.; Jin, P.; Li, Y.H.; Han, Z.; Hu, L.Y.; Yao, G.F.; et al. MYB44 competitively inhibits the formation of the MYB340-bHLH2-NAC56 complex to regulate anthocyanin biosynthesis in purple-fleshed sweet potato. BMC Plant Biol. 2020, 20, 258. [Google Scholar] [CrossRef]
  43. Deng, J.; Wu, D.; Shi, J.; Balfour, K.; Wang, H.; Zhu, G.; Liu, Y.; Wang, J.; Zhu, Z. Multiple MYB Activators and Repressors Collaboratively Regulate the Juvenile Red Fading in Leaves of Sweetpotato. Front. Plant Sci. 2020, 11, 941. [Google Scholar] [CrossRef]
  44. Kang, L.; Teng, Y.; Cen, Q.; Fang, Y.; Tian, Q.; Zhang, X.; Wang, H.; Zhang, X.; Xue, D. Genome-Wide Identification of R2R3-MYB Transcription Factor and Expression Analysis under Abiotic Stress in Rice. Plants 2022, 11, 1928. [Google Scholar] [CrossRef] [PubMed]
  45. Li, Y.; Lin-Wang, K.; Liu, Z.; Allan, A.C.; Qin, S.; Zhang, J.; Liu, Y. Genome-wide analysis and expression profiles of the StR2R3-MYB transcription factor superfamily in potato (Solanum tuberosum L.). Int. J. Biol. Macromol. 2020, 148, 817–832. [Google Scholar] [CrossRef] [PubMed]
  46. Li, Z.; Peng, R.; Tian, Y.; Han, H.; Xu, J.; Yao, Q. Genome-Wide Identification and Analysis of the MYB Transcription Factor Superfamily in Solanum lycopersicum. Plant Cell Physiol. 2016, 57, 1657–1677. [Google Scholar] [CrossRef]
  47. Yin, Y.; Guo, C.; Shi, H.; Zhao, J.; Ma, F.; An, W.; He, X.; Luo, Q.; Cao, Y.; Zhan, X. Genome-Wide Comparative Analysis of the R2R3-MYB Gene Family in Five Solanaceae Species and Identification of Members Regulating Carotenoid Biosynthesis in Wolfberry. Int. J. Mol. Sci. 2022, 23, 2259. [Google Scholar] [CrossRef]
  48. Yang, Y.; Yuan, Z.; Ning, C.; Zhao, B.; Wang, R.; Zheng, X.; Liu, Y.; Chen, J.; He, L. The Pea R2R3-MYB Gene Family and Its Role in Anthocyanin Biosynthesis in Flowers. Front. Genet. 2022, 13, 936051. [Google Scholar] [CrossRef] [PubMed]
  49. Austin, D.F.; Huáman, Z. A synopsis of Ipomoea (Convolvulaceae) in the Americas. Taxon 2019, 45, 3–38. [Google Scholar] [CrossRef]
  50. Yang, J.; Moeinzadeh, M.H.; Kuhl, H.; Helmuth, J.; Xiao, P.; Haas, S.; Liu, G.; Zheng, J.; Sun, Z.; Fan, W.; et al. Haplotype-resolved sweet potato genome traces back its hexaploidization history. Nat. Plants 2017, 3, 696–703. [Google Scholar] [CrossRef]
  51. Wu, S.; Lau, K.H.; Cao, Q.; Hamilton, J.P.; Sun, H.; Zhou, C.; Eserman, L.; Gemenet, D.C.; Olukolu, B.A.; Wang, H.; et al. Genome sequences of two diploid wild relatives of cultivated sweetpotato reveal targets for genetic improvement. Nat. Commun. 2018, 9, 4580. [Google Scholar] [CrossRef]
  52. Hoshino, A.; Jayakumar, V.; Nitasaka, E.; Toyoda, A.; Noguchi, H.; Itoh, T.; Shin, I.T.; Minakuchi, Y.; Koda, Y.; Nagano, A.J.; et al. Genome sequence and analysis of the Japanese morning glory Ipomoea nil. Nat. Commun. 2016, 7, 13295. [Google Scholar] [CrossRef]
  53. Gupta, S.; Harkess, A.; Soble, A.; Van Etten, M.; Leebens-Mack, J.; Baucom, R.S. Interchromosomal linkage disequilibrium and linked fitness cost loci associated with selection for herbicide resistance. New Phytol. 2023, 238, 1263–1277. [Google Scholar] [CrossRef] [PubMed]
  54. Hao, Y.; Bao, W.; Li, G.; Gagoshidze, Z.; Shu, H.; Yang, Z.; Cheng, S.; Zhu, G.; Wang, Z. The chromosome-based genome provides insights into the evolution in water spinach. Sci. Hortic. 2021, 289, 110501. [Google Scholar] [CrossRef]
  55. Qiao, X.; Li, Q.; Yin, H.; Qi, K.; Li, L.; Wang, R.; Zhang, S.; Paterson, A.H. Gene duplication and evolution in recurring polyploidization-diploidization cycles in plants. Genome Biol. 2019, 20, 38. [Google Scholar] [CrossRef] [PubMed]
  56. Yang, Y.; Shi, D.; Wang, Y.; Zhang, L.; Chen, X.; Yang, X.; Xiong, H.; Bhattarai, G.; Ravelombola, W.; Olaoye, D.; et al. Transcript profiling for regulation of sweet potato skin color in Sushu8 and its mutant Zhengshu20. Plant Physiol. Biochem. 2020, 148, 1–9. [Google Scholar] [CrossRef] [PubMed]
  57. Xiao, J.; Xu, X.; Li, M.; Wu, X.; Guo, H. Regulatory network characterization of anthocyanin metabolites in purple sweetpotato via joint transcriptomics and metabolomics. Front. Plant Sci. 2023, 14, 1030236. [Google Scholar] [CrossRef]
  58. Chen, C.; Chen, H.; Zhang, Y.; Thomas, H.R.; Frank, M.H.; He, Y.; Xia, R. TBtools: An Integrative Toolkit Developed for Interactive Analyses of Big Biological Data. Mol. Plant 2020, 13, 1194–1202. [Google Scholar] [CrossRef]
  59. Zhang, J.Z.; He, P.W.; Xu, X.M.; Lu, Z.F.; Cui, P.; George, M.S.; Lu, G.Q. Genome-Wide Identification and Expression Analysis of the Xyloglucan Endotransglucosylase/Hydrolase Gene Family in Sweet Potato [Ipomoea batatas (L.) Lam]. Int. J. Mol. Sci. 2023, 24, 775. [Google Scholar] [CrossRef]
  60. Si, Z.; Wang, L.; Qiao, Y.; Roychowdhury, R.; Ji, Z.; Zhang, K.; Han, J. Genome-wide comparative analysis of the nucleotide-binding site-encoding genes in four Ipomoea species. Front. Plant Sci. 2022, 13, 960723. [Google Scholar] [CrossRef]
  61. Li, X.; Zhao, L.; Zhang, H.; Liu, Q.; Zhai, H.; Zhao, N.; Gao, S.; He, S. Genome-Wide Identification and Characterization of CDPK Family Reveal Their Involvements in Growth and Development and Abiotic Stress in Sweet Potato and Its Two Diploid Relatives. Int. J. Mol. Sci. 2022, 23, 3088. [Google Scholar] [CrossRef]
  62. Dai, Z.; Yan, P.; He, S.; Jia, L.; Wang, Y.; Liu, Q.; Zhai, H.; Zhao, N.; Gao, S.; Zhang, H. Genome-Wide Identification and Expression Analysis of SWEET Family Genes in Sweet Potato and Its Two Diploid Relatives. Int. J. Mol. Sci. 2022, 23, 15848. [Google Scholar] [CrossRef]
  63. Sun, H.; Mei, J.; Zhao, W.; Hou, W.; Zhang, Y.; Xu, T.; Wu, S.; Zhang, L. Phylogenetic Analysis of the SQUAMOSA Promoter-Binding Protein-Like Genes in Four Ipomoea Species and Expression Profiling of the IbSPLs During Storage Root Development in Sweet Potato (Ipomoea batatas). Front. Plant Sci. 2021, 12, 801061. [Google Scholar] [CrossRef] [PubMed]
  64. Chen, Y.; Zhu, P.; Wu, S.; Lu, Y.; Sun, J.; Cao, Q.; Li, Z.; Xu, T. Identification and expression analysis of GRAS transcription factors in the wild relative of sweet potato Ipomoea trifida. BMC Genom. 2019, 20, 911. [Google Scholar] [CrossRef] [PubMed]
  65. Yang, Z.; Sun, J.; Chen, Y.; Zhu, P.; Zhang, L.; Wu, S.; Ma, D.; Cao, Q.; Li, Z.; Xu, T. Genome-wide identification, structural and gene expression analysis of the bZIP transcription factor family in sweet potato wild relative Ipomoea trifida. BMC Genet. 2019, 20, 41. [Google Scholar] [CrossRef]
  66. Li, Y.; Zhang, L.; Zhu, P.; Cao, Q.; Sun, J.; Li, Z.; Xu, T. Genome-wide identification, characterisation and functional evaluation of WRKY genes in the sweet potato wild ancestor Ipomoea trifida (H.B.K.) G. Don. under abiotic stresses. BMC Genet. 2019, 20, 90. [Google Scholar] [CrossRef]
  67. Liu, S.; Zhang, C.; Guo, F.; Sun, Q.; Yu, J.; Dong, T.; Wang, X.; Song, W.; Li, Z.; Meng, X.; et al. A systematical genome-wide analysis and screening of WRKY transcription factor family engaged in abiotic stress response in sweetpotato. BMC Plant Biol. 2022, 22, 616. [Google Scholar] [CrossRef]
  68. Shao, Z.; He, M.; Zeng, Z.; Chen, Y.; Hanna, A.D.; Zhu, H. Genome-Wide Identification and Expression Analysis of the MADS-Box Gene Family in Sweet Potato [Ipomoea batatas (L.) Lam]. Front. Genet. 2021, 12, 750137. [Google Scholar] [CrossRef] [PubMed]
  69. Liu, Z.; Zhang, Y.; Altaf, M.A.; Hao, Y.; Zhou, G.; Li, X.; Zhu, J.; Ma, W.; Wang, Z.; Bao, W. Genome-wide identification of myeloblastosis gene family and its response to cadmium stress in Ipomoea aquatica. Front. Plant Sci. 2022, 13, 979988. [Google Scholar] [CrossRef]
  70. Komatsuzaki, A.; Hoshino, A.; Otagaki, S.; Matsumoto, S.; Shiratake, K. Genome-wide analysis of R2R3-MYB transcription factors in Japanese morning glory. PLoS ONE 2022, 17, e0271012. [Google Scholar] [CrossRef] [PubMed]
  71. Yang, J.; Zhang, B.; Gu, G.; Yuan, J.; Shen, S.; Jin, L.; Lin, Z.; Lin, J.; Xie, X. Genome-wide identification and expression analysis of the R2R3-MYB gene family in tobacco (Nicotiana tabacum L.). BMC Genom. 2022, 23, 432. [Google Scholar] [CrossRef]
  72. Isobe, S.; Shirasawa, K.; Hirakawa, H. Challenges to genome sequence dissection in sweetpotato. Breed Sci. 2017, 67, 35–40. [Google Scholar] [CrossRef]
  73. Yan, H.; Pei, X.; Zhang, H.; Li, X.; Zhang, X.; Zhao, M.; Chiang, V.L.; Sederoff, R.R.; Zhao, X. MYB-Mediated Regulation of Anthocyanin Biosynthesis. Int. J. Mol. Sci. 2021, 22, 3103. [Google Scholar] [CrossRef] [PubMed]
  74. Ma, D.; Constabel, C.P. MYB Repressors as Regulators of Phenylpropanoid Metabolism in Plants. Trends. Plant Sci. 2019, 24, 275–289. [Google Scholar] [CrossRef] [PubMed]
  75. Rozewicki, J.; Li, S.; Amada, K.M.; Standley, D.M.; Katoh, K. MAFFT-DASH: Integrated protein sequence and structural alignment. Nucleic. Acids Res. 2019, 47, W5–W10. [Google Scholar] [CrossRef] [PubMed]
  76. Nguyen, L.T.; Schmidt, H.A.; von Haeseler, A.; Minh, B.Q. IQ-TREE: A fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 2015, 32, 268–274. [Google Scholar] [CrossRef]
  77. Emms, D.M.; Kelly, S. OrthoFinder: Phylogenetic orthology inference for comparative genomics. Genome Biol. 2019, 20, 238. [Google Scholar] [CrossRef] [PubMed]
  78. Gill, N.; Dhillon, B. RNA-seq Data Analysis for Differential Expression. In Fusarium wilt: Methods and Protocols; Coleman, J., Ed.; Springer: New York, NY, USA, 2022; pp. 45–54. [Google Scholar]
Figure 1. Physicochemical properties of all identified R2R3-MYB proteins in six Ipomoea species. (a) Protein length; (b) Molecular weight (MW); (c) Isoelectric point (pI); (d) Instability index; (e) Aliphatic index; (f) Grand average of hydropathicity (GRAVY).
Figure 1. Physicochemical properties of all identified R2R3-MYB proteins in six Ipomoea species. (a) Protein length; (b) Molecular weight (MW); (c) Isoelectric point (pI); (d) Instability index; (e) Aliphatic index; (f) Grand average of hydropathicity (GRAVY).
Plants 12 01731 g001
Figure 2. Phylogenetic tree of R2R3-MYB members in Arabidopsis and six Ipomoea species. All identified R2R3-MYB members from I. batatas (131), I. trifida (133), I. triloba (129), I. nil (127), I. purpurea (51), I. aquatica (124), and Arabidopsis (126) were used. The full-length amino acid sequences of R2R3-MYB proteins were aligned using MAFFT, and the tree was constructed by the maximum-likelihood (ML) method of IQ-tree using the JTT + F + R10 model. These R2R3-MYB proteins are clustered into 36 clades (designated as C1 to C36). The AtCDC5 was rooted as the outgroup, and 4 proteins did not fit well into the clade. The genes that belonged to the same organism were marked in the same shape and color. The dots on the branches represent bootstrap values based on 1000 replications, and bootstrap values >90% and ≥70% are shown as black and gray dots in the phylogenetic tree, respectively, while those <70% are not shown.
Figure 2. Phylogenetic tree of R2R3-MYB members in Arabidopsis and six Ipomoea species. All identified R2R3-MYB members from I. batatas (131), I. trifida (133), I. triloba (129), I. nil (127), I. purpurea (51), I. aquatica (124), and Arabidopsis (126) were used. The full-length amino acid sequences of R2R3-MYB proteins were aligned using MAFFT, and the tree was constructed by the maximum-likelihood (ML) method of IQ-tree using the JTT + F + R10 model. These R2R3-MYB proteins are clustered into 36 clades (designated as C1 to C36). The AtCDC5 was rooted as the outgroup, and 4 proteins did not fit well into the clade. The genes that belonged to the same organism were marked in the same shape and color. The dots on the branches represent bootstrap values based on 1000 replications, and bootstrap values >90% and ≥70% are shown as black and gray dots in the phylogenetic tree, respectively, while those <70% are not shown.
Plants 12 01731 g002
Figure 3. Chromosomal distribution and intraspecies synteny analysis of R2R3-MYBs in six Ipomoea species. (a) I. batatas; (b) I. trifida; (c) I. triloba; (d) I. nil; (e) I. aquatica; (f) I. purpurea. The circle represents 15 chromosomes of each Ipomoea species. The R2R3-MYBs are unevenly distributed on corresponding all chromosomes in six Ipomoea species’ genomes. The blue lines show syntenic gene pairs of R2R3-MYBs, and the red lines show syntenic gene pairs of R2R3-MYBs belonging to clade C24(S6). The detailed information is listed in Table S4.
Figure 3. Chromosomal distribution and intraspecies synteny analysis of R2R3-MYBs in six Ipomoea species. (a) I. batatas; (b) I. trifida; (c) I. triloba; (d) I. nil; (e) I. aquatica; (f) I. purpurea. The circle represents 15 chromosomes of each Ipomoea species. The R2R3-MYBs are unevenly distributed on corresponding all chromosomes in six Ipomoea species’ genomes. The blue lines show syntenic gene pairs of R2R3-MYBs, and the red lines show syntenic gene pairs of R2R3-MYBs belonging to clade C24(S6). The detailed information is listed in Table S4.
Plants 12 01731 g003
Figure 4. Interspecies collinear analysis of R2R3-MYBs between I. batatas and Arabidopsis, and other five Ipomoea species. (a) Collinear analysis of IbR2R3-MYBs between AtR2R3-MYBs. (b) Collinear analysis of IbR2R3-MYBs between ItfR2R3-MYBs. (c) Collinear analysis of IbR2R3-MYBs between ItbR2R3-MYBs. (d) Collinear analysis of IbR2R3-MYBs between InR2R3-MYBs. (e) Collinear analysis of IbR2R3-MYBs between IaR2R3-MYBs. (f) Collinear analysis of IbR2R3-MYBs between IpR2R3-MYBs. The gray line links show the collinear block with I.batatas and six other plant species’ genomes. The blue lines show collinear gene pairs related to R2R3-MYBs, while the red lines show collinear gene pairs related to R2R3-MYBs belonging to clade C24(S6). The detailed information is listed in Table S5.
Figure 4. Interspecies collinear analysis of R2R3-MYBs between I. batatas and Arabidopsis, and other five Ipomoea species. (a) Collinear analysis of IbR2R3-MYBs between AtR2R3-MYBs. (b) Collinear analysis of IbR2R3-MYBs between ItfR2R3-MYBs. (c) Collinear analysis of IbR2R3-MYBs between ItbR2R3-MYBs. (d) Collinear analysis of IbR2R3-MYBs between InR2R3-MYBs. (e) Collinear analysis of IbR2R3-MYBs between IaR2R3-MYBs. (f) Collinear analysis of IbR2R3-MYBs between IpR2R3-MYBs. The gray line links show the collinear block with I.batatas and six other plant species’ genomes. The blue lines show collinear gene pairs related to R2R3-MYBs, while the red lines show collinear gene pairs related to R2R3-MYBs belonging to clade C24(S6). The detailed information is listed in Table S5.
Plants 12 01731 g004
Figure 5. The number of R2R3-MYB gene pairs derived from different gene duplication events in the six Ipomoea species. (a) The phylogenetic relationship among the six Ipomoea species. The species tree was inferred from a concatenated alignment matrix of 9835 single-copy ortholog sequences across 6 Ipomoea species genomes. (b) The number of five models of duplicated gene pairs in each species. The gene duplication events include whole-genome duplication (WGD), tandem duplication (TD), proximal duplication (PD), transposed duplication (TRD), and dispersed duplication (DSD). The x-axis represents the number of duplicated gene pairs. The y-axis represents species.
Figure 5. The number of R2R3-MYB gene pairs derived from different gene duplication events in the six Ipomoea species. (a) The phylogenetic relationship among the six Ipomoea species. The species tree was inferred from a concatenated alignment matrix of 9835 single-copy ortholog sequences across 6 Ipomoea species genomes. (b) The number of five models of duplicated gene pairs in each species. The gene duplication events include whole-genome duplication (WGD), tandem duplication (TD), proximal duplication (PD), transposed duplication (TRD), and dispersed duplication (DSD). The x-axis represents the number of duplicated gene pairs. The y-axis represents species.
Plants 12 01731 g005
Figure 6. Phylogenetic relationships, exon-intron structures, and conserved motifs of the R2R3-MYB genes in sweet potatoes. (a) The phylogenetic tree of the 131 IbR2R3-MYB proteins. The tree was constructed by the maximum-like (ML) method of IQ-tree with 1000 bootstrap replicates. (b) The gene structure of the 131 IbR2R3-MYB genes. The yellow boxes, green ellipses, and black lines represent exons, untranslated regions (UTR), and introns, respectively. The bar scale at the bottom shows the length of the gene. (c) The conserved motifs of the 131 IbR2R3-MYB proteins. The boxes with different colors represent Motif 1–10, and the black solid line represents non-conserved regions. While the dashed boxes represent the R2 and R3 domains, their sequences logo shown in Figure S1. The bar scale at the bottom shows the length of the amino acids.
Figure 6. Phylogenetic relationships, exon-intron structures, and conserved motifs of the R2R3-MYB genes in sweet potatoes. (a) The phylogenetic tree of the 131 IbR2R3-MYB proteins. The tree was constructed by the maximum-like (ML) method of IQ-tree with 1000 bootstrap replicates. (b) The gene structure of the 131 IbR2R3-MYB genes. The yellow boxes, green ellipses, and black lines represent exons, untranslated regions (UTR), and introns, respectively. The bar scale at the bottom shows the length of the gene. (c) The conserved motifs of the 131 IbR2R3-MYB proteins. The boxes with different colors represent Motif 1–10, and the black solid line represents non-conserved regions. While the dashed boxes represent the R2 and R3 domains, their sequences logo shown in Figure S1. The bar scale at the bottom shows the length of the amino acids.
Plants 12 01731 g006
Figure 7. Expression patterns analysis of differentially expressed genes (DEGs) of IbR2R3-MYB and anthocyanin biosynthesis genes (ABGs) in different pigmented tissues of sweet potatoes. (a) IbR2R3-MYBs and ABGs in white, yellow, and purple fleshes. (b,c) IbR2R3-MYBs and ABGs in red and yellow skins. (d,e) IbR2R3-MYBs and ABGs in green and purple leaves. The Log2(FPKM + 1) values were row scaled and displayed according to the color code. The red and blue colors represent the highest and lowest expression levels, respectively. The hierarchical clustering and green boxes in each heatmap show similar expression patterns.
Figure 7. Expression patterns analysis of differentially expressed genes (DEGs) of IbR2R3-MYB and anthocyanin biosynthesis genes (ABGs) in different pigmented tissues of sweet potatoes. (a) IbR2R3-MYBs and ABGs in white, yellow, and purple fleshes. (b,c) IbR2R3-MYBs and ABGs in red and yellow skins. (d,e) IbR2R3-MYBs and ABGs in green and purple leaves. The Log2(FPKM + 1) values were row scaled and displayed according to the color code. The red and blue colors represent the highest and lowest expression levels, respectively. The hierarchical clustering and green boxes in each heatmap show similar expression patterns.
Plants 12 01731 g007
Figure 8. Pearson’s correlation analysis of differentially expressed genes (DEGs) of IbR2R3-MYB and anthocyanin biosynthesis genes (ABGs) in different pigmented tissues of sweet potatoes. (a) IbR2R3-MYBs and ABGs in white, yellow, and purple fleshes. (b,c) IbR2R3-MYBs and ABGs in red and yellow skins. (d,e) IbR2R3-MYBs and ABGs in green and purple leaves. The pie charts show the Pearson correlation coefficient values. The red and blue colors represent the positive and negative correlation, respectively. The asterisk shows the p-value, * p < 0.05, ** p < 0.01, and *** p < 0.001.
Figure 8. Pearson’s correlation analysis of differentially expressed genes (DEGs) of IbR2R3-MYB and anthocyanin biosynthesis genes (ABGs) in different pigmented tissues of sweet potatoes. (a) IbR2R3-MYBs and ABGs in white, yellow, and purple fleshes. (b,c) IbR2R3-MYBs and ABGs in red and yellow skins. (d,e) IbR2R3-MYBs and ABGs in green and purple leaves. The pie charts show the Pearson correlation coefficient values. The red and blue colors represent the positive and negative correlation, respectively. The asterisk shows the p-value, * p < 0.05, ** p < 0.01, and *** p < 0.001.
Plants 12 01731 g008
Table 1. Genomic information and identified R2R3-MYB gene numbers in six Ipomoea species.
Table 1. Genomic information and identified R2R3-MYB gene numbers in six Ipomoea species.
Scientific NameCommon NamePloidyChromosome NumberGenome SizeGenome Gene NumberR2r3-Myb Genes Number
I. batatasSweet potatohexaploid15451 Mb64,295131
I. trifida diploid15477 Mb32,301133
I. trilobaTrilobed morning glorydiploid15447 Mb31,426129
I. nilJapanese morning glorydiploid15707 Mb42,783127
I. purpureaCommon morning glorydiploid15581 Mb53,97951
I. aquaticaWater spinachdiploid15494 Mb29,606124
Note: Genome size and gene number are estimated based on the final assembly version used in this study.
Table 2. The R2R3-MYB members number of each clade in each species and their major functions of each clade.
Table 2. The R2R3-MYB members number of each clade in each species and their major functions of each clade.
CladeClassification in ArabidopsisFunction AnnotationAtIbItfItbInIaIpTotal
C1S28development/stomatal formation222222214
C2S25development/female gametogenesis7553861145
C3S21development/specialized metabolism/cell wall thickening856555438
C4S23abiotic stress/salt stress322222114
C5S22abiotic stress/drought, salt, and cold stress4988881055
C6S26development/sperm cell formation/pollen formation122222213
C7S18 and S77development/stamen development724443327
C8S27development/leaf patterning11111117
C9S17abiotic stress343333120
C10S78abiotic stress/potassium stress341211214
C11S19 and S20development/abiotic stress945565438
C12S8secondary wall/lignin biosynthesis/abiotic stress656666035
C13S16development/hypocotyl elongation322222013
C14S32phenylpropanoid-derived secondary metabolites/secondary cell wall biosynthesis211212110
C15S13lignin/triterpenoid/cellulose biosynthesis423333018
C16S30 and S31starch biosynthesis/fructan synthesis/secondary cell wall biosynthesis368878040
C17S3phenylpropanoid-derived secondary metabolites/flavonoid/lignin biosynthesis412111010
C18S2abiotic stress355545128
C19S4phenylpropanoid-derived secondary metabolites/proanthocyanin/lignin biosynthesis666767240
C20S7phenylpropanoid-derived secondary metabolites/flavonol biosynthesis333223016
C21NOfunction unknown054444021
C22S15development/root hair development41111109
C23S44phenylpropanoid-derived secondary metabolites/proanthocyanin biosynthesis101132311
C24S6phenylpropanoid-derived secondary metabolites/anthocyanin biosynthesis446565030
C25S12glucosinolate biosynthesis60000006
C26NOfunction unknown089865238
C27S1abiotic stress/specialized metabolism554444026
C28S33 and S46development/anther/pollen development212122010
C29S14abiotic stress/specialized metabolism/development691110912057
C30NOfunction unknown022222010
C31S11abiotic stress453344023
C32S36development/inflorescence development/seed germination133333016
C33S10suberin biosynthesis21111118
C34S24development/lateral root development331222013
C35S9development/trichome branching/cuticle formation222212011
C36NOfunction unknown0118950033
Note: The classification and function annotation of R2R3-MYB members were obtained from references [9,14].
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Li, M.; Zhou, Y.; Li, K.; Guo, H. Genome-Wide Comparative Analysis of the R2R3-MYB Gene Family in Six Ipomoea Species and the Identification of Anthocyanin-Related Members in Sweet Potatoes. Plants 2023, 12, 1731. https://doi.org/10.3390/plants12081731

AMA Style

Li M, Zhou Y, Li K, Guo H. Genome-Wide Comparative Analysis of the R2R3-MYB Gene Family in Six Ipomoea Species and the Identification of Anthocyanin-Related Members in Sweet Potatoes. Plants. 2023; 12(8):1731. https://doi.org/10.3390/plants12081731

Chicago/Turabian Style

Li, Maoxing, Yuanping Zhou, Kaifeng Li, and Huachun Guo. 2023. "Genome-Wide Comparative Analysis of the R2R3-MYB Gene Family in Six Ipomoea Species and the Identification of Anthocyanin-Related Members in Sweet Potatoes" Plants 12, no. 8: 1731. https://doi.org/10.3390/plants12081731

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop