Next Article in Journal
Novel KCNQ2 Variants Related to a Variable Phenotypic Spectrum Ranging from Epilepsy with Auditory Features to Severe Developmental and Epileptic Encephalopathies
Next Article in Special Issue
Reynoutria japonica Houtt. Transformed Hairy Root Cultures as an Effective Platform for Producing Phenolic Compounds with Strong Bactericidal Properties
Previous Article in Journal
Leukocyte Telomere Length as a Marker of Chronic Complications in Type 2 Diabetes Patients: A Risk Assessment Study
Previous Article in Special Issue
Characterization of the IAA-Producing and -Degrading Pseudomonas Strains Regulating Growth of the Common Duckweed (Lemna minor L.)
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Metabolomic and Transcriptomic Analyses of Flavonoid Biosynthesis in Different Colors of Soybean Seed Coats

1
Crop Research Institute, Sichuan Academy of Agricultural Sciences, Chengdu 610066, China
2
Environment-Friendly Crop Germplasm Innovation and Genetic Improvement Key Laboratory of Sichuan Province, Chengdu 610066, China
3
College of Life Sciences, Shandong Agricultural University, Taian 271018, China
4
Sichuan Seed Station, Chengdu 610041, China
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2025, 26(1), 294; https://doi.org/10.3390/ijms26010294
Submission received: 19 December 2024 / Accepted: 29 December 2024 / Published: 31 December 2024
(This article belongs to the Special Issue Developing Methods and Molecular Basis in Plant Biotechnology)

Abstract

Soybean has outstanding nutritional and medicinal value because of its abundant protein, oil, and flavonoid contents. This crop has rich seed coat colors, such as yellow, green, black, brown, and red, as well as bicolor variants. However, there are limited reports on the synthesis of flavonoids in the soybean seed coats of different colors. Thus, the seed coat metabolomes and transcriptomes of five soybean germplasms with yellow (S141), red (S26), brown (S62), green (S100), and black (S124) seed coats were measured. In this study, 1645 metabolites were detected in the soybean seed coat, including 426 flavonoid compounds. The flavonoids differed among the different-colored seed coats of soybean germplasms, and flavonoids were distributed in all varieties. Procyanidins A1, B1, B6, C1, and B2, cyanidin 3-O-(6″-malonyl-arabinoside), petunidin 3-(6″-p-coumaryl-glucoside) 5-glucoside, and malvidin 3-laminaribioside were significantly upregulated in S26_vs._S141, S62_vs._S141, S100_vs._S141, and S124_vs._S141 groups, with a variation of 1.43–2.97 × 1013 in terms of fold. The differences in the contents of cyanidin 3-O-(6″-malonyl-arabinoside) and proanthocyanidin A1 relate to the seed coat color differences of red soybean. Malvidin 3-laminaribioside, petunidin 3-(6″-p-coumaryl-glucoside) 5-glucoside, cyanidin 3-O-(6″-malonyl-arabinoside), and proanthocyanidin A1 affect the color of black soybean. The difference in the contents of procyanidin B1 and malvidin 3-glucoside-4-vinylphenol might be related to the seed coat color differences of brown soybeans. Cyanidin 3-gentiobioside affects the color of green soybean. The metabolomic–transcriptomic combined analysis showed that flavonoid biosynthesis is the key synthesis pathway for soybean seed color formation. Transcriptome analysis revealed that the upregulation of most flavonoid biosynthesis genes was observed in all groups, except for S62_vs._S141, and promoted flavonoid accumulation. Furthermore, CHS, CHI, DFR, FG3, ANR, FLS, LAR, and UGT88F4 exhibited differential expression in all groups. This study broadens our understanding of the metabolic and transcriptomic changes in soybean seed coats of different colors and provides new insights into developing bioactive substances from soybean seed coats.

1. Introduction

Soybean [Glycine max (Linn.) Merr.] is an essential nutritional and commercial crop that provides major plant proteins and oils for people and animals worldwide [1,2]. As a physical barrier, a soybean seed coat controls the metabolism of seed development and dormancy, disease resistance, and nutritional metabolism [3]. A soybean seed coat contains several active substances, such as dietary fiber, polyphenols, and flavonoids. These compounds are the focus of extensive research because of their remarkable antioxidant properties and ability to influence the flavor characteristics of the seeds [4]. In addition, these compounds play a crucial role in preventing cancer [5], inflammation, and cardiovascular diseases [6].
The soybean seed coat has evolved to be black, brown, green, and bicolor via artificial selection and domestication [7]. Flavonoids and anthocyanins are the main substances affecting the color of the soybean seed coat [8]. Flavonoids are widely distributed in higher plants, with flavones, chalcones, anthoxanthins, and proanthocyanidins present [9,10]. These substances play major roles in various biological processes, imparting various colors, antioxidant properties, and nutritional value [11,12]. Anthocyanins are important flavonoids that influence the coloration of the seed coat, which can cause plant organs to exhibit different color phenotypes [13]. Anthocyanins are synthesized through the flavonoid pathway and regulated by many structural genes, such as chalcone synthase (CHS), flavanone 3-hydroxylase (F3H), and anthocyanidin synthase (ANS) [12,14]. In soybean seeds, a large number of color changes and pigmentation patterns are observed [15].
Currently, 23 color-regulating genes have been reported for soybean seeds, and 19 genes are on the flavonoid synthesis pathway, such as CHS, chalcone isomerase (CHI), and UDP-glucose flavonoid 3-glucosyltransferase (UFGT) [16]. CHS regulates the content and activity of chalcone synthase, while the rate-limiting step governs the entire metabolic pathway. The transcription factor MYB at the R site regulates the content of anthocyanidin by controlling the content and activity of ANS, affecting the color of the soybean seed coat [17]. Anthocyanidins are catalyzed by anthocyanidin reductase (ANR) and UDP-glucose: flavonoid 3-O-glucosyltransferase (UF3GT), forming proanthocyanins and anthocyanins [7]. The proportion of proanthocyanins and anthocyanins determines the color of the seed coat.
With the rapid development of sequencing technology, multi-group approaches have become the primary means by which researchers explore previously uncharted areas of cellular biology. Metabolomics is used to determine the metabolic profile of crops under specific conditions and is considered a bridge between genotypes and phenotypes [18]. The transcriptome typically clarifies the regulatory mechanisms of a trait of interest by comprehensively analysing gene expression. The combined analysis of transcriptomics and metabolomics explores the causal relationship between genes and metabolites. Combined with the functional annotation of metabolic pathways and biological functional analysis, plant regulatory mechanisms can be systematically analyzed [19]. Multi-omics research, such as transcriptomics, proteomics and metabolomics, is widely performed on plant leaves [20,21] and seeds [6,22]. However, limited metabolomic research focuses on detecting metabolites in the soybean seed coat.
The soybean seed coat color is a main agronomic feature that determines seed quality. The seed coat color is closely related to appearance quality and nutritional value [23]. Compared with yellow seed coat soybean, the black seed coat soybean germplasm has a higher nutritional value in breeding programs [24]. Nevertheless, little is known about the metabolites in soybean seed coats of different colors within a wide range of soybean seed germplasm resources. Therefore, the purpose of this study was to detect and quantify the composition and content of flavonoids in five soybean germplasms with different seed coat colors via metabolome-based methods. The related genes controlling seed coat color were explored by transcriptome sequencing, and the candidate genes regulating the coloring mechanism of soybean seed coat were screened to provide important information regarding the genetic improvement of high-quality, nutritious soybean varieties.

2. Results

2.1. Metabolic Differences Among the Seed Coat of Soybean Germplasms

Five soybean cultivars with different seed coat colors and phenotypes of soybean germplasms were analyzed, and they were found to be in the mature stage (Figure 1 and Table 1). The systematic metabolic profiling of soybean seed coats with different colors was performed. The yellow seed coat soybean S141 was used as the control, and the results for S26, S62, S100, and S124 were labelled as S26_vs._S141, S62_vs._S141, S100_vs._S141, and S124_vs._S141, respectively (Figure 2). The PCA results showed the excellent reproducibility of the sample data. The scattering patterns of the five soybean germplasms in the PCA plot showed evident differences among the metabolite profiles (Figure 2a). A total of 475, 422, 328, and 420 DAMs had differential expressions among S26_vs._S141, S62_vs._S141, S100_vs._S141, and S124_vs._S141, respectively (Figure 2b). In addition, 475 DAMs were screened out in S26_vs._S141, of which 298 were upregulated, and 177 were downregulated (Figure 2b). Moreover, 422 DAMs were screened out in S62_vs._S141, of which 278 were upregulated and 144 were downregulated (Figure 2b). A total of 328 DAMs were screened out in S100_vs._S141, of which 129 were upregulated and 199 were downregulated (Figure 2b). A total of 420 DAMs were screened out in S124_vs._S141, of which 310 were upregulated, and 110 were downregulated (Figure 2b). The DAMs caused by different seed coat colors were described by volcano plots in order to visually assess the overall distribution of DAMs in all comparison groups (Figure 2c–f).

2.2. Analysis of the DAMs in the Flavonoid Biosynthetic Pathway

A total of 1645 DAMs were detected in the seed coat of five soybean germplasms (Table S2). These DAMs were divided into 19 categories, and the remaining 140 DAMs were assigned to the “others” group (Figure 3a). The dominant DAMs were flavonoids, prenol lipids, organooxygen compounds, carboxylic acids and their derivatives, and isoflavonoids. These DAMs accounted for 72.2% of the total DAMs. In total, 426 flavonoids were identified from these four sample groups (Figure 3a). The annotation of these flavonoids is detailed in Table S2. The metabolomics data show that 143, 108, 33, and 142 flavonoids were identified among the S26_vs._S141, S62_vs._S141, S100_vs._S141, and S124_vs._S141 groups, respectively (Figure 3b). Compared with S141, 130, 93, 9, and 133 flavonoids were upregulated and 13, 15, 24, and 9 flavonoids were downregulated in S26, S62, S100, and S124, respectively (Table S3). The highest number of flavonoids was detected in the S26_vs._S141 group, demonstrating that soybeans with red seed coats exhibited a different metabolic profile than those with yellow seed coats. The lowest number of flavonoids was detected in the S100_vs._S141 group, indicating that soybeans with green seed coats exhibited a similar metabolic profile to those with yellow seed coats. In addition, a high number of flavonoids was detected in the S62_vs._S141 and S124_vs._S141 groups, indicating that soybeans with black and brown seed coats exhibited a different metabolic profile compared with soybeans with yellow seed coats (Figure 3b). Overall, our results showed the abundant diversity of metabolites in local soybean germplasm seed coats.
Based on the metabolome data, a total of 426 flavonoids, with significant differences among the S26_vs._S141, S62_vs._S141, S100_vs._S141, and S124_vs._S141 groups, were composed of 365 upregulated and 61 downregulated metabolites (Table S3). The number of differentially expressed flavonoids between the S26_vs._S141 and S124_vs._S141 groups was almost identical, with the accumulation of 143 and 142 flavonoids upregulated in the seed coat. In S26 and S124, 130 and 133 flavonoids, including quercetin and its derivatives, procyanidins, kaempferol and its derivatives, epicatechin, and epigallocatechin derivatives, were upregulated in the soybean seed coat. These flavonoids included nine procyanidins (procyanidin, procyanidin B1, procyanidin B2, procyanidin B3, procyanidin B6, procyanidin C1, proanthocyanidin A1, proanthocyanidin A2, and prodelphinidin B) and nine anthocyanins (Table S3). The anthocyanins included three cyanidin derivatives (cyanidin 3-galactoside, cyanidin 3-gentiobioside, cyanidin3-O-(6″-malonyl-arabinoside), three malvidin derivatives (malvidin 3-glucoside-4-vinylguaiacol, malvidin 3-glucoside-4-vinylphenol, malvidin 3-laminaribioside), peonidin-3-glucoside, petunidin 3-(6″-p-coumaryl-glucoside) 5-glucoside), and pelargonidin 3-rhamnoside 5-glucoside (Table S3). The high levels of flavonoids in soybean seed coats have long been reported to provide additional nutritional value to soybean food and feed [13], indicating that black and red seed coats of soybeans may have higher nutritional value.
Among these DAMs, five, three, one, and five types of anthocyanins were upregulated in S26, S62, S100, and S124, respectively, compared with S141, and these anthocyanins might be the key metabolites influencing seed coat coloration in soybean. Three kinds of anthocyanins, namely, cyanidin 3-O-(6″-malonyl-arabinoside), malvidin 3-laminaribioside, and malvidin 3-glucoside-4-vinylphenol, exhibited 2.97 × 1013-fold, 2.57 × 1012-fold, and 1.12 × 103-fold, as well as 2.17 × 1012-fold, 5.40 × 1012-fold, and 1.00 × 103-fold, increments in S26_vs._S141 and S124_vs._S141, respectively (Table S3). Proanthocyanidin A1 exhibited 2.0 × 1013-fold and 2.2 × 1012-fold increments in S26_vs._S141 and S124_vs._S141, respectively, which are greater increments than those of proanthocyanidin A2 and other procyanidin. Therefore, specific anthocyanins such as cyanidin 3-O-(6″-malonyl-arabinoside), malvidin 3-laminaribioside, malvidin 3-glucoside-4-vinylphenol, and proanthocyanidin A1 can serve as landmark metabolites of red and black seed coat soybeans. Notably, petunidin 3-(6″-p-coumaryl-glucoside) 5-glucoside, cyanidin 3-galactoside, astilbin, 3-galloylgallocatechin, and kaempferol 3-(2-p-coumaroylsophoroside) 7-glucoside accumulated massively in S124. The presence of these compounds was 1.29 × 106–6.79 × 1012 times higher in S124 than in S141 (Table S3). Therefore, we speculated that these DAMs might be key metabolites affecting the black seed coat color of S124. Peonidin-3-glucoside was 4.99-, 1.3-, 2.58, and 1.84-fold lower in S26_vs._S141, S62_vs._S141, S100_vs._S141, and S124_vs._S141, respectively. By contrast, procyanidin, procyanidin B1, procyanidin B2, procyanidin B3, procyanidin B6, procyanidin C1, proanthocyanidin A1, proanthocyanidin A2, and prodelphinidin B were increased in S26_vs._S141, S62_vs._S141, and S124_vs._S141, respectively (Table S3). Previous studies have shown that the dark seed coat mainly contains anthocyanins and procyanidins, and the composition ratio and distribution of the two determine the coloring degree of the seed coat [24]. Similar results were also reported for cyanidin 3-O-glucoside in some plants with red flowers or fruit [25,26].
Three malvidin derivatives, one pelargonidin derivative, one petunidin derivative, and two cyanidin derivatives were accumulated in deep-color seed coat soybeans (red, brown, and black). Among these, cyanidin 3-O-(6″-malonyl-arabinoside) and malvidin 3-laminaribioside were the most abundant in S26 and S124, indicating that they were the primary sources of the red and black pigments. However, petunidin 3-(6″-p-coumaryl-glucoside) 5-glucoside was almost undetectable in S26, which indicated that petunidin 3-(6″-p-coumaryl-glucoside) 5-glucoside is the primary source of black phenotypes. Moreover, kaempferol and its derivatives, quercetin and its derivatives, and epicatechin were identified in S26_vs._S141. In particular, the presence of trifolin, quercetin, epicatechin-(4beta->8)-epigallocatechin 3-O-gallate isorhamnetin 3-(6″-malonylglucoside), and kaempferol 4′-glucoside 7-rhamnoside was 2.17 × 1012–2.97 × 1013 times higher in S26 than in S141 (Table S3), implying that these flavonoid compounds play essential roles in deepening the red color of soybean seed coats.

2.3. Transcriptomic Profiling of Soybean Seed Coat

RNA-seq analysis was performed to investigate the transcriptome of soybean germplasms with different seed coat colors and to determine the molecular basis of the metabolic differences. In this study, we obtained 101.51 G of clean data, and the compelling data of each sample were distributed between 6.16 and 7.16 G. Moreover, the Q30 value (sequencing accuracy of 99.9%) was more than 93.30% (Table S4). The reads were aligned to the reference genome, and the ratio of successfully mapped reads ranged from 87.02% to 97.09%. PCA obtained excellent sample data reproducibility (Figure 4a). The PCA results showed large variations in S26_vs._S141 and S100_vs._S141, and less variation was observed in the S62_vs._S141 and S124_vs._S141 groups (Figure 4a). The DEGs, which emerged due to soybean germplasms with different seed coat colors, were described by volcano plots (Figure 4c–f). The comparison of S26_vs._S141 resulted in 14,904 DEGs, including 8424 upregulated and 6480 downregulated genes. The comparison of S62_vs._S141 resulted in 5425 DEGs, including 1959 upregulated and 3466 downregulated genes. The comparison of S100_vs._S141 resulted in 17,581 DEGs, including 10,012 upregulated and 7569 downregulated genes. The comparison of S124_vs._S141 resulted in 1924 DEGs, including 1151 upregulated and 773 downregulated genes (Figure 4b). The results indicated that these DEGs might play a key role in the expression of soybeans with different seed coat colors.

2.4. KEGG Pathway Enrichment Analysis of the Expected Changes in DAMs and DEGs

All DAMs were mapped to the KEGG (Kyoto Encyclopedia of Genes and Genomes) database, and a total of 43, 41, 46, and 42 DAMs in the S26_vs._S141, S62_vs._S141, S100_vs._S141, and S124_vs._S141 groups were mapped to the KEGG pathways, respectively (Table S5). Similarly, KEGG enrichment analysis was conducted on the DEGs, and the results showed that a total of 3161, 1210, 3773, and 401 DEGs in the S26_vs._S141, S62_vs._S141, S100_vs._S141, and S124_vs._S141 groups were mapped to the KEGG pathway, respectively (Table S6). A large number of DAMs and DEGs were enriched into these pathways, which were associated with phenylpropanoid biosynthesis (gmx00940), flavonoid biosynthesis (gmx00941), flavone and flavonol biosynthesis (gmx00944), carotenoid biosynthesis (gmx00906), porphyrin and chlorophyll metabolism (gmx00860), and anthocyanin biosynthesis (gmx00942, Figure 5).
Mapping the differentially accumulated flavonoid metabolites among the five soybean germplasm seed coats showed that S26 had the highest relative content of all flavonoid metabolites (Table S5). Compared with S141, 10 compounds were detected that involved flavonoid biosynthesis, including quercetin, isoquercitrin, phloretin, 3,7-dimethylquercetin, luteoforol, catechin, phlorizin, quercetin 3-(6″-malonyl-glucoside), isowertin 2″-rhamnoside, and (+)-gallocatechin, which were significantly upregulated in S26 (Table S5). The results showed that there were significant differences in metabolites among the four groups with different seed coat colors, including 11 (10 upregulated and 1 downregulated), 9 (6 upregulated and 3 downregulated), 4 (1 upregulated and 3 downregulated), and 16 (13 upregulated and 3 downregulated) DAMs in the S26_vs._S141, S62_vs._S141, S100_vs._S141, and S124_vs._S141 groups, respectively (Table S5). Moreover, the presence of DAMs in the flavonoid biosynthetic pathways of the S26_vs._S141, S62_vs._S141, and S124_vs._S141 groups was significantly increased, indicating that these groups were related to the seed coat color changes of soybean, especially phloretin, luteoforol, catechin, and phlorizin (Table S5). Therefore, we hypothesized that those DAMs produced in the flavonoid biosynthetic pathways might be key metabolites in the deep-colored seed coat of soybean germplasm. Two flavonoids that comprised quercetin and isoquercitrin were identified from 40 flavonoid metabolites, and they were only found in S26 soybean. Quercetin accounted for the most significant proportion of upregulated metabolites, followed by isoquercitrin and phloretin (Table S5). This suggests that quercetin, isoquercitrin and phloretin may also play important roles in the formation of the red soybean seed coat color.

2.5. Candidate Genes Related to the Flavonoid Biosynthetic Pathway

We annotated the functions of 13 DEGs based on the GO and KEGG databases to anchor the candidate genes for seed coat color in soybean. The results indicated that the genes SoyZH13_11G010700 (CHS), SoyZH13_08G103500 (CHS1), SoyZH13_02G120600 (CHS6), SoyZH13_10G269700 (CHI1B2), SoyZH13_02G046100 (FHT), SoyZH13_14G066200 (DFR), SoyZH13_02G147700 (DFR), SoyZH13_20G171400 (LAR), SoyZH13_01G196800 (ANT17), SoyZH13_08G058800 (ANR), SoyZH13_05G078700 (FLS), SoyZH13_12G118300 (FG3), SoyZH13_20G225300 (CHI2-A), and SoyZH13_06G290900 (UGT88F4) might be involved in the response to seed coat color in soybean (Figure 6). They were involved in the flavonoid biosynthetic pathway. Compared with S141, the genes FG3, CHS, CHS1, CHS6, CHI2-A, UGT88F4, FHT, DFR, ANT17, LAR, and FLS were upregulated in S26, whereas the gene ANR was downregulated. The genes FG3 and LAR showed 78.61- and 18.8-fold increments in S26 compared with S141, whereas the gene ANR showed 0.04-fold degression (Table 2). The genes FHT, ANT17, ANR, UGT88F4, and FLS were downregulated in S62 compared with S141, whereas CYP75B2 showed a 9.28-fold degression (Table 2). The expression levels of CHS6, CHS1, UGT88F4, FHT, DFR, ANT17, LAR, and FLS were upregulated in S100 compared with S141 (Table 2). CHS6, CHS1, and ANT17 were upregulated in S124, whereas CHS, ANR, LAR, UGT88F4, and FLS were downregulated compared with S141. The genes CHS6, CHS1, FG3, and ANT17 were significantly and highly expressed in S124 compared with S141, and their levels were 11.24-, 10.3-, 6.81-, and 12.17-fold higher than those in S141, respectively (Table 2).
We randomly selected and measured six genes by qRT-PCR to verify the identified differentially expressed genes. These genes exhibited an expression pattern similar to the profiles of differentially expressed genes in soybean seed coats identified through RNA-seq analysis (Figure 7).

3. Discussion

3.1. Identification of Flavonoid Compounds in Soybean Seed Coat

The transcriptome and metabolome have enabled the development of effective methods to determine gene and metabolite components [27]. At present, few omics studies have been conducted on soybean seed coats. In this research, metabolic profiling and transcriptome analyses were used to explore the changes in relevant flavonoid compounds of soybean seed coats and their respective genes with different seed coat colors. Our results show that 1645 metabolites were identified in soybean seed coats using a non-targeted method. Compared with yellow seed coat soybean, more flavonoid metabolites were detected in red, brown, and black seed coat soybean, whereas fewer flavonoid metabolites were detected in green soybean seed coats in this research (Figure 3b). Consequently, the rich flavonoid metabolites identified in this research provide valuable reference information about soybeans with high nutritional values.
Flavonoids are natural phenolic secondary metabolites widely distributed in higher plants, including quercetin, kaempferol, flavonol, flavone, chalcones, anthoxanthins, and proanthocyanidin [9,10]. The phenylpropanoid biosynthetic pathway forms flavonoids, and they play major roles in various biological processes because of their effect on several characteristics of plant organs, imparting various colors, antioxidant properties, and nutritional value [11,12]. In plants, the accumulation of flavonoid compounds results in the different colors of plant tissues and is closely related to the types and amounts of flavonoids [7,10]. In our study, 426 flavonoids were detected in five soybeans with different seed coat colors. We found five flavonoids, including cyanidin 3-O-(6″-malonyl-arabinoside), proanthocyanidin A1, quercetin, kaempferol 4′-glucoside 7-rhamnoside, and kaempferol 3-rhamnoside 7-xyloside, to be enriched in the deep-colored seed coat of soybean germplasm. The content of these flavonoids was highest in red seed coats. We also found a higher content of kaempferol 3-(2-p-coumaroylsophoroside) 7-glucoside, cyanidin 3-O-(6″-malonyl-arabinoside), and proanthocyanidin A1 in black seed coats (Table S3). The chemical traits of anthocyanins have been widely investigated [10]. Anthocyanins and their contents are the key factors influencing the coloration of plant tissues, and they comprise pelargonidin, cyanidin, and delphinidin [28]. Cyanidin contributes to the red color of the soybean seed coat [29]. Our study detected cyanidin 3-O-(6″-malonyl-arabinoside) in the red seed coat. Compared with the yellow seed coat, the content of cyanidin 3-O-(6″-malonyl-arabinoside) in the red seed coat increased 2.97 × 1013-fold, indicating that cyanidin 3-O-(6″-malonyl-arabinoside) may play a key role in the coloration of the red seed coat of soybean (Table S3). This finding is similar to the results of Fu et al., who suggested that cyanidin 3-O-(6″-malonyl) glucoside is the major source of red phenotypes [12].
We found that the accumulation of flavonoids in the soybean seed coat of different colors was inconsistent. Interestingly, the flavonoids between the red and yellow seed coat of soybean showed that quercetin, quercetin 4′-glucoside, quercetin 3-beta-laminaribioside, and kaempferol contents were significantly higher in red seed coats than in yellow seed coats (Table S3). Some studies show that seed metabolites are directly related to seed quality and nutritional values and that the metabolites fluctuate greatly with seed coat color in a wide range of soybean seed germplasm resources [21]. The contents of isoflavones and the composition of fatty acids were affected by the seed coat color [30]. More in-depth studies of the seed coat metabolomes of red soybeans would help us to understand their unique metabolism and nutritional values better.
Meanwhile, we found that the expression level of genes related to quercetin and kaempferol was increased, indicating that quercetin and kaempferol compounds were regulated at the transcriptional level. A few studies reported that peonidin is distributed in purple sweet potato and black grain, and it is a crucial flavonoid component that changes the color of the grain from green to black [9,28]. However, peonidin-3-glucoside is downregulated in red, brown, green, and black soybean seed coats among these flavones compared with yellow seed coat soybean. This result was different from a previous study on purple sweet potato and black grain, suggesting that this flavonoid metabolite expression level was related to the type and variety of plant tissue.

3.2. DEGs Related to the Biosynthesis of Flavonoids in the Seed Coat of Soybean

The transcriptome is widely used to analyze the molecular mechanism of metabolite accumulation [31,32]. In this paper, we found that a large number of DEGs are enriched during phenylpropanoid biosynthesis, flavonoid biosynthesis, and anthocyanin biosynthesis in KEGG annotation, such as CHS, FHT, ANT17, ANR, UGT88F4, and FLS, and these genes exhibit differential expression (Table 2). CHS6, CHS, CHS1, UGT88F4, FHT, DFR, ANT17, LAR, FLS, FG3, and CHI2-A were significantly upregulated in S26_vs._S141, whereas ANR was significantly downregulated. Similar results were also exhibited in S100_vs._S141 (Table 2). The biosynthesis of flavonoids was significantly enriched in the transcriptome comparison of S26_vs._S141, S100_vs._S141, and S124_vs._S141 and was enriched in the corresponding metabolome comparisons of S26_vs._S141, S62_vs._S141, and S124_vs._S141 (Table 2 and Table S5). Although the biosynthesis of flavonoids was not significantly enriched in the transcriptome comparison of S62_vs._S141, it was enriched in the metabolome comparison of S62_vs._S141, indicating the presence of posttranscriptional regulatory activity, and similar results were reported in cassava flesh [18,33]. DEGs (CHS1, UGT88F4, FHT, DFR, ANT17, LAR, and FLS) in S62_vs._S141 showed a contrary expression pattern compared with those in S100_vs._S141. UGT88F4, DFR, LAR, and FLS in S124_vs._S141 also showed a contrary expression pattern compared with those in S100_vs._(Table 2). Flavonoids and DEGs were significantly enriched, and they showed a similar expression pattern in the transcriptome and metabolome comparisons of S26_vs._S141. These results suggest that soybean seed red coat pigmentation is potentially regulated at the transcriptional level.
Previous research reported that upregulating the expression level of FLS improved anthocyanin accumulation and changed the flower color of P. suffruticosa and C. nitidissima [12,34]. CHS and FLS play vital roles in the formation of the golden flowers of C. nitidissima [35]. When the expression of ANR1 and ANR2 was inhibited, the soybean seed coat showed a red-brown color [36]. In this study, the expression level of the CHS and FLS genes was upregulated (Table 2), which was in accordance with the significant increment in the content of cyanidin 3-O-(6″-malonyl-arabinoside) and malvidin 3-laminaribioside in S26_vs._S141. As shown in Table 2, comparison-based analysis revealed that the downregulation of genes encoding UGT88F4, FLS, ANR and LAR was found in S62_vs._S141 and S124_vs._S141, which was inconsistent with the significant increment in the content of pelargonidin 3-rhamnoside 5-glucoside, malvidin 3-glucoside-4-vinylphenol, cyanidin 3-O-(6″-malonyl-arabinoside), petunidin 3-(6″-p-coumaryl-glucoside) 5-glucoside, and malvidin 3-laminaribioside (Table 2). Anthocyanin biosynthesis involves multiple enzymes encoded by early biosynthesis genes (CHS, CHI) and anthocyanin-specific biosynthesis genes (DFR, ANS, and UFGT) [34]. In this study, the expression levels of LAR, DFR, CHS6, and FLS in S26 were higher than those in S141, whereas the transcriptional levels of ANR were lower than those of S141 (Table 2). ANR can convert anthocyanins into epicatechins and promote the biosynthesis of proanthocyanins [37]. A low level of expression of ANR genes leads to the abundance of anthocyanidins in the flavonoid biosynthesis of S26. We found that the cyanidin 3-gentiobioside content of S100 is considerably higher than that of S141. Thus, we speculated that the high expression of DFR in S100 can be related to the accumulation of cyanidin 3-gentiobioside. FLS encodes a flavonol synthase and can affect plant pigment, and anthocyanin content is reduced during FLS over-expression lines in Arabidopsis. This study found that an FG3 gene was differentially expressed and upregulated 78.61-fold in the red seed coats. We deduced that the higher transcription level of FG3 red seed coats may redirect the biosynthesis of cyanidin 3-O-(6′-malonyl-arabinoside) and proanthocyanidin A1. These results indicate that the genes mentioned earlier might play essential roles in the color diversity of soybean seed coats. We will see this in the next step of this research.

4. Materials and Methods

4.1. Plant Materials and Experimental Site

Field experiments were conducted in 2021 at the Modern Agricultural Science and Technology Innovation Demonstration Park of Sichuan Academy of Agricultural Sciences in Xindu District, Sichuan, China (30°40′ N, 103°54′ E). The seed coat colors of the soybean germplasms “Huangdou” (S141), “Hongpixiangdou” (S26), “Bendihuangdou” (S62), “Bendiluhuangdou” (S100), and “Heipidadou” (S124) were yellow, red, brown, green, and black, respectively (Figure 1). A randomized block design was adopted. Soybean seeds were planted on 17 June 2021 and harvested on 25 October 2021. The row-to-row and plant-to-plant distances of soybeans were 50 and 20 cm, respectively. The fertilization scheme and field management followed Yang’s method [38]. At the R7 (beginning of maturity, one normal pod on the main stem has reached its mature pod color) stage of soybean growth, the seed coat of soybean germplasm was sampled and frozen in liquid nitrogen. The soybean seed coat samples were stored in a freezer at −80 °C until metabolome and transcriptome analyses. The analysis was performed with three biological replicates.

4.2. Metabolome Measurement

The metabolites in the soybean seed coat were detected by liquid chromatography–mass spectrometry (LC-MS). Using the method described by Xiao et al. [18], the metabolites were extracted, detected, identified, quantified, and analyzed by Shanghai Luming Biological Technology Co., Ltd. (Shanghai, China).
Then, 80 mg of soybean seed coat powder was weighed, and 20 μL of internal standard (2-chloro-L-phenylalanine, 0.06 mg/mL; prepared in methanol), 1 mL of a methanol–water mixture (7/3, v/v), and two small steel balls were added. The mixture was precooled at −20 °C and then ground. Ultrasonic extraction was performed in an ice-water bath for 30 min, followed by overnight storage at −20 °C. The samples were centrifuged at 13,000 rpm and 4 °C for 10 min. A total of 150 µL of supernatant was collected using a syringe, filtered through a 0.22 μm microporous membrane, transferred to an LC injection vial, and stored at −80 °C until LC-MS analysis [39].
The extraction solution was analyzed using an ACQUITY UPLC I-Class plus machine (Waters Corporation, Milford, CT, USA) connected to a Q-Exactive mass spectrometer (Thermo Fisher Scientific, Waltham, MA, USA) operating with both positive and negative ion modes within an electrospray ionization source (ESI). The liquid chromatography conditions were as follows: an ACQUITY UPLC HSS T3 column (1.8 μm, 100 × 2.1 mm) was used. The mobile phase consisted of two components: solvent A (0.1% formic acid in water, v/v) and solvent B (0.1% formic acid in acetonitrile, v/v). Sample measurements were made using an elution, the gradient of which is shown in Table 3. The flow rate was maintained at 0.35 mL/min throughout the analysis. The injection volume was 2 μL, and the column temperature was set to 45 °C. Mass spectrometry data acquisition was performed in both positive and negative ion scanning modes. Mass spectrum parameters are shown in Table 4.
The raw LC-MS data were processed using Progenesis QI V2.0 software (Waters Corporation, Milford, CT, USA). This involved baseline filtering, peak identification, integration, retention time correction, peak alignment, and normalization. Compounds were identified using the Human Metabolome Database (HDBM), Lipidmaps (V2.3) and a self-built database (Luming Biotech CO., Ltd., Shanghai, China). For the obtained data, we removed ion peaks with a missing value (0 value) > 50% in the group, replaced a 0 value with half of the minimum value, and screened qualitative compounds according to the qualitative result score. Compound identification was performed based on exact molecular weight matching in MS/MS (20 points), fragment matching in MS/MS (20 points), and isotope distribution matching (20 points), with a total score of 60 points. Compounds scoring below 36 points were excluded as inaccurate identifications. Quality control (QC) samples were prepared by mixing in equal volumes the extracts from all samples. Positive and negative ion data were combined into a data matrix for further analysis [40].
Principal component analysis (PCA) was performed in R to determine the relationships among the sample replicates. Orthogonal partial least-squares discriminant analysis (OPLS-DA) and partial least-squares discriminant analysis (PLS-DA) were used to distinguish the differentially accumulated metabolites (DAMs) among the groups. Model quality was evaluated using seven-fold cross-validation and 200 permutations of response testing. Metabolites with variable importance in projection (VIP) scores > 1 and a p-value < 0.05 were chosen as significant DAMs. Pathway analysis of DAMs was conducted using the KEGG database. Differential metabolites were further used for the enrichment analysis of the KEGG pathway (http://www.genome.jp/kegg/, accessed on 1 June 2023). Metabolomics data were deposited in the EMBL-EBI MetaboLights database (DOI: 10.1093/nar/gkad1045, PMID:37971328, accessed on 7 November 2024) with the identifier MTBLS11608.

4.3. Transcriptome Analysis

All freeze-dried soybean seed coats were ground by liquid nitrogen, and total RNA was extracted using the TRIzol reagent (Invitrogen, Carlsbad, CA, USA). RNA purity and quantification were evaluated using the NanoDrop 2000 spectrophotometer (Thermo Scientific, Waltham, MA, USA). RNA integrity was assessed using the Agilent 2100 Bioanalyzer (Agilent Technologies, Santa Clara, CA, USA). Then, the libraries were constructed using the VAHTS Universal V6 RNA-seq Library Prep Kit following the manufacturer’s instructions. Transcriptome sequencing and analysis were conducted by OE Biotech Co., Ltd. (Shanghai, China). The libraries were sequenced on an Illumina Novaseq 6000 platform, generating 150 bp paired-end reads. The clean reads were mapped to the reference genome using HISAT2. The FPKM of each gene was calculated, and the read counts of each gene were obtained using the HTSeq-count.
PCA was performed using R (v 3.2.0) to evaluate the biological duplication of samples. Differential expression analysis was performed using DESeq2. We set a Q value of <0.05 and a foldchange of >2 or <0.5 as the threshold for significantly and differentially expressed genes (DEGs). Based on the hypergeometric distribution, GO and KEGG pathway enrichment analysis of DEGs was performed to screen significantly enriched terms. Three biological replicates were tested for transcriptome sequencing. The raw sequence data reported in this paper were deposited in the Genome Sequence Archive [41] of the National Genomics Data Center, China National Center for Bioinformation/Beijing Institute of Genomics, Chinese Academy of Sciences (GSA: CRA014031), and are publicly accessible at https://ngdc.cncb.ac.cn/gsa (accessed on 5 March 2024).

4.4. Quantitative Real-Time Polymerase Chain Reaction Analysis

Six genes were randomly selected and measured for quantitative real-time polymerase chain reaction. Total RNA was extracted using the TRIzol reagent (Invitrogen, Carlsbad, CA, USA). Gene-specific primers were designed using primer 5. The primers are listed in Table S1. Actin was used as the reference gene.

5. Conclusions

We assessed metabolomes and transcriptomes and compared DAMs and DEGs in soybeans with different seed coat colors. In this study, soybeans with different seed coat colors exhibited distinct metabolic profile. Compared with yellow seed coat soybean, more flavonoid metabolites were detected in red, brown, and black seed coats, whereas less flavonoid metabolites were detected in green seed coat, especially procyanidins, procyanidin, malvidin and its derivatives, petunidin and its derivatives, and cyanidin and its derivatives. Cyanidin 3-O-(6″-malonyl-arabinoside), malvidin 3-laminaribioside, and petunidin 3-(6″-p-coumaryl-glucoside) 5-glucoside were the main anthocyanin components in the soybean seed coat. In addition, CHS, CHI, DFR, FG3, ANR, FLS, LAR, UGT88F4, and UGT88F5 were identified as candidate genes contributing to the color diversity of soybean seed coats. These results broaden the knowledge of the metabolic and transcriptomic changes in soybean seed coat caused by color changes and provide new insights into the development of bioactive substances from soybean seed coats.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms26010294/s1.

Author Contributions

C.X. conceived the idea of the study; Y.F. and S.H. performed all experiments and drafted the manuscript; X.W., M.Y., X.Z., L.T., J.L. and Y.Z. performed statistical analysis some figures. All authors have read and agreed to the published version of the manuscript.

Funding

Natural Science Foundation of Sichuan (2023NSFSC1176); The Third National Campaign of Crop Germplasm Census and Collection (111821301354052034); Sichuan Academy of Agricultural Sciences project “1-9 unveiling the list” (1+9 KJGG001); The Accurate Identification Project of Crop Germplasm from Sichuan Provincial Finance Department (1+3 ZYGG001); Crop Germplasm Census (51000023T000009852064); Research and application of key technologies for high quality and efficient potato and soybeans (51000024Y000010717763).

Data Availability Statement

All data are fully available without restriction.

Conflicts of Interest

The authors declare that they have no competing interests.

References

  1. Whipps, J.M.; Davies, K.G. Success in Biological Control of Plant Pathogens and Nematodes by Microorganisms; Springer: Amsterdam, The Netherlands, 2000. [Google Scholar]
  2. Desta, K.T.; Hur, O.S.; Lee, S.; Yoon, H.; Shin, M.J.; Yi, J.; Lee, Y.; Ro, N.Y.; Wang, X.; Choi, Y.M. Origin and seed coat color differently affect the concentrations of metabolites and antioxidant activities in soybean (Glycine max (L.) Merrill) seeds. Food Chem. 2022, 381, 132249. [Google Scholar] [CrossRef] [PubMed]
  3. Mcclean, P.E.; Lee, R.K.; Otto, C.; Gepts, P.; Bassett, M.J. Molecular and Phenotypic Mapping of Genes Controlling Seed Coat Pattern and Color in Common Bean (Phaseolus vulgaris L.). J. Hered. 2002, 2, 148. [Google Scholar] [CrossRef] [PubMed]
  4. Kim, Y.J.; Lee, S.J.; Lee, H.M.; Lee, B.W.; Ha, T.J.; Bae, D.W.; Son, B.Y.; Kim, Y.H.; Baek, S.B.; Kim, Y.C.; et al. Comparative proteomics analysis of seed coat from two black colored soybean cultivars during seed development. Plant Omics 2013, 6, 456–463. [Google Scholar]
  5. Toda, K.; Yang, D.; Yamanaka, N.; Watanabe, S.; Harada, K.; Takahashi, R. A single-base deletion in soybean flavonoid 3′-hydroxylase gene is associated with gray pubescence color. Plant Mol. Biol. 2002, 50, 187–196. [Google Scholar] [CrossRef]
  6. Bi, W.; Zhao, G.; Zhou, Y.; Xia, X.; Wang, J.; Wang, G.; Lu, S.; He, W.; Bi, T.; Li, J. Metabonomics analysis of flavonoids in seeds and sprouts of two Chinese soybean cultivars. Sci. Rep. 2022, 12, 5541. [Google Scholar] [CrossRef]
  7. Lee, J.; Hwang, Y.S.; Kim, S.T.; Yoon, W.B.; Han, W.Y.; Kang, I.K.; Choung, M.G. Seed coat color and seed weight contribute differential responses of targeted metabolites in soybean seeds. Food Chem. 2017, 214, 248–258. [Google Scholar] [CrossRef] [PubMed]
  8. Lepiniec, L.; Debeaujon, I.; Routaboul, J.M.; Baudry, A.; Pourcel, L.; Nesi, N.; Caboche, M. Flavonoids and the regulation of seed size in Arabidopsis. Biochem. Soc. Trans. 2014, 42, 364–369. [Google Scholar] [CrossRef]
  9. Hou, S.; Du, W.; Hao, Y.; Han, Y.; Li, H.; Liu, L.; Zhang, K.; Zhou, M.; Sun, Z. Elucidation of the Regulatory Network of Flavonoid Biosynthesis by Profiling the Metabolome and Transcriptome in Tartary Buckwheat. J. Agric. Food Chem. 2021, 69, 7218–7229. [Google Scholar] [CrossRef]
  10. Liu, C.; Yu, Q.; Li, Z.; Jin, X.; Xing, W. Metabolic and transcriptomic analysis related to flavonoid biosynthesis during the color formation of Michelia crassipes tepal. Plant Physiol. Biochem. 2020, 155, 938–951. [Google Scholar] [CrossRef]
  11. Qi, Y.; Li, C.; Duan, C.; Gu, C.; Zhang, Q. Integrated Metabolomic and Transcriptomic Analysis Reveals the Flavonoid Regulatory Network by Eutrema EsMYB90. Int. J. Mol. Sci. 2021, 22, 8751. [Google Scholar] [CrossRef]
  12. Fu, M.; Yang, X.; Zheng, J.; Wang, L.; Yang, X.; Tu, Y.; Ye, J.; Zhang, W.; Liao, Y.; Cheng, S.; et al. Unraveling the Regulatory Mechanism of Color Diversity in Camellia japonica Petals by Integrative Transcriptome and Metabolome Analysis. Front. Plant Sci. 2021, 12, 685136. [Google Scholar] [CrossRef] [PubMed]
  13. Li, Y.; Shan, X.; Gao, R.; Han, T.; Zhang, J.; Wang, Y.; Kimani, S.; Wang, L.; Gao, X. MYB repressors and MBW activation complex collaborate to fine-tune flower coloration in Freesia hybrida. Commun. Biol. 2020, 3, 396. [Google Scholar] [CrossRef]
  14. Abbas, M.; Saeed, F.; Anjum, F.M.; Afzaal, M.; Tufail, T.; Bashir, M.S.; Ishtiaq, A.; Hussain, S.; Suleria, H.A. Natural polyphenols: An overview. Int. J. Food Prop. 2017, 20, 1689–1699. [Google Scholar] [CrossRef]
  15. Gao, R.; Han, T.; Xun, H.; Zeng, X.; Li, P.; Li, Y.; Wang, Y.; Shao, Y.; Cheng, X.; Feng, X.; et al. MYB transcription factors GmMYBA2 and GmMYBR function in a feedback loop to control pigmentation of seed coat in soybean. J. Exp. Bot. 2021, 72, 4401–4418. [Google Scholar] [CrossRef]
  16. Zhang, A.; Zheng, J.; Chen, X.; Shi, X.; Wang, H.; Fu, Q. Comprehensive Analysis of Transcriptome and Metabolome Reveals the Flavonoid Metabolic Pathway Is Associated with Fruit Peel Coloration of Melon. Molecules 2021, 26, 2830. [Google Scholar] [CrossRef] [PubMed]
  17. Lim, Y.J.; Kwon, S.-J.; Qu, S.; Kim, D.-G.; Eom, S. Antioxidant Contributors in Seed, Seed Coat, and Cotyledon of γ-ray-Induced Soybean Mutant Lines with Different Seed Coat Colors. Antioxidants 2021, 10, 353. [Google Scholar] [CrossRef]
  18. Xiao, L.; Cao, S.; Shang, X.; Xie, X.; Zeng, W.; Lu, L.; Kong, Q.; Yan, H. Metabolomic and transcriptomic profiling reveals distinct nutritional properties of cassavas with different flesh colors. Food Chem. Mol. Sci. 2021, 2, 100016. [Google Scholar] [CrossRef]
  19. Tian, T.; Qiao, G.; Wen, Z.; Deng, B.; Qiu, Z.; Hong, Y.; Wen, X. Comparative transcriptome analysis reveals the molecular regulation underlying the adaptive mechanism of cherry (Cerasus pseudocerasus Lindl.) to shelter covering. BMC Plant Biol. 2020, 20, 27. [Google Scholar] [CrossRef] [PubMed]
  20. Qin, J.; Zhang, J.; Liu, D.; Yin, C.; Wang, F.; Chen, P.; Chen, H.; Ma, J.; Zhang, B.; Xu, J.; et al. iTRAQ-based analysis of developmental dynamics in the soybean leaf proteome reveals pathways associated with leaf photosynthetic rate. Mol. Genet. Genom. 2016, 291, 1595–1605. [Google Scholar] [CrossRef] [PubMed]
  21. Lin, H.; Rao, J.; Shi, J.; Hu, C.; Cheng, F.; Wilson, Z.A.; Zhang, D.; Quan, S. Seed metabolomic study reveals significant metabolite variations and correlations among different soybean cultivars. J. Integr. Plant Biol. 2014, 56, 826–836. [Google Scholar] [CrossRef]
  22. Yang, Y.; Zhao, T.; Wang, F.; Liu, L.; Liu, B.; Zhang, K.; Qin, J.; Yang, C.; Qiao, Y. Identification of candidate genes for soybean seed coat-related traits using QTL mapping and GWAS. Front. Plant Sci. 2023, 14, 1190503. [Google Scholar] [CrossRef] [PubMed]
  23. Kofsky, J.; Zhang, H.; Song, B.H. The Untapped Genetic Reservoir: The Past, Current, and Future Applications of the Wild Soybean (Glycine soja). Front. Plant Sci. 2018, 9, 949. [Google Scholar] [CrossRef] [PubMed]
  24. Ohnishi, S.; Funatsuki, H.; Kasai, A.; Kurauchi, T.; Yamaguchi, N.; Takeuchi, T.; Yamazaki, H.; Kurosaki, H.; Shirai, S.; Miyoshi, T.; et al. Variation of GmIRCHS (Glycine max inverted-repeat CHS pseudogene) is related to tolerance of low temperature-induced seed coat discoloration in yellow soybean. TAG Theor. Appl. Genet. 2011, 122, 633–642. [Google Scholar] [CrossRef]
  25. Wang, R.; Yang, X.; Wang, T.; Li, B.; Li, P.; Zhang, Q. Integration of Metabolomic and Transcriptomic Analyses Reveals the Molecular Mechanisms of Flower Color Formation in Prunus mume. Plants 2024, 13, 1077. [Google Scholar] [CrossRef]
  26. Zhao, C.; Wang, Z.; Liao, Z.; Liu, X.; Li, Y.; Zhou, C.; Sun, C.; Wang, Y.; Cao, J.; Sun, C. Integrated Metabolomic-Transcriptomic Analyses of Flavonoid Accumulation in Citrus Fruit under Exogenous Melatonin Treatment. Int. J. Mol. Sci. 2024, 25, 6632. [Google Scholar] [CrossRef] [PubMed]
  27. Zheng, S.; Hao, Y.; Fan, S.; Cai, J.; Chen, W.; Li, X.; Zhu, X. Metabolomic and Transcriptomic Profiling Provide Novel Insights into Fruit Ripening and Ripening Disorder Caused by 1-MCP Treatments in Papaya. Int. J. Mol. Sci. 2021, 22, 916. [Google Scholar] [CrossRef] [PubMed]
  28. Hu, Y.; Deng, L.; Chen, J.; Zhou, S.; Liu, S.; Fu, Y.; Yang, C.; Liao, Z.; Chen, M. An analytical pipeline to compare and characterise the anthocyanin antioxidant activities of purple sweet potato cultivars. Food Chem. 2016, 194, 46–54. [Google Scholar] [CrossRef] [PubMed]
  29. Khoo, H.E.; Azlan, A.; Tang, S.T.; Lim, S.M. Anthocyanidins and anthocyanins: Colored pigments as food, pharmaceutical ingredients, and the potential health benefits. Food Nutr. Res. 2017, 61, 1361779. [Google Scholar] [CrossRef] [PubMed]
  30. Azam, M.; Zhang, S.; Qi, J.; Abdelghany, A.M.; Shaibu, A.S.; Feng, Y.; Ghosh, S.; Agyenim-Boateng, K.G.; Liu, Y.; Yao, L.; et al. Effect of Origin, Seed Coat Color, and Maturity Group on Seed Isoflavones in Diverse Soybean Germplasm. Plants 2024, 13, 1774. [Google Scholar] [CrossRef] [PubMed]
  31. Song, L.; Luo, H.; Jiang, L.; Hou, J.; Zhang, T.; Dai, L.; Yu, Z. Integrative analysis of transcriptome and metabolome reveals the possible mechanism of leaf yellowing in pak choi (Brassica rapa subsp. chinensis) with 1-methylcyclopropene treatment during storage at 20 °C. Postharvest Biol. Technol. 2020, 169, 111300. [Google Scholar] [CrossRef]
  32. Wang, J.; Zhang, S.; Fu, Y.; He, T.; Wang, X. Analysis of Dynamic Global Transcriptional Atlas Reveals Common Regulatory Networks of Hormones and Photosynthesis Across Nicotiana Varieties in Response to Long-Term Drought. Front. Plant Sci. 2020, 11, 672. [Google Scholar] [CrossRef] [PubMed]
  33. Jung, J.W.; Oh, S.D.; Park, S.Y.; Jang, Y.; Lee, S.K.; Yun, D.W.; Chang, A.; Park, S.U.; Ha, S.H.; Kim, J.K. Metabolic profiling and antioxidant properties of hybrid soybeans with different seed coat colors, obtained by crossing β-carotene-enhanced (Glycine max) and wild (Glycine soja) soybeans. Plant Biotechnol. Rep. 2022, 16, 449–463. [Google Scholar] [CrossRef]
  34. Zhang, Y.; Cheng, Y.; Xu, S.; Ma, H.; Han, J.; Zhang, Y. Tree peony variegated flowers show a small insertion in the F3′H gene of the acyanic flower parts. BMC Plant Biol. 2020, 20, 211. [Google Scholar] [CrossRef] [PubMed]
  35. Zhou, X.; Li, J.; Zhu, Y.; Ni, S.; Chen, J.; Feng, X.; Zhang, Y.; Li, S.; Zhu, H.; Wen, Y. De novo Assembly of the Camellia nitidissima Transcriptome Reveals Key Genes of Flower Pigment Biosynthesis. Front. Plant Sci. 2017, 8, 1545. [Google Scholar] [CrossRef]
  36. Kovinich, N.; Saleem, A.; Rintoul, T.L.; Brown, D.C.; Arnason, J.T.; Miki, B. Coloring genetically modified soybean grains with anthocyanins by suppression of the proanthocyanidin genes ANR1 and ANR2. Transgenic Res. 2012, 21, 757–771. [Google Scholar] [CrossRef]
  37. Rothenberg, D.O.; Yang, H.; Chen, M.; Zhang, W.; Zhang, L. Metabolome and Transcriptome Sequencing Analysis Reveals Anthocyanin Metabolism in Pink Flowers of Anthocyanin-Rich Tea (Camellia sinensis). Molecules 2019, 24, 1064. [Google Scholar] [CrossRef] [PubMed]
  38. Yang, F.; Liao, D.; Wu, X.; Gao, R.; Fan, Y.; Raza, M.A.; Wang, X.; Yong, T.; Liu, W.; Liu, J.; et al. Effect of aboveground and belowground interactions on the intercrop yields in maize-soybean relay intercropping systems. Field Crops Res. 2017, 203, 16–23. [Google Scholar] [CrossRef]
  39. Zhang, Y.; Yang, L.; Yang, J.; Hu, H.; Wei, G.; Cui, J.; Xu, J. Transcriptome and Metabolome Analyses Reveal Differences in Terpenoid and Flavonoid Biosynthesis in Cryptomeria fortunei Needles Across Different Seasons. Front. Plant Sci. 2022, 13, 862746. [Google Scholar] [CrossRef]
  40. Wang, T.; Xing, L.; Song, H.; Wei, Y.; Li, P.; Lu, Q.; Hu, N.; Liu, Y.; Zhao, Y.; Liu, J.; et al. Large-scale metabolome analysis reveals dynamic changes of metabolites during foxtail millet grain filling. Food Res. Int. 2023, 165, 112516. [Google Scholar] [CrossRef] [PubMed]
  41. Wang, Y.; Song, F.; Zhu, J.; Zhang, S.; Yang, Y.; Chen, T.; Tang, B.; Dong, L.; Ding, N.; Zhang, Q.; et al. GSA: Genome Sequence Archive. Genom. Proteom. Bioinform. 2017, 15, 14–18. [Google Scholar] [CrossRef]
Figure 1. Phenotypes of soybean germplasms. S141: yellow seed coat; S26: red seed coat; S62: brown seed coat; S100: green seed coat; S124: black seed coat.
Figure 1. Phenotypes of soybean germplasms. S141: yellow seed coat; S26: red seed coat; S62: brown seed coat; S100: green seed coat; S124: black seed coat.
Ijms 26 00294 g001
Figure 2. Differentially accumulated metabolites of the metabolome in soybeans with different seed coat colors. (a). Principal component analysis of the metabolites detected in the soybean seed coat using three biological replicates. (b). Numbers of differentially expressed metabolites among S26, S62, S100, and S124. (cf). Volcano plots of differentially expressed metabolites among S26_vs._S141, S62_vs._S141, S100_vs._S141, and S124_vs._S141. Red dots indicate the upregulation of metabolites, blue dots indicate the downregulation of metabolites, and grey dots indicate that there is no significant difference in the metabolites.
Figure 2. Differentially accumulated metabolites of the metabolome in soybeans with different seed coat colors. (a). Principal component analysis of the metabolites detected in the soybean seed coat using three biological replicates. (b). Numbers of differentially expressed metabolites among S26, S62, S100, and S124. (cf). Volcano plots of differentially expressed metabolites among S26_vs._S141, S62_vs._S141, S100_vs._S141, and S124_vs._S141. Red dots indicate the upregulation of metabolites, blue dots indicate the downregulation of metabolites, and grey dots indicate that there is no significant difference in the metabolites.
Ijms 26 00294 g002
Figure 3. The classification of all the identified metabolites in the soybean seed coat. (a) classification of all the identified metabolites; (b) number of the DAMs.
Figure 3. The classification of all the identified metabolites in the soybean seed coat. (a) classification of all the identified metabolites; (b) number of the DAMs.
Ijms 26 00294 g003
Figure 4. Analysis of transcriptomics data of soybeans with different seed coat colors. (a) Principal component analysis of the genes detected in the soybean seed coat with three biological replicates. (b) Number of differentially expressed genes among S26, S62, S100, and S124. (cf). Volcano plots of differentially expressed genes among S26_vs._S141, S62_vs._S141, S100_vs._S141, and S124_vs._S141. Red dots indicate the upregulation of gene expression, green dots indicate the downregulation of gene expression, and grey dots indicate an absence of a significant difference in gene expression.
Figure 4. Analysis of transcriptomics data of soybeans with different seed coat colors. (a) Principal component analysis of the genes detected in the soybean seed coat with three biological replicates. (b) Number of differentially expressed genes among S26, S62, S100, and S124. (cf). Volcano plots of differentially expressed genes among S26_vs._S141, S62_vs._S141, S100_vs._S141, and S124_vs._S141. Red dots indicate the upregulation of gene expression, green dots indicate the downregulation of gene expression, and grey dots indicate an absence of a significant difference in gene expression.
Ijms 26 00294 g004
Figure 5. KEGG enrichment analysis of differentially accumulated metabolites and differentially expressed genes in soybeans with different seed coat colors. (ad). KEGG enrichment analysis of differentially accumulated metabolites. The x- and y-axes represent the enrichment factor and pathway term, respectively. The colors and sizes of the dots represent the significance and number of metabolites, respectively. (e). KEGG enrichment analysis of differentially expressed genes.
Figure 5. KEGG enrichment analysis of differentially accumulated metabolites and differentially expressed genes in soybeans with different seed coat colors. (ad). KEGG enrichment analysis of differentially accumulated metabolites. The x- and y-axes represent the enrichment factor and pathway term, respectively. The colors and sizes of the dots represent the significance and number of metabolites, respectively. (e). KEGG enrichment analysis of differentially expressed genes.
Ijms 26 00294 g005
Figure 6. Flavonoid pathway in soybean with different seed coat colors. CHS: chalcone synthase; CHS6: chalcone synthase 6; CHS1: chalcone synthase 1; UGT88F4: UDP-glycosyltransferase 88F4; FTH: Naringenin,2-oxoglutarate 3-dioxygenase (fragment); DFR: bifunctional dihydroflavonol 4-reductase/flavanone 4-reductase; ANR: anthocyanidin reductase ((2S)-flavan-3-ol-forming), LAR: leucoanthocyanidin reductase. The colored rectangle represents the up- or downregulation of S141, S26, S62, S100, and S124, respectively, and the red rectangle represents a high FPKM value compared with the blue rectangle.
Figure 6. Flavonoid pathway in soybean with different seed coat colors. CHS: chalcone synthase; CHS6: chalcone synthase 6; CHS1: chalcone synthase 1; UGT88F4: UDP-glycosyltransferase 88F4; FTH: Naringenin,2-oxoglutarate 3-dioxygenase (fragment); DFR: bifunctional dihydroflavonol 4-reductase/flavanone 4-reductase; ANR: anthocyanidin reductase ((2S)-flavan-3-ol-forming), LAR: leucoanthocyanidin reductase. The colored rectangle represents the up- or downregulation of S141, S26, S62, S100, and S124, respectively, and the red rectangle represents a high FPKM value compared with the blue rectangle.
Ijms 26 00294 g006
Figure 7. qRT-PCR detection of 6 key differentially expressed genes. The x-axis represents the relative expression of samples in qRT-PCR, and the y-axis represents the FPKM value in transcriptomics. The error bars indicate the SDs of three biological replicates. SoyZH13 14G066200 (DFR); SoyZH13 02G147700 (DFR); SoyZH13 01G196800 (ANT17); SoyZH13 08G058800 (ANR); SoyZH13 02G046100 (FHT); SoyZH13 20G171400 (LAR).
Figure 7. qRT-PCR detection of 6 key differentially expressed genes. The x-axis represents the relative expression of samples in qRT-PCR, and the y-axis represents the FPKM value in transcriptomics. The error bars indicate the SDs of three biological replicates. SoyZH13 14G066200 (DFR); SoyZH13 02G147700 (DFR); SoyZH13 01G196800 (ANT17); SoyZH13 08G058800 (ANR); SoyZH13 02G046100 (FHT); SoyZH13 20G171400 (LAR).
Ijms 26 00294 g007
Table 1. General information on soybean germplasms.
Table 1. General information on soybean germplasms.
CodeNameSeed Coat Color100-Seed Weight (g)
S141HuangdouYellow16.4
S26HongpixiangdouRed36.8
S62BendihuangdouBrown19.7
S100BendilvhuangdouGreen20.5
S124HeipidadouBlack20.7
Table 2. DEGs related to the flavonoid biosynthetic pathway.
Table 2. DEGs related to the flavonoid biosynthetic pathway.
S26_vs._S141S62_vs._S141S100_vs._S141S124_vs._S141
Fold
Change
p-ValueRegu-lationFold
Change
p-ValueRegu-lationFold
Change
p-ValueRegu-lationFold
Change
p-ValueRegu
lation
SoyZH13_02G120600(CHS6)8.650.010up1.540.660up17.300.000up11.240.000up
SoyZH13_11G010700(CHS)2.830.000up1.360.580up0.950.860down0.540.010down
SoyZH13_08G103500(CHS1)10.750.000up0.230.060down18.760.000up10.300.000up
SoyZH13_06G290900(UGT88F4)1.510.014up0.410.000down1.750.000up0.500.001down
SoyZH13_02G046100(FHT)5.350.000up0.120.000down7.710.000up1.510.260up
SoyZH13_14G066200(DFR)10.470.000up0.210.200down33.060.000up0.720.730down
SoyZH13_02G147700(DFR)7.510.000up0.570.220down5.870.000up1.710.100up
SoyZH13_01G196800(ANT17)3.170.000up0.080.010down16.610.000up12.170.000up
SoyZH13_08G058800(ANR)0.040.010down0.050.030down0.630.550down0.080.030down
SoyZH13_20G171400(LAR)18.820.000up0.680.580down9.390.000up0.190.030down
SoyZH13_05G078700(FLS)7.760.000up0.260.020down4.390.000up0.050.000down
SoyZH13_12G118300(FG3)78.610.000up 26.240.000up6.810.000up
SoyZH13_20G225300(CHI2-A)7.420.000up 7.420.000up
SoyZH13_05G020900(CYP75B2) 9.2750.002up
SoyZH13_12G061300(CYP93B1) 21.2650.001up
CHS6: chalcone synthase 6; CHS: chalcone synthase; CHS1: chalcone synthase 1; UGT88F4: UDP-glycosyltransferase 88F4; FTH: Naringenin,2-oxoglutarate 3-dioxygenase (fragment); DFR: bifunctional dihydroflavonol 4-reductase/flavanone 4-reductase; ANT17: leucoanthocyanidin dioxygenase; ANR: anthocyanidin reductase ((2S)-flavan-3-ol-forming); LAR: leucoanthocyanidin reductase; FLS: flavonol synthase/flavanone 3-hydroxylase; FG3: UDP-glycosyltransferase 79B30; CHI2-A: chalcone–flavonone isomerase 2-A; CYP75B2: Flavonoid 3′-monooxygenase; CYP93B1: licodione synthase.
Table 3. The elution gradient.
Table 3. The elution gradient.
Time (min)A%B%
0.01955
2955
47030
85050
102080
140100
150100
15.1955
16955
A: 0.1% formic acid in water, v/v; B: 0.1% formic acid in acetonitrile, v/v.
Table 4. Mass spectrum parameters.
Table 4. Mass spectrum parameters.
ParametersPositive Ion ModeNegative Ion Mode
Mass scan range (m/z)100–1200100–1200
Resolution (full scan)70,00070,000
Resolution (HCD MS/MS scans)17,50017,500
Spray voltage (V)3800−3000
Sheath gas flow rate (Arb)4035
Aux gas flow rate (Arb)108
Capillary temperature (°C)320320
Auxiliary gas heater temperature (°C)350350
S-lens radio frequency (RF) level5050
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Fan, Y.; Hussain, S.; Wang, X.; Yang, M.; Zhong, X.; Tao, L.; Li, J.; Zhou, Y.; Xiang, C. Metabolomic and Transcriptomic Analyses of Flavonoid Biosynthesis in Different Colors of Soybean Seed Coats. Int. J. Mol. Sci. 2025, 26, 294. https://doi.org/10.3390/ijms26010294

AMA Style

Fan Y, Hussain S, Wang X, Yang M, Zhong X, Tao L, Li J, Zhou Y, Xiang C. Metabolomic and Transcriptomic Analyses of Flavonoid Biosynthesis in Different Colors of Soybean Seed Coats. International Journal of Molecular Sciences. 2025; 26(1):294. https://doi.org/10.3390/ijms26010294

Chicago/Turabian Style

Fan, Yuanfang, Sajad Hussain, Xianshu Wang, Mei Yang, Xiaojuan Zhong, Lei Tao, Jing Li, Yonghang Zhou, and Chao Xiang. 2025. "Metabolomic and Transcriptomic Analyses of Flavonoid Biosynthesis in Different Colors of Soybean Seed Coats" International Journal of Molecular Sciences 26, no. 1: 294. https://doi.org/10.3390/ijms26010294

APA Style

Fan, Y., Hussain, S., Wang, X., Yang, M., Zhong, X., Tao, L., Li, J., Zhou, Y., & Xiang, C. (2025). Metabolomic and Transcriptomic Analyses of Flavonoid Biosynthesis in Different Colors of Soybean Seed Coats. International Journal of Molecular Sciences, 26(1), 294. https://doi.org/10.3390/ijms26010294

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop