Next Article in Journal
Metagenomic Sequencing Revealed the Effects of Different Potassium Sulfate Application Rates on Soil Microbial Community, Functional Genes, and Yield in Korla Fragrant Pear Orchard
Previous Article in Journal
Limited Short-Term Impact of Annual Cover Crops on Soil Carbon and Soil Enzyme Activity in Subtropical Tree Crop Systems
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Combining Transcriptome Analysis and Comparative Genomics to Identify Key Components of the Lignin Biosynthesis Gene Network in Sorghum bicolor

1
Institute of Sorghum, Shanxi Agricultural University, Jinzhong 030600, China
2
Shanxi Hou Ji Laboratory, Taiyuan 030031, China
3
Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
4
College of Agriculture, Shanxi Agricultural University, Jinzhong 030801, China
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Agronomy 2025, 15(7), 1751; https://doi.org/10.3390/agronomy15071751
Submission received: 17 June 2025 / Revised: 8 July 2025 / Accepted: 17 July 2025 / Published: 21 July 2025
(This article belongs to the Section Plant-Crop Biology and Biochemistry)

Abstract

Sorghum is a versatile crop that serves as a major source of food, feed, fodder and biofuel globally. Lignin content in sorghum affects multiple important traits, including lodging resistance, forage digestibility and the efficiency of bioenergy production. However, the genetic regulation of lignin content in sorghum remains poorly understood. In this study, we combined transcriptomic and comparative genomic approaches to uncover the genetic network underlying lignin biosynthesis in sorghum. Through comparative genomic analysis, we identified 104 candidate genes involved in lignin biosynthesis. Transcriptome analysis of four sorghum accessions with contrasting lignin contents identified 6132 differentially expressed genes with an enrichment of genes related to phenylpropanoid biosynthesis and cell wall biogenesis. The 104 lignin biosynthesis candidates were significantly enriched (p-value < 0.01) in these differentially expressed genes, with most differentially expressed candidate genes related to monolignol biosynthesis and polymerization being up-regulated in high-lignin accessions. These up-regulated genes are related to all key enzymes involved in lignin biosynthesis, suggesting that the elevated lignin content in these accessions results from a collective increase in enzyme activity. Sequence analysis revealed a significant reduction in genetic diversity across lignin biosynthesis genes in cultivated sorghum compared to wild sorghum. Moreover, selection signals during domestication were identified in 30 lignin biosynthesis genes, 22 of which were differentially expressed, further supporting the functional relevance of these differentially expressed genes in lignin biosynthesis. Overall, our findings uncover the lignin biosynthesis gene network in sorghum and offer potential targets for future functional studies and trait manipulation.

1. Introduction

Sorghum is a multi-purpose crop providing a major source of food, feed, fodder and bio-fuel globally [1]. Known for its adaptation to hot and drought environments, sorghum thrives in semiarid and arid areas, providing a staple food for over half a billion people in sub-Saharan Africa and South Asia [2]. The ability of sorghum to produce a reasonable yield in marginal land with limited water and fertilizer makes it an ideal choice for smallholder farmers in resource-constrained areas [3]. Consequently, the importance of sorghum in global agricultural systems is expected to increase as climate change and population growth place greater demands on food and water resources.
Lignin is a complex phenylpropanoid polymer that plays a critical role in the growth and development of sorghum. As a major component of the plant cell wall, lignin enhances the strength and rigidity of stems and leaves, providing mechanical support, facilitating water transport and helping the plant manage both abiotic and biotic stresses [4,5]. High lignin content is particularly beneficial for grain sorghum, where it improves resistance to lodging. However, excessive lignin content negatively affects the digestibility of forage sorghum and the refinery efficiency of bioenergy sorghum, as the branched polyphenolic structure is difficult for both animals and microbes to break down [6]. Therefore, achieving an optimal lignin content tailored to different types of sorghum is essential for their end-use, which in turn requires a clear understanding of the gene network controlling this trait. Generally, low-lignin accessions are preferred maintainer lines in forage sorghum breeding.
Lignin content varies widely among different sorghum genotypes, ranging from approximately 2% to 11% of dry matter (Silva et al., 2022) [1]. This variation is largely driven by the genetic makeup of each genotype [7]. A previous study using a diversity panel of 206 sorghum accessions identified nine quantitative trait loci (QTLs) associated with lignin content [8]. The brown midrib (bmr) mutants, which exhibit brown vascular tissue in the leaves and stems, have shown reduced lignin content [9]. Molecular experiments using these mutants have identified key genes involved in lignin biosynthesis [10,11,12,13]. For instance, the bmr12 mutant is caused by a premature stop codon in the caffeic acid O-methyltransferase (COMT) gene, leading to reduced lignin content and altered lignin composition in the cell walls [11]. Despite these advances, a comprehensive understanding of genes involved in lignin biosynthesis and their genetic variation in sorghum remains elusive. Combining transcriptomic analyses, comparative genomics and population genetics could shed light on this topic.
Lignin biosynthesis has been well-characterized in model species [14]. The key steps in lignin production involve the biosynthesis of monolignols and their subsequent polymerization. While enzymes involved in monolignol biosynthesis are well understood, peroxidases and laccases are the primary enzymes known to catalyze the polymerization of monolignols into lignin [15]. The transcriptional regulation of lignin biosynthesis is largely mediated by MYB transcription factors that bind to the promoters of lignin biosynthetic genes [15,16]. Given that the lignin biosynthesis pathway is highly conserved across vascular plants [17], knowledge of lignin biosynthesis from model species can be used to understand the genetic control of lignin content in sorghum. Moreover, the availability of the sorghum reference genome enables the utilization of comparative genomic approaches to identify the candidate genes involved in lignin biosynthesis [18,19,20]. The release of the sorghum pan-genome and resequencing data from diverse accessions further facilitates the analysis of genetic variation in these candidate genes [2,21,22].
This study aims to (1) combine transcriptomic analysis and comparative genomics to investigate the gene network controlling lignin content in sorghum and (2) identify sequence variations and selection signals in genes related to lignin content during sorghum domestication. The results will enhance our understanding of the genetic regulation of lignin content in sorghum and accelerate genetic improvement of this important trait.

2. Materials and Methods

2.1. Plant Materials

Four sorghum accessions, including two accessions (SXSDC, SXHLJC) with high lignin content (average lignin content of 8.26%) and two accessions (SXZM12, SXZM6) with low lignin content (average lignin content of 4.15%), were used in this study. The two sorghum accessions with low lignin content originate from America and serve as key maintainer lines for breeding forage sorghum, while the two accessions with high lignin content originate from China and Russia, respectively. These sorghum accessions were grown in a field experiment at Yuci, Shanxi, China, during the 2020 growing season. The sorghum was planted in two row plots with a completely randomized design. Each row was 1.8 m long and 0.65 m wide. Standard field management was applied. At the jointing stage, stem tissue at the eighth node above ground was harvested from middle plants in the plots with three biological replicates per genotype, resulting in a total of 12 samples. Lignin content of these sorghum accessions was determined using an FOSS 5000 near-infrared (NIR) analyzer (FOSS, Hillerød, Denmark) following the manufacturer’s instructions. A total of 100 g of stem samples per accession was collected, dried to less than 10% moisture, milled to fine powder and sieved with a 1 mm sieve. The sieved powder was collected for spectral scanning with an FOSS 5000 analyzer using the wavelength range of 1100 to 2500 with 2 nm data spacing. Each sample was scanned two times, and the average spectrum was calculated as the sample spectrum for lignin content estimation, as described previously [8].

2.2. RNA Extraction and Sequencing

The collected stem tissue samples were ground together with liquid nitrogen into powder. Total RNA was then extracted using the Plant RNA Kit (TIANGEN Biotech (Beijing) Co., Ltd. (Beijing, China)) following the manufacturer’s instructions. Agarose gel (1%) electrophoresis was used to assess contamination and degradation of extracted RNA. The concentration and purity of RNA were examined with a Nano Photometer® spectrophotometer (IMPLEN, Westlake Village, CA, USA). The qualified RNA samples were used to construct paired-end sequencing libraries, which were sequenced with the Illumina HiSeq2000 sequencing platform (San Diego, CA, USA). The construction of sequencing libraries and sequencing were carried out by Shanghai Personal Bio Technology Co., Ltd. (Shanghai, China).

2.3. Analysis of RNA-Seq Data

Illumina sequencing of the pooled RNA-seq libraries yielded 12 FASTQ files with a total of 48.67 million raw reads. FastQC was employed to assess the quality of raw reads (http://www.bioinformatics.babraham.ac.uk/projects/fastqc, accessed on 20 June 2020), and Trimmomatic [23] was used to filter out adapter sequences and low-quality reads with an average quality score < Q20. The resultant clean reads were aligned to the reference genome (BTx623_V3.1) using HISAT2 software (version 2.2.1) [24]. SAMtools was used to convert the sam files to bam files [25]. The R package Rsubread [26] was used to assemble the transcript and calculate normalized gene expression values based on the trimmed mean of M-values (TMM) method [27].
The TMM-normalized gene expression data was used to conduct hierarchical clustering, principal component analysis (PCA) and correlation analysis of samples in R with packages FactoMineR, pheatmap and corrplot, respectively. Differentially expressed genes (DEGs) between low and high-lignin samples were identified using the R package DESeq2. Expressed genes meeting the criteria: p-value < 0.05 and |log2FoldChange| > 1 between low-lignin samples and high-lignin samples were identified as DEGs. Visualization of DEGs was conducted using the package ggplot2 in R (version 4.3.3).

2.4. Functional Enrichment Analysis

GO enrichment analysis was performed using the functional annotation files downloaded from AnnotationHub (org.Sorghum_bicolor.eg.sqlite) as background files. KEGG pathway enrichment analysis was conducted with KofamKOALA [28]. The significance of each pathway was calculated using the hypergeometric distribution test. Pathways with p-value < 0.05 were defined as significantly enriched in GO and KEGG. Identification and classification of transcription factors and protein kinases were performed using iTAK [29].

2.5. Analysis of Gene Co-Expression Network

The determination of the soft thresholding power was based on the scale-free topology model fit (R2) ≥ 0.8 using the pickSoftThreshold function. Subsequently, the automatic network construction function (blockwiseModules) was used to complete the network construction and module detection to obtain highly correlated modules. TOMType was signed, the minModuleSize was 20, the soft threshold (power) was 8 and the mergeCutHeight was 0.25. Co-expression and transcriptional regulatory networks were plotted using Cytoscape (version 3.7.1) [30].
The WGCNA R package was used to identify co-expression modules related to lignin content based on our expression data. A total of 34,211 genes were used as input for the signed WGCNA network construction. For standard WGCNA networks, the soft power was set to 8, the minimum module size was 20 and the merge cut height value was 0.25. Correlations between modules and lignin traits were analyzed with all genes in each module. Significant trait-related modules were identified based on high correlation values. Candidate target genes were visualized using Cytoscape and the MCODE plugin (version 2.0.0).

2.6. Identification of Sorghum Othologs of Lignin Biosynthesis Genes

Known genes related to lignin biosynthesis were collected from previous studies (Table S1). The sorghum orthologs of these genes were extracted from the database in Ensembl (https://plants.ensembl.org/Sorghum_bicolor, accessed on 16 June 2025). Sequence similarities between these lignin biosynthesis genes and their sorghum counterparts were further tested using BLASTP to exclude any pair with sequence similarity < 20% or E-values > 1 × 10−5.

2.7. Sequence Variation and Selection Signature of Genes Related to Lignin Content in Sorghum

The published sorghum resequencing data of 44 sorghum accessions [2] was used to investigate sequence variation and domestication signals of these identified lignin biosynthesis candidate genes. Sequence variation was identified by mapping the resequencing data to the reference genome (BTx623_V3.1) using BWA [25]. Sequence variations were called using GATK [31] and filtered on minor allele frequency > 0.05 and missing rate < 0.2. The 44 accessions included seven wild sorghum accessions, 32 cultivated (improved lines and landrace) accessions and 5 unclassified accessions. The investigation of selection signatures during sorghum domestication was conducted between wild and cultivated accessions. Population statistics, including θπ [32], Tajima’s D [33] and FST [34] were calculated using a BioPerl module and an in-house perl script. Signatures of selection were identified based on the following criteria: (1) reduction of sequence diversity greater than the average RoD of 129 neutral loci identified in previous study; (2) FST great than the average of 129 neutral loci; (3) negative Tajima’s D. The criteria signals were combined from the change of diversity, population difference and allele frequency to avoid false positives [35].

3. Results

3.1. Identification of Candidate Genes Related to Lignin Biosynthesis

Through a comprehensive literature review [36,37,38,39], we summarized 19 genes involved in monolignol biosynthesis, 12 genes related to monolignol polymerization and 42 genes regulating the transcription of lignin biosynthesis from model plants (Table S1, Figures S1 and S2). Using these genes as references, we identified 104 sorghum orthologs, which constituted our list of candidate genes related to lignin biosynthesis (Table 1). The 104 candidate genes included 43 involved in monolignol biosynthesis, 21 in monolignol polymerization and 40 in the transcriptional regulation of lignin biosynthesis. These genes were distributed unevenly across the 10 sorghum chromosomes, with the number of genes on each chromosome ranging from a single gene on Chromosome 5 to 18 genes on Chromosome 2.

3.2. Transcriptomic Data of Sorghum Accessions with Contrasting Lignin Content

To investigate the gene expression network regulating lignin content, four sorghum accessions with contrasting lignin levels were selected for transcriptome analysis. Lignin content measurements confirmed the distinction between low and high-lignin content accessions (Table S2). A total of 12 stem tissue samples were collected at the jointing stage, with 3 biological replicates per genotype for RNA sequencing (RNA-seq).
In total, 693.83 million raw paired-end reads were generated, with 98.24% of sequence bases exhibiting a quality score greater than 20. Low-quality reads and adaptor sequences were removed, resulting in 6.31–7.86 Gb of clean data per sample (Table S3). On average, 94.05% of the raw sequences were retained as clean reads, indicating high-quality RNA-seq data. These clean reads were then mapped to the sorghum reference genome (BTx623_V3.1) with an average mapping rate of 95.67%, and most of the reads mapped to 28,463 gene regions.
Gene expression data was calculated based on RNA-seq mapping results. The gene expression data revealed strong correlations between samples within the same lignin content category compared to between categories. Specifically, the average correlation coefficient for high-lignin accessions was 0.87, and for low-lignin accessions was 0.73, whereas the average correlation coefficient between high and low-lignin accessions was 0.53 (Figure 1A). This suggests that lignin content is a major factor influencing gene expression patterns in this dataset. Principal component analysis of the gene expression data (Figure 1B) showed a clear separation between high and low-lignin accessions, particularly along the first principal component (PC1), which explained 48% of the variance in the gene expression data (Figure 1B). Notably, SbCOMT (Sobic.007G047300), encoding the caffeic acid O-methyltransferase, a critical enzyme in monolignol biosynthesis, was the most influential gene on PC1 (Figure 1C). These results confirm the reliability of our RNA-seq data and its utility in elucidating the gene network controlling lignin content in sorghum.

3.3. Differentially Expressed Genes Between High and Low-Lignin Sorghum Accessions

We identified 28,463 genes expressed in the RNA-seq data, with gene counts ranging from 21,660 to 24,348 per sample. A total of 6132 differentially expressed genes (DEGs) were detected between high and low-lignin accessions, with 2209 up-regulated and 3923 down-regulated genes in the high-lignin accessions (Figure 2A,B).
The 104 candidate genes related to lignin biosynthesis were significantly enriched in the 6132 DEGs, with 57 of these genes differentially expressed between high and low-lignin accessions (p-value < 0.01, chi-square test, Table S4). Notably, 43 of these 57 genes were up-regulated in the high-lignin accessions. Genes related to monolignol biosynthesis had the highest percentage of DEGs (65%), while about half of the genes related to monolignol polymerization and transcriptional regulation were also differentially expressed. Overall, genes involved in monolignol biosynthesis and polymerization were predominantly up-regulated (Figure 2C). In contrast, 40% of the transcriptional regulation genes were down-regulated in the high-lignin accessions.
Of the 37 DEGs related to monolignol biosynthesis and polymerization, 31 were up-regulated, and 6 were down-regulated. The up-regulated genes covered all key enzymes involved in monolignol biosynthesis and two enzymes involved in monolignol polymerization, suggesting that the elevated lignin content in these accessions results from a collective increase in the activity of these enzymes. For instance, the expression of SbCOMT was approximately eight times higher in high-lignin accessions than in low-lignin accessions, and SbCAD2 (Sobic.004G071000), which encodes the final enzyme (a cinnamyl alcohol dehydrogenase) in monolignol biosynthesis, was nearly five times more highly expressed in high-lignin accessions compared to low-lignin accessions. The six down-regulated genes related to monolignol biosynthesis and polymerization had homologous up-regulation in high-lignin accessions. In contrast, a total of 20 candidate genes involved in transcriptional regulation were differentially expressed, including 12 up-regulated genes and eight down-regulated genes.

3.4. Functional Enrichment of Differentially Expressed Genes

To better understand the functions of the DEGs, we performed Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses of these DEGs. The 6132 DEGs were significantly enriched in all three GO categories: biological processes, molecular functions and cellular components (Table S5). Among the biological process GO terms, polysaccharide biosynthetic processes, cell wall organization or biogenesis and phenylpropanoid metabolic processes were the most highly enriched (Figure 2D). In the molecular function category, GO terms related to microtubule cytoskeleton, extracellular regions and plasmodesma were enriched, while hydrolase activity and microtubule binding dominated the enriched terms in the cellular component category.
Enrichment analysis of KEGG pathways revealed five metabolic subgroups enriched in our DEGs, including phenylpropanoid biosynthesis, flavonoid biosynthesis and cutin, suberine and wax biosynthesis (Table S6). The phenylpropanoid biosynthesis pathway was particularly enriched, with 72 genes in the pathway being differentially expressed, of which 42 were up-regulated and 30 were down-regulated in high-lignin accessions (Figure 3). These findings underscore the significant role of phenylpropanoid biosynthesis and cell wall biogenesis in determining lignin content in sorghum.

3.5. Analysis of Gene Co-Expression Network

Genes with similar expression profiles often perform related functions. To identify co-expression modules associated with lignin content, we performed weighted correlation network analysis (WGCNA) using the RNA-seq data. A total of 29 co-expression modules were identified (Figure 4A). Two co-expression modules, green and turquoise, showed a significant correlation with lignin content. Gene networks based on these modules were constructed, with the green module containing 1813 genes and the turquoise module containing 9167 genes.
Using the MCODE plugin in Cytoscape, we identified five gene clusters within the green module, with cluster 1 exhibiting the highest score (59.6). This cluster comprised 71 nodes and 2071 edges, indicating a dense network structure (Figure 4B). Notably, four lignin biosynthesis candidate genes, including SbLAC2 (Sobic.001G422300), SbCCoAOMT7 (Sobic.007G218800), SbMYB103 (Sobic.007G039100) and SbHCT (Sobic.004G212300), were present in this cluster. The seed gene (Sobic.006G220800) with the highest MCODE score of 42 in this cluster might play a central role in lignin biosynthesis.

3.6. Genetic Variation of Lignin Biosynthesis Genes

Understanding the sequence variation of key genes involved in lignin biosynthesis is crucial for functional validation and genetic improvement of lignin content. Using previously reported resequencing data from 44 sorghum accessions [2], we analyzed the genetic diversity and selection signals of the 104 candidate genes (Table S7). Of these, 10 genes showed no sequence variation. The remaining 94 genes exhibited substantial sequence diversity in both cultivated and wild sorghum groups. Notably, the sequence diversity of these genes was significantly lower in cultivated sorghum compared to wild genotypes (p-value < 0.01, paired t-test, Figure 5A). These lignin biosynthesis candidate genes also showed higher Fst values between wild and cultivated sorghum compared to the neutral genes (p-value < 0.01, Wilcoxon test, Table S8, Figure 5B). Together, the evidence suggests that the genes may have been targeted during domestication
Selection signals were detected in 30 out of the 94 genes analyzed, including eight genes related to monolignol biosynthesis, seven genes involved in monolignol polymerization, and 15 genes associated with transcriptional regulation. Among these 30 genes under selection, 22 genes were differentially expressed, including 17 up-regulated genes and five down-regulated genes in high-lignin accessions, suggesting an enrichment (p-value < 0.01, chi-squared test). Of the 15 genes involved in monolignol biosynthesis and polymerization under selection, the majority (11) were up-regulated in high-lignin accessions (Figure 5C). In contrast, only six out of the 15 genes associated with transcriptional regulation were up-regulated. These findings further support the idea that up-regulated genes involved in monolignol biosynthesis and polymerization play a more prominent role in lignin biosynthesis.

4. Discussion

Lignin content is directly linked to lodging in grain sorghum, digestibility in forage sorghum and refinery efficiency in bio-energy sorghum [1]. Understanding the gene network that regulates lignin content and its genetic variation is essential for manipulating lignin levels to improve these traits [40]. Building upon the established lignin biosynthesis network in model species, this study identified 104 candidate genes involved in lignin biosynthesis through a comparative genomic approach. Transcriptomic analysis of four sorghum accessions with contrasting lignin contents revealed significant enrichment of these candidate genes in differentially expressed genes, with most monolignol biosynthesis and polymerization genes up-regulated in high-lignin accessions. Furthermore, we observed a significant reduction in genetic diversity across these lignin biosynthesis candidate genes in cultivated sorghum compared to wild sorghum, with 30 genes showing signs of selection during sorghum domestication (Figure 5A). Of these 30 genes under selection, 22 were differentially expressed between high and low-lignin accessions, suggesting that differentially expressed genes are more likely to be functional in lignin biosynthesis. This study integrates evidence from comparative genomics, transcriptomics and population genetics to provide new insights into the lignin biosynthesis gene network in sorghum. The identified candidate genes, with their expression differences and sequence variations, offer promising targets for further functional investigation and trait manipulation.
Transcriptomic analyses have been performed to identify the genes underlying lignin variation in crop species. In barley, RNA-seq analysis of genotypes with different lignin content identified over ten thousand DEGs, including 47 genes in the monolignol pathway, providing targets for genetic improvement [41]. Likewise, over 2000 DEGs, including three genes in the monolignol pathway, were identified in transcriptome analysis of two sugarcane genotypes, contrasting the lignin content [42]. While these studies generally focused on a pair of genotypes, our analysis used four genotypes to better control background noise. As a result, a significant enrichment of lignin biosynthesis in DEGs was identified in our analysis, with 57 out of 6132 DEGs related to lignin biosynthesis.
Comparative genomics is a widely used approach to gain insights into gene function in non-model plants by leveraging the conservation of gene function among orthologs [35,43,44]. A total of 117 genes and 90 genes corresponding to 10 key enzymes in lignin biosynthesis have been reported in maize and rice, respectively [45,46]. These gene families showed high copy numbers that often exist as clusters. Our analysis focuses on both key enzymes in lignin biosynthesis and known transcription regulators in sorghum to gain a broad picture of the lignin biosynthesis pathway. Similar to maize and rice, these 10 key enzymes in lignin biosynthesis also displayed multiple copies of clusters in sorghum, indicating a general pattern in cereal crops. However, functional divergence of orthologs can occur through processes like neofunctionalization, which is particularly common for genes with multiple copies in a given species [47]. Therefore, further analysis is needed to determine which specific genes are responsible for lignin biosynthesis. The expression patterns of these genes are crucial for understanding their molecular function. Given that these 12 enzymes are key catalysts in lignin biosynthesis, up-regulated genes in high-lignin accessions are more likely to be functionally involved in lignin biosynthesis. Supporting this hypothesis, 31 out of the 37 differentially expressed genes related to monolignol biosynthesis and polymerization were up-regulated in the high-lignin accessions.
Transcriptional regulation adds another layer of complexity to lignin biosynthesis. Unlike these enzymes, transcriptional regulators can either positively or negatively influence lignin biosynthesis. This is reflected in the nearly equal proportion of up-regulated and down-regulated genes in differentially expressed transcriptional regulation genes. The balanced expression of these two types of genes further validates the high quality of our expression data.
Sorghum underwent significant morphological changes during domestication, leading to the formation of traits collectively known as “domestication syndrome” [48,49]. Key domestication traits in sorghum include the loss of seed shattering, increased grain size, reduced tillering and an expanded plant canopy [22,50,51]. To support increased grain yield and an enlarged canopy, cultivated sorghum requires stronger stems with higher lignin content. This suggests that lignin content in sorghum may have been selected during domestication. This hypothesis is further supported by the identification of selection signals in lignin biosynthesis candidate genes. Notably, differentially expressed genes were enriched in genes under selection, reaffirming that they are likely to be functionally involved in lignin biosynthesis.
In summary, this study identified 104 candidate genes within the lignin biosynthesis network in sorghum. By integrating transcriptomics and population genetics data, the expression pattern and sequence diversity of these genes were investigated, leading to 57 DEGs related to lignin biosynthesis being identified. Further investigation of the impact of their sequence variation on lignin content in sorghum could help pinpoint natural variation underlying these important traits [52]. Genetic modification of the lignin biosynthetic pathway has been successfully applied in plant species [53,54,55,56]. The findings of this study provide targets for functional investigation and trait manipulation.

5. Conclusions

In this study, we identified 104 candidate genes involved in lignin biosynthesis using comparative genomics. Transcriptomic analysis of four sorghum accessions with contrasting lignin contents revealed 6132 differentially expressed genes with an enrichment of genes related to phenylpropanoid biosynthesis and cell wall biogenesis. Out of these candidate genes, 57 were differentially expressed between high and low-lignin sorghum accessions, indicating a significant enrichment. These differentially expressed candidate genes were more likely to be functional in lignin biosynthesis, with most differentially expressed candidate genes related to monolignol biosynthesis and polymerization being up-regulated in high-lignin accessions. These up-regulated genes covered all key enzymes involved in lignin biosynthesis, suggesting that the elevated lignin content in these accessions results from a collective increase in enzyme activity. Sequence analysis revealed a significant reduction in genetic diversity across lignin biosynthesis genes in cultivated sorghum compared to wild sorghum. Moreover, selection signals were identified in 30 lignin biosynthesis genes, 22 of which were differentially expressed, further supporting the functional relevance of these genes in lignin biosynthesis. The findings enhance our understanding of the lignin biosynthesis network in sorghum and provide potential targets for future functional studies and trait manipulation.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/agronomy15071751/s1, Table S1 list of genes invovled in lignin biosythsis. Table S2 summary of sorghum samples in this study. Table S3 summary of RNA-seq data. Table S4 expression change of 104 lignin biosythsis candidate genes. Table S5 GO enrichment of DEGs between low lignin content accessions and high lignin content accessions. Table S6 KEGG enrichment of DEGs between low lignin content accessions and high lignin content accessions. Table S7 population statistics of 104 lignin biosynthesis candidate genes. Table S8 list of 129 neutral genes and their Nucleotide diversity (θπ), Tajima’s D and Fst values. Figure S1 Genetic regulatory network of lignin. Figure S2 Three lignin monomer biosynthetic pathways.

Author Contributions

J.P. and Y.T. designed the project. H.N., Y.W. (Yanbo Wang), R.L., X.C., Y.W. (Yao Wang), Y.W. (Yubin Wang), X.L., F.F., L.J., J.C., H.Y., H.W., H.C. and Y.Z. performed transcriptome and comparative genomic analyses. J.P., Y.T. and H.N. supervised the work. H.N., Y.T., J.P. and Y.W. (Yanbo Wang) wrote the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Biological Breeding Engineering Project of Shanxi Agricultural University (No. YZGC059), Key Research and Development Project of Shanxi Province (No. 202102140601008), Shanxi HouJi Laboratory Project (No. 202304010930003-18), China Agriculture Research System of MOF and MARA (CARS-06), Natural Science Foundation of China (No. 32241042) and the National Natural Science Foundation of China (No. 32372176).

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Silva, T.N.; Thomas, J.B.; Dahlberg, J.; Rhee, S.Y.; Mortimer, J.C. Progress and challenges in sorghum biotechnology, a multipurpose feedstock for the bioeconomy. J. Exp. Bot. 2022, 73, 646–664. [Google Scholar] [CrossRef] [PubMed]
  2. Mace, E.S.; Tai, S.; Gilding, E.K.; Li, Y.; Prentis, P.J.; Bian, L.; Campbell, B.C.; Hu, W.; Innes, D.J.; Han, X.; et al. Whole-genome sequencing reveals untapped genetic potential in Africa’s indigenous cereal crop sorghum. Nat. Commun. 2013, 4, 2320. [Google Scholar] [CrossRef] [PubMed]
  3. Yuan, J.; Cao, H.; Qin, W.; Yang, S.; Zhang, D.; Zhu, L.; Song, H.; Zhang, Q. Genomic and modern biotechnological strategies for enhancing salt tolerance in crops. New Crops 2025, 2, 100057. [Google Scholar] [CrossRef]
  4. Michael, F. Lignin: Characterization of a Multifaceted Crop Component. Sci. World J. 2013, 2013, 436517. [Google Scholar] [CrossRef] [PubMed]
  5. Sree Rayanoothala, P.; Dweh, T.J.; Mahapatra, S.; Kayastha, S. Unveiling the protective role of chitosan in plant defense: A comprehensive review with emphasis on abiotic stress management. Crop Des. 2024, 3, 100076. [Google Scholar] [CrossRef]
  6. Dien, B.S.; Sarath, G.; Pedersen, J.F.; Sattler, S.E.; Chen, H.; Funnell-Harris, D.L.; Nichols, N.N.; Cotta, M.A. Improved Sugar Conversion and Ethanol Yield for Forage Sorghum (Sorghum bicolor L. Moench) Lines with Reduced Lignin Contents. Bioenergy Res. 2009, 2, 153–164. [Google Scholar] [CrossRef]
  7. Brenton, Z.W.; Cooper, E.A.; Myers, M.T.; Boyles, R.E.; Shakoor, N.; Zielinski, K.J.; Rauh, B.L.; Bridges, W.C.; Morris, G.P.; Kresovich, S. A Genomic Resource for the Development, Improvement, and Exploitation of Sorghum for Bioenergy. Genetics 2016, 204, 21–33. [Google Scholar] [CrossRef] [PubMed]
  8. Niu, H.; Han, Y.; Ping, J.; Wang, Y.; Lv, X.; Chu, J. Genome wide association analysis of acid detergent fiber content of 206 forage sorghum (Sorghum bicolor (L.) Moench) accessions. Genet. Resour. Crop Evol. 2022, 69, 1941–1951. [Google Scholar] [CrossRef]
  9. Ferreira, S.S.; Goeminne, G.; Simões, M.S.; Pina, A.V.d.A.; Lima, L.G.A.d.; Pezard, J.; Gutiérrez, A.; Rencoret, J.; Mortimer, J.C.; del Río, J.C.; et al. Transcriptional and metabolic changes associated with internode development and reduced cinnamyl alcohol dehydrogenase activity in sorghum. J. Exp. Bot. 2022, 73, 6307–6333. [Google Scholar] [CrossRef] [PubMed]
  10. Adeyanju, A.O.; Sattler, S.E.; Rich, P.J.; Rivera-Burgos, L.A.; Ejeta, G. Sorghum Brown Midrib19 (Bmr19) Gene Links Lignin Biosynthesis to Folate Metabolism. Genes 2021, 12, 660. [Google Scholar] [CrossRef] [PubMed]
  11. Bout, S.; Vermerris, W. A candidate-gene approach to clone the sorghum Brown midrib gene encoding caffeic acid O-methyltransferase. Mol. Genet. Genom. 2003, 269, 205–214. [Google Scholar] [CrossRef] [PubMed]
  12. Saballos, A.; Sattler, S.E.; Sanchez, E.; Foster, T.P.; Vermerris, W. Brown midrib2 (Bmr2) encodes the major 4-coumarate:coenzyme A ligase involved in lignin biosynthesis in sorghum (Sorghum bicolor (L.) Moench). Plant J. 2012, 70, 818–830. [Google Scholar] [CrossRef] [PubMed]
  13. Sattler, S.E.; Saathoff, A.J.; Haas, E.J.; Palmer, N.A.; Funnell-Harris, D.L.; Sarath, G.; Pedersen, J.F. A Nonsense Mutation in a Cinnamyl Alcohol Dehydrogenase Gene Is Responsible for the Sorghum brown midrib6 Phenotype. Plant Physiol. 2009, 150, 584–595. [Google Scholar] [CrossRef] [PubMed]
  14. Bonawitz, N.D.; Chapple, C. The genetics of lignin biosynthesis: Connecting genotype to phenotype. Annu. Rev. Genet. 2010, 44, 337–363. [Google Scholar] [CrossRef] [PubMed]
  15. Xie, M.; Zhang, J.; Tschaplinski, T.J.; Tuskan, G.A.; Chen, J.G.; Muchero, W. Regulation of Lignin Biosynthesis and Its Role in Growth-Defense Tradeoffs. Front. Plant Sci. 2018, 9, 1427. [Google Scholar] [CrossRef] [PubMed]
  16. Zhao, Q.; Dixon, R.A. Transcriptional networks for lignin biosynthesis: More complex than we thought? Trends Plant Sci. 2011, 16, 227–233. [Google Scholar] [CrossRef] [PubMed]
  17. Xu, Z.; Zhang, D.; Hu, J.; Zhou, X.; Ye, X.; Reichel, K.L.; Stewart, N.R.; Syrenne, R.D.; Yang, X.; Gao, P.; et al. Comparative genome analysis of lignin biosynthesis gene families across the plant kingdom. BMC Bioinform. 2009, 10, S3. [Google Scholar] [CrossRef] [PubMed]
  18. McCormick, R.F.; Truong, S.K.; Sreedasyam, A.; Jenkins, J.; Shu, S.; Sims, D.; Kennedy, M.; Amirebrahimi, M.; Weers, B.D.; McKinley, B.; et al. The Sorghum bicolor reference genome: Improved assembly, gene annotations, a transcriptome atlas, and signatures of genome organization. Plant J. 2018, 93, 338–354. [Google Scholar] [CrossRef] [PubMed]
  19. Paterson, A.H.; Bowers, J.E.; Bruggmann, R.; Dubchak, I.; Grimwood, J.; Gundlach, H.; Haberer, G.; Hellsten, U.; Mitros, T.; Poliakov, A.; et al. The Sorghum bicolor genome and the diversification of grasses. Nature 2009, 457, 551–556. [Google Scholar] [CrossRef] [PubMed]
  20. Wei, C.; Gao, L.; Xiao, R.; Wang, Y.; Chen, B.; Zou, W.; Li, J.; Mace, E.; Jordan, D.; Tao, Y. Complete telomere-to-telomere assemblies of two sorghum genomes to guide biological discovery. iMeta 2024, 3, e193. [Google Scholar] [CrossRef] [PubMed]
  21. Tao, Y.; Luo, H.; Xu, J.; Cruickshank, A.; Zhao, X.; Teng, F.; Hathorn, A.; Wu, X.; Liu, Y.; Shatte, T.; et al. Extensive variation within the pan-genome of cultivated and wild sorghum. Nat. Plants 2021, 7, 766–773. [Google Scholar] [CrossRef] [PubMed]
  22. Wu, X.; Liu, Y.; Luo, H.; Shang, L.; Leng, C.; Liu, Z.; Li, Z.; Lu, X.; Cai, H.; Hao, H.; et al. Genomic footprints of sorghum domestication and breeding selection for multiple end uses. Mol. Plant 2022, 15, 537–551. [Google Scholar] [CrossRef] [PubMed]
  23. Bolger, A.M.; Marc, L.; Bjoern, U. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics 2014, 30, 2114–2120. [Google Scholar] [CrossRef] [PubMed]
  24. Kim, D.; Paggi, J.M.; Park, C.; Bennett, C.; Salzberg, S. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat. Biotechnol. 2019, 37, 907–915. [Google Scholar] [CrossRef] [PubMed]
  25. Li, H.; Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 2009, 25, 1754–1760. [Google Scholar] [CrossRef] [PubMed]
  26. Liao, Y.; Smyth, G.K.; Shi, W. The R package Rsubread is easier, faster, cheaper and better for alignment and quantification of RNA sequencing reads. Nucleic Acids Res. 2019, 47, e47. [Google Scholar] [CrossRef] [PubMed]
  27. Robinson, M.D.; Oshlack, A. A scaling normalization method for differential expression analysis of RNA-seq data. Genome Biol. 2010, 11, R25. [Google Scholar] [CrossRef] [PubMed]
  28. Aramaki, T.; Blanc-Mathieu, R.; Endo, H.; Ohkubo, K.; Ogata, H. KofamKOALA: KEGG ortholog assignment based on profile HMM and adaptive score threshold. Bioinformatics 2019, 36, 2251–2252. [Google Scholar] [CrossRef] [PubMed]
  29. Zheng, Y.; Jiao, C.; Sun, H.; Rosli, H.G.; Pombo, M.A.; Zhang, P.; Banf, M.; Dai, X.; Martin, G.B.; Giovannoni, J.J.; et al. iTAK: A Program for Genome-wide Prediction and Classification of Plant Transcription Factors, Transcriptional Regulators, and Protein Kinases. Mol. Plant 2016, 9, 1667–1670. [Google Scholar] [CrossRef] [PubMed]
  30. Otasek, D.; Morris, J.H.; Bouças, J.; Pico, A.R.; Demchak, B. Cytoscape Automation: Empowering workflow-based network analysis. Genome Biol. 2019, 20, 185. [Google Scholar] [CrossRef] [PubMed]
  31. Mckenna, A.; Hanna, M.; Banks, E.; Sivachenko, A.; Cibulskis, K.; Kernytsky, A.; Garimella, K.; Altshuler, D.; Gabriel, S.; Daly, M. The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010, 20, 1297–1303. [Google Scholar] [CrossRef] [PubMed]
  32. Nei, M.; Li, W.H. Mathematical model for studying genetic variation in terms of restriction endonucleases. Proc. Natl. Acad. Sci. USA 1979, 76, 5269–5273. [Google Scholar] [CrossRef] [PubMed]
  33. Tajima, F. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 1989, 123, 585–595. [Google Scholar] [CrossRef] [PubMed]
  34. Hudson, R.R.; Slatkin, M.; Maddison, W.P. Estimation of levels of gene flow from DNA sequence data. Genetics 1992, 132, 583. [Google Scholar] [CrossRef] [PubMed]
  35. Tao, Y.; Mace, E.S.; Tai, S.; Cruickshank, A.; Campbell, B.C.; Zhao, X.; Van Oosterom, E.J.; Godwin, I.D.; Botella, J.R.; Jordan, D.R. Whole-Genome Analysis of Candidate genes Associated with Seed Size and Weight in Sorghum bicolor Reveals Signatures of Artificial Selection and Insights into Parallel Domestication in Cereal Crops. Front. Plant Sci. 2017, 8, 1237. [Google Scholar] [CrossRef] [PubMed]
  36. Baucher, M.; Halpin, C.; Petit-Conil, M.; Boerjan, W. Lignin: Genetic Engineering and Impact on Pulping. Crit. Rev. Biochem. Mol. Biol. 2003, 38, 305–350. [Google Scholar] [CrossRef] [PubMed]
  37. Dong, N.Q.; Lin, H.X. Contribution of phenylpropanoid metabolism to plant development and plant–environment interactionsFA. J. Integr. Plant Biol. 2021, 63, 30. [Google Scholar] [CrossRef] [PubMed]
  38. Rao, X.; Dixon, R.A. Current Models for Transcriptional Regulation of Secondary Cell Wall Biosynthesis in Grasses. Front. Plant Sci. 2018, 9, 399. [Google Scholar] [CrossRef] [PubMed]
  39. Zhong, R.; Ye, Z.-H. Secondary Cell Walls: Biosynthesis, Patterned Deposition and Transcriptional Regulation. Plant Cell Physiol. 2014, 56, 195–214. [Google Scholar] [CrossRef] [PubMed]
  40. Carpita, N.C.; McCann, M.C. Maize and sorghum: Genetic resources for bioenergy grasses. Trends Plant Sci. 2008, 13, 415–420. [Google Scholar] [CrossRef] [PubMed]
  41. Chen, X.; Sun, H.; Zhao, B.; Wang, R.; Ji, C.; Zhu, B. Transcriptome analysis identifies differentially expressed genes involved in lignin biosynthesis in barley. Int. J. Biol. Macromol. 2023, 236, 123940. [Google Scholar] [CrossRef] [PubMed]
  42. Amancio, S.; Vicentini, R.; Bottcher, A.; Brito, M.d.S.; dos Santos, A.B.; Creste, S.; Landell, M.G.d.A.; Cesarino, I.; Mazzafera, P. Large-Scale Transcriptome Analysis of Two Sugarcane Genotypes Contrasting for Lignin Content. PLoS ONE 2015, 10, e0137698. [Google Scholar]
  43. Caicedo, A.L.; Purugganan, M.D. Comparative Plant Genomics. Frontiers and Prospects. Plant Physiol. 2005, 138, 545–547. [Google Scholar] [CrossRef] [PubMed]
  44. Tao, Y.; Zhao, X.; Wang, X.; Hathorn, A.; Hunt, C.; Cruickshank, A.W.; van Oosterom, E.J.; Godwin, I.D.; Mace, E.S.; Jordan, D.R. Large-scale GWAS in sorghum reveals common genetic control of grain size among cereals. Plant Biotechnol. J. 2020, 18, 1093–1105. [Google Scholar] [CrossRef] [PubMed]
  45. Shad, M.A.; Li, X.; Rao, M.J.; Luo, Z.; Li, X.; Ali, A.; Wang, L. Exploring Lignin Biosynthesis Genes in Rice: Evolution, Function, and Expression. Int. J. Mol. Sci. 2024, 25, 10001. [Google Scholar] [CrossRef] [PubMed]
  46. Wang, S.; Wang, X.; Yue, L.; Li, H.; Zhu, L.; Dong, Z.; Long, Y. Genome-Wide Identification and Characterization of Lignin Synthesis Genes in Maize. Int. J. Mol. Sci. 2024, 25, 6710. [Google Scholar] [CrossRef] [PubMed]
  47. Clark, J.W.; Donoghue, P.C.J. Whole-Genome Duplication and Plant Macroevolution. Trends Plant Sci. 2018, 23, 933–945. [Google Scholar] [CrossRef] [PubMed]
  48. Fuller, D.Q.; Stevens, C.J. Sorghum Domestication and Diversification: A Current Archaeobotanical Perspective. In Plants and People in the African Past: Progress in African Archaeobotany; Mercuri, A.M., D’Andrea, A.C., Fornaciari, R., Höhn, A., Eds.; Springer International Publishing: Cham, Switzerland, 2018; pp. 427–452. [Google Scholar]
  49. Ge, F.; Xie, P.; Wu, Y.; Xie, Q. Genetic architecture and molecular regulation of sorghum domestication. aBIOTECH 2023, 4, 57–71. [Google Scholar] [CrossRef] [PubMed]
  50. Lin, Z.; Li, X.; Shannon, L.M.; Yeh, C.T.; Wang, M.L.; Bai, G.; Peng, Z.; Li, J.; Trick, H.N.; Clemente, T.E. Parallel domestication of the Shattering1 genes in cereals. Nat. Genet. 2012, 44, 720–724. [Google Scholar] [CrossRef] [PubMed]
  51. Tang, H.; Cuevas, H.E.; Das, S.; Sezen, U.U.; Zhou, C.; Guo, H.; Goff, V.H.; Ge, Z.; Clemente, T.E.; Paterson, A.H. Seed shattering in a wild sorghum is conferred by a locus unrelated to domestication. Proc. Natl. Acad. Sci. USA 2013, 110, 15824–15829. [Google Scholar] [CrossRef] [PubMed]
  52. Tao, Y.; Trusov, Y.; Zhao, X.; Wang, X.; Cruickshank, A.W.; Hunt, C.; van Oosterom, E.J.; Hathorn, A.; Liu, G.; Godwin, I.D.; et al. Manipulating assimilate availability provides insight into the genes controlling grain size in sorghum. Plant J. 2021, 108, 231–243. [Google Scholar] [CrossRef] [PubMed]
  53. Eudes, A.; Liang, Y.; Mitra, P.; Loqué, D. Lignin bioengineering. Curr. Opin. Biotechnol. 2014, 26, 189–198. [Google Scholar] [CrossRef] [PubMed]
  54. Fu, C.; Mielenz, J.R.; Xiao, X.; Ge, Y.; Hamilton, C.Y.; Rodriguez, M.; Chen, F.; Foston, M.; Ragauskas, A.; Bouton, J. Genetic manipulation of lignin reduces recalcitrance and improves ethanol production from switchgrass. Proc. Natl. Acad. Sci. USA 2011, 108, 3803–3808. [Google Scholar] [CrossRef] [PubMed]
  55. Wilkerson, C.G.; Mansfield, S.D.; Lu, F.; Withers, S.; Park, J.-Y.; Karlen, S.D.; Gonzales-Vigil, E.; Padmakshan, D.; Unda, F.; Rencoret, J.; et al. Monolignol Ferulate Transferase Introduces Chemically Labile Linkages into the Lignin Backbone. Science 2014, 344, 90–93. [Google Scholar] [CrossRef] [PubMed]
  56. Yan, J.; Aznar, A.; Chalvin, C.; Birdseye, D.S.; Baidoo, E.E.K.; Eudes, A.; Shih, P.M.; Loqué, D.; Zhang, A.; Scheller, H.V. Increased drought tolerance in plants engineered for low lignin and low xylan content. Biotechnol. Biofuels 2018, 11, 195. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Correlation and clustering analysis results between different sorghum accessions. (A) Correlation analysis among samples based on gene expression. (B) PCA analysis between different lines. (C) The PCA loading values of genes that explain the top 5% of variance from PC1 to PC5.
Figure 1. Correlation and clustering analysis results between different sorghum accessions. (A) Correlation analysis among samples based on gene expression. (B) PCA analysis between different lines. (C) The PCA loading values of genes that explain the top 5% of variance from PC1 to PC5.
Agronomy 15 01751 g001
Figure 2. Differential Expression Analysis between low and high lignin content in the lines. (A) Heatmap of differentially expressed genes. (B) Volcano plot of differentially expressed genes. (C) Proportion of up and down-regulated genes in different types of lignin biosynthesis. MP, Monolignol Polymerization genes, MB, Monolignol Biosynthesis genes, TR, Transcriptional Regulation genes. WG, Whole Genome genes. (D) GO enrichment.
Figure 2. Differential Expression Analysis between low and high lignin content in the lines. (A) Heatmap of differentially expressed genes. (B) Volcano plot of differentially expressed genes. (C) Proportion of up and down-regulated genes in different types of lignin biosynthesis. MP, Monolignol Polymerization genes, MB, Monolignol Biosynthesis genes, TR, Transcriptional Regulation genes. WG, Whole Genome genes. (D) GO enrichment.
Agronomy 15 01751 g002
Figure 3. Expression changes of each synthetase gene in phenylpropanoid biosynthesis pathway.
Figure 3. Expression changes of each synthetase gene in phenylpropanoid biosynthesis pathway.
Agronomy 15 01751 g003
Figure 4. Co-expression Modules and Gene Networks Associated with Lignin Content. (A) Correlation between lignin content and each module. (B) The gene expression networks of the green modules were clustered by MCODE.
Figure 4. Co-expression Modules and Gene Networks Associated with Lignin Content. (A) Correlation between lignin content and each module. (B) The gene expression networks of the green modules were clustered by MCODE.
Agronomy 15 01751 g004
Figure 5. Genetic diversity and selection signatures associated with lignin biosynthesis in sorghum. (A) Nucleotide diversity (π) of candidate lignin genes compared with genome-wide genes in wild and cultivated sorghum. (B) Fst comparison between lignin-pathway candidate genes and genome-wide neutral loci. (C) Proportions of up- and down-regulated genes under selection within three functional classes of the lignin pathway. MP, Monolignol Polymerization genes, MB, Monolignol Biosynthesis genes, TR, Transcriptional Regulation genes.
Figure 5. Genetic diversity and selection signatures associated with lignin biosynthesis in sorghum. (A) Nucleotide diversity (π) of candidate lignin genes compared with genome-wide genes in wild and cultivated sorghum. (B) Fst comparison between lignin-pathway candidate genes and genome-wide neutral loci. (C) Proportions of up- and down-regulated genes under selection within three functional classes of the lignin pathway. MP, Monolignol Polymerization genes, MB, Monolignol Biosynthesis genes, TR, Transcriptional Regulation genes.
Agronomy 15 01751 g005
Table 1. Summary of lignin biosynthesis candidate genes.
Table 1. Summary of lignin biosynthesis candidate genes.
Gene IDChrStartEndGene NameLignin Biosynthesis Process Expression ChangeSelection Signal
Sobic.001G110900Chr018,614,9158,616,973MYB52/MYB54transcriptional regulationupNo
Sobic.001G137100Chr0110,870,88710,877,123BLH6transcriptional regulationnsNo
Sobic.001G160500Chr0113,186,39213,188,966PAL1/PAL2/PAL3/PAL4monolignol biosynthesisNANo
Sobic.001G196300Chr0117,685,32617,690,280F5H1monolignol biosynthesisupYes
Sobic.001G261700Chr0145,967,55045,969,583LBD18/LBD30transcriptional regulationNANo
Sobic.001G316800Chr0160,505,05360,512,997VND1/VND2/VND3transcriptional regulationNAYes
Sobic.001G346800Chr0163,612,86563,614,287COMTmonolignol biosynthesisupNA
Sobic.001G358900Chr0164,882,02664,883,966MYB46transcriptional regulationdownNo
Sobic.001G422300Chr0170,308,57770,311,856LAC2monolignol polymerizationupYes
Sobic.001G437300Chr0171,566,22471,569,045LBD18/LBD30transcriptional regulationnsNA
Sobic.001G516600Chr0178,328,40178,335,0844CL8monolignol biosynthesisnsNo
Sobic.001G522700Chr0178,781,46778,788,810VND1/VND2/VND3transcriptional regulationnsYes
Sobic.001G526200Chr0179,021,00479,026,139KNAT7transcriptional regulationupNo
Sobic.002G003200Chr02425,796427,548AtPRX2monolignol polymerizationdownNo
Sobic.002G003400Chr02445,651447,224AtPRX2monolignol polymerizationNANA
Sobic.002G003500Chr02450,973452,852AtPRX2monolignol polymerizationNANA
Sobic.002G003700Chr02460,085461,869AtPRX2monolignol polymerizationnsNo
Sobic.002G029500Chr022,740,7542,742,865F5H1monolignol biosynthesisNANo
Sobic.002G126600Chr0217,292,18417,296,557C4Hmonolignol biosynthesisupNo
Sobic.002G146000Chr0229,037,64829,043,667CCR1monolignol biosynthesisdownNo
Sobic.002G195400Chr0258,394,42458,396,434CAD6monolignol biosynthesisNANo
Sobic.002G195500Chr0258,401,42458,401,985CAD6monolignol biosynthesisNANA
Sobic.002G195600Chr0258,404,43358,406,373CAD6monolignol biosynthesisdownNo
Sobic.002G195700Chr0258,408,61958,410,895CAD6monolignol biosynthesisdownNo
Sobic.002G196000Chr0258,453,17558,454,976MYB20transcriptional regulationnsNo
Sobic.002G196100Chr0258,500,45858,502,066MYB20transcriptional regulationnsNo
Sobic.002G242300Chr0263,154,94763,156,519CCoAOMT7monolignol biosynthesisnsNo
Sobic.002G275500Chr0265,814,95265,825,791MYB42/MYB85transcriptional regulationupNo
Sobic.002G279100Chr0266,107,62366,109,361MYB4transcriptional regulationnsNo
Sobic.002G325000Chr0269,554,96069,556,417COMTmonolignol biosynthesisupYes
Sobic.002G388700Chr0274,246,29474,248,187MYB46transcriptional regulationnsNo
Sobic.003G050300Chr034,577,4004,579,054AtPRX25monolignol polymerizationupNo
Sobic.003G121000Chr0311,015,23311,017,556AtPRX72monolignol polymerizationdownYes
Sobic.003G183300Chr0348,690,86648,695,241AtPRX72monolignol polymerizationnsYes
Sobic.003G231400Chr0357,067,03757,068,905LAC17monolignol polymerizationNANo
Sobic.003G244400Chr0358,382,28258,387,226OsFBK1transcriptional regulationnsNo
Sobic.003G251800Chr0358,989,84858,995,158SND2transcriptional regulationupNo
Sobic.003G281500Chr0361,655,47861,664,524PDR1/AtABCG29monolignol polymerizationnsNo
Sobic.003G320800Chr0364,842,03464,843,978AtPRX2monolignol polymerizationupNo
Sobic.003G327800Chr0365,336,67065,340,440C3Hmonolignol biosynthesisnsNo
Sobic.003G337400Chr0366,049,42666,052,170C4Hmonolignol biosynthesisupYes
Sobic.003G353200Chr0367,210,78767,213,484LAC2monolignol polymerizationupNo
Sobic.004G062500Chr045,041,2605,046,4464CL1/4CL2monolignol biosynthesisupNo
Sobic.004G065600Chr045,305,3965,308,691CCR1monolignol biosynthesisupNo
Sobic.004G071000Chr045,728,6045,734,362CAD3monolignol biosynthesisupNo
Sobic.004G123200Chr0413,843,96113,858,621MYB69transcriptional regulationdownYes
Sobic.004G123400Chr0413,903,17313,921,822MYB69transcriptional regulationnsNo
Sobic.004G176600Chr0452,879,27552,880,557XND1transcriptional regulationdownNo
Sobic.004G212300Chr0456,194,95856,201,627HCTmonolignol biosynthesisupNo
Sobic.004G220300Chr0457,051,38357,055,340PAL1/PAL2/PAL3/PAL4monolignol biosynthesisupNo
Sobic.004G220400Chr0457,064,91457,069,724PAL1/PAL2/PAL3/PAL4monolignol biosynthesisnsNo
Sobic.004G220500Chr0457,075,10657,077,250PAL1/PAL2/PAL3/PAL4monolignol biosynthesisupNA
Sobic.004G220600Chr0457,083,77157,086,639PAL1/PAL2/PAL3/PAL4monolignol biosynthesisupNA
Sobic.004G220700Chr0457,099,42257,101,566PAL1/PAL2/PAL3/PAL4monolignol biosynthesisupNA
Sobic.004G248700Chr0459,576,46459,578,405MYB20transcriptional regulationnsYes
Sobic.004G272700Chr0461,636,90261,641,3374CL3monolignol biosynthesisupYes
Sobic.004G273800Chr0461,765,78461,767,930MYB58/MYB63transcriptional regulationnsNo
Sobic.004G298400Chr0463,778,46063,782,424WRKY12transcriptional regulationdownYes
Sobic.004G302400Chr0464,103,33664,112,306VND4/VND5/VND6transcriptional regulationdownYes
Sobic.005G088400Chr0512,613,41712,615,757F5H1monolignol biosynthesisNANo
Sobic.006G008800Chr061,302,6491,307,415E2Fctranscriptional regulationdownNo
Sobic.006G079300Chr0644,550,69544,552,325AtPRX72monolignol polymerizationNANo
Sobic.006G086000Chr0645,545,06545,546,863XND1transcriptional regulationnsNo
Sobic.006G136800Chr0649,923,68849,929,393HCTmonolignol biosynthesisupNo
Sobic.006G148800Chr0651,039,81651,042,905PAL1/PAL2/PAL3/PAL4monolignol biosynthesisupNo
Sobic.006G148900Chr0651,053,47751,056,309PAL1/PAL2/PAL3/PAL4monolignol biosynthesisnsNo
Sobic.006G160900Chr0651,876,06651,882,814VND4/VND5/VND6transcriptional regulationNANo
Sobic.006G166300Chr0652,377,82852,383,758WRKY12transcriptional regulationnsNo
Sobic.006G199800Chr0655,156,81655,158,923MYB58/MYB63transcriptional regulationupYes
Sobic.006G211900Chr0656,109,08656,113,719CAD6monolignol biosynthesisdownNo
Sobic.006G224500Chr0657,040,31057,042,534AtPRX2monolignol polymerizationNAYes
Sobic.006G279400Chr0660,968,19360,970,485AtVND7transcriptional regulationnsNA
Sobic.007G003000Chr07277,399280,393AtVND7transcriptional regulationdownYes
Sobic.007G018100Chr071,663,0321,668,132NST1/NST2/NST3transcriptional regulationupYes
Sobic.007G039100Chr073,776,1043,779,183MYB103transcriptional regulationupNo
Sobic.007G047300Chr074,721,0734,724,503COMTmonolignol biosynthesisupNo
Sobic.007G076000Chr078,729,7108,733,771CAD6monolignol biosynthesisnsNo
Sobic.007G089900Chr0712,684,97912,699,4034CL1/4CL2monolignol biosynthesisnsNo
Sobic.007G132600Chr0755,272,19455,274,367MYB20transcriptional regulationupYes
Sobic.007G141200Chr0757,060,87357,068,026CCR1monolignol biosynthesisupNo
Sobic.007G145600Chr0757,553,94057,556,3334CL1/4CL2monolignol biosynthesisupYes
Sobic.007G177100Chr0761,085,88761,087,423MYB4transcriptional regulationnsNo
Sobic.007G178300Chr0761,150,83161,158,248MYB42/MYB85transcriptional regulationupYes
Sobic.007G217200Chr0764,554,56164,555,105CCoAOMT7monolignol biosynthesisNANA
Sobic.007G218500Chr0764,677,32264,678,679CCoAOMT7monolignol biosynthesisNANo
Sobic.007G218700Chr0764,689,20664,692,655CCoAOMT7monolignol biosynthesisupNo
Sobic.007G218800Chr0764,709,89764,711,491CCoAOMT7monolignol biosynthesisupNo
Sobic.008G112200Chr0851,883,85451,887,072MYB83transcriptional regulationnsYes
Sobic.008G188900Chr0862,311,40362,316,497BLH6transcriptional regulationupYes
Sobic.009G162300Chr0951,911,38851,914,098LAC2monolignol polymerizationupYes
Sobic.009G181800Chr0953,566,76553,571,063C3Hmonolignol biosynthesisupYes
Sobic.009G186500Chr0953,933,81653,935,359AtPRX2monolignol polymerizationNANo
Sobic.009G186600Chr0953,940,27653,942,152AtPRX2monolignol polymerizationupYes
Sobic.009G231600Chr0957,151,98757,154,230SND2transcriptional regulationupYes
Sobic.010G002900Chr10263,346268,317VND4/VND5/VND6transcriptional regulationdownNo
Sobic.010G022400Chr101,840,6901,844,258NST1/NST2/NST3transcriptional regulationupNo
Sobic.010G052200Chr104,071,5784,073,478CCoAOMT1monolignol biosynthesisupYes
Sobic.010G066000Chr105,255,6905,259,051CCR1monolignol biosynthesisupNo
Sobic.010G106601Chr1010,257,92610,259,640MYB20transcriptional regulationNAYes
Sobic.010G128300Chr1016,621,90216,623,207AtPRX2monolignol polymerizationNANo
Sobic.010G128400Chr1016,881,60516,882,960AtPRX2monolignol polymerizationNANo
Sobic.010G128700Chr1016,968,97616,970,381AtPRX2monolignol polymerizationNANo
Sobic.010G165500Chr1048,920,36948,930,191PDR1/AtABCG29monolignol polymerizationupYes
Sobic.010G214900Chr1055,777,18455,784,7164CL1/4CL2monolignol biosynthesisnsYes
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Niu, H.; Wang, Y.; Liu, R.; Cheng, X.; Wang, Y.; Wang, Y.; Lv, X.; Fan, F.; Ju, L.; Chu, J.; et al. Combining Transcriptome Analysis and Comparative Genomics to Identify Key Components of the Lignin Biosynthesis Gene Network in Sorghum bicolor. Agronomy 2025, 15, 1751. https://doi.org/10.3390/agronomy15071751

AMA Style

Niu H, Wang Y, Liu R, Cheng X, Wang Y, Wang Y, Lv X, Fan F, Ju L, Chu J, et al. Combining Transcriptome Analysis and Comparative Genomics to Identify Key Components of the Lignin Biosynthesis Gene Network in Sorghum bicolor. Agronomy. 2025; 15(7):1751. https://doi.org/10.3390/agronomy15071751

Chicago/Turabian Style

Niu, Hao, Yanbo Wang, Ruizhen Liu, Xiaoqiang Cheng, Yao Wang, Yubin Wang, Xin Lv, Fangfang Fan, Lan Ju, Jianqiang Chu, and et al. 2025. "Combining Transcriptome Analysis and Comparative Genomics to Identify Key Components of the Lignin Biosynthesis Gene Network in Sorghum bicolor" Agronomy 15, no. 7: 1751. https://doi.org/10.3390/agronomy15071751

APA Style

Niu, H., Wang, Y., Liu, R., Cheng, X., Wang, Y., Wang, Y., Lv, X., Fan, F., Ju, L., Chu, J., Yan, H., Wang, H., Chang, H., Zhang, Y., Tao, Y., & Ping, J. (2025). Combining Transcriptome Analysis and Comparative Genomics to Identify Key Components of the Lignin Biosynthesis Gene Network in Sorghum bicolor. Agronomy, 15(7), 1751. https://doi.org/10.3390/agronomy15071751

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop