1. Introduction
Lentil (
Lens culinaris Medik.) which belongs to the Fabaceae family, is among the earliest domesticated plant species. Lentil is one of the most important legumes for human and animal nutrition [
1] and is also useful for soil enrichment with nitrogen as it enhances nitrogen fixation [
2]. Despite lentils’ high nutritional value and positive health benefits, their content in certain antinutritional factors such as phytic acid, oxalates, antitrypsin inhibitors, and polyphenols can significantly affect the absorbance of mineral elements by the human body. These compounds can create complexes with essential mineral elements, significantly diminishing their solubility within the intestinal lumen and, consequently, impeding their absorption by the human body [
3,
4]. Effective approaches to drastically limit these antinutritional factors and improve the overall taste of pulses are cooking, which promotes water uptake from the seeds, and dehulling [
5]. Cooking time is determined by the seed’s hardness, which controls the seed’s ability to absorb water. Reduced seed hardness leads to faster water uptake which ensures less cooking time [
6]. The hard seed coat of lentils is a feature of domestication, and it is attributed to environmental and genetic factors [
7]. Despite similarities to other legumes’ seed coats, lentils’ seed coat microstructure tends to lead to less hard seed coats [
8]. Lentil’s transparent and thin seed coat is controlled by a recessive gene (
tan) which moderates the tannin precursors that are responsible for seed darkening during cooking and the preservation period [
9]. It is worth noting that both findings of our research group [
10,
11,
12,
13] and those of other research teams (for a review see Polidoros et al. [
1]) indicate that the genetic diversity present within and among lentil populations is rich enough to provide material for genetic studies and manipulation of this trait. Phenolic acids, stilbenes and flavonoids such as flavanones, flavones, dihydroflavonols, flavonols, flavan-3-ols, anthocyanidins and proanthocyanidins, also known as condensed tannins, are some of the derivatives of polyphenolic compounds that play an important role in the properties of the seed coat [
14]. The biosynthesis of tannins begins with the conversion of L-phenylalanine to trans-cinnamic acid, which is catalyzed by phenylalanine ammonia-lyase (PAL). Chalcone synthase (CHS) is the key enzyme that catalyzes the first step in the branch that leads to the flavonoid pathway [
15]. Chalcone isomerase (CHI) then speeds up the chemical reaction that converts the yellow-colored chalcone into the colorless naringenin. Following this isomerization, flavanone 3-hydroxylase (F3H) transforms naringenin into dihydrokaempferol (DHK) which is hydroxylated by flavonoid 3’-hydroxylase (F3′H) to form dihydromyricetin (DHM) [
16]. Transcription factors, such as basic helix-loop helix (bHLH), that interfere with the pathway determine the regulation of several biosynthetic enzymes such us, F3H, F3′H, flavonoid-3′-5′-hydroxylase (F3′5′H), flavonol synthase (FLS), dihydroflavonol 4-reductase (DFR), anthocyanidin synthase (ANS), anthocyanidin reductase (ANR) and leucoanthocyanidin reductase (LAR) that are considered to affect the pigmentation in the lentil seed coat. These enzymes are reported as key enzymes for the biosynthesis of flavan-3-ols, which are attributed as the monomers of polymerized proanthocyanidins [
15,
16,
17,
18].
While the impact of tannin content on seed coat hardness is recognized, there is a limited understanding of the factors influencing their accumulation. Feedback from local farmers underscores the substantial influence of environmental variables. Their observations reveal that for certain lentil varieties, within the same variety and field, the percentage of hard seeds that are resistant to boiling can vary from year to year in a seemingly random manner. This prompts a worthwhile investigation into the role of diverse field factors that may influence tannin accumulation during lentil seed development. One such factor, susceptible to fluctuations due to environmental conditions, is the availability of phosphorus (P) in the soil. P stands as a vital macronutrient crucial for the growth and development of plants. It plays a pivotal role in essential metabolic processes, including the synthesis of adenosine triphosphate (ATP) and nicotinamide adenine dinucleotide phosphate hydrogen (NADPH), it is involved in the formation of nucleic acids and phospholipids [
19]. It is also well known that soil P dynamics are affected by the frequency and intensity of soil drying and rewatering [
20]. P deficiency is reported to induce the increase in certain phenolic substances in all the vegetative parts of the lentil plants [
21]. However, our knowledge of how P levels, above or below optimal doses, affect transcriptional regulation and gene expression during seed development is limited.
In recent years, next-generation sequencing technologies have facilitated the study of lentil’s genome (~4.3 Gbp), and the most recent effort in developing lentil genome assemblies for both the cultivated lentil (
Lens culinaris) and its wild relative (
Lens ervoides) was effectuated by Ramsay et al. [
22]. Numerous studies utilizing transcriptome sequencing on lentil have investigated topics such as lentil’s response to biotic and abiotic stresses by studying its defense mechanisms against pathogens, drought and heat stress [
23,
24,
25,
26,
27,
28]. Transcriptome analysis offers several advantages, including the discovery of stress-resilient genes and the identification of transcription factors and stress-responsive pathways [
26]. In another transcriptomic analysis conducted on both small and large-seeded genotypes, the study examined the involvement of pathways related to hormone biosynthesis and crucial genes encoding kinases, transcription factors (TF) and enzymes responsible for cell wall formation. These findings suggest that the control of seed size in lentils operates through the regulation of cell division, involving both cell expansion and the overall seed growth [
29]. In a recent transcriptomic analysis of lentil embryos, seed coats and whole seeds, specific genes and co-expressed gene sets were found to be prevalent in distinct tissues and developmental stages, distinguishing the expression patterns of the embryo from those of the seed coat [
30].The availability of the current lentil’s genome assemblies that include the cultivated lentil (
Lens culinaris) and the wild relative (
Lens ervoides) has facilitated the potential to gain knowledge regarding mechanisms regulating even complex traits [
22].
Evidence supporting the significance of seed coat hardness in the quality of lentil products and, consequently, their market value [
31] underscores the critical need to explore the factors governing tannin accumulation in lentil seeds, as tannins are the primary determinants of seed coat hardness [
9,
32]. As hard seed coat appears to be determined both by genetic and environmental factors [
7], to gain a better understanding of this phenomenon, it is crucial to examine the lentil’s innate molecular mechanisms that control tannin production, such as the seed maturity stage, but also the effect of environmental features such as P, a principal nutrient involved in many metabolic processes. This study examined the impact of P and the seed maturity stage on both the overall gene expression patterns and those of the phenylpropanoid genes in lentil, the pathway responsible for tannin biosynthesis. Insights derived from this study may have the potential to provide a plausible explanation for differences in seed coat hardness within the same lentil variety stemming from differences in P fertilization levels. These findings can inform innovative agricultural strategies designed to enhance lentil product quality.
4. Discussion
Lentil is one of the earliest domesticated species and one of the most consumed pulses, especially in the Mediterranean territory. One of the key elements in enhancing lentil crop productivity and ensuring its successful adaptation is optimal P fertilization. As reported in other studies, low soil productivity in cultivated areas is frequently linked to P deficiency. Unlike nitrogen, which can be introduced into the soil through nitrogen-fixing bacteria in the roots of leguminous plants like lentil [
40], atmospheric microbes cannot augment P availability. The incorporation of P in the soil has been noted to enhance the microbial population, biomass and the activity of certain enzymes in the soil, collectively contributing to an increase in soil respiration, which is a desirable soil trait for improving the overall soil productivity [
41]. In the context of lentils, the addition of P in the right doses is essential to achieve the anticipated crop yield [
42]. Vital biological activities such as nodulation, nitrogen fixation and nutrient absorbance from the soil and the environment of the rhizosphere all require P sufficiency [
19]. Adequate P in soil promotes the lateral root length and the total root mass. This is one of the mechanisms that ensures lentil plants’ adaptation to P deficiency conditions since the plants are forced to search for water and nutrients in deeper soil layers [
43].
In this study, the lower P treatment (P0) presented the highest number of unique and total DEGs. The differential expression analysis of genes did not demonstrate any particular clustering pattern in the different P treatments but revealed some groups of DEGs within noted areas in
Figure 5’s heatmap (Elements 1–5) that appear to be influenced by the level of P fertilization. The utilization of GO enrichment analysis and KEGG pathway analysis facilitated a comprehensive examination of these DEGs, and their expression patterns appear to be influenced by the different levels of P fertilization. Through the analysis, the involvement of the DEGs in specific biological processes and molecular functions was elucidated, thereby enhancing the scientific understanding of their functional roles. The genes of interest that were found within the highlighted regions (Elements 1–5) in
Figure 5’s heatmap, primarily fall within biological processes associated with macromolecule metabolism, nitrogen compound metabolism, biological regulation and localization. Their molecular functions encompass enzyme activities such as hydrolases, transferases and oxidoreductases, as well as roles such as transporter proteins and binding-associated functions.
The analysis of DEGs possibly influenced by P levels yielded intriguing findings within more specific GOs, revealing genes closely associated with stress signaling, abiotic and biotic stress resistance, and gene and protein regulation. Terms that evidently support the genes’ connection to stress signaling are “signal transduction” (GO:0007165), “response to biotic stimuli” (GO:0009607), “cellular response to stress” (GO:0033554), and “heat shock protein binding” (GO:0031072). Plant heat shock proteins are widely recognized for their crucial role in enhancing abiotic and biotic stress tolerance, which constitutes an integral part of the plant’s defense mechanism [
44]. According to the bibliography, in low P conditions, plants activate certain signaling and stress-related pathways that include the expression of phosphatases, MAP kinase, protein kinases, purple acid phosphatases (PAPs), S-glutathione transferases and the PHT1 family, which is comprised of phosphate transporters in order to improve P transport and utilization [
45].
The term “pyrophosphatase activity” (GO:0016462) is linked to plant growth, development and stress tolerance. Pyrophosphatases, a class of enzymes, play a vital role in catalyzing the degradation of pyrophosphate. This enzymatic process provides plants with essential energy, particularly in low-oxygen conditions when ATP synthesis is not feasible [
46]. The data exhibited a decrease in the expression of genes linked to “cellular response to stress”, and “pyrophosphatase activity” as the levels of P fertilization increased. This observation implies that lower P levels may serve as a stressor, leading to the activation of stress-related genes.
The term “GTP binding” (GO:0005525) is also pertinent to signaling, as GTP-binding proteins, particularly the Rop GTPase family, participate in the regulation of numerous biological processes by utilizing an on/off mechanism. These processes include signal transduction, cell polarity, cell shape, hormone responses and pathogen defense [
47].
The terms “methylation” (GO:0032259) and “methyltransferase activity” (GO:0008168) are linked to the regulation of gene activity while “phosphorylation” (GO:0016310) is connected to the regulation of protein activity. Notably, DNA methylation, an epigenetic mechanism mediated by methyltransferases, plays a pivotal role in controlling gene expression, thereby influencing processes related to growth, development and stress tolerance in plants [
48]. In accordance with the study of Chu et al., where it was observed that in soybean that low P conditions increased the level of DNA methylation [
49], the results showed that the expression levels of genes that belong to the ”methyltransferase activity” function decreased in higher P treatments. An elevated expression of these genes under low phosphorus (P) conditions may indicate increased methylation levels, which are associated with broader gene regulatory processes. Over the past decade, the concept of “epitranscriptomics” has gained prominence, drawing significant attention from the scientific community. Researchers have increasingly focused on post-translational modifications of RNA, particularly RNA methyl modifications, which exhibit a strong correlation with plant survival and fitness [
50].
Meanwhile, phosphorylation serves as a fundamental reversible mechanism for post-translational modifications that govern protein activity and its interactions with other molecules. This biological process is mediated by kinases that introduce phosphate groups to proteins, while phosphatases control dephosphorylation by removing these phosphate groups. This reversible system functions as a signal transduction mechanism and has been conclusively demonstrated to hold significant importance in the regulation of phytohormones and the enhancement of cold stress tolerance [
51].
Through the analysis, several genes have been uncovered and identified that are associated with GO terms such as “GTPase activity” (GO:0003924), “small GTPase regulator activity” (GO:0005083), “methyltransferase activity” (GO:0008168), “heat shock protein binding” (GO:0031072), “pyrophosphatase activity” (GO:0016462), “kinase activity” (GO:0016301) and “phosphatase activity” (GO:0016791). In the current context of climate change, genes linked to stress signaling and stress tolerance, and in this case genes whose expression correlates with the extent of P fertilization, hold significant potential as promising candidates for further investigation. Such research endeavors may yield valuable outcomes that can be applied to breeding programs and the development of effective cultivation techniques.
Moreover, P fertilization seems to affect the seed coat properties, without causing any anatomic alterations in the seed [
52]. One of the most decisive traits for the market value of lentils is their seeds’ physical properties. Seed coat hardness is considered a limiting factor for farmers and consumers. Hardness is attributed to the pigment and, mainly, tannin (proanthocyanidins) accumulation that happens from the embryo development in the seed right after fertilization [
53].
The results from the differential expression analysis showed that the expression patterns observed in both the overall heatmap and the individual heatmaps demonstrate that the seed maturity stage plays a significant role in influencing gene expression. GO enrichment analysis highlighted that the S1–S3 comparison exhibited, in most cases, the greatest number of upregulated and downregulated genes in GO terms associated with phosphorus metabolic processes. More precisely, this is confirmed in annotations like “phosphorus metabolic process” (GO:0006793) and “phosphate-containing compound metabolic process” (GO:0006796) in all three P treatments. An additional noteworthy observation from the GO analysis, particularly within the annotations related to the “phosphorus metabolic process”, is that the majority of enriched DEGs were found to be in a downregulated state.
Functional analysis of the DEGs by GO analysis revealed that the majority of DEGs related to P metabolic processes mainly consist of downregulated genes in all three P treatments. The influence of P in the downregulated DEGs was observed in the annotations “phosphorus metabolic process”, “phosphate-containing compound metabolic process” and “phosphorylation” as the total number of downregulated DEGs decreased while the level of P fertilization increased. According to studies, lentils’, faba beans’ and peas’ increased yield has been ascribed to higher P fertilization [
54]. On the other hand, P deficiency induces the increase in certain phenolic substances in all the vegetative parts of the lentil plants [
21]. Lentil, like other legumes, has a high demand for P for both seed quality and plant growth [
43], and unpublished data from our team certainly confirms this statement.
Through KEGG pathway analysis it was observed that among the upregulated DEGs, the following pathways exhibited the highest number of enriched DEGs: the spliceosome pathway for the P0 treatment, the protein processing in the endoplasmic reticulum pathway for the P1 treatment, and the ribosome pathway for the P2 treatment. In the case of downregulated DEGs, it was consistently observed that the ribosome pathway had the highest number of DEGs across all treatments.
RNA sequencing results were investigated further to reveal how the P treatment and the seed maturity stage inside the pod influence the expression levels of specific genes in the phenylpropanoid pathway. The selected genes seem to be highly expressed in S1, slightly less expressed in S2 and even less expressed in S3. This indicates that the gene expression decreases as the seed matures, while the expression levels remain unaffected by P treatments. The developmental stage appears to be a crucial factor affecting the gene expression at pivotal points within the pathway. A specific step in the phenylpropanoid pathway is blocked in low-tannin lentils at the stage of formation of anthocyanidins and proanthocyanidins from dihydromyrecitin, whereas a slighter blockage in this step has been found in gray genotypes. Additionally, in low-tannin seeds a lack of dihydroflavonols is observed, which are precursors for anthocyanidin and proanthocyanidin generation [
55]. The FLS and F3H genes’ expression, though, does not follow the same pattern as the rest of the studied genes of the phenylpropanoid pathway. The expression levels of the FLS gene appear to decrease as the P level increases. The F3H expression seems to be influenced by the P level when considering each seed maturity stage separately. The pattern of the P influence, though, is not consistent among the P treatments, which may suggest the existence of additional factors regulating the expression of F3H. Yu et al. reported that two genes in the FLS family, FLS1 and FLS2, are highly expressed in the early stages of seed development, while another gene in the FLS family, FLS3, is highly expressed in the later stages of seed development, suggesting that the FLS genes may play different roles in regulating seed development [
30]. In the study conducted by Jia et al. in tobacco leaves, it was observed that in higher P treatment there was a reduction in the transcript levels of NtCHS (chalcone synthase), NtCHI (chalcone isomerase), NtF3H (flavanone 3-hydroxylase), NtF30H (flavanone 30-hydroxylase) and NtFLS (flavonol synthase), while NtFLS transcript levels were increased in both juvenile and mature leaves in the lower P treatment. Pi (inorganic P) deficiency led to a lack of bHLH transcription factor and negatively affected the key gene for anthocyanidin biosynthesis (NtDFR) [
56]. In Arabidopsis, a mutation in the regulator of the flavonol biosynthesis AtMYB12, which encodes the R2R3-MYB TF and influences the expression of CHS, CHI, F3H and FLS, causes remarkable downregulation of these genes. In maize, the P1 regulator, which encodes the R2R3-MYB TF, induces the expression of these genes in a similar way except for the F3H and genes that are only found in the anthocyanidin branch of the phenylpropanoid pathway [
57,
58]. The bHLH transcription factor and F3′H are identified among the genes occupying pivotal positions in the phenylpropanoid pathway. The expression of the bHLH gene exhibited a significant reduction, particularly in the S3 maturity stage of P0. In general, the results suggest that the maturity stage played a central role in the downregulation of bHLH, with a similar expression pattern observed in the variations of F3′H expression. Plant responses to P alterations often involve changes in gene expression at the post-transcriptional level. In Arabidopsis, rice and soybean for example, the miRNA339 is induced by P stress and regulates phosphate homeostasis by suppressing the activity of the PHO
2 enzyme; additionally, P stresses can activate genes due to alterations in transcription factor expression, such as TF of the bHLH families [
59]. Alterations in F3′H expression in common bean seeds affected the concentration of dihydroquercetin-related compounds and proanthocyanidins [
60]. In tobacco leaves, P treatments in both low and high concentrations affected the regulation of the transcripts involved in the flavonoid pathway, which is a branch of the phenylpropanoid.
KEGG pathway analysis indicated an increase in the upregulated genes in P2 treatment in comparison with P0, while the downregulated genes showcased a decrease only in P2 treatment. On the other hand, the upregulated genes in the flavonoid biosynthesis pathway exhibited a one-gene increase in P2, whereas the downregulated genes rose in both P1 and P2 treatments. The alterations observed in enriched DEGs as a result of changes in P levels do not align with the selected genes within the pathway, except for F3′H and FLS. Consequently, the results of the KEGG analysis may be associated with other genes expressed in the broader phenylpropanoid pathway. In a comparative transcriptomic study conducted on apple seedlings under different P treatments, DEG analysis revealed significant changes in flavonoids and anthocyanidin concentration in the treatments, while the KEGG enrichment analysis showed that the flavonoid and phenylpropanoid biosynthesis pathways were highly enhanced in different P treatments [
61]. In tobacco leaves, P treatments of both low and high concentrations, affected the regulation of the transcripts involved in the flavonoid pathway, which is a branch of the phenylpropanoid [
56]. Excess P (P2) increased the DFR expression, but the seed maturity stage negatively affected the expression since in every P treatment there was a remarkable reduction in the expression in S3 compared to S2. The downregulation of the genes of the phenylpropanoid pathway and the consistent even lower expression in most cases in P1 and P2 treatments suggest that excess P is a limiting factor for proanthocyanidin (tannin) formation.
Ongoing research aims to determine if the transcriptional downregulation of phenylpropanoid gene expression correlates with lower seed tannin accumulation. This may explain the observed variability in lentil seed coat hardness from year to year in the same field, as P availability—affected by soil drying and rewatering fluctuations—influences the genes responsible for tannin accumulation in the phenylpropanoid pathway. Future studies involving a broader range of genotypes will provide clearer insights into the role of P levels in tannin biosynthesis and help elucidate why periodic seed hardness occurs in specific genotypes rather than across all lentil varieties.