Next Article in Journal
Identification and Characterization of LINE and SINE Retrotransposons in the African Hedgehog (Atelerix albiventris, Erinaceidae) and Their Association with 3D Genome Organization and Gene Expression
Previous Article in Journal
Association Analysis of ENPP1 Tissue Expression, Polymorphism, and Growth Traits in Xiangsu Pigs
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Transcriptome Analysis Reveals Key Genes Involved in Fatty Acid and Triacylglycerol Accumulation in Developing Sunflower Seeds

College of Life Science, Shihezi University, Shihezi 832000, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Genes 2025, 16(4), 393; https://doi.org/10.3390/genes16040393
Submission received: 24 February 2025 / Revised: 23 March 2025 / Accepted: 27 March 2025 / Published: 29 March 2025
(This article belongs to the Section Plant Genetics and Genomics)

Abstract

:
Background/Objectives: Sunflower (Helianthus annuus L.) is one of the four major global oilseed crops. Understanding the molecular mechanisms regulating fatty acid synthesis and triacylglycerol (TAG) accumulation is crucial for improving oil yield and quality. In this study, the oilseed sunflower cultivar ‘T302’, which was wild-cultivated in the northwestern region of China, was analyzed for fatty acid content by targeted lipidomic analysis. RNA sequencing (RNA-seq) was performed on 15 cDNA libraries from sunflower embryos at five developmental stages (10, 17, 24, 31, and 38 days after flowering) to investigate gene expression patterns during oil accumulation. Differentially expressed genes (DEGs) related to fatty acid and triacylglycerol accumulation in developing sunflower seeds were identified. WGCNA was used to gain deeper insights into the mechanisms underlying lipid metabolism. Results: The oil composition of ‘T302’ consisted of 86.61% unsaturated fatty acids (UFA), mainly linoleic acid (48.47%) and oleic acid (37.25%). Saturated fatty acids (SFAs) accounted for 13.39%, with palmitic acid (7.46%) and stearic acid (5.04%) being the most abundant. A total of 81,676 unigenes were generated from RNA-seq data, and 91 DEGs associated with lipid metabolism were identified, including key enzymes such as FAD2-1, SAD, FATA, LACS, PDAT2, and DGAT2. In addition, we identified several novel candidate transcription factor genes, including WRI1, LEC1, FUS3, and ABI3, which were found to regulate TAG synthesis during seed maturation and are worthy of further investigation. This study provides valuable insights into the molecular mechanisms of seed oil biosynthesis in oilseed sunflower. The identified key genes and transcription factors provide potential targets for molecular breeding strategies to increase oil content and modify fatty acid compositions in sunflower and other oilseed crops.

1. Introduction

Sunflower (Helianthus annuus L.) is one of the four major oil crops worldwide and plays a significant role in global agriculture. Sunflower seeds contain abundant vegetable oils and proteins that are essential for human nutrition and are widely used in the food sector [1,2]. The fatty acid (FA) composition of traditional sunflower seed oil is characterized by a high concentration of unsaturated fatty acids (UFAs) (approximately 85%), predominantly oleic acid (OA, C18:1; 14–43%) and linoleic acid (LA, C18:2; 44–75%). The remaining 15% consists of saturated FAs, primarily palmitic acid (PA, C16:0) and stearic acid (SA, C18:0) [3]. In recent years, sunflower breeding has resulted in the development of high-quality oils with different OA levels, including medium OA (43.1–71.8%) and high OA (75–90.7%) sunflower hybrids, which contain more OA than traditional sunflower varieties [4]. Edible oils rich in UFAs are healthy for humans, which can lower blood pressure and lipid levels, thus providing protective benefits for cardiovascular and cerebral vessels [5,6]. Therefore, increasing seed oil content and optimizing FA profiles are key objectives in sunflower breeding programs to improve crop quality.
Lipid biosynthesis in plants consists of four interrelated processes: the de novo synthesis of FAs, acyl elongation and modification, triacylglycerol (TAG) formation, and oil droplet production [7,8]. De novo FA biosynthesis takes place in plastids through a series of reactions, including condensation, reduction, and dehydration processes, facilitated by fatty acid synthases (FAS). As a result, free FAs are generated, including PA, SA, and OA [9]. A number of enzymes participate in this process, such as acetyl-CoA carboxylase (ACC), 3-ketoacyl-ACP synthase (KAS), β-ketoacyl-ACP reductase (KAR), β-hydroxyacyl-ACP dehydratase (HAD), and enoyl-ACP reductase (ENR). Additionally, acyl-ACP thioesterases (FATA/B) and stearoyl-ACP desaturase (SAD) play crucial roles in FA modification and release [10,11]. After synthesis, long-chain free FAs (C16→18) are exported from the plastid to the endoplasmic reticulum (ER), where they undergo further elongation, acyl modification, and lipid assembly [12]. At the outer envelope of the plastid, long-chain acyl-CoA synthetases (LACS) convert free FAs into fatty acyl-CoA, which is the essential substrate for glycerolipid assembly [7].
In plants, TAGs are produced in the ER through two main pathways: an acyl-CoA-dependent pathway and an acyl-CoA-independent pathway [13]. The acyl-CoA-dependent pathway, or Kennedy pathway, utilizes acyl-CoA as a substrate to assemble TAG. Several key enzymes are involved in this process, including glycerol-3-phosphate acyltransferase (GPAT), lysophosphatidic acid acyltransferase (LPAAT), phosphatidic acid phosphatase (PAP), and diacylglycerol acyltransferase (DGAT), which work together to acylate the sn-3 position of diacylglycerol (DAG), leading to the formation of TAG [14,15]. In the acyl-CoA-independent pathway, phospholipid: diacylglycerol acyltransferase (PDAT) catalyzes the transfer of an acyl group from phosphatidylcholine (PC) to DAG, leading to the formation of TAG [16]. The relative contributions of DGAT and PDAT to TAG biosynthesis remain unclear. During the final stages of oil synthesis, key oil body-associated proteins, including oleosin, caleosin, and steroleosin, bind to TAG molecules to form oil bodies [17].
In the lipid biosynthetic pathway of plants, the content and catalytic activity of fatty acid desaturases (FADs) are critical determinants of the composition and ratio of UFAs [18]. FADs can be classified into two main groups based on their localization and associated cofactors: soluble desaturases and membrane-bound desaturases. Soluble desaturases, such as SAD, are the only known enzymes that convert stearic acid (18:0-ACP) to oleic acid (18:1-ACP), a key step in UFA synthesis [19]. Membrane-bound desaturases, attached to plastid or ER membranes, consist of seven main types: FAD2, FAD3, FAD4, FAD5, FAD6, FAD7, and FAD8 [12]. The ω-6 desaturases, FAD2 and FAD6, add double bonds at the ω-6 (Δ-12) position, converting OA into LA. On the other hand, ω-3 desaturases, including FAD3, FAD7, and FAD8, introduce double bonds at the ω-3 (Δ-15) position, converting LA into linolenic acid (ALA, 18:3) [15].
In addition, several transcription factors (TFs), including WRI1 (WRINKLED1), LEC1 (Leafy cotyledon 1), LEC2, ABI3 (abscisic acid insensitive 3), and FUS3 (FUSCA3), play key roles in regulating embryo development [20]. WRI1, a TF of the APETALA2/ethylene-responsive element binding factor (AP2/ERF) [21], plays a crucial role in regulating FA synthesis. In Arabidopsis thaliana, AtWRI1 controls oil accumulation by regulating the carbon flux from sugars to oils [22,23]. LEC1, a member of the NF-YB family, contributes to FA biosynthesis through synergistic interactions with TFs such as ABI3 and WRI1 [24]. FUS3 activates the expression of storage proteins, including oleosin, and lipid synthesis genes, by physically cooperatively with LEC1 and LEC2, which belong to the B3 family, in a cooperative manner [25]. Alterations such as mutations, overexpression, or heterologous expression of these TFs greatly influence seed formation, FA synthesis, and oil content [26,27,28].
In recent years, significant progress has been made in the improvement of the quality of sunflower oil. An integrated quantitative genetics approach has established a gene network for sunflower oil metabolism that includes 429 genes linked to 125 reactions in 12 pathways. Notably, 46 oil-related genes were identified within 32 genomic regions, facilitating the discovery of candidate genes for enhancing sunflower oil quality [29]. Seven quantitative trait loci (QTLs) related to OA, total oil, and LA content were identified, explaining over 10% of the phenotypic variation in sunflower traits [30]. In addition, genetic variation related to FA content has been identified in Russian sunflower collections through genome-wide association studies (GWAS) combined with high-throughput lipidome phenotyping [31]. Transcriptome studies on UFA biosynthesis suggest that OA synthesis plays a dominant role in UFA biosynthetic pathways in seed embryos [32].
Despite advancements in sunflower oil metabolism, several gaps remain in current research. Seed oil yield and FA profile are complex traits influenced by genetic, physiological, biochemical, and environmental factors. However, the precise molecular mechanisms underlying TAG accumulation and unsaturated FA biosynthesis in oilseed sunflower are still poorly understood. Additionally, the transcriptional regulatory mechanisms of these processes remain unclear, limiting the potential for optimizing sunflower oil production and quality through genetic engineering. Therefore, the study of the total gene expression of RNA transcripts in sunflower seeds is crucial for advancing our understanding of the molecular mechanisms involved in FA biosynthesis. In this study, seeds of sunflower cultivar ‘T302’ within five developmental stages were used for transcriptome analysis by Illumina sequencing technology. The key genes and transcription factors involved in FA biosynthesis were identified. The results will increase our knowledge of the molecular mechanisms of FA synthesis and oil accumulation in sunflower seeds and provide a basis for future advances in sunflower breeding and metabolic engineering.

2. Materials and Methods

2.1. Plant Materials

The hybrid oilseed sunflower cultivar ‘T302’, which is widely cultivated in the northwestern region of China, was used as the plant material. This culture is curated by the College of Life Sciences, Shihezi University, Xinjiang, China. The seeds were grown in May 2020 in the experimental field of Shihezi (44°18′ N, 86°00′ E), on soils composed mainly of gray desert soil and irrigated soil, with summer daylight lasting 12–14 h and regular irrigation. Nylon mesh bags were used to cover the flower heads before flowering to ensure that the seeds obtained were self-pollinated. Embryo samples were collected from 10 days after anthesis (DAF), three flower heads were sampled every 7 days until 38 DAF, and seeds were collected from the outer three whorls of each flower head. At each developmental stage, the seed coats were carefully removed from the embryos, with a total of 20 embryos collected per plant, immediately frozen in liquid nitrogen, and stored at −80 °C.

2.2. Lipid Extraction and Targeted Lipidomic Analysis

Forty fatty acid methyl ester (FAME) standards were prepared in ten concentration gradients (0.5–1000 mg/L), with each gradient representing the total concentration of all components. In the 40 FAME standards, the concentration of each component was either 2% or 4% of the total, with 30 components at 2% and 10 components at 4%. For sample analysis, 500 μL of the FAME standard was combined with 25 μL of methyl nonadecanoate (500 ppm) as an internal standard. The mixture was injected (1.0 μL) into a Gas Chromatography–Mass Spectrometry (GC-MS) system with a 10:1 split ratio.
Mature ‘T302’ sunflower seeds (50 mg) were placed in 2.0 mL glass centrifuge tubes, and 1.0 mL chloroform-methanol (2:1, v/v) was added and sonicated for 30 min. The supernatant was collected and methylated by adding 2.0 mL of a 1% sulfuric acid–methanol solution, then incubated at 80 °C for 30 min. After cooling, 1.0 mL of hexane was added to extract the lipids, and the organic phase was washed with 5.0 mL of deionized water. A 500 μL portion of the supernatant was mixed with 25 μL of methyl salicylate (internal standard) and transferred to a vial. Finally, 1.0 μL of the sample was injected into the GC-MS system with a 10:1 split ratio.
GC-MS analysis was performed by Shanghai Zhongke Xinlife Biotechnology Co., Ltd. (Shanghai, China) using an Agilent 7890/5975C GC-MS system. The samples were analyzed using an Agilent DB-WAX capillary column (30 m × 0.25 mm ID × 0.25 μm) in a gas chromatography system. The initial temperature was set at 50 °C and held for 3 min, then ramped to 220 C at a rate of 10 °C/min and held for 5 min. Helium was used as the carrier gas at a flow rate of 1.0 mL/min. A quality control (QC) sample was analyzed periodically to evaluate system stability and reproducibility. The injector temperature was set at 280 °C and electron ionization (EI) was performed at 70 eV. Data were acquired using MSD ChemStation software (version E.02.00 Keysight Technologies) to measure peak areas and retention times. A standard curve was constructed for the quantification of long chain fatty acids (FAs), and three biological replicates were analyzed to determine the FA profile of sunflower cultivar ‘T302’.

2.3. RNA Extraction, cDNA Library Construction, and Transcriptome Sequencing

RNA extraction, cDNA library construction, and sequencing were carried out by Shanghai Meiji Biomedical Technology Co., Ltd (No. 3, Lane 3399, Kangxin Road, Pudong New Area, Shanghai, China). Total RNA was isolated from sunflower embryos with TRIzol reagent (Invitrogen, Carlsbad, CA, USA) and then treated with DNase I to eliminate genomic DNA contamination. Five developmental stages of embryos were selected for RNA sequencing (RNA-seq): 10, 17, 24, 31, and 38 DAF, designated S1 (stage 1), S2 (stage 2), S3 (stage 3), S4 (stage 4), and S5 (stage 5), respectively. RNA samples were sequenced on the NovaSeq 6000 Sequencing System (Illumina), with three independent biological replicates.
Raw data were cleaned by removing reads containing adapters, poly-N sequences, and low-quality reads. Analysis of the cleaned data was performed using HISAT2 (v2.0.5) [33] and StringTie [34]. Sequences were aligned to the sunflower genome [29] (https://www.ncbi.nlm.nih.gov/datasets/genome/GCF_002127325.2/, accessed on 12 November 2022) using the HISAT2 transcript program. The raw sequencing data have been deposited in the Sequence Read Archive (SRA) under the NCBI BioProject, with accession number PRJNA1220589.

2.4. Sequence Alignment and Functional Annotation

The clean reads, obtained after quality control, were aligned to the H. annuus reference genome (https://www.sunflowergenome.org/annotations-data/ accessed on 22 March 2025). The mapped data were used for subsequent transcript assembly, expression quantification, and alignment against six databases: NR (NCBI non-redundant protein sequences), Pfam (protein family), Swiss-Prot (annotated and reviewed protein sequence database), COG (Clusters of Orthologous Groups of Proteins), GO (Gene Ontology), and KEGG (Kyoto Encyclopedia of Genes and Genomes). Unigenes identified by BLASTx (version 2.9.0) (e-value < 1 × 10−5) were annotated using BLAST2GO to describe biological processes (BP), cellular components (CC), and molecular functions (MF).

2.5. Differentially Expressed Genes (DEGs) Analysis

Clean reads were aligned to assembled unigenes using Bowtie2 (version 2.4.1) [35], and gene expression was quantified in FPKM (fragments per million reads per mapped kilobase). DEGs were identified using DESeq2 (version 1.24.0) [36] with an adjusted p-value < 0.05 and |log2FC| ≥ 1 as thresholds. GO and KEGG enrichment analyses were performed on DEGs, with FDR ≤ 0.001 considered significant. TFs were predicted by PlantTFDB 5.0 (http://planttfdb.gao-lab.org/ accessed on 22 March 2025).

2.6. Co-Expression Network Construction

Gene co-expression networks were constructed using WGCNA with hybrid network types (https://horvath.genetics.ucla.edu/html/CoexpressionNetwork/Rpackages/WGCNA/ accessed on 22 March 2025). Module eigengene (ME), representing the first principal component of each module, was used to evaluate the correlation with oil content by calculating the values for each module.

2.7. Quantitative Reverse Transcriptase Polymerase Chain Reaction (RT-qPCR) Analysis

RT-qPCR was used to validate gene expression changes at three developmental stages: 10, 24, and 38 DAF. Five micrograms of total RNA was synthesized into cDNA using PrimeScript RT reagents (Kusatsu City, Japan, Takara Bio Inc.). Gene expression analysis was performed using SYBR Green I (CWBIO) on a LightCycler 480 system with three biological replicates. The sunflower 18S rRNA gene (AF1057577) was used as an internal control [37]. The specific primer sequences used are listed in Table S1. Relative gene expression levels were calculated using the 2−ΔΔCT method [38].

2.8. Statistical Analysis

Statistical analysis was performed using SPSS 25.0 software. Data are presented as mean ± standard deviation (SD). One-way ANOVA followed by Tukey’s post hoc test was used to evaluate differences between groups, with a significance threshold of α = 0.05. Groups with different letters (e.g., ‘a’ and ‘b’) indicate statistically significant differences (p < 0.05). Before analysis, normality was assessed using the Shapiro–Wilk test and homogeneity of variance was confirmed using Levene’s test.

3. Results

3.1. Fatty Acid Composition in Mature Sunflower Seeds

The oil composition of sunflower cultivar ‘T302’ showed a high percentage of UFAs, accounting for 86.61% of the total oil (Table 1). UFAs were further divided into monounsaturated fatty acids (MUFAs) and polyunsaturated fatty acids (PUFAs). LA was the predominant PUFA (48.47%), and OA was the predominant MUFA (37.25%). Small amounts of several UFAs were also present. Saturated fatty acids (SFAs) were present at 13.39%, with PA (7.46%) and SA (5.04%) being the most abundant SFAs. In addition, trace amounts of nine other SFAs were detected.

3.2. Illumina Sequencing and De Novo Assembly

A total of 105.15 GB of raw paired-end data were obtained from the transcriptome sequencing. Quality control analysis revealed a low error rate of 0.02%, with over 97% of reads achieving Q20 and over 92% achieving Q30, along with an average GC content of 47.36%. After filtering out low-quality reads, adapter contamination, and sequences with a high number of unknown bases (N), a total of 81,676 unique transcripts were obtained (Table S2), with clean reads accounting for over 81.65% of the total. Among these, 27,796 unigenes were longer than 1800 base pairs (bp) (Figure S1).

3.3. Gene Functional Annotation

The coding sequences (CDS) and corresponding amino acid sequences of 81,676 non-redundant unigenes were annotated. BLAST alignments (E-value ≤ 10−5) were performed against six public databases: NR, GO, KEGG, COG, Pfam, and Swiss-Prot. The results showed that 12,361 unigenes (20.50%) were annotated in the GO database; 16,022 (26.58%) in KEGG; 40,205 (66.69%) in COG; 43,843 (72.72%) in NR; 31,691 (52.57%) in Swiss-Prot; and 36,749 (60.96%) in Pfam (Figure S2).
To assess the functional roles of the unigenes, we performed a classification using the COG database. A total of 63,478 unigenes were classified into 23 functional categories. Excluding the category “[S] Function unknown”, the largest group was “[L] Replication, recombination, and repair” (10,081 unigenes, 15.88%), followed by “[O] Post-translational modification, protein turnover, chaperones” (2951 unigenes, 4.65%). Notably, 795 unigenes (1.25%) were categorized under “[I] Lipid transport and metabolism”, suggesting that these candidate unigenes may be involved in lipid metabolism in sunflower seeds (Figure S3).
To investigate their functions, a total of 34,781 unigenes were annotated in the GO database. They were grouped into three main categories: molecular functions, biological processes, and cellular components (Figure S4), and further classified into 11, 15, and 14 subcategories, respectively. In the molecular functions category (13,887 unigenes), the most prominent subcategory was “binding” (5749 unigenes, 41.40%), followed by “catalytic activity” (6352 unigenes, 45.74%). In the biological process category (17,897 unigenes), “metabolic process” (6231 unigenes, 34.82%) and “cellular process” (5740 unigenes, 32.07%) were the dominant subcategories. Within the cellular component category (16,884 unigenes), the most abundant subcategory was “cell part” (5288 unigenes, 31.32%). These results suggest that metabolic and enzymatic activities play an important role during sunflower seed development.
To further explore the potential pathways of these unigenes, a KEGG pathway analysis was performed (Figure 1). A total of 15,550 unigenes were assigned to 20 subcategories in six KEGG categories, corresponding to 136 KEGG pathways. “Metabolism” was the largest category with 5922 unigenes, followed by “Genetic Information Processing” (3669 unigenes). The most prominent subcategories were “carbohydrate metabolism” (1419 unigenes) and “translation” (1253 unigenes). Notably, 751 unigenes were annotated under “lipid metabolism”, which was similar to the COG results of 795 unigenes, providing valuable insights into fatty acid and oil accumulation during sunflower seed development. The 20 pathways were grouped into six major categories, with three key pathways standing out: “replication and repair” (2859 unigenes), “carbohydrate metabolism” (2017 unigenes), and “translation” (1343 unigenes). Of the total 15,550 unigenes, approximately 897 were involved in 14 key pathways of lipid metabolism. Among these, “glycerophospholipid metabolism” contained the largest number of unigenes, followed by “α-linolenic acid metabolism”, “FA degradation”, and “FA biosynthesis”. Other lipid metabolism-related pathways included “LA metabolism”; “FA elongation”; “UFA biosynthesis”; and “keratin, chondroitin, and wax biosynthesis”. These findings highlight unigenes that are crucial for identifying key genes involved in FA and TAG biosynthesis in sunflower seeds, providing a basis for further investigation of lipid accumulation processes.

3.4. Analysis of Differentially Expressed Genes in Developing Sunflower Seeds

A total of 81,676 unigenes were identified from the transcriptome data. Principal component analysis (PCA) of the 15 samples based on RNA-seq FPKM values showed that two principal components explained 71.74% of the total variance (59.63% for PC1 and 12.11% for PC2). The samples were clearly clustered into five groups, each corresponding to a specific developmental stage, demonstrating consistency among the three biological replicates collected for each stage (Figure 2A). The number of genes expressed at each stage was determined based on gene expression levels (Figure 2B). A total of 23,396 DEGs were identified by pairwise comparisons of expression levels between samples at each time point (Table S3). Using the samples at 10, 17, and 24 DAF as controls, the number of DEGs identified at different time points can be observed in Figure 2C. The highest number of DEGs was observed at 10 DAF vs. 38 DAF, with 7425 up- and 9621 down-regulated unigenes. The second largest difference occurred at 17 DAF vs. 38 DAF with 6550 up- and 8379 down-regulated unigenes (p-value < 0.05).
To explore the DEGs related to FA synthesis and triacylglycerol accumulation during sunflower seed development, KEGG and GO enrichment analyses were performed on the 23,396 DEGs, and 522 DEGs related to lipid metabolism were identified. These genes participated in various pathways, such as glycerolipid metabolism, glycerophospholipid metabolism, sphingolipid metabolism, FA biosynthesis, elongation, degradation, and UFA biosynthesis, including α-linolenic acid, arachidonic acid, and LA metabolism (Table S4; Figure S5).

3.5. Weighted Gene Co-Expression Network Analysis

To gain deeper insights into the mechanisms underlying lipid metabolism in sunflower, we analyzed the trends in DEGs. The transcriptomic changes were evaluated using weighted gene co-expression network analysis (WGCNA), which revealed six distinct gene modules (Figure 3A). For each module, the number of DEGs and the corresponding KEGG pathways were characterized. KEGG enrichment analysis identified 282 DEGs associated with lipid metabolism by WGCNA (Table S5).
The turquoise-colored module, the largest identified, contained 8280 DEGs with peak expression at 38 DAF. This module included genes involved in ribosome function, RNA transport, glycerophospholipid metabolism, glycolysis, and glycerolipid metabolism. The second largest module, colored blue, contained 4241 DEGs with peak expression at 10 DAF. Genes in this module were associated with glycolysis, amino and nucleotide sugar metabolism, starch and sucrose metabolism, and phytohormone signaling. The brown-colored module contained 607 DEGs that were up-regulated at 31 DAF and down-regulated at 38 DAF. These genes are related to glycerophospholipid metabolism, phytohormone signaling, and plant–pathogen interactions. The yellow-colored module, consisting of 586 DEGs, showed peak expression at 10 DAF. This module included genes involved in ribosome function, glycolysis, protein processing in the ER, and purine metabolism. The gray-colored module contained 534 DEGs with peak expression at 31 DAF. These genes are involved in phytohormone signaling, the MAPK pathway, plant–pathogen interactions, and cysteine and methionine metabolism. The smallest module, the green-colored module, contained 469 DEGs that were up-regulated from 17 to 31 DAF. These genes are associated with phenylpropanoid biosynthesis, FA elongation, plant–pathogen interactions, glycerolipid metabolism, glycolysis, and FA degradation.
To explore the relationship between module gene expression patterns and physiological traits, WGCNA was performed (Figure 3B). The accumulation rates of PA and SA were strongly correlated with the blue- and yellow-colored modules, respectively, while OA content showed a significant association with the turquoise-colored module. This module was also closely related to seed weight and oil content, while LA content showed a predominant correlation with the yellow-colored module. Together with KEGG pathway and module–trait correlation analyses, these results suggest that the turquoise-, blue-, and yellow-colored modules play key roles in FA biosynthesis.

3.6. Identification of Genes Involved in Lipid Biosynthesis

In this study, we identified 138 DEGs related to FA and TAG biosynthesis from a total of 522 lipid metabolism-related DEGs, as determined by GO and KEGG analysis. These genes are involved in processes such as FA initiation, elongation, desaturation, thioesterase activity, acyl-CoA activation, Kennedy pathway, TAG synthesis, phospholipase activity, TAG lipase activity, and oil body formation (Table S6). Subsequently, by integrating 282 lipid metabolism-related DEGs identified by WGCNA, we finally identified 91 critical DEGs related to FA and TAG biosynthesis and oil bodies (Table 2). A model of lipid biosynthesis in the sunflower embryo is shown in Figure 4.

3.6.1. Genes Involved in Fatty Acid Biosynthesis in Plastids

FA biosynthesis in plants begins in the plastids, where pyruvate is converted to acetyl-CoA, the primary substrate for FA synthesis. Acetyl-CoA carboxylase (ACC) catalyzes the carboxylation of acetyl-CoA to malonyl-CoA, the first rate-limiting step in the FA biosynthetic pathway. ACC is a multi-subunit enzyme consisting of biotin carboxyl carrier protein (BCCP), biotin carboxylase (BC), α-carboxyltransferase (α-CT), and β-carboxyltransferase (β-CT). Our analysis showed that BC, BCCP, and α-CT were predominantly highly expressed during the early to mid-stages of sunflower embryo development, with expression levels decreasing in the late stages of embryo development (Figure 4 and Figure 5A; Table 2).
Plastid FA synthesis involves three KAS enzymes, each with a different role. KASIII initiates the condensation reaction by combining acetyl-CoA and malonyl-ACP to form 4:0-ACP. KASI elongates the carbon chain from six to sixteen carbons, while KASII continues the elongation, converting 16:0-ACP to 18:0-ACP [39]. Each enzyme is specialized for specific chain length ranges during elongation. Our data indicate that KASIII was highly expressed during early seed development (10–17 DAF) and subsequently down-regulated (Figure 4 and Figure 5B; Table 2). The expression patterns of KASI and KASII were similar to that of KASIII, with high expression observed during early to mid-seed development (10–24 DAF), followed by down-regulation at later stages of seed development (31–38 DAF).
FATA and FATB are two different thioesterases with different functions: FATA primarily releases UFAs (e.g., C18:1) from ACP, whereas FATB terminates the biosynthesis of saturated or medium-chain FAs (e.g., C16:0) during lipid production [11,40]. In this study, we found that the expression of FATA was significantly higher (by approximately 10-fold) compared to FATB across all five developmental stages (Figure 5C), which likely promotes the accumulation of OA.
In addition, LACS catalyzes the formation of acyl-CoA from free FAs. A total of 10 LACS isoforms were identified, where LACS7 (Ug10240) showed high expression at 31–38 DAF, LACS9 (Ug46285) showed higher expression levels at the early stage (10–24 DAF), and LACS7 (Ug10240) was up-regulated at the late stage (31–38 DAF) (Figure 5D, Table 2). The differential expression patterns of LACS suggest that the LACS gene family in sunflower may catalyze different substrates, regulate acyl-CoA synthesis, and facilitate the translocation of FAs from the plastid to the cytoplasm or ER.

3.6.2. Genes Related to TAG and Oil Bodied in the Endoplasmic Reticulum

Diacylglycerol Acyltransferase (DGAT) catalyzes the final and key step in the Kennedy pathway of TAG synthesis. Based on their function and subcellular localization, DGAT enzymes are grouped into four subfamilies: DGAT1, DGAT2, DGAT3, and wax ester synthase (WS)/DGAT [41]. DGAT1 and DGAT2 are membrane-bound enzymes normally located in the ER, whereas DGAT3 is a soluble enzyme involved in a cytosolic pathway for TAG biosynthesis [42,43]. In this study, we identified three DGAT isoforms: DGAT1, DGAT2, and DGAT3. Among them, DGAT2 (Ug63054) and DGAT3 (Ug21368) showed higher expression levels at 38 DAF, while the other DGAT2 (Ug69246) showed increased expression during the 10–24 DAF period (Table 2). Another enzyme, PDAT, catalyzes the acyl-CoA-independent synthesis of TAG. In Arabidopsis seeds, PDAT partially compensates for DGAT activity. Our results indicated that PDAT2 (Ug64137) was expressed at significantly higher levels than DGAT1, DGAT2, and DGAT3, with peak expression observed at 10–24 DAF (Figure 5E), implying that PDAT activity may play a more critical role in the early stages.
In mature seeds, TAG is stored in specialized organelles called oil bodies. Oleosin (OLE), the predominant membrane protein in oil bodies, is essential for maintaining their shape and size by preventing fusion, thus ensuring their stability during seed development [44,45]. Caleosin (CLO), on the other hand, is involved in the formation and metabolism of oil bodies [46]. In this study, seven differentially expressed OLE genes were detected, five of which (Ug67185, Ug63945, Ug29184, Ug31436, and Ug77732) showed high expression levels during sunflower seed development (FPKM > 1000, with a maximum of 14,402.46). In addition, one CLO-encoding gene (Ug391) was consistently highly expressed throughout seed development (FPKM >1000 at 10 DAF, and FPKM >14,000 at 24 DAF) (Table 2).

3.6.3. Genes Involved in Unsaturated Fatty Acid Synthesis

The composition of mature sunflower embryos in this study shows that OA accounts for 54.72%, while LA accounts for 35.99%. The activity of FAD enzymes is critical in regulating OA levels. In addition, SAD is crucial for the de novo production of UFAs in plants [47]. Together, these enzymes modulate the proportion of UFAs in plant lipid biosynthesis. Transcriptomic analysis identified 19 FAD genes from sunflower RNA-seq data, including 13 FAD2, 2 SAD, 3 FAD7/8, and 1 FAD3 (Figure 6A). Phylogenetic classification revealed distinct subfamily distributions: Δ9 desaturase (two members), ω-6/Δ12 desaturase (thirteen members), and ω-3/Δ15 desaturase (four members) (Figure 6B). Notably, two SAD genes (Ug3372 and Ug48724) exhibited significant transcriptional activity between 10 and 24 DAF, with the Ug3372 transcript showing particularly high expression levels (FPKM > 1000 at 10 DAF and FPKM > 500 at 17 DAF) (Table 2; Figure 5G). This period coincides with the accelerated phase of OA biosynthesis.
Thirteen FAD2 unigenes were identified in the sunflower RNA-seq data, with the FAD2-1 (Ug64287) showing significantly higher expression levels between 10 and 24 DAF (FPKM > 1000 at both 10 DAF and 24 DAF) compared to the other FAD2 genes (Figure 5G; Table 2). This suggests that FAD2-1 may play a key role in the biosynthesis of LA in sunflower. Previous studies have indicated that the low ALA content in sunflower seeds is closely related to the transcriptional regulation of Δ15-FAD during seed development [48]. In our study, the expression of FAD3 (Ug30216) was significantly lower than that of FAD2-1 during sunflower seed development (Figure 5E and Figure 6A,C). Since FAD3 encodes the Δ15-FAD responsible for the conversion of LA to ALA, the reduced expression of FAD3 likely limits the conversion of LA, thereby contributing to the low ALA content observed in sunflower seeds. In addition, FAD2-1 is involved in the conversion of OA to LA, and its higher expression compared to FAD3 may further influence FA composition by increasing LA levels and limiting the accumulation of ALA.

3.7. Transcription Factors Related to Lipid Synthesis

TFs are crucial for regulating multiple biological processes throughout plant growth and development. In this study, 1031 DEGs belonging to 46 different TF families were identified as TFs. Among them, nineteen AP2, forty B3, seventy-six bHLH, sixty-eight bZIP, ninety-six ERF, sixty-nine MYB, forty-eight NAC, six NF-YA, one NF-YB, and eighteen TCP members were potentially associated with lipid biosynthesis (Figure 7A; Table S7). Among them, eight TFs were identified as key regulators of seed development and oil accumulation, including LEC1, LEC2, FUS3, ABI3, and WRI1. The transcriptional regulatory network of oil biosynthesis in sunflower seeds is shown in Figure 7B. The LEC2 gene, a B3 domain TF essential for embryo development [49], was identified, but its expression (Ug68124 and Ug68126) was nearly undetectable throughout sunflower seed development (Table 3; Figure 7C), suggesting that LEC2 may have a minimal effect on seed oil accumulation.
The expression of LEC1 (Ug73129), an NF-YB family member that regulates FA biosynthesis, peaked during early to mid-seed development (10–24 DAF) and declined at later stages (31–38 DAF), mirroring the trend of FA accumulation. Similarly, the AP2 TF WRI1 (Ug63453), a key regulator of glycolysis and FA metabolism, was up-regulated during 10–24 DAF, coinciding with rapid seed oil accumulation. The B3 TFs ABI3 and FUS3 were also identified, with FUS3 (Ug67251) showing significantly higher expression during 10–31 DAF. Notably, FUS3 exhibited much higher expression than LEC1, LEC2, WRI1, and ABI3 (Figure 5H), suggesting its critical role in sunflower seed development and TAG synthesis.
In addition to the well-known TFs regulating TAG biosynthesis, several other TFs, including MYBs, ERFs, bZIPs, bHLHs, WRKYs, and TCPs, showed high expression during sunflower seed development (Figure 7C). This suggests their potential involvement in FA biosynthesis.

3.8. qPCR Analysis of DEGs Involved in Lipid Metabolism

To confirm the expression patterns at different developmental stages revealed by the transcriptomic analysis, RT-qPCR was performed on 16 DEGs involved in lipid metabolic pathways. These genes included those involved in FA biosynthesis (PDHA, ACC, KASI, FATA, FATB, SAD, LACS, FAD2), TAG assembly (GPAT, LPAAT, DGAT, PDAT), and lipid storage regulation (OLE). RNA samples were collected from sunflower seeds at three critical developmental stages: early embryogenesis (10 DAF), mid-maturation (24 DAF), and late maturation (38 DAF). The RT-qPCR results for these genes closely matched the RNA-seq data (Figure 8), confirming the accuracy of the transcriptome analysis.

4. Discussion

4.1. Role of Key De Novo Biosynthetic Enzymes in the Regulation of Fatty Acid Composition in Sunflower Seeds

During early seed development (10–17 DAF), several genes involved in de novo FA biosynthesis were found to have significantly up-regulated expression, including the plastidial FAS complex—ACC, ENR, MCAT, HAD, FATA, KASI, KASⅡ, and KAS III (Table 2). The ACC enzyme initiates this process by catalyzing the ATP-dependent conversion of acetyl-CoA to malonyl-CoA, a rate-limiting step regulated by biotinylation [50]. The enzymes KASI, KASII, and KASIII are involved in acyl chain elongation, whereas HAD stabilizes acyl-ACP intermediates to ensure efficient FA biosynthesis. Overexpression of JcKASII from Jatropha curcas in Arabidopsis caused a decline in 16-carbon FAs and a rise in 18-carbon FAs in both leaves and seeds [51]. Similarly, overexpression of JcKASIII in Arabidopsis resulted in higher PA content and an increased proportion of C16-C18 FAs in seeds [52]. Overall, these findings indicate that both JcKASII and JcKASIII play critical roles in regulating FA composition, with JcKASII primarily affecting the balance between C16 and C18 FAs, whereas JcKASIII promotes the accumulation of PA in seeds. Our data showed that the expression of KASI, KASII, and KASIII was significantly higher during the early stages of sunflower seed development, suggesting their important roles in the early accumulation of FAs in sunflower seeds. However, their functions and mechanisms require further investigation.
FATA encodes fatty acyl-ACP thioesterase A, which primarily catalyzes the substrate C18:1-ACP, whereas FATB has a higher catalytic activity towards saturated acyl-ACPs [11,37]. In Arabidopsis fata2 mutants, except for a slight decrease in C24:0 and no change in C22:0 levels, all other FAs increased by 10% to 60% [53]. In Arabidopsis double mutants in which both FATA gene copies were knocked out, the TAG content of the seeds was reduced, while the percentage of ALA and erucic acid in the total FA composition increased [54]. In this study, the sunflower FATA gene was highly expressed throughout seed development (Figure 5C), while the FATB gene showed lower expression levels. This suggests that sunflower FATA may play a crucial role in de novo FA biosynthesis, contributing to the increased production of OA.

4.2. Role of FAD and SAD Enzymes in Regulating Sunflower Seed Fatty Acid Desaturation and Composition

In plant lipid metabolism, the synthesis of PUFAs mainly occurs in the plastids and the ER. In plastids, two desaturase systems catalyze sequential reactions: FAD6 introduces a double bond at the Δ12 position of OA, converting it to LA, while FAD7/FAD8 further desaturates LA at the Δ15 position to produce α-ALA [18]. Both of these reactions use ferredoxin as the primary electron donor [55]. In the ER, both FAD2 and FAD3 catalyze the desaturation of OA to LA, and then further to ALA, with cytochrome b5 as the primary electron donor [56]. Regulation of FAD activity significantly affects the accumulation of OA, LA, and ALA in the seeds of several plant species, such as Camelina sativa [57], Glycine max [58], Paeonia section [59], Zanthoxylum bungeanum [60], Linum usitatissimum [61], and Eucommia ulmoides [62]. In this study, 13 FAD2 genes were determined to be active in the transcriptome of the sunflower seed, with FAD2-1 (Ug64287) returning significantly higher levels of expression than the other identified FAD2 genes. This suggests that FAD2-1 is the major contributor to FAD2 enzyme activity in sunflower seeds to highlight its critical role in converting OA to LA. In addition, the expression levels of sunflower FAD3 and FAD7/FAD8 were extremely low during seed development and FA accumulation, which probably explains the minimal content of ALA in mature sunflower seeds.
SAD enzymes are responsible for the initial desaturation step, converting SFAs to the MUFA (C18:1), which can be further converted into PUFAs. Therefore, SADs are crucial for the regulation of seed oil content and composition. In Arabidopsis, seven candidate SAD genes were identified, four of which (AAD1, AAD5, AAD6, and FAB2) showed expression patterns related to the accumulation of OA and its derivatives during seed development [63]. In this study, two SAD genes were expressed in developing sunflower seeds, with one gene (Ug3372) showing particularly high expression between 10 and 24 DAF (Table 2; Figure 5G), suggesting its key role in UFA biosynthesis. Given the established role of SAD genes in converting SFAs to MUFAs [48,64,65], it is reasonable to propose that the increased expression of SAD genes could enhance enzymatic activity, thereby facilitating a greater accumulation of MUFAs.

4.3. Key Enzymes and Transcription Factors Regulating Lipid Accumulation in Sunflower Seed

In the transcriptomic data of developing sunflower seeds, we identified key genetic factors that regulate TAG biosynthesis, shedding light on stage-specific regulatory mechanisms that control lipid accumulation. In particular, genes such as LACS7, LPCAT, DGAT2, and PDAT2 appear to play critical roles in TAG synthesis. LACS catalyzes the activation of FAs to form acyl-CoA, a key precursor in TAG synthesis. LPCAT facilitates the acylation of lysophosphatidylcholine to phosphatidylcholine, a critical step in the formation of DAG. DGAT is recognized as a critical enzyme in the conversion of DAG to TAG; therefore, it is thought to be the rate-limiting enzyme in the accumulation of storage lipids in plants [66]. Several studies have shown that DGAT is critical in regulating both the amount and type of FAs incorporated into storage TAGs [67,68,69]. As a result, genes encoding DGAT-like proteins have become key targets in biotechnological strategies aimed at improving oil content and FA profiles in oilseed crops.
The DGAT enzyme family consists of three major isoforms (DGAT1, DGAT2, DGAT3) with different functions in plant lipid metabolism. In most plants, DGAT1 is more extensively involved in lipid synthesis, whereas DGAT2 primarily regulates the accumulation of specialized FAs. However, their functions are not mutually exclusive [70]. DGAT2 acts upstream of the DGAT1 and influences the synthesis and storage of TAG [71]. Heterologous expression of Neurospora crassa DGAT2 with a shorter form (S-NcDGAT2) in maize increased oil content, altered FA composition, increased OA content, and decreased LA content [72]. In our study, DGAT1 expression remained low throughout the development of sunflower seeds. In contrast, DGAT2 (Ug69246) showed higher expression during the early and middle stages (10–24 DAF), while DGAT2 (Ug63054) was more highly expressed during the later stages (31–38 DAF) (Table 2). These expression patterns suggest that DGAT2 may play a more critical role in oil accumulation than DGAT1. However, further studies are needed to fully understand the functional differences between these two genes.
The PDAT enzyme in plants catalyzes the transfer of acyl groups from phospholipids to DAG, providing an alternative pathway for TAG biosynthesis. This pathway complements the conventional DGAT-mediated pathway and plays a critical role in lipid remodeling and storage, especially under stress conditions or during seed development [73,74,75]. DGAT1 and PDAT1 have overlapping functions in TAG biosynthesis in Arabidopsis, and both enzymes are crucial for the proper development of pollen and seeds [76]. However, PDAT might have a more significant function in other plant species. In seeds that produce highly unsaturated oils, such as sunflower, safflower [77], flax [78], and cotton [79], the capacity of PDAT to use phosphatidylcholine, a substrate involved in desaturation, as a direct precursor for TAG synthesis is particularly beneficial. In this study, sunflower PDAT2 exhibited significantly higher expression than the DGAT isoforms, with the highest expression observed during early and mid-stages (10–24 DAF), which corresponds to the period of rapid TAG accumulation in seeds. This suggests that the PDAT-mediated pathway of TAG synthesis plays a critical role in lipid accumulation in sunflower seeds.
In addition, several regulatory factors, including WRI1, LEC1, ABI3, and FUS3, were identified as up-regulated during the early and mid-stages (10–24 DAF), which coincides with the rapid accumulation of oil in sunflower seeds, suggesting their important roles in regulating lipid biosynthesis and storage. WRI1 is known to activate genes involved in FA and TAG biosynthesis, while LEC1, ABI3, and FUS3 are key regulators of seed development and lipid accumulation, controlling genes responsible for lipid metabolism [28,80,81,82]. These TFs coordinate the pathways that drive the accumulation of storage lipids in seeds. In addition, other TF families, including MYBs, ERFs, bZIPs, bHLHs, WRKYs, and TCPs, also showed higher expression during sunflower seed development. These candidate TFs may play important roles in sunflower lipid metabolism and provide novel regulatory mechanisms for oil accumulation. The identification of these TFs provides new targets for improving oil yield and quality in sunflower through biotechnology and breeding strategies.
This study provides valuable insights into the molecular mechanisms of fatty acid synthesis and triacylglycerol (TAG) accumulation in sunflower seeds, with the aim of improving oil yield and quality. Key genes such as FAD2-1, SAD, and FATA play important roles in the regulation of fatty acid composition and contribute to the high content of unsaturated FAs in ‘T302’ oil, especially LA and OA. The discovery of key transcription factor genes such as WRI1, LEC1, and ABI3, which regulate TAG synthesis during seed maturation, provides new targets for genetic modification. These findings have improved our understanding of lipid metabolism and provide potential genetic targets for improving oil content and modifying FA profiles in sunflower and other oilseed crop breeding.

5. Conclusions

In conclusion, this study provides valuable insights into the molecular mechanisms of lipid biosynthesis and oil accumulation in oilseed sunflower seeds. We have identified key genes and transcription factors involved in fatty acid synthesis, triacylglycerol formation, and lipid storage, highlighting potential targets for improving oil yield and quality. Future research should focus on the functional roles of these genes and transcription factors and explore their potential for optimizing lipid accumulation under different environmental conditions. While this study presents significant findings, it is important to acknowledge that we used only the five developmental stages of sunflower seeds, and environmental factors may influence the results, and further studies are needed to validate these findings under different conditions and larger sample sets.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/genes16040393/s1, Supplementary figures: Figure S1: Size distribution of sunflower Illumina reads. Figure S2: Function annotation statistics table. Figure S3: COG classification of the identified unigenes. Figure S4: GO functional analysis of the DEGs. Figure S5: Bubble charts of GO (A) and KEGG (B) analyses. Table S1: Primers used in this study. Table S2: Annotation information of 81676 genes. Table S3: Expression of 23,396 differentially expressed genes per period and differential expression. Table S4: Enrichment pathways and number of genes associated with lipid metabolism classification in sunflower seeds. Table S5: KEGG enrichment analysis of each module in WGCNA. Table S6: Enzymes/proteins related to lipid accumulation in sunflower seeds. Table S7: Transcription factors associated with lipid accumulation in sunflower seeds.

Author Contributions

L.S. and W.M. conceived and designed the experiments; W.M. and L.Z. performed the research; X.Y. and D.C. analyzed the data; W.M., L.Z. and L.S. wrote the paper. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (grant nos. 32360501 and 31760064).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All the data in this study are included in this manuscript. The raw RNA-seq reads have been deposited in the NCBI database (accession PRJNA1220589).

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Khan, S.; Choudhary, S.; Pandey, A.; Khan, M.K.; Thomas, G. Sunflower oil: Efficient oil source for human consumption. Emerg. Life Sci. Res. 2015, 1, 1–3. [Google Scholar]
  2. Seiler, G.J.; Qi, L.L.; Marek, L.F. Utilization of sunflower crop wild relatives for cultivated sunflower improvement. Crop Sci. 2017, 57, 1083–1101. [Google Scholar] [CrossRef]
  3. Akkaya, M.R. Prediction of fatty acid composition of sunflower seeds by near-infrared reflectance spectroscopy. J. Food Sci. Technol. 2018, 55, 2318–2325. [Google Scholar] [CrossRef]
  4. Flagella, Z.; Rotunno, T.; Tarantino, E.; Di Caterina, R.; De Caro, A. Changes in seed yield and oil fatty acid composition of high oleic sunflower (Helianthus annuus L.) hybrids in relation to the sowing date and the water regime. Eur. J. Agron. 2002, 17, 221–230. [Google Scholar] [CrossRef]
  5. Czumaj, A.; Sledzinski, T. Biological role of unsaturated fatty acid desaturases in health and disease. Nutrients 2020, 12, 356. [Google Scholar] [CrossRef]
  6. Petermann, A.B.; Reyna-Jeldes, M.; Ortega, L.; Coddou, C.; Yevenes, G.E. Roles of the unsaturated fatty acid docosahexaenoic acid in the central nervous system: Molecular and cellular insights. Int. J. Mol. Sci. 2022, 23, 5390. [Google Scholar] [CrossRef]
  7. Bates, P.D.; Stymne, S.; Ohlrogge, J. Biochemical pathways in seed oil synthesis. Curr. Opin. Plant Biol. 2013, 16, 358–364. [Google Scholar] [CrossRef]
  8. Li-Beisson, Y.; Shorrosh, B.; Beisson, F.; Andersson, M.X.; Arondel, V.; Bates, P.D.; Baud, S.; Bird, D.; Debono, A.; Durrett, T.P.; et al. Acyl-lipid metabolism. Arab. Book 2013, 11, e0161. [Google Scholar] [CrossRef]
  9. Xu, C.; Shanklin, J. Triacylglycerol metabolism, function, and accumulation in plant vegetative tissues. Annu. Rev. Plant Biol. 2016, 67, 179–206. [Google Scholar] [CrossRef]
  10. Dehesh, K.; Tai, H.; Edwards, P.; Byrne, J.; Jaworski, J.G. Overexpression of 3-ketoacyl-acyl-carrier protein synthase IIIs in plants reduces the rate of lipid synthesis. Plant Physiol. 2001, 125, 1103–1114. [Google Scholar] [CrossRef]
  11. Aznar-Moreno, J.A.; Venegas-Calerón, M.; Martínez-Force, E.; Garcés, R.; Salas, J.J. Acyl carrier proteins from sunflower (Helianthus annuus L.) seeds and their influence on FatA and FatB acyl-ACP thioesterase activities. Planta 2016, 244, 479–490. [Google Scholar] [CrossRef] [PubMed]
  12. Li, N.; Xu, C.; Li-Beisson, Y.; Philippar, K. Fatty acid and lipid transport in plant cells. Trends Plant Sci. 2016, 21, 145–158. [Google Scholar] [PubMed]
  13. Kroon, J.T.; Wei, W.; Simon, W.J.; Slabas, A.R. Identification and functional expression of a type 2 acyl-CoA:diacylglycerol acyltransferase (DGAT2) in developing castor bean seeds which has high homology to the major triglyceride biosynthetic enzyme of fungi and animals. Phytochem. 2006, 67, 2541–2549. [Google Scholar] [CrossRef] [PubMed]
  14. Kennedy, E.P. Biosynthesis of complex lipids. Fed. Proc. 1961, 20, 934. [Google Scholar]
  15. Ohlrogge, J.; Browse, J. Lipid Biosynthesis. Plant Cell 1995, 7, 957–970. [Google Scholar]
  16. Dahlqvist, A.; Ståhl, U.; Lenman, M.; Banas, A.; Lee, M.; Sandager, L.; Ronne, H.; Stymne, S. Phospholipid:diacylglycerol acyltransferase: An enzyme that catalyzes the acyl-CoA-independent formation of triacylglycerol in yeast and plants. Proc. Natl. Acad. Sci. USA 2000, 97, 6487–6492. [Google Scholar]
  17. Umate, P. Comparative genomics of the lipid-body-membrane proteins oleosin, caleosin and steroleosin in magnoliophyte, lycophyte and bryophyte. Genom. Proteom. Bioinform. 2012, 10, 345–353. [Google Scholar] [CrossRef]
  18. Dar, A.A.; Choudhury, A.R.; Kancharla, P.K.; Arumugam, N. The FAD2 gene in plants: Occurrence, regulation, and role. Front. Plant Sci. 2017, 8, 1789. [Google Scholar]
  19. Shah, F.H.; Cha, T.S. A mesocarp-and species-specific cDNA clone from oil palm encodes for sesquiterpene synthase. Plant Sci. 2000, 154, 153–160. [Google Scholar]
  20. Wang, H.; Guo, J.; Lambert, K.N.; Lin, Y. Developmental control of Arabidopsis seed oil biosynthesis. Planta 2007, 226, 773–783. [Google Scholar] [CrossRef]
  21. Focks, N.; Benning, C. wrinkled1: A novel, low-seed-oil mutant of Arabidopsis with a deficiency in the seed-specific regulation of carbohydrate metabolism. Plant Physiol. 1998, 118, 91–101. [Google Scholar] [PubMed]
  22. Baud, S.; Mendoza, M.S.; To, A.; Harscot, E.; Lepiniec, L.; Dubreucq, B. WRINKLED1 specifies the regulatory action of LEAFY COTYLEDON2 towards fatty acid metabolism during seed maturation in Arabidopsis. Plant J. 2007, 50, 825–838. [Google Scholar] [PubMed]
  23. Marchive, C.; Nikovics, K.; To, A.; Lepiniec, L.; Baud, S. Transcriptional regulation of fatty acid production in higher plants: Molecular bases and biotechnological outcomes. Eur. J. Lipid Sci. Technol. 2014, 116, 1332–1343. [Google Scholar] [CrossRef]
  24. Ridenour, J.B.; Smith, J.E.; Bluhm, B.H. The HAP complex governs fumonisin biosynthesis and maize kernel pathogenesis in fusarium verticillioides. J. Food Protect. 2016, 79, 1498–1507. [Google Scholar]
  25. To, A.; Valon, C.; Savino, G.; Guilleminot, J.; Devic, M.; Giraudat, J.; Parcy, F. A network of local and redundant gene regulation governs Arabidopsis seed maturation. Plant Cell 2006, 18, 1642–1651. [Google Scholar]
  26. Mu, J.; Tan, H.; Zheng, Q.; Fu, F.; Liang, Y.; Zhang, J.; Yang, X.; Wang, T.; Chong, K.; Wang, X.J.; et al. LEAFY COTYLEDON1 is a key regulator of fatty acid biosynthesis in Arabidopsis. Plant Physiol. 2008, 148, 1042–1054. [Google Scholar]
  27. Zhang, M.; Cao, X.; Jia, Q.; Ohlrogge, J. FUSCA3 activates triacylglycerol accumulation in Arabidopsis seedlings and tobacco BY2 cells. Plant J. 2016, 88, 95–107. [Google Scholar]
  28. Lim, A.R.Q.; Kong, Q.; Singh, S.K.; Guo, L.; Yuan, L.; Ma, W. Sunflower WRINKLED1 plays a key role in transcriptional regulation of oil biosynthesis. Int. J. Mol. Sci. 2022, 23, 3054. [Google Scholar] [CrossRef]
  29. Badouin, H.; Gouzy, J.; Grassa, C.J.; Murat, F.; Staton, S.E.; Cottret, L.; Lelandais-Brière, C.; Owens, G.L.; Carrère, S.; Mayjonade, B.; et al. The sunflower genome provides insights into oil metabolism, flowering and Asterid evolution. Nature 2017, 546, 148–152. [Google Scholar]
  30. Premnath, A.; Narayana, M.; Ramakrishnan, C.; Kuppusamy, S.; Chockalingam, V. Mapping quantitative trait loci controlling oil content, oleic acid and linoleic acid content in sunflower (Helianthus annuus L.). Mol. Breed. 2016, 36, 106. [Google Scholar] [CrossRef]
  31. Chernova, A.I.; Gubaev, R.F.; Singh, A.; Sherbina, K.; Goryunova, S.V.; Martynova, E.U.; Goryunov, D.V.; Boldyrev, S.V.; Vanyushkina, A.A.; Anikanov, N.A.; et al. Genotyping and lipid profiling of 601 cultivated sunflower lines reveals novel genetic determinants of oil fatty acid content. BMC Genom. 2021, 22, 505. [Google Scholar] [CrossRef] [PubMed]
  32. Huang, Q.; Xiang, L.; Zhang, L.; Maimaiti, Y.; Luo, W.; Lei, Z. Transcriptome sequencing reveals key genes for sunflower unsaturated fatty acid synthesis. Agronomy 2023, 13, 3. [Google Scholar] [CrossRef]
  33. Kim, D.; Langmead, B.; Salzberg, S.L. HISAT: A fast spliced aligner with low memory requirements. Nat. Methods 2015, 12, 357–360. [Google Scholar] [CrossRef] [PubMed]
  34. Pertea, M.; Pertea, G.M.; Antonescu, C.M.; Chang, T.C.; Mendell, J.T.; Salzberg, S.L. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat. Biotechnol. 2015, 33, 290–295. [Google Scholar] [CrossRef]
  35. Langmead, B.; Salzberg, S.L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 2012, 9, 357–359. [Google Scholar] [CrossRef]
  36. Love, M.I.; Huber, W.; Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014, 15, 550. [Google Scholar] [CrossRef]
  37. Guo, S.; Zuo, Y.; Zhang, Y.; Wu, C.; Su, W.; Jin, W.; Yu, H.; An, Y.; Li, Q. Large-scale transcriptome comparison of sunflower genes responsive to Verticillium dahliae. BMC Genom. 2017, 18, 42. [Google Scholar] [CrossRef]
  38. Livak, K.J.; Schmittgen, T.D. Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method. Methods 2001, 25, 402–408. [Google Scholar] [CrossRef]
  39. Liu, Q.; Wu, M.; Zhang, B.; Shrestha, P.; Petrie, J.R.; Green, A.G.; Singh, S.P. Genetic enhancement of palmitic acid accumulation in cotton seed oil through RNAi down-regulation of ghKAS2 encoding β-ketoacyl-ACP synthase II (KASII). Plant Biotechnol. 2017, 15, 132–143. [Google Scholar] [CrossRef]
  40. Baud, S.; Lepiniec, L. Regulation of de novo fatty acid synthesis in maturing oilseeds of Arabidopsis. Plant Physiol. Biochem. 2009, 47, 448–455. [Google Scholar] [CrossRef]
  41. Xu, H.; Li, D.; Hao, Y.; Guo, X.; Lu, J.; Zhang, T. Genome-wide analysis of DGAT gene family in Perilla frutescens and functional characterization of PfDGAT2-2 and PfDGAT3-1 in Arabidopsis. Plant Sci. 2022, 324, 111426. [Google Scholar] [CrossRef] [PubMed]
  42. Turchetto-Zolet, A.C.; Christoff, A.P.; Kulcheski, F.R.; Loss-Morais, G.; Margis, R.; Margis-Pinheiro, M. Diversity and evolution of plant diacylglycerol acyltransferase (DGATs) unveiled by phylogenetic, gene structure and expression analyses. Genet. Mol. Biol. 2016, 39, 524–538. [Google Scholar] [PubMed]
  43. Hernández, M.L.; Whitehead, L.; He, Z.; Gazda, V.; Gilday, A.; Kozhevnikova, E.; Vaistij, F.E.; Larson, T.R.; Graham, I.A. A cytosolic acyltransferase contributes to triacylglycerol synthesis in sucrose-rescued Arabidopsis seed oil catabolism mutants. Plant Physiol. 2012, 160, 215–225. [Google Scholar] [PubMed]
  44. Beaudoin, F.; Lacey, D.J.; Napier, J.A. The biogenesis of the plant seed oil body: Oleosin protein is synthesised by ER-bound ribosomes. Plant Physiol. Biochem. 1999, 37, 481–490. [Google Scholar] [CrossRef]
  45. Frandsen, G.I.; Mundy, J.; Tzen, J.T.C. Oil bodies and their associated proteins, oleosin and caleosin. Physiol. Plant 2001, 112, 301–307. [Google Scholar]
  46. Naested, H.; Frandsen, G.I.; Jauh, G.Y.; Hernandez-Pinzon, I.; Nielsen, H.B.; Murphy, D.J.; Rogers, J.C.; Mundy, D.J. Caleosins: Ca2+-binding proteins associated with lipid bodies. Plant Mol. Biol. 2000, 44, 463–476. [Google Scholar]
  47. Fofana, B.; Cloutier, S.; Duguid, S.; Ching, J.; Rampitsch, C. Gene expression of stearoyl-ACP desaturase and delta12 fatty acid desaturase 2 is modulated during seed development of flax (Linum usitatissimum). Lipids 2006, 41, 705–712. [Google Scholar]
  48. Pérez-vich, B.; Fernández-martínez, J.M.; Grondona, M.; Knapp, S.J.; Berry, S.T. Stearoyl-ACP and oleoyl-PC desaturase genes cosegregate with quantitative trait loci underlying high stearic and high oleic acid mutant phenotypes in sunflower. Theor. Appl. Genet. 2002, 104, 338–349. [Google Scholar]
  49. Stone, S.L.; Kwong, L.W.; Yee, K.M.; Pelletier, J.; Lepiniec, L.; Fischer, R.L.; Goldberg, R.B.; Harada, J.J. LEAFY COTYLEDON2 encodes a B3 domain transcription factor that induces embryo development. Proc. Natl. Acad. Sci. USA 2001, 98, 11806–11811. [Google Scholar] [CrossRef]
  50. Nikolau, B.J.; Ohlrogge, J.B.; Wurtele, E.S. Plant biotin-containing carboxylases. Arch. Biochem. Biophys. 2003, 414, 211–222. [Google Scholar]
  51. Wei, Q.; Li, J.; Zhang, L.; Wu, P.; Chen, Y.; Li, M.; Jiang, H.; Wu, G. Cloning and characterization of a β-ketoacyl-acyl carrier protein synthase II from Jatropha curcas. J. Plant Physiol. 2012, 169, 816–824. [Google Scholar] [PubMed]
  52. Yu, N.; Xiao, W.F.; Zhu, J.; Chen, X.Y.; Peng, C.C. The Jatropha Curcas KASIII gene alters fatty acid composition of seeds in Arabidopsis thaliana. Biol. Plant 2015, 59, 773–782. [Google Scholar]
  53. Wang, Q.; Huang, W.; Jiang, Q.; Lian, J.; Liu, Z. Lower levels of expression of FATA2 gene promote longer siliques with modified seed oil content in Arabidopsis thaliana. Plant Mol. Biol. Rep. 2013, 31, 1368–1375. [Google Scholar]
  54. Moreno-Pérez, A.J.; Venegas-Calerón, M.; Vaistij, F.E.; Salas, J.J.; Larson, T.R.; Garcés, R.; Graham, I.A.; Martínez-Force, E. Reduced expression of FatA thioesterases in Arabidopsis affects the oil content and fatty acid composition of the seeds. Planta 2012, 235, 629–639. [Google Scholar]
  55. Iba, K.; Gibson, S.; Nishiuchi, T.; Fuse, T.; Nishimura, M.; Arondel, V.; Hugly, S.; Somerville, C. A gene encoding a chloroplast omega-3 fatty acid desaturase complements alterations in fatty acid desaturation and chloroplast copy number of the fad7 mutant of Arabidopsis thaliana. J. Biol. Chem. 1993, 268, 24099–24105. [Google Scholar]
  56. Shah, S.; Xin, Z.G.; Browse, J. Overexpression of the FAD3 desaturase gene in a mutant of Arabidopsis. Plant Physiol. 1997, 114, 1533–1539. [Google Scholar]
  57. Abdullah, H.M.; Akbari, P.; Paulose, B.; Schnell, D.; Qi, W.; Park, Y.; Pareek, A.; Dhankher, O.P. Transcriptome profiling of Camelina sativa to identify genes involved in triacylglycerol biosynthesis and accumulation in the developing seeds. Biotechnol. Biofuels 2016, 9, 136. [Google Scholar]
  58. Yang, S.; Miao, L.; He, J.; Zhang, K.; Li, Y.; Gai, J. Dynamic transcriptome changes related to oil accumulation in developing soybean seeds. Int. J. Mol. Sci. 2019, 20, 2202. [Google Scholar] [CrossRef]
  59. Li, S.S.; Wang, L.S.; Shu, Q.Y.; Wu, J.; Chen, L.G.; Shao, S.; Yin, D.D. Fatty acid composition of developing tree peony (Paeonia section Moutan DC.) seeds and transcriptome analysis during seed development. BMC Genom. 2015, 16, 208. [Google Scholar]
  60. Fei, X.; Ma, Y.; Hu, H.; Wei, A. Transcriptome analysis and GC-MS profiling of key genes in fatty acid synthesis of Zanthoxylum bungeanum seeds. Ind. Crops Prod. 2020, 156, 112870. [Google Scholar]
  61. Xie, D.; Dai, Z.; Yang, Z.; Tang, Q.; Deng, C.; Xu, Y.; Wang, J.; Chen, J.; Zhao, D.; Zhang, S.; et al. Combined genome-wide association analysis and transcriptome sequencing to identify candidate genes for flax seed fatty acid metabolism. Plant Sci. 2019, 286, 98–107. [Google Scholar] [CrossRef] [PubMed]
  62. Feng, Y.-Z.; Wang, L.; Fu, J.-M.; Wuyun, T.-N.; Du, H.-Y.; Tan, X.-F.; Zou, F.; Li, F.-D. Transcriptome sequencing discovers genes related to fatty acid biosynthesis in the seeds of Eucommia ulmoides. Genes Genom. 2016, 38, 275–283. [Google Scholar] [CrossRef]
  63. Kazaz, S.; Barthole, G.; Domergue, F.; Ettaki, H.; To, A.; Vasselon, D.; De Vos, D.; Belcram, K.; Lepiniec, L.; Baud, S. Differential activation of partially redundant Δ9 stearoyl-ACP desaturase genes is critical for omega-9 monounsaturated fatty acid biosynthesis during seed development in Arabidopsis. Plant Cell 2020, 32, 3613–3637. [Google Scholar] [CrossRef]
  64. Salas, J.J.; Martínez-Force, E.; Garcés, R. Biochemical characterization of a high-palmitoleic acid Helianthus annuus mutant. Plant Physiol. Biochem. 2004, 42, 373–381. [Google Scholar] [CrossRef]
  65. Pleite, R.; Martínez-Force, E.; Garcés, R. Increase of the stearic acid content in high-oleic sunflower (Helianthus annuus) seeds. J. Agric. Food Chem. 2006, 54, 9383–9388. [Google Scholar]
  66. Perry, H.J.; Harwood, J.L. Changes in the lipid content of developing seeds of Brassica napus. Phytochemistry 1993, 32, 1411–1415. [Google Scholar]
  67. Liu, D.; Ji, H.; Yang, Z. Functional characterization of three novel genes encoding diacylglycerol acyltransferase (DGAT) from oil-rich tubers of Cyperus esculentus. Plant Cell Physiol. 2020, 61, 118–129. [Google Scholar] [CrossRef]
  68. Lu, C.L.; de Noyer, S.B.; Hobbs, D.H.; Kang, J.; Wen, Y.; Krachtus, D.; Hills, M.J. Expression pattern of diacylglycerol acyltransferase-1, an enzyme involved in triacylglycerol biosynthesis, in Arabidopsis thaliana. Plant Mol. Biol. 2003, 52, 31–41. [Google Scholar]
  69. Shockey, J.M.; Gidda, S.K.; Chapital, D.C.; Kuan, J.C.; Dhanoa, P.K.; Bland, J.M.; Rothstein, S.J.; Mullen, R.T.; Dyer, J.M. Tung tree DGAT1 and DGAT2 have nonredundant functions in triacylglycerol biosynthesis and are localized to different subdomains of the endoplasmic reticulum. Plant Cell 2006, 18, 2294–2313. [Google Scholar] [CrossRef]
  70. Zhou, L.; Wu, Q.; Yang, Y.; Li, Q.; Li, R.; Ye, J. Regulation of oil biosynthesis and genetic improvement in plants: Advances and prospects. Genes 2024, 15, 1125. [Google Scholar] [CrossRef]
  71. Wurie, H.R.; Bucketf, L.; Zammit, V.A. Diacylglycerol acyhransferase 2 acts upstream of diacylglycerol acyhransferase 1 and utilizes nascent diglycerides and de novo synthesized fatty acids in HepG2 cells. FEBS J. 2012, 279, 3033–3047. [Google Scholar] [CrossRef] [PubMed]
  72. Lotan, T.; Ohto, M.; Yee, K.M.; West, M.A.; Lo, R.; Kwong, R.W.; Yamagishi, K.; Fischer, R.L.; Goldberg, R.B.; Harada, J.J. Arabidopsis LEAFY COTYLEDON1 is sufficient to induce embryo development in vegetative cells. Cell 1998, 93, 1195–1205. [Google Scholar] [CrossRef] [PubMed]
  73. Xu, Y.; Caldo, K.M.P.; Pal-Nath, D.; Ozga, J.; Lemieux, M.J.; Weselake, R.J.; Chen, G. Properties and biotechnological applications of acyl-coa:diacylglycerol acyltransferase and phospholipid:diacylglycerol acyltransferase from terrestrial plants and microalgae. Lipids 2018, 53, 663–688. [Google Scholar] [CrossRef]
  74. Mueller, S.P.; Unger, M.; Guender, L.; Fekete, A.; Mueller, M.J. Phospholipid:diacylglycerol acyltransferase-mediated triacylglyerol synthesis augments basal thermotolerance. Plant Physiol. 2017, 175, 486–497. [Google Scholar] [CrossRef]
  75. Klińska-Bąchor, S.; Kędzierska, S.; Demski, K.; Banaś, A. Phospholipid:diacylglycerol acyltransferase1-overexpression stimulates lipid turnover, oil production and fitness in cold-grown plants. BMC Plant Biol. 2023, 23, 370. [Google Scholar] [CrossRef]
  76. Zhang, M.; Fan, J.; Taylor, D.C.; Ohlrogge, J.B. DGAT1 and PDAT1 acyltransferases have overlapping functions in Arabidopsis triacylglycerol biosynthesis and are essential for normal pollen and seed development. Plant Cell 2009, 21, 3885–3901. [Google Scholar] [CrossRef]
  77. Banaś, W.; Sanchez Garcia, A.; Banaś, A.; Stymne, S. Activities of acyl-CoA:diacylglycerol acyltransferase (DGAT) and phospholipid:diacylglycerol acyltransferase (PDAT) in microsomal preparations of developing sunflower and safflower seeds. Planta 2013, 237, 1627–1636. [Google Scholar] [CrossRef]
  78. Pan, X.; Siloto, R.M.; Wickramarathna, A.D.; Mietkiewska, E.; Weselake, R.J. Identification of a pair of phospholipid:diacylglycerol acyltransferases from developing flax (Linum usitatissimum L.) seed catalyzing the selective production of trilinolenin. J. Biol. Chem. 2013, 288, 24173–24188. [Google Scholar] [CrossRef]
  79. Zang, X.; Geng, X.; Ma, L.; Wang, N.; Pei, W.; Wu, M.; Zhang, J.; Yu, J. A genome-wide analysis of the phospholipid: Diacylglycerol acyltransferase gene family in Gossypium. BMC Genom. 2019, 20, 402. [Google Scholar] [CrossRef]
  80. Tan, H.; Yang, X.; Zhang, F.; Zheng, X.; Qu, C.; Mu, J.; Fu, F.; Li, J.; Guan, R.; Zhang, H.; et al. Enhanced seed oil production in canola by conditional expression of Brassica napus LEAFY COTYLEDON1 and LEC1-LIKE in developing seeds. Plant Physiol. 2011, 156, 1577–1588. [Google Scholar] [CrossRef]
  81. Tian, R.; Wang, F.; Zheng, Q.; Niza, V.M.A.G.E.; Downie, A.B.; Perry, S.E. Direct and indirect targets of the Arabidopsis seed transcription factor ABSCISIC ACID INSENSITIVE3. Plant J. 2020, 103, 1679–1694. [Google Scholar]
  82. Yang, Z.; Liu, X.; Wang, K.; Li, Z.; Jia, Q.; Zhao, C.; Zhang, M. ABI3 with or without FUS3 highly up-regulates lipid droplet proteins and activates oil accumulation. J. Exp. Bot. 2021, 73, 2077–2092. [Google Scholar]
Figure 1. KEGG pathway enrichment analysis of the unigenes. The X-axis represents the names of the KEGG metabolic pathways, while the Y-axis denotes the count of genes or transcripts annotated to each respective pathway.
Figure 1. KEGG pathway enrichment analysis of the unigenes. The X-axis represents the names of the KEGG metabolic pathways, while the Y-axis denotes the count of genes or transcripts annotated to each respective pathway.
Genes 16 00393 g001
Figure 2. PCA analysis of sunflower transcriptome samples and cluster analysis of DEGs. (A) Principal component analysis (PCA) of all 15 samples was performed based on RNA-seq FPKM. (B) The number of genes in sunflower seeds at different developmental stages: 10, 17, 24, 31, and 38 days after flowering (DAF). (C) Pairwise comparisons of the number of DEGs between samples at each time point during sunflower seed development were performed. A p-value < 0.05 and a fold change ≥ 1 were used as criteria for DEG identification.
Figure 2. PCA analysis of sunflower transcriptome samples and cluster analysis of DEGs. (A) Principal component analysis (PCA) of all 15 samples was performed based on RNA-seq FPKM. (B) The number of genes in sunflower seeds at different developmental stages: 10, 17, 24, 31, and 38 days after flowering (DAF). (C) Pairwise comparisons of the number of DEGs between samples at each time point during sunflower seed development were performed. A p-value < 0.05 and a fold change ≥ 1 were used as criteria for DEG identification.
Genes 16 00393 g002
Figure 3. WGCNA analysis of RNA-seq and physiological trait data. (A) WGCNA was calculated from 15 samples, and the 23,396 DEGs were partitioned into six modules. Columns represent the module eigengenes of the means. For each module, the number of DEGs and the number of KEGG pathways were listed. (B) The expression profiles of the modules were related to the date of sampling (DAF) and the physiological traits in Figure 1. The data in the boxes represent the number of differentially expressed genes.
Figure 3. WGCNA analysis of RNA-seq and physiological trait data. (A) WGCNA was calculated from 15 samples, and the 23,396 DEGs were partitioned into six modules. Columns represent the module eigengenes of the means. For each module, the number of DEGs and the number of KEGG pathways were listed. (B) The expression profiles of the modules were related to the date of sampling (DAF) and the physiological traits in Figure 1. The data in the boxes represent the number of differentially expressed genes.
Genes 16 00393 g003
Figure 4. Fatty acid and TAG biosynthetic pathways in sunflower seeds. The expression levels of potential candidate genes (represented by Log10 FPKM) in sunflower seeds at various developmental stages (10, 17, 24, 31, and 38 DAF) are highlighted through color coding, ranging from blue (low expression) to red (high expression). An asterisk (*) denotes the differentially expressed genes (DEGs) associated with FA synthesis, as identified through WGCNA analysis. Key genes involved in lipid metabolism include ACCase (acetyl-CoA carboxylase), BC (biotin carboxylase), BCCP (biotin carboxyl carrier protein), α-CT (carboxyl transferase subunit α), MACT (malonyl-CoA-acyl carrier protein transacylase), ACP (acid phosphatase), KAS III (3-oxoacyl-ACP synthase III), KAR (3-oxoacyl-ACP reductase), HAD (3-hydroxyacyl-ACP dehydratase), ENR (Enoyl-CoA reductase), KAS II (3-oxoacyl-ACP synthase II), FATB (fatty acyl-ACP thioesterase B), FATA (fatty acyl-ACP thioesterase A), SAD (stearoyl-ACP desaturase), FAD (fatty acid desaturase), LACS (long-chain acyl-CoA synthetase), LPCAT (lysophosphatidylcholine acyltransferase), PLA2 (phospholipase A2), GPAT (glycerol-3-phosphate acyltransferase), LPAAT (1-acyl-sn-glycerol-3-phosphate acyltransferase), PLD (phospholipase D), PAP (phosphatide phosphatase), CPT (CDP-choline: diacylglycerol cholinephosphotransferase), PDCT (phosphatidylcholine: diacylglycerol cholinephosphotransferase), PLC (phospholipase C), DGAT (diacylglycerol acyltransferase), PDAT (phospholipid: diacylglycerol acyltransferase), OLE (oleosin), CLO (caleosin), WRI1 (wrinkled 1), and FUS3 (FUSCA3).
Figure 4. Fatty acid and TAG biosynthetic pathways in sunflower seeds. The expression levels of potential candidate genes (represented by Log10 FPKM) in sunflower seeds at various developmental stages (10, 17, 24, 31, and 38 DAF) are highlighted through color coding, ranging from blue (low expression) to red (high expression). An asterisk (*) denotes the differentially expressed genes (DEGs) associated with FA synthesis, as identified through WGCNA analysis. Key genes involved in lipid metabolism include ACCase (acetyl-CoA carboxylase), BC (biotin carboxylase), BCCP (biotin carboxyl carrier protein), α-CT (carboxyl transferase subunit α), MACT (malonyl-CoA-acyl carrier protein transacylase), ACP (acid phosphatase), KAS III (3-oxoacyl-ACP synthase III), KAR (3-oxoacyl-ACP reductase), HAD (3-hydroxyacyl-ACP dehydratase), ENR (Enoyl-CoA reductase), KAS II (3-oxoacyl-ACP synthase II), FATB (fatty acyl-ACP thioesterase B), FATA (fatty acyl-ACP thioesterase A), SAD (stearoyl-ACP desaturase), FAD (fatty acid desaturase), LACS (long-chain acyl-CoA synthetase), LPCAT (lysophosphatidylcholine acyltransferase), PLA2 (phospholipase A2), GPAT (glycerol-3-phosphate acyltransferase), LPAAT (1-acyl-sn-glycerol-3-phosphate acyltransferase), PLD (phospholipase D), PAP (phosphatide phosphatase), CPT (CDP-choline: diacylglycerol cholinephosphotransferase), PDCT (phosphatidylcholine: diacylglycerol cholinephosphotransferase), PLC (phospholipase C), DGAT (diacylglycerol acyltransferase), PDAT (phospholipid: diacylglycerol acyltransferase), OLE (oleosin), CLO (caleosin), WRI1 (wrinkled 1), and FUS3 (FUSCA3).
Genes 16 00393 g004
Figure 5. DEGs related to FA and TAG biosynthetic pathways identified in developing sunflower seeds. The Y-axis represents reads per kilobase per million mapped reads (FPKM) values and the X-axis represents five developmental stages of sunflower seeds: 10, 17, 24, 31, and 38 days after anthesis (DAF). The error bar line represents the standard deviation (SD) calculated from three biological replicates. (A) Acetyl-CoA carboxylase subunits (BC, BCCP, α-CT); (B) β-Ketoacyl-ACP synthase isoforms (KASI, KASII, KASIII); (C) Acyl-ACP thioesterases (FATA, FATB); (D) Long-chain acyl-CoA synthetases (LACS4, LACS7, LACS9); (E) Triacylglycerol assembly enzymes (GPAT, DGAT2, DGAT3, PDAT); (F) Oil body-associated proteins (OLE, CLO); (G) Fatty acid desaturases (SAD, FAD2, FAD3); (H) Seed development regulators (LEC1, LEC2, ABI3, FUS3, WRI1).
Figure 5. DEGs related to FA and TAG biosynthetic pathways identified in developing sunflower seeds. The Y-axis represents reads per kilobase per million mapped reads (FPKM) values and the X-axis represents five developmental stages of sunflower seeds: 10, 17, 24, 31, and 38 days after anthesis (DAF). The error bar line represents the standard deviation (SD) calculated from three biological replicates. (A) Acetyl-CoA carboxylase subunits (BC, BCCP, α-CT); (B) β-Ketoacyl-ACP synthase isoforms (KASI, KASII, KASIII); (C) Acyl-ACP thioesterases (FATA, FATB); (D) Long-chain acyl-CoA synthetases (LACS4, LACS7, LACS9); (E) Triacylglycerol assembly enzymes (GPAT, DGAT2, DGAT3, PDAT); (F) Oil body-associated proteins (OLE, CLO); (G) Fatty acid desaturases (SAD, FAD2, FAD3); (H) Seed development regulators (LEC1, LEC2, ABI3, FUS3, WRI1).
Genes 16 00393 g005
Figure 6. Fatty acid desaturase genes identified in developing sunflower seeds. (A) Heat map showing the expression levels of desaturase genes. (B) Phylogenetic analysis of desaturase genes in sunflower and Arabidopsis, performed using ClustalX2 for nucleotide sequence alignment and MEGA 7.0 Neighbor Joining (NJ) method for tree construction. (C) Pathway map illustrating the desaturase gene network. The asterisk (*) represents the major DEGs associated with FA synthesis.
Figure 6. Fatty acid desaturase genes identified in developing sunflower seeds. (A) Heat map showing the expression levels of desaturase genes. (B) Phylogenetic analysis of desaturase genes in sunflower and Arabidopsis, performed using ClustalX2 for nucleotide sequence alignment and MEGA 7.0 Neighbor Joining (NJ) method for tree construction. (C) Pathway map illustrating the desaturase gene network. The asterisk (*) represents the major DEGs associated with FA synthesis.
Genes 16 00393 g006
Figure 7. Transcription factors (TFs) involved in lipid synthesis in sunflower. (A) TFs identified in developing sunflower seeds. A total of 44 families were predicted, and the red columns indicate TFs that may be involved in FA and TAG biosynthesis. (B) Regulatory model of seed oil accumulation by TFs. (C) Heat map of differential expression of TF genes with FA and TAG biosynthesis. The asterisk (*) represents the major DEGs associated with TAG synthesis.
Figure 7. Transcription factors (TFs) involved in lipid synthesis in sunflower. (A) TFs identified in developing sunflower seeds. A total of 44 families were predicted, and the red columns indicate TFs that may be involved in FA and TAG biosynthesis. (B) Regulatory model of seed oil accumulation by TFs. (C) Heat map of differential expression of TF genes with FA and TAG biosynthesis. The asterisk (*) represents the major DEGs associated with TAG synthesis.
Genes 16 00393 g007
Figure 8. Expression validation of genes between RT-qPCR and RNA-seq at three stages of seed development in sunflower. Red line and blue volume represent RNA-seq and data, respectively. Sunflower 18S rRNA gene (AF1057577) was used as an internal control. Means with different letters indicate a significant difference (p < 0.05); error bars lines indicate SD (n = 3).
Figure 8. Expression validation of genes between RT-qPCR and RNA-seq at three stages of seed development in sunflower. Red line and blue volume represent RNA-seq and data, respectively. Sunflower 18S rRNA gene (AF1057577) was used as an internal control. Means with different letters indicate a significant difference (p < 0.05); error bars lines indicate SD (n = 3).
Genes 16 00393 g008
Table 1. Fatty acid composition (% of total fatty acids) in mature seeds of ‘T302’ sunflower cultivar.
Table 1. Fatty acid composition (% of total fatty acids) in mature seeds of ‘T302’ sunflower cultivar.
Fatty AcidsFormulaPercentage (%)
lauric acidC12:00.01
myristic acidC14:00.07
pentadecanoic acidC15:00.02
palmitic acidC16:07.46
heptadecanoic acidC17:00.06
stearic acidC18:05.04
arachidonic acidC20:00.36
eicosanoic acidC21:00.01
behenic acidC22:00.03
pantotrizoic acidC23:00.04
lignoceric acidC24:00.29
myristic acrylic acidC14:1N50.07
pentadecenoic acidC15:1N50.06
palmitoleic acidC16:1N70.13
heptadecenoic acidC17:1N70.04
trans-oleic acidC18:1TN90.03
oleic acidC18:1N937.25
linoleic acidC18:2N648.47
linolenic acidC18:3N30.06
eicosenoic acidC20:1N90.15
eicosapentaenoic acidC20:5N30.34
docosapentaenoic acidC22:5N60.01
tetracosenoic acidC24:1N90.01
docosahexaenoic acid (DHA)C22:6N30.01
Total_ SFAs 13.39
Total_ MUFAs 37.73
Total_ PUFAs 48.88
Note: The first double bonds in unsaturated fatty acids, designated n3, n5, n6, n7, and n9, are located at positions 3, 5, 6, 7, and 9, respectively, from the methyl end of the carbon chain. SFAs, saturated fatty acids; MUFAs, monounsaturated fatty acids; PUFAs, polyunsaturated fatty acids.
Table 2. Identification of lipid-related genes in the developing sunflower embryo.
Table 2. Identification of lipid-related genes in the developing sunflower embryo.
EnzymeKEGG AnnotationUg IDGene Name10 DAF17 DAF24 DAF31 DAF38 DAFModel
ACC/α-CTacetyl-coenzyme A carboxylase carboxyl transferase subunit αUg40664LOC11088445852.1738.6522.7719.8913.02blue
Ug65667LOC110909894103.7962.4733.2715.242.2blue
ACC/BCbiotin carboxylase 2, chloroplastic-likeUg80390LOC11092093696.3544.5125.8311.012.84blue
Ug4062LOC110882513149.2776.7749.8928.1411.67blue
ACC/BCCPbiotin carboxyl carrier protein of acetyl-CoA carboxylase, chloroplastic-likeUg5997LOC11091726425.37315.257.366.193.49blue
Ug38366LOC110879644237.86212.6132.7460.939.06blue
Ug71124LOC11091586962.4638.7520.4111.744.26blue
MCATmalonyl CoA-acyl carrier protein transacylase-likeUg1819LOC110869205110.83106.7972.1753.8514.11blue
HAD3-hydroxyacyl-[acyl-carrier-protein] dehydrataseUg9482LOC110928978162.8296.3237.418.290.94blue
ENR/FABIenoyl-[acyl-carrier-protein] reductase [NADH]Ug8057LOC11092715511.859.8510.0452.2143.95turquosio
Ug80148LOC110920857103.9848.7533.0410.111.25blue
Ug74540LOC11091831077.2239.4421.9411.023.84blue
KAR3-oxoacyl-ACP reductaseUg76007LOC11092629594.5157.8144.0129.6722.61blue
Ug45295LOC11088689874.6141.1723.8615.7110.65blue
KASI3-oxoacyl-[acyl-carrier-protein] synthase IUg80194LOC110925958231.82137.5184.155.4614.97blue
Ug16903LOC11093738339.8735.7220.4512.023.97blue
Ug75867LOC1109231771.11.390.991.840.73brown
KASII3-oxoacyl-[acyl-carrier-protein] synthase IIUg33622LOC110877021191.49177105.550.43.36blue
Ug49585LOC11089118910.877.854.834.876.25yellow
KASIII3-oxoacyl-[acyl-carrier-protein] synthase IIIUg21332LOC110941199124.8875.548.5632.635.19blue
Ug7042LOC11091302415.9611.365.543.920.96blue
FAD2delta (12) fatty acid desaturaseUg64287LOC1109043121022.51994.41304.58346.145.73blue
Ug25083LOC1108652840.260.10.0518.8820.01turquosio
Ug1458LOC1108676817.072.20.658.937.59turquosio
Ug55389LOC1108975681.962.170.753.082.2turquosio
Ug18975LOC1109399711.721.850.163.431.66grey
SADstearoyl-[acyl-carrier-protein] 9-desaturaseUg3372LOC1108781601693.36934.07470.8284.213.08blue
Ug48724LOC110890752192.41174.93113.8540.049.64blue
FATAoleoyl-acyl carrier protein thioesteraseUg2269LOC110871748306.52225.67221.38224.42156.45turquosio
FATBpalmitoyl-acyl carrier protein thioesteraseUg34054LOC11087721716.9117.458.787.462.19blue
LACSlong chain acyl-CoA synthetase 1 Ug65810LOC1109100532.133.876.9424.4818.81turquosio
long chain acyl-CoA synthetase 2 Ug43869LOC11088608916.2327.9930.5210.71.57blue
long chain acyl-CoA synthetase 4Ug11892LOC11093024636.0938.0225.7723.438.24blue
Ug58965LOC11089950000.510.393.695.33turquosio
long chain acyl-CoA synthetase 6, peroxisomalUg31054LOC1108725670.020.0700.1240.11turquosio
Ug42142LOC1108852437.335.632.583.862.65yellow
long chain acyl-CoA synthetase 7, peroxisomal Ug10240LOC11092934217.9529.4526.21139.35140.43turquosio
Ug1412LOC1108957472.151.382.514.920.47brown
long chain acyl-CoA synthetase 9, chloroplastic-Ug37053LOC11087880818.5310.994.261.530.28blue
Ug46285LOC11088949884.0372.5834.4128.6113.77blue
GPATglycerol-3-phosphate acyltransferase Ug65669LOC1109098973.955.028.889.991.14brown
Ug64602LOC1109045301.670.840.842.890.69brown
Ug65083LOC1109048383.923.763.244.682.62brown
Ug60504LOC1109003914.926.719.9912.047.39brown
Ug24261LOC1108647765.8610.3410.3413.557.49green
ATS1glycerol-3-phosphate acyltransferase, chloroplasticUg11854LOC1109302241.290.960.834.614.85turquosio
Ug59423LOC1108997756.575.324.4258.2grey
Ug70832LOC1109160841.382.010.631.577.94turquosio
LPAATlysophosphatidic acid acyltransferaseUg55427LOC11089758226.1213.0312.277.041.9blue
Ug38130LOC1108794982.974.313.616.472.69green
Ug43812LOC1108860592.411.741.551.921.55blue
Ug69414LOC11091253410.018.464.5716.533.87turquosio
Ug63749LOC1109039422.683.533.6710.1916.89turquosio
PAPphosphatidate phosphatase PAH1-likeUg3797LOC1108806054.514.413.013.111.43blue
Ug49621LOC1108912131.381.872.285.227.4turquosio
Ug38050LOC1108794502.914.935.2712.339.89turquosio
Ug5507LOC1109145982.623.995.017.524.91turquosio
Ug62420LOC1109088562.383.213.59.0615.91turquosio
PDATphospholipid: diacylglycerol acyltransferase 1Ug44521LOC1108864581.221.451.318.0715.62turquosio
Ug50246LOC1108936893.617.156.8514.424.3turquosio
phospholipid: diacylglycerol acyltransferase 2Ug64137LOC110904203319.61314.75321.83176.0246.18blue
LPCATlysophospholipid acyltransferaseUg32561LOC1108733911.353.281.9310.1812turquosio
Ug6933LOC11092225870.1643.9925.8214.167.09blue
Ug5650LOC11091545114.7416.718.7140.0472.81turquosio
DGATdiacylglycerol O-acyltransferase 1Ug45411LOC1108869780.130.460.494.053.91turquosio
Ug50291LOC1108937275.57.514.458.214.79yellow
diacylglycerol O-acyltransferase 2Ug69246LOC11091244837.4630.4341.7827.585.05blue
Ug63054LOC1109091963.399.0523.0984.76125.24turquosio
diacylglycerol O-acyltransferase 3, cytosolicUg21368LOC11094122512.0819.4327.6169.91155.47turquosio
CPIcycloeucalenol cycloisomeraseUg38774LOC1108798988.717.216.36.93.62blue
CPTcholine/ethanolaminephosphotransferaseUg45630LOC11088913618.5816.514.7627.119.44brown
Ug5123LOC1109126891.752.543.017.182.73turquosio
PLDphospholipase DUg13661LOC11093564822.0428.0319.2869.34119.35turquosio
Ug43122LOC1108856931.61.40.371.31.167yellow
Ug36550LOC11087850323.1728.4522.5156.19113.88yellow
Ug54911LOC110897355 4.352.841.222.123.1yellow
Ug50834LOC1108940462.894.223.616.291.32brown
Ug41664LOC110884979 0.71.191.732.552.77turquosio
Ug52043LOC1108947250.370.760.753.652.12turquosio
Ug60454LOC1109003571.182.251.692.660.88brown
Caleosin (CLO)peroxygenaseUg391LOC1109435491628.642634.684621.254012.811703.27turquosio
Ug80554LOC1109227251.451.32.377.532.97turquosio
Ug80552LOC11092231912.5211.8427.8444.4712.47green
oleosin (OLE)oleosinUg67185LOC1109112642918.24725.8510,221.6111,148.613956.6green
Ug65317LOC110906283681.18686.941068.37480.97340.3green
Ug63945LOC1109040717917.127774.569851.57118.13225.92green
Ug29184LOC11086916610,678.3914,402.4621,354.3710,587.851985.85green
Ug31436LOC1108710771858.281716.852878.291815.632101.36green
Ug71463LOC110919066241.72503.791575.6610.37429.39green
Ug77732LOC1109217834105.235143.997891.143065.981777.43green
Note: Unigenes exhibiting FPKM values exceeding 100 are emphasized using bold formatting.
Table 3. Expression profiling of lipid metabolism-regulating transcription factors during sunflower seed maturation.
Table 3. Expression profiling of lipid metabolism-regulating transcription factors during sunflower seed maturation.
EnzymeKEGG AnnotationUg IDGene Name10 DAF17 DAF24 DAF31 DAF38 DAF
LEC1nuclear transcription factor Y subunit B-6-like (NF-YB)Ug73129LOC11091859878.7039.2420.434.240
LEC2B3 domain-containing transcription factor LEC2-likeUg68124LOC1109097500.02000.020.04
Ug68126LOC11091383400000
FUS3B3 domain-containing transcription factor FUS3-likeUg64049LOC1109060264.235.4310.285.050.16
Ug67251LOC110911305630.63419.49573.11297.368.39
ABI3B3 domain-containing transcription factor ABI3-likeUg66200LOC11091047045.3860.7659.7666.6657.29
Ug39718LOC11088393819.1817.9024.4635.4830.41
WRI1ethylene-responsive transcription factor WRI1Ug63453LOC11090374065.8741.9442.6019.730.58
Note: Unigenes exhibiting FPKM values exceeding 100 are emphasized using bold formatting.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Meng, W.; Zeng, L.; Yang, X.; Chen, D.; Sun, L. Transcriptome Analysis Reveals Key Genes Involved in Fatty Acid and Triacylglycerol Accumulation in Developing Sunflower Seeds. Genes 2025, 16, 393. https://doi.org/10.3390/genes16040393

AMA Style

Meng W, Zeng L, Yang X, Chen D, Sun L. Transcriptome Analysis Reveals Key Genes Involved in Fatty Acid and Triacylglycerol Accumulation in Developing Sunflower Seeds. Genes. 2025; 16(4):393. https://doi.org/10.3390/genes16040393

Chicago/Turabian Style

Meng, Wanqiu, Linglu Zeng, Xiuli Yang, Dawei Chen, and Li Sun. 2025. "Transcriptome Analysis Reveals Key Genes Involved in Fatty Acid and Triacylglycerol Accumulation in Developing Sunflower Seeds" Genes 16, no. 4: 393. https://doi.org/10.3390/genes16040393

APA Style

Meng, W., Zeng, L., Yang, X., Chen, D., & Sun, L. (2025). Transcriptome Analysis Reveals Key Genes Involved in Fatty Acid and Triacylglycerol Accumulation in Developing Sunflower Seeds. Genes, 16(4), 393. https://doi.org/10.3390/genes16040393

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop