Next Article in Journal
Molecular Responses of the NAC Gene Family in Acanthus ebracteatus: Involvement in Abiotic Stress Responses and Biosynthesis of Verproside and Verbascoside
Previous Article in Journal
Development of Updated Models to Characterize Skidding Performance in Romania Based on a Nation-Level Dataset
Previous Article in Special Issue
Integrated Transcriptomic Analysis Reveals Molecular Mechanisms Underlying Albinism in Schima superba Seedlings
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genome-Wide Identification and Expression Profiling of AP2/ERF Transcription Factor Genes in Prunus armeniaca L.

1
Research Institute of Non-Timber Forestry, Chinese Academy of Forestry, Zhengzhou 450003, China
2
College of Horticulture and Forestry Sciences, Huazhong Agricultural University, Wuhan 430070, China
3
Department of Horticulture, University of Georgia, Athens, GA 30602, USA
4
Co-Innovation Center for the Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing 210037, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Forests 2025, 16(8), 1353; https://doi.org/10.3390/f16081353
Submission received: 7 July 2025 / Revised: 5 August 2025 / Accepted: 18 August 2025 / Published: 20 August 2025
(This article belongs to the Special Issue Forest Tree Breeding: Genomics and Molecular Biology)

Abstract

The APETALA2/Ethylene Responsive Factor (AP2/ERF transcription factor) family plays pivotal roles in plant growth, stress responses, and metabolic regulation. Here, we identified 118 AP2/ERF family members in the apricot (Prunus armeniaca L.) genome, which were classified into four major subfamilies (AP2, DREB, ERF, and RAV) and Soloists (few unclassified factors), through phylogenetic analysis. The ERF subfamily exhibited the largest expansion (55 members), driven predominantly by 10 tandem and 14 segmental duplication events. Gene structures and conserved motifs exhibited similar patterns within each subfamily. Chromosomal distribution was uneven, with chromosome 1 harboring the highest gene density. PaWRI1 was specifically expressed in apricot kernel and positively correlated with oil accumulation. A total of 47 lipid-related genes were predicted as potential targets of PaWRI1 through correlation analysis, which covers the entire three-stage process of plant oil synthesis. These results advance our understanding of how core AP2/ERF transcription factors modulate oil accumulation pathways in apricot, offering potential targets for metabolic engineering.

1. Introduction

Apricot (Prunus armeniaca L.), an important member of the Rosaceae family [1], is one of the most commercially valuable fruit crops worldwide. Its fruits are not only prized for their unique flavor, but the kernels are also rich in high-value oils (40%–60% oil content) and various bioactive compounds, including oleic acid, linoleic acid, tocopherols, and phenolic substances, which have wide applications in the food, pharmaceutical, and cosmetic industries [2,3,4].
Transcription factors (TFs) are DNA-binding proteins that specifically recognize and bind to cis-acting elements in the promoter regions of target genes, thereby regulating transcriptional activity [5]. The APETALA2/Ethylene Responsive Factor (AP2/ERF) family, a major group of plant TFs, contains a conserved AP2/ERF domain that recognizes and binds to specific cis-elements such as the GCC-box and DRE/CRT motifs in target gene promoters [6,7,8,9]. Based on structural features and phylogenetic relationships, this family can be divided into four major subfamilies: (1) the AP2 subfamily, characterized by dual AP2 domains, primarily regulates plant growth and development; (2) the ERF subfamily, containing a single AP2 domain, is involved in stress responses and hormone signaling; (3) the Related to ABI3/VP1 (RAV) subfamily, which possesses both AP2 and B3 domains, modulates diverse biological processes; and (4) the dehydration-responsive element binding protein (DREB) subfamily, containing only one AP2 domain, regulates genes involved in hormone and stress responses [6,7,8,10]. At the molecular level, the typical AP2 domain consists of 60 amino acids, forming a characteristic three-dimensional structure with three DNA-binding β-sheets and an α-helix mediating protein–protein interactions [7,11,12]. Notably, the distribution of this family varies significantly across organisms: AP2 and ERF subfamilies are plant-specific, whereas RAV members are also found in animals [10,13]. During plant evolution, the AP2/ERF family has undergone significant expansion (e.g., 147 members in Arabidopsis thaliana (L.) Heynh., 163 in rice (Oryza sativa L.)), with gene duplication contributing to functional diversification [14,15,16]. Although AP2/ERF TFs have been extensively studied in model plants such as Arabidopsis and rice, research in Rosaceae Prunus species remains uneven—most studies have focused on peach (Prunus persica (L.) Batsch) [17,18,19].
The AP2/ERF transcription factors play important roles in many biological and physiological processes, including spikelet and floral organ development [6,20], fruit ripening and seed development [21,22], root initiation and development [23], stem growth and development [24], leaf size and development [25], and biotic and abiotic stress [26,27]. In apricot studies, flowers, fruits, and seeds constitute primary research targets owing to their centrality in reproduction and agricultural value. However, most of the related studies were conducted on model plants, including Arabidopsis [28], rice [29], maize (Zea mays L.) [30], and Brassica napus L. [31] fewer studies were report on other plant [21], especially Rosaceae, which underpins global fruit production and drives critical advances in pomology, genomics, and perennial crop breeding. Significantly, the regulatory functions of AP2/ERF transcription factors in apricot fruit maturation and seed development await characterization.
In this study, we performed a genome-wide systematic analysis of the AP2/ERF TF family in apricot. Through bioinformatics approaches, we identified 118 AP2/ERF members and classified them into four major subfamilies: AP2 (20), RAV (5), DREB (37), and ERF (55), and Soloist (1 unclassified member). Comprehensive analyses of gene structure, conserved motifs, chromosomal localization, duplication events, and syntenic relationships were conducted using multiple bioinformatics tools. Notably, cis-element analysis revealed that AP2/ERF members are enriched with hormone-responsive (e.g., ABRE) and stress-related elements. Particularly, PaERF35 and PaDREB5 contain both endosperm-specific and seed-preferential regulatory motifs, suggesting their potential roles in seed development in apricot. To systematically investigate expression patterns, transcriptomic data from apricot flowers, flower buds, leaves, and five kernel (K1–K5) and eight flesh (F1–F8) developmental stages were analyzed. Five AP2/ERF genes that harbor endosperm expression or seed-specific regulation elements were specifically expressed in apricot kernels, indicating their potential functions in kernel development. Through correlation analysis, 47 lipid-related genes were predicted as the potential targets of PaWRI1. This study provides preliminary insights into the potential molecular mechanisms by which the AP2/ERF transcription factor family (with particular emphasis on the WRI1 homolog PaAP2-5) regulates oil biosynthesis in apricot kernels. These findings lay a foundation for further investigations into the molecular mechanism of AP2/ERF genes in regulating apricot oil biosynthesis.

2. Materials and Methods

2.1. Genome-Wide Identification of AP2/ERF Transcription Factors in Apricot

To identify AP2/ERF proteins, we employed a computational pipeline using the AP2 DNA-binding domain HMM profile (PF00847) from InterPro. This profile was used to scan the annotated proteome of P. armeniaca cv. ‘Sungold’ (Genome assembly v1.0, 4 June 2021); accessed via https://www.rosaceae.org/Analysis/10254125 on 8 August 2024 using HMMER software (v3.4) with default parameters.
For comparative analysis, 146 AP2/ERF protein sequences from A. thaliana were retrieved from the TAIR database (https://www.arabidopsis.org/, accessed on 12 August 2024). The sequences served as BLASTP (BLAST-2.12.0+) queries against a local P. armeniaca protein database. Candidate genes identified by HMMER and BLASTP underwent NCBI CDD validation for AP2 domain confirmation, with verified sequences designated as final AP2/ERF family members.
Key physicochemical properties (molecular weight, pI, instability index, aliphatic index, GRAVY) of the identified AP2 TFs were predicted using Expasy ProtParam (https://web.expasy.org/protparam/, accessed on 15 August 2024).

2.2. Phylogenetic and Evolutionary Analysis

Protein sequences were aligned using MUSCLE (v5.1, default parameters) following standard protocols [32,33]. Phylogenetic reconstruction employed two complementary methods: maximum likelihood (TreeBeST module in MEGA7, 1000 bootstraps) and Bayesian inference in IQ-TREE (v2.0.4) [34]. Resultant phylogenies were visualized and annotated via iTOL (https://itol.embl.de/, accessed on 17 August 2025) [35,36]. In total, 115 AP2/ERF protein sequences from peach were retrieved from the JGI Phytozome 14 (https://phytozome-next.jgi.doe.gov/, accessed on 2 August 2025).

2.3. Structural Characterization of AP2/ERF Genes

Gene structure analysis was conducted through a comprehensive examination of exon–intron organization using TBtools -II (v2.100) [37]. Conserved motifs were identified via MEME (https://meme-suite.org/meme/, accessed on 17 August 2025, 20 maximum motifs, width 6–200 aa). Domain architecture was verified using the NCBI Batch CD-Search tool (https://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi, accessed on 17 August 2025) [37,38], with all structural features visualized through TBtools.

2.4. Chromosomal Localization and Synteny Analysis of AP2/ERF Gene Family

Chromosomal localization of the PaAP2/ERF gene family was visualized using TBtools, with genomic positional information extracted from the annotated genome assembly of P. armeniaca. Gene duplication events and synteny analysis among apricot, Arabidopsis, and peach were analyzed using One StepMCScanX-SuperFast module and visualized in TBtools [33].

2.5. Analysis of Regulatory Elements and Functional Annotation

Promoter regions (2 kb upstream of AP2/ERF genes) were analyzed for cis-regulatory elements using PlantCARE (https://bioinformatics.psb.ugent.be/webtools/plantcare/html/, accessed on 17 August 2025) with standard parameters [39]. Functional annotation was performed through eggNOG-mapper (http://eggnog-mapper.embl.de/, accessed on 17 August 2025), followed by Gene Ontology enrichment analysis using TBtools ORA (Over-Representation Analysis) module [40,41].

2.6. Transcriptional Profiling and Experimental Validation

AP2/ERF expression patterns were analyzed using RNA-seq data (https://www.cncb.ac.cn/, accessed on 17 August 2025, PRJCA001987), with FPKM values calculated across multiple tissues (young leaves, flower buds, flowers) and developmental stages (kernels: K1–K5; flesh: F1–F8). Hisat2-aligned reads (default parameters) were mapped to the ‘Sungold’ reference genome [42]. Gene expression levels were quantified as read counts with htseq-count, followed by FPKM calculation in R according to the formula: FPKM = (total exon fragments)/(mapped reads in millions × exon length in kilobases) [43]. Normalized FPKM values were used to generate heatmaps, with hierarchical clustering performed via the “complete” linkage method.
To validate the RNA-seq expression data, RT-qPCR analysis was performed. Kernel samples were collected from nine-year-old P. armeniaca cv. ‘Shenyangdashuaifu’ trees cultivated at the Yuanyang Long-Term Experimental Base of the Non-Timber Forest Research Institute, Chinese Academy of Forestry (Xinxiang, China). Samples from three healthy trees were harvested at 50 (17 April), 60, 70, 80, and 90 days after flowering (DAF) in spring, immediately frozen in liquid nitrogen for 10 min, and stored at −80 °C. For each tree, 30 kernels were collected from the east, south, west, and north canopy directions. Total RNA extraction and RT-qPCR were conducted using a CFX96 Touch Real-Time PCR Detection System (BIO-RAD, Hercules, CA, USA) [44]. The primer sequences are provided in Table S1, using UBQ as the reference gene. Expression levels were determined via the 2−ΔΔCt method with three biological replicates per gene [45].

2.7. Oil Content Quantification

Kernels used for oil content detection were the same samples used for RT-qPCR. Total oil content was determined using a nuclear magnetic resonance (NMR) analyzer (PQ001-SFC-Solid Fat Content Analyzer Benchtop NMR, Niumag, Suzhou, China) with three biological replicates. The oil calibration was constructed according to the manufacturer’s instruction using pure apricot kernel oil, which was obtained following Soxhlet extraction of 30 g freeze-dried mature kernel samples as previously described [46].

2.8. Statistical Analysis

Statistical analyses were conducted using GraphPad Prism 8:
(1)
Differences in gene expression (RT-qPCR) and oil content across sample groups were assessed using one-way ANOVA.
(2)
Pearson correlation analyses were performed to evaluate the concordance between RNA-seq and RT-qPCR data, and the relationships between gene expression patterns and oil accumulation profiles.

2.9. Targets Prediction of PaWRI1

To identify potential lipid biosynthesis targets of PaWRI1 in apricot kernels, we employed a dual-criterion approach: (1) Cis-element screening: Promoters must contain the AW-box motif CnTnG7[CG] (where n = any nucleotide); (2) Expression correlation: Pearson correlation coefficient between PaWRI1 and target genes ≥ 0.9. Target genes satisfying both criteria were functionally annotated via KEGG pathway analysis. Corresponding Arabidopsis orthologs were identified through BLASTP alignment (E-value < 1 × 10−5). Key candidate genes with putative roles in lipid metabolism are highlighted in Table S2.

3. Results

3.1. Identification of AP2/ERF Family Members in Apricot

Through HMM (Hidden Markov Model) screening and BLASTP analysis, followed by verification using NCBI’s CDD, we identified 118 members of the AP2/ERF gene family in apricot (Table S2). The encoded proteins exhibited considerable diversity in their physicochemical properties: amino acid lengths spanning 167–837 residues, molecular weight between 14.08 and 91.52 kDa, and isoelectric points (pI) from 4.63 to 10.61. Protein stability analysis revealed instability indices of 31.69–82.43, aliphatic indices of 32.45–78.93, and hydropathicity scores (GRAVY) ranging from −1.107 to −0.337 (Table S3).

3.2. Phylogenetic Analysis of the PaAP2/ERF Family

To elucidate the evolutionary relationships among apricot AP2/ERF genes, we performed comparative phylogenetic analysis using 118 apricot, 146 Arabidopsis, and 115 peach AP2/ERF protein sequences through multiple sequence alignment followed by maximum likelihood (ML) tree construction (Figure 1). The ML-based phylogenetic classification revealed that apricot AP2/ERF superfamily members segregate into five distinct groups: four major subfamilies, ERF (55 members), DREB (37), AP2 (20), and RAV (5), along with one unclassified member (Soloist) (Figure 1, Table S4). Notably, the ERF subfamily represents the most abundant group, while RAV constitutes the smallest subset.

3.3. Gene Structure, Conserved Motif, and Domain Analysis of the AP2/ERF Family in Apricot

To investigate the genomic architecture and phylogenetic characteristics of apricot AP2/ERF genes, we analyzed their exon-intron organization (Figure 2A). The analysis revealed considerable structural variation, with exon numbers ranging from 1 to 10 across all members, while DREB subfamily genes exhibited a more conserved structure containing only 1–2 exons. Members of the ERF and RAV subfamily contain 1–3 exons. The AP2 subfamily members contained significantly more exons (ranging from 4 to 10) than other subfamilies, exhibiting the most diverse gene structures among the four subfamilies. Consistent with the phylogenetic analysis, AP2 subfamily members with similar exon numbers exhibit a closer evolutionary relationship, such as PaAP2-10 (6) and PaAP2-14 (4).
Using the MEME suite in TBtools, 20 conserved motifs were identified in AP2/ERF protein sequences, with widths ranging from 6 to 200 bp (Figure 2B). Certain motifs, such as motifs 1 and 5, appeared across all subfamilies. Members within the same subfamily displayed similar motif compositions. For example, all DREB members contained adjacent motif 1 and motif 2, while the RAV subfamily uniquely possessed motif 13. Most AP2 subfamily members harbored motifs 1, 4, 6, and 8. The ERF subfamily showed the most complex motif organization, though all its members shared a conserved arrangement of motifs 1, 2, and 3. In contrast, the Soloist subfamily, represented by a single member, exclusively contained motifs 1 and 18.
NCBI Batch CD Search revealed key conserved domains in the apricot AP2/ERF protein sequences (Figure 2C). While all members possessed AP2 or AP2 superfamily domains, their number and type varied by subfamily. Most DREB and ERF members had a single AP2 domain, with one gene from each subfamily containing an AP2 superfamily domain. The AP2 subfamily primarily featured two domains—either two AP2 domains or one AP2 domain paired with an AP2 superfamily domain. Five AP2 members had only one domain of either type. Despite having the fewest members, the RAV subfamily exhibited the highest domain diversity, with five distinct types. Each RAV member contained one AP2 domain, while the remaining four domains were unique to this subfamily, including the rare Bfil_C_EcoRII_N_B3 and PLN02501 superfamily domains, each appearing only once.

3.4. Chromosomal Localization and Syntenic Relationships of the PaAP2/ERF Family

To visualize the distribution of AP2/ERF genes on apricot chromosomes, the Gene Density Profile in TBtools was employed. The 118 genes were mapped across all eight chromosomes (Figure 3), showing variations in gene counts per chromosome and an uneven distribution within individual chromosomes. Chromosome 1 contained the highest number (24), including 10 ERF, 9 DREB, 4 AP2, and 1 RAV subfamily members. In contrast, chromosome 8 had the fewest genes (7), with six belonging to the ERF subfamily and clustered near the chromosome end.
To investigate the evolutionary dynamics of AP2/ERF transcription factors in P. armeniaca, we employed MCScanX to analyze duplication events, identifying 25 segmental duplication pairs and 17 tandem duplication clusters (Figure 4A, Table S5).
Comparative synteny analysis with A. thaliana and peach revealed 110 and 147 syntenic gene pairs, respectively (Figure 4B, Table S6). The results showed distinct evolutionary patterns: 29 apricot AP2/ERF genes had one-to-one synteny with Arabidopsis homologs, while 31 genes exhibited one-to-multiple relationships, including 17 (one-to-two), 10 (one-to-three), 3 (one-to-four), and 1 (one-to-five). Similarly, synteny analysis with peach identified 50, 30, 11, and 1 apricot genes with one-to-one, one-to-two, one-to-three, and one-to-four relationships, respectively.

3.5. Analysis of Cis-Regulatory Elements and Functional Annotation of the PaAP2/ERF Family

To investigate the potential functions of AP2/ERF genes in apricot, we analyzed the 2 kb promoter regions upstream of each AP2/ERF gene using PlantCARE (Figure 2D). The analysis revealed that AP2/ERF gene promoters are enriched with diverse hormone- and stress-responsive cis-elements, demonstrating their extensive regulatory roles in plant growth and environmental adaptation. The cis-regulatory elements were functionally classified into hormone-responsive elements (including gibberellin and abscisic acid responsive motifs) mediating signal transduction, stress-responsive elements (such as low temperature and drought inducible motifs) involved in stress adaptation, and tissue-specific elements (e.g., seed and endosperm preferential motifs) regulating organ development. Promoter analysis revealed that the ABRE (abscisic acid-responsive element) was widely distributed across the AP2/ERF family, being present in 47 out of 55 ERF subfamily members, 33 out of 37 DREBs, 18 out of 20 AP2s, and all RAVs. In total, 25 and 18 AP2/ERF genes contain endosperm expression and seed-specific regulation elements, respectively. Notably, PaERF35 and PaDREB5 harbor both endosperm-specific expression and seed-specific regulatory cis-elements.
Comprehensive gene ontology (GO) enrichment analysis of AP2/ERF transcription factors in apricot revealed their multifaceted regulatory roles in plant growth and stress adaptation (Table S7). At the molecular level, the AP2/ERF family exhibited significant enrichment in DNA-binding transcription factor activity and sequence-specific DNA binding. Biological process annotation demonstrated their involvement in four major functional categories: developmental processes including post-embryonic organ morphogenesis and root system development; stress responses to abiotic stimuli, drought, and pathogen defense; metabolic regulation of primary metabolism and lipid biosynthesis; and hormone signaling pathways mediated by ethylene and jasmonate (Figure 5). Cellular component analysis revealed their predominant nuclear localization.

3.6. Expression Patterns of AP2/ERF Genes Across Tissues and Their Roles in Apricot Oil Accumulation

Among the tested samples, 89 AP2/ERF genes were expressed (FPKM > 1) in at least one tissue (Figure 6A). Tissue-specific expression was observed for 3 genes in flesh, 2 in flower/flower bud, and 10 each in young leaf and kernel. The majority of these genes exhibited distinct expression patterns across different tissues.
Most of the AP2 subfamily indicated a higher expression level in the kernel than in other samples, including PaAP2-1, PaAP2-5, PaAP2-13, PaAP2-16, and PaAP2-20, which presented higher expression activity at the last two developmental stages of kernel. Among all AP2 genes, PaAP2-5 (WRI1), PaAP2-13, and PaAP2-19 were specifically expressed in the kernel, whereas PaAP2-2 was specifically expressed in the flower or flower bud. Most of the RAV subfamily genes represented very low expression levels, except PaRAV1, the expression level of which is much higher in young leaves than that of any other RAV members’ expression in other samples. Most of the ERF subfamily represented a higher expression level in leaf and flesh than in other samples. Among all members of the AP2/ERF family, PaERF25 exhibits the highest expression level across all detected tissues and developmental stages, with an average FPKM value of 713. PaERF38, PaERF43, PaERF44, and PaERF53 were specifically expressed in young leaves. PaERF39 was specifically expressed in the flower and flower bud. Interestingly, PaERF26, PaERF42, and PaERF51 were specifically expressed in the kernel and had a different expression pattern, i.e., PaERF42 expressed more in the early development stage of the apricot kernel, whereas the other two in the later stage, indicating that they might play different roles during apricot kernel development. Similar to the ERF subfamily, genes in the DREB subfamily also showcased high expression activity in flesh and young leaves. PaDREB8, PaDREB11, and PaDREB36 were specifically expressed in flesh and had a higher expression level in the later, middle, and early develop pmental stages of flesh, respectively, indicating that they might play different roles during apricot flesh development. PaDREB5, PaDREB9, PaDREB20, PaDREB21, PaDREB24, and PaDREB37 were specifically expressed in young leaves, while PaDREB12 and PaDREB32 were specifically expressed in the kernel.
Since WRI1 was confirmed to bind AW-box and play pivotal roles in plant lipid biosynthesis [47], we mainly focus on the further analysis of PaWRI1. In apricot, the oil accumulated mainly from 50 DAF to 90 DAF (mature stage), indicating S-shaped growth (Figure S1). The expression pattern of PaWRI1 was positively correlated with oil accumulation, with a high Pearson correlation coefficient (0.7). To identify potential target genes of PaWRI1 involved in lipid biosynthesis in apricot kernel, we conduct correlation analysis between PaWRI1 and genes involved in lipid metabolism based on FPKMs, and AW-box scanning in the promoter sequences of lipid genes. In total, 47 lipid metabolism-related genes were identified with AW-box and co-expressed with PaWRI1 with a high Pearson correlation coefficient (>0.9), including genes involved in glycolysis, de novo FA synthesis, and TAG assembly (Table S2). To identify other AP2/ERF genes that might be co-expressed with PaWRI1 in the same network during kernel development, we conducted expression correlation analysis based on FPKMs. As a result, 14 AP2/ERF genes were co-expressed with PaWRI1 with a Pearson coefficient > 0.9, among which PaERF26, PaRAV3, PaDREB12, and PaAP2-13 were kernel-specifically expressed (Table S8).
To validate the RNA-seq expression profiles, we performed RT-qPCR analysis on selected genes (PaAP2-5, PaAP2-10, PaAP2-20, PaDREB4, PaDREB6, and PaDREB12) across different developmental stages (Figure 6B–G). The RT-qPCR results showed strong consistency with RNA-seq data, with significant positive correlations (Pearson’s r > 0.83, p < 0.05) between the two datasets. This concordance confirms the reliability of our transcriptomic analysis.

4. Discussion

The AP2/ERF transcription factor family represents one of the most important regulatory gene families in plants, playing crucial roles in various biological processes ranging from development to stress responses [48,49,50,51]. Our comprehensive genome-wide analysis identified 118 AP2/ERF genes in apricot, expanding our understanding of this gene family in Rosaceae species. The identified AP2/ERF proteins exhibited considerable variation in their physicochemical properties (molecular weights: 14.08–91.52 kDa; pI: 4.63–10.61; GRAVY: −1.107 to −0.337), suggesting extensive functional diversification within this family. The molecular diversity likely underlies the functional plasticity of AP2/ERF proteins in regulating various aspects of apricot growth and environmental adaptation.
According to phylogenetic analysis, apricot AP2/ERF transcription factors were classified into four distinct subfamilies (AP2, DREB, ERF, and RAV), with ERF being the largest (55 members) and RAV the smallest (5 members). This classification is consistent with reports in other plant species [52], which confirms the evolutionary conservation of AP2/ERF gene family organization. Compared to Arabidopsis, the total number of AP2 genes and the number of genes in each subfamily are more similar between apricot and peach (Table S4). This result indicates that the evolutionary trajectories of the AP2 gene family are strongly constrained by phylogenetic context, reflecting shared genomic and selective pressures between closely related species.
Notably, structural analysis showed that AP2 subfamily members possessed the most complex gene architectures (4–10 exons), while the other three subfamilies generally contained fewer exons (1–3). This situation was also found in other plants [52]. Research has indicated that evolutionary processes can lead to the progressive loss of introns [53,54]. In light of this theory, it is plausible that the AP2 subfamily gave rise to other subfamilies during evolution in plants. This structural diversity, particularly the alternative splicing potential in AP2/ERF subfamily genes [48], likely contributes to functional specialization among subfamilies. Domain architecture analysis further supported this notion—while most DREB/ERF members contained a single AP2 domain, AP2 subfamily proteins typically possessed two AP2 domains. Intriguingly, the small RAV subfamily exhibited the greatest domain diversity, which may reflect an evolutionary strategy where functional versatility compensates for limited gene numbers [55].
Chromosomal distribution analysis revealed an uneven distribution of AP2/ERF genes across the apricot genome, with chromosome 1 harboring the highest number (24 genes) and chromosome 8 the fewest (7 genes). This non-random distribution pattern suggests potential hotspots for gene family expansion [56,57,58].
The evolutionary expansion of gene families constitutes a fundamental mechanism for environmental adaptation in plants [59,60,61]. Our study reveals that the ERF subfamily within the AP2/ERF transcription factor family of apricot has undergone a distinctive expansion, characterized by frequent segmental and tandem duplication events. Genomic analyses identified 14 segmental duplications (out of 25 total) and 10 tandem duplications (out of 17 total) within the ERF subfamily. Notably, genes forming physical clusters exhibit complex expression patterns. For instance, PaERF1, PaERF2, and PaERF3 display divergent expression profiles, while the tandemly duplicated PaDREB genes show functional redundancy: PaDREB20, PaDREB21, and PaDREB24 were specifically expressed in young leaves, whereas PaDREB22 and PaDREB23 remain transcriptionally silent in all test samples. These results demonstrate that tandem duplication provides both genetic redundancy for developmental robustness and functional diversity for specialized environmental responses.
Comparative synteny analysis elucidates evolutionary connections among species while revealing how genomic architecture shapes biological function [62]. Our analysis identified 110 syntenic AP2/ERF gene pairs between apricot and Arabidopsis, and 147 pairs between apricot and peach (Figure 4B and Table S6), demonstrating conserved evolutionary trajectories of this superfamily across these species. Notably, 29 apricot AP2/ERF genes exhibited one-to-one synteny with Arabidopsis homologs, while 31 genes showed one-to-multiple relationships—including 17 (1:2), 10 (1:3), 3 (1:4), and 1 (1:5) gene pairs (Figure 4B). This asymmetry implies differential expansion rates following species divergence [62], with Arabidopsis undergoing more frequent gene duplications than apricot—a pattern potentially linked to the shorter life cycle of the former. Given that evolution typically preserves ortholog function [39], these syntenic relationships establish a robust framework for functional annotation of apricot AP2/ERF genes based on characterized Arabidopsis orthologs. The increased synteny conservation (including predominant 1:1 orthologs) in the apricot–peach comparison reflects their closer phylogenetic affinity relative to apricot–Arabidopsis pairs.
Cis-regulatory elements in promoters are key modulators of metabolic gene expression [39]. Numerous characterized cis-elements participate in hormonal signaling and stress-responsive gene regulation [63]. Comprehensive cis-element analysis of apricot AP2/ERF promoters demonstrated significant enrichment of diverse regulatory elements, including hormone-responsive motifs (auxin, GA, and ABA), stress adaptation elements, and tissue-specific regulators. Notably, five AP2/ERF genes that harbor endosperm expression or seed-specific regulation elements were specifically expressed in apricot kernels, indicating their potential functions in kernel development. Emerging evidence shows tissue-specific promoters mediate stronger gene expression than the constitutive 35S promoter [30,64]. Recent studies have demonstrated that the seed-specific PsOLE1 promoter from Prunus sibirica L. exhibits stronger transcriptional activity than the 35S constitutive promoter during late seed maturation stages [64]. Selecting optimal promoters to regulate lipid-related genes is critical in oil metabolic engineering. Employing the FUS3 promoter to control AtWRI1 expression represents an effective strategy for enhancing seed oil production by prolonging the accumulation phase during mid-seed development [65]. These findings suggest that future applications utilizing seed-specific promoters to drive expression of the key lipid biosynthesis genes and regulators could enable precise spatial-temporal control of oil accumulation specifically in developing apricot kernels. Importantly, ABA-responsive elements (ABREs) were overrepresented (89%) in DREB subfamily promoters, correlating with their stress-induced expression patterns [66]. Furthermore, auxin- and GA-responsive elements in flesh-specific AP2/ERFs (PaDREB8, PaDREB11, and PaDREB36) exhibited developmental stage-matched expression profiles during fruit development, recapitulating conserved hormonal regulation mechanisms observed in peach [67,68,69]. These findings collectively establish the molecular basis for AP2/ERF-mediated regulation of specialized metabolic and developmental processes in apricot.
The temporospatial characteristics of gene expression reflect their functions in different tissues and different developmental stages of plants. The functional diversification of AP2/ERF genes in apricot is evident from their varied expression patterns and predicted regulatory roles. The RAV subfamily genes represent very low expression levels, except PaRAV1, which had a much higher expression level in young leaves than in other samples. Previous study found that overexpression of RAV1 in Arabidopsis delays flowering, causes rosette leaf development to slow, and induces leaf senescence [70,71]. Thus, we speculate that PaRAV1 may play an important role in apricot leaves. Most of the AP2 subfamily had a higher expression level in the kernel than in other tissues, which is consistent with the important roles of AP2 in seed development and yield of Arabidopsis [22]. Most of the ERF and DREB subfamilies were found to have a higher expression level in leaf and flesh than in other samples. A previous study reported that PsERF1 played important roles in plum (Prunus salicina L.) fruit ripening [21]. Thus, we speculated that those ERFs with high transcriptional activity in flesh might be related to fruit development and the ripening of apricot. By targeting DRE/CRT motifs in promoter regions, DREB proteins modulate the expression of stress-responsive genes to play a pivotal role in ABA-mediated pathways, dehydration tolerance, and chilling stress responses [48]. More stress-related experiments should be performed to illustrate DREBs’ roles in apricot.
WRI1, as a member of the AP2 subfamily, was reported to play a pivotal role in plant oil biosynthesis in direct or indirect ways [72,73]. In apricot, the oil accumulated mainly from 50 DAF (when the kernel coat size reached the maximum) to 90 DAF (mature stage), along with seed maturation and dehydration (Figure S1). The expression of PaWRI1 was specifically activated in the kernel and positively correlated with apricot oil accumulation, indicating its potential role in lipid biosynthesis and kernel development in apricot. PsWRI1 was confirmed to orchestrate the regulation of carbon partitioning for oleic acid accumulation in P. sibirica kernel, based on transcriptome analysis and a transient co-transformation experiment [74]. These authors found that PsWRI1 could specifically up-regulate the promoter activity of stearoyl-ACP desaturase (SAD), phosphoenolpyruvate carboxykinase (PEPCK), and biotin synthase (BS). In this study, we predicted 47 target lipid-related genes of PaWRI1 on a whole-genomic scale, spanning the entire three-stage process of plant oil synthesis. These 47 target lipid-related genes include key genes, such as diacylglycerol acyltransferase (DGAT), Stearoyl-ACP Desaturase (SAD), and oleosin, and confirmed Ababidopsis genes, such as pyruvate kinase beta subunit 1 (PKP-β1), biotin carboxyl carrier protein (BCCP2; encoding a subunit of the ACCase), and BIOTIN ATTACHMENT DOMAIN-CONTAINING (BADC) [75]. Enhancing vegetable oil production by manipulating transcription factors has shown greater efficacy compared to strategies centered on individual enzymes [63,64,65]. This represents a strategic advantage for engineered lipid biosynthesis: once a stable genetic transformation system for apricot is established, we can employ gene editing technology to upregulate PaWRI1 expression, thereby increasing the kernel oil content of apricot. What’s more, four AP2/ERF genes specifically expressed in kernels were co-expressed with PaWRI1, among which PaDREB12 is the homolog of AtABI4. In Arabidopsis, abi4 mutants decreased sensitivity to ABA inhibition of germination and altered seed-specific gene expression [28]. These results indicated their potential synergistic role in the apricot kernel developmental network. Further experiments will be needed to verify our results and establish the WRI1-mediated oil biosynthesis and kernel development network in the apricot kernel.

5. Conclusions

In this study, we identified 118 putative AP2/ERF transcription factors in apricot, which were divided into four distinct subfamilies (20 AP2s, 37 DREBs, 55 ERFs, and 5 RAVs) and one Soloist. The clustered PaAP2/ERFs exhibited comparable gene architectures and conserved motif patterns, reinforcing the phylogenetic associations observed. The PaAP2/ERF genes displayed uneven chromosomal distribution, with expansion driven by both segmental (25 events) and tandem (17 events) duplications. In total, 25 AP2/ERF genes were tissue-specifically expressed in apricot. Based on co-expression analysis and AW-box scanning, 47 lipid-related genes were predicted as the potential targets of PaWRI1, which is pivotal in plant oil biosynthesis regulation. This study delivers the first chromosome-scale genomic annotation of the PaAP2/ERF family, establishing a foundation for elucidating the functional roles of critical PaAP2/ERF members in apricot lipid metabolism.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/f16081353/s1. Table S1: Primers used in this study. Table S2: Correlation between the expression level of PaWRI1 and lipid-related genes. Table S3: Characteristics of apricot (Prunus armeniaca L.) AP2/ERF transcription factor superfamily. Table S4: Summary of the AP2/ERF family in apricot, Arabidopsis thaliana (L.) Heynh., and peach (Prunus persica (L.) Batsch). Table S5: Duplication events of apricot AP2/ERF genes. Table S6: Synteny gene pairs of apricot vs Arabidopsis and peach. Table S7: GO enrichment of AP2/ERF transcription factors. Table S8: Identification of co-expressed AP2/ERF genes with PaWRI1. Figure S1: Oil content of apricot kernels at different developmental stages. K1, K2, K3, K4, and K5 refer to 50, 60, 70, 80, and 90 days after flowering.

Author Contributions

Conceptualization, H.L.; methodology, N.J.; software, Y.H. and X.S.; validation, Y.H.; formal analysis, L.W. and N.J.; investigation, H.L.; resources, H.L.; writing—original draft preparation, Y.H., N.J., and L.W.; writing—review and editing, N.J., L.W., T.W., and H.L.; visualization, D.Z.; project administration, T.W. and D.Z.; funding acquisition, H.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the central non-profit research institution of the Chinese Academy of Forestry, grant number CAFYBB2021MA008, and the National Natural Science Foundation of China, grant number 32101562.

Data Availability Statement

Data are contained within the article and Supplementary Materials.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Erdogan-Orhan, I.; Kartal, M. Insights into research on phytochemistry and biological activities of Prunus armeniaca L. (apricot). Food Res. Int. 2011, 44, 1238–1243. [Google Scholar] [CrossRef]
  2. Pawar, K.R.; Nema, P.K. Apricot kernel characterization, oil extraction, and its utilization: A review. Food Sci. Biotechnol. 2023, 32, 249–263. [Google Scholar] [CrossRef]
  3. Lee, H.; Ahn, J.; Kwon, A.; Lee, E.S.; Kwak, J.; Min, Y. Chemical composition and antimicrobial activity of the essential oil of apricot seed. Phytotherapy Res. 2014, 28, 1867–1872. [Google Scholar] [CrossRef]
  4. Kiralan, M.; Özkan, G.; Kucukoner, E.; Ozcelik, M.M. Apricot (Prunus armeniaca L.) Oil. In Fruit Oils: Chemistry and Functionality; Springer: Berlin/Heidelberg, Germany, 2019; pp. 505–519. Available online: https://link.springer.com/chapter/10.1007/978-3-030-12473-1_25 (accessed on 17 August 2025).
  5. Alves, M.S.; Dadalto, S.P.; Gonçalves, A.B.; De Souza, G.B.; Barros, V.A.; Fietto, L.G. Transcription factor functional protein-protein interactions in plant defense responses. Proteomes 2014, 2, 85–106. [Google Scholar] [CrossRef]
  6. Nakano, T.; Suzuki, K.; Fujimura, T.; Shinshi, H. Genome-Wide Analysis of the ERF Gene Family in Arabidopsis and Rice. Plant Physiol. 2006, 140, 411–432. [Google Scholar] [CrossRef] [PubMed]
  7. Licausi, F.; Ohme-Takagi, M.; Perata, P. APETALA2/Ethylene Responsive Factor (AP2/ERF) transcription factors: Mediators of stress responses and developmental programs. New Phytol. 2013, 199, 639–649. [Google Scholar] [CrossRef]
  8. Sakuma, Y.; Liu, Q.; Dubouzet, J.G.; Abe, H.; Shinozaki, K.; Yamaguchi-Shinozaki, K. DNA-binding specificity of the ERF/AP2 domain of Arabidopsis DREBs, transcription factors involved in dehydration-and cold-inducible gene expression. Biochem. Biophys. Res. Commun. 2002, 290, 998–1009. [Google Scholar] [CrossRef] [PubMed]
  9. Ohme-Takagi, M.; Shinshi, H. Ethylene-inducible DNA binding proteins that interact with an ethylene-responsive element. Plant Cell 1995, 7, 173. [Google Scholar] [CrossRef]
  10. Rashid, M.; Guangyuan, H.; Guangxiao, Y.; Hussain, J.; Xu, Y. AP2/ERF transcription factor in rice: Genome-wide canvas and syntenic relationships between monocots and eudicots. Evol. Bioinform. 2012, 8, 321–355. [Google Scholar] [CrossRef]
  11. Allen, M.D.; Yamasaki, K.; Ohme-Takagi, M.; Tateno, M.; Suzuki, M. A novel mode of DNA recognition by a β-sheet revealed by the solution structure of the GCC-box binding domain in complex with DNA. EMBO J. 1998, 17, 5484–5496. [Google Scholar] [CrossRef]
  12. Giri, M.K.; Swain, S.; Gautam, J.K.; Singh, S.; Singh, N.; Bhattacharjee, L.; Nandi, A.K. The Arabidopsis thaliana At4g13040 gene, a unique member of the AP2/EREBP family, is a positive regulator for salicylic acid accumulation and basal defense against bacterial pathogens. J. Plant Physiol. 2014, 171, 860–867. [Google Scholar] [CrossRef]
  13. Magnani, E.; SjölAnder, K.; Hake, S. From endonucleases to transcription factors: Evolution of the AP2 DNA binding domain in plants. Plant Cell 2004, 16, 2265–2277. [Google Scholar] [CrossRef] [PubMed]
  14. Sharoni, A.M.; Nuruzzaman, M.; Satoh, K.; Shimizu, T.; Kondoh, H.; Sasaya, T.; Choi, I.-R.; Omura, T.; Kikuchi, S. Gene structures, classification and expression models of the AP2/EREBP transcription factor family in rice. Plant Cell Physiol. 2010, 52, 344–360. [Google Scholar] [CrossRef]
  15. Liu, Q.; Wang, Z.; Xu, X.; Zhang, H.; Li, C.; Prasad, M. Genome-wide analysis of C2H2 zinc-finger family transcription factors and their responses to abiotic stresses in poplar (Populus trichocarpa). PLoS ONE 2015, 10, e0134753. [Google Scholar] [CrossRef] [PubMed]
  16. Zhuang, J.; Cai, B.; Peng, R.-H.; Zhu, B.; Jin, X.-F.; Xue, Y.; Gao, F.; Fu, X.-Y.; Tian, Y.-S.; Zhao, W.; et al. Genome-wide analysis of the AP2/ERF gene family in Populus trichocarpa. Biochem. Biophys. Res. Commun. 2008, 371, 468–474. [Google Scholar] [CrossRef] [PubMed]
  17. Verde, I.; Jenkins, J.; Dondini, L.; Micali, S.; Pagliarani, G.; Vendramin, E.; Paris, R.; Aramini, V.; Gazza, L.; Rossini, L.; et al. The Peach v2.0 release: High-resolution linkage mapping and deep resequencing improve chromosome-scale assembly and contiguity. BMC Genom. 2017, 18, 1–18. [Google Scholar] [CrossRef]
  18. Gao, Z.; Li, Q.; Li, J.; Chen, Y.; Luo, M.; Li, H.; Wang, J.; Wu, Y.; Duan, S.; Wang, L.; et al. Characterization of the ABA receptor VlPYL1 that regulates anthocyanin accumulation in grape berry skin. Front. Plant Sci. 2018, 9, 592. [Google Scholar] [CrossRef]
  19. Tatsuki, M.; Nakajima, N.; Fujii, H.; Shimada, T.; Nakano, M.; Hayashi, K.-I.; Hayama, H.; Yoshioka, H.; Nakamura, Y. Increased levels of IAA are required for system 2 ethylene synthesis causing fruit softening in peach (Prunus persica L. Batsch). J. Exp. Bot. 2013, 64, 1049–1059. [Google Scholar] [CrossRef]
  20. Chuck, G.; Meeley, R.B.; Hake, S. The control of maize spikelet meristem fate by the APETALA2-like gene indeterminate spikelet1. Genes Dev. 1998, 12, 1145–1154. [Google Scholar] [CrossRef]
  21. El-Sharkawy, I.; Kim, W.S.; El-Kereamy, A.; Jayasankar, S.; Svircev, A.M.; Brown, D.C.W. Isolation and characterization of four ethylene signal transduction elements in plums (Prunus salicina L.). J. Exp. Bot. 2007, 58, 3631–3643. [Google Scholar] [CrossRef]
  22. Jofuku, K.D.; Omidyar, P.K.; Gee, Z.; Okamuro, J.K. Control of seed mass and seed yield by the floral homeotic gene APETALA2. Proc. Natl. Acad. Sci. USA 2005, 102, 3117–3122. [Google Scholar] [CrossRef]
  23. Hirota, A.; Kato, T.; Fukaki, H.; Aida, M.; Tasaka, M. The auxin-regulated AP2/EREBP gene PUCHI is required for morphogenesis in the early lateral root primordium of Arabidopsis. Plant Cell 2007, 19, 2156–2168. [Google Scholar] [CrossRef]
  24. van Raemdonck, D.; Pesquet, E.; Cloquet, S.; Beeckman, H.; Boerjan, W.; Goffner, D.; El Jaziri, M.; Baucher, M. Molecular changes associated with the setting up of secondary growth in aspen. J. Exp. Bot. 2005, 56, 2211–2227. [Google Scholar] [CrossRef]
  25. Shukla, R.K.; Raha, S.; Tripathi, V.; Chattopadhyay, D. Expression of CAP2, an APETALA2-family transcription factor from chickpea, enhances growth and tolerance to dehydration and salt stress in transgenic tobacco. Plant Physiol. 2006, 142, 113–123. [Google Scholar] [CrossRef]
  26. Dietz, K.-J.; Vogel, M.O.; Viehhauser, A. AP2/EREBP transcription factors are part of gene regulatory networks and integrate metabolic, hormonal and environmental signals in stress acclimation and retrograde signalling. Protoplasma 2010, 245, 3–14. [Google Scholar] [CrossRef]
  27. Yamaguchi-Shinozaki, K.; Shinozaki, K. A novel cis-acting element in an Arabidopsis gene is involved in responsiveness to drought, low-temperature, or high-salt stress. Plant Cell 1994, 6, 251–264. [Google Scholar] [PubMed]
  28. Finkelstein, R.R.; Wang, M.L.; Lynch, T.J.; Rao, S.; Goodman, H.M. The Arabidopsis abscisic acid response locus ABI4 encodes an APETALA2 domain protein. Plant Cell 1998, 10, 1043. [Google Scholar] [CrossRef] [PubMed]
  29. Kitomi, Y.; Ito, H.; Hobo, T.; Aya, K.; Kitano, H.; Inukai, Y. The auxin responsive AP2/ERF transcription factor CROWN ROOTLESS5 is involved in crown root initiation in rice through the induction of OsRR1, a type-A response regulator of cytokinin signaling. Plant J. 2011, 67, 472–484. [Google Scholar] [CrossRef]
  30. Wang, G.; Wang, H.; Zhu, J.; Zhang, J.; Zhang, X.; Wang, F.; Tang, Y.; Mei, B.; Xu, Z.; Song, R. An expression analysis of 57 transcription factors derived from ESTs of developing seeds in Maize (Zea mays). Plant Cell Rep. 2010, 29, 545–559. [Google Scholar] [CrossRef]
  31. Boutilier, K.; Offringa, R.; Sharma, V.K.; Kieft, H.; Ouellet, T.; Zhang, L.; Hattori, J.; Liu, C.-M.; van Lammeren, A.A.M.; Miki, B.L.A.; et al. Ectopic Expression of BABY BOOM Triggers a Conversion from Vegetative to Embryonic Growth. Plant Cell 2002, 14, 1737–1749. [Google Scholar] [CrossRef]
  32. Edgar, R.C. Muscle5: High-accuracy alignment ensembles enable unbiased assessments of sequence homology and phylogeny. Nat. Commun. 2022, 13, 6968. [Google Scholar] [CrossRef]
  33. Vilella, A.J.; Severin, J.; Ureta-Vidal, A.; Heng, L.; Durbin, R.; Birney, E. EnsemblCompara GeneTrees: Complete, duplication-aware phylogenetic trees in vertebrates. Genome Res. 2009, 19, 327–335. [Google Scholar] [CrossRef]
  34. Darriba, D.; Posada, D.; Kozlov, A.M.; Stamatakis, A.; Morel, B.; Flouri, T.; Crandall, K. ModelTest-NG: A New and Scalable Tool for the Selection of DNA and Protein Evolutionary Models. Mol. Biol. Evol. 2020, 37, 291–294. [Google Scholar] [CrossRef]
  35. Minh, B.Q.; Schmidt, H.A.; Chernomor, O.; Schrempf, D.; Woodhams, M.D.; von Haeseler, A.; Lanfear, R. IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Mol. Biol. Evol. 2020, 37, 1530–1534. [Google Scholar] [CrossRef]
  36. Letunic, I.; Life, P.B. Interactive Tree of Life (iTOL) v5: An online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 2021, 49, W293–W296. Available online: https://pubmed.ncbi.nlm.nih.gov/33885785/ (accessed on 17 August 2025). [CrossRef]
  37. Chen, C.J.; Chen, H.; Zhang, Y.; Thomas, H.R.; Frank, M.H.; He, Y.H.; Xia, R. TBtools: An integrative toolkit developed for interactive analyses of big biological data. Mol. Plant 2020, 13, 1194–1202. [Google Scholar] [CrossRef]
  38. Marchler-Bauer, A.; Bo, Y.; Han, L.; He, J.; Lanczycki, C.J.; Lu, S.; Chitsaz, F.; Derbyshire, M.K.; Geer, R.C.; Gonzales, N.R.; et al. CDD/SPARCLE: Functional classification of proteins via subfamily domain architectures. Nucleic Acids Res. 2017, 45, D200–D203. [Google Scholar] [CrossRef]
  39. Lescot, M.; Déhais, P.; Thijs, G.; Marchal, K.; Moreau, Y.; Van de Peer, Y.; Rouzé, P.; Rombauts, S. PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences. Nucleic Acids Res. 2002, 30, 325–327. [Google Scholar] [CrossRef] [PubMed]
  40. Wu, T.; Hu, E.; Xu, S.; Chen, M.; Guo, P.; Dai, Z.; Feng, T.; Zhou, L.; Tang, W.; Zhan, L.; et al. clusterProfiler 4.0: A universal enrichment tool for interpreting omics data. Innovation 2021, 2, 100141. [Google Scholar] [CrossRef] [PubMed]
  41. Wang, Y.; Tang, H.; DeBarry, J.D.; Tan, X.; Li, J.; Wang, X.; Lee, T.-H.; Jin, H.; Marler, B.; Guo, H.; et al. MCScanX: A toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 2012, 40, e49. [Google Scholar] [CrossRef] [PubMed]
  42. Kim, D.; Langmead, B.; Salzberg, S.L. HISAT: A fast spliced aligner with low memory requirements. Nat. Methods 2015, 12, 357–360. [Google Scholar] [CrossRef] [PubMed]
  43. Luo, J.; Nvsvrot, T.; Wang, N. Comparative transcriptomic analysis uncovers conserved pathways involved in adventitious root formation in poplar. Physiol. Mol. Biol. Plants 2021, 27, 1903–1918. [Google Scholar] [CrossRef] [PubMed]
  44. Liu, H.; Lu, Y.; Wang, J.; Hu, J.; Wuyun, T. Genome-wide screening of long non-coding RNAs involved in rubber biosynthesis in Eucommia ulmoides. J. Integr. Plant Biol. 2018, 60, 1070–1082. [Google Scholar] [CrossRef]
  45. Livak, K.J.; Schmittgen, T.D. Analysis of relative gene expression data using real-time quantitative PCR and the 2−ΔΔCT method. Methods 2001, 25, 402–408. [Google Scholar] [CrossRef]
  46. Gayas, B.; Kaur, G.; Singh, A. Ultrasound assisted extraction of apricot kernel oil: Effect on physicochemical, morphological characteristics, and fatty acid composition. Acta Aliment. 2020, 49, 23–31. [Google Scholar] [CrossRef]
  47. Maeo, K.; Tokuda, T.; Ayame, A.; Mitsui, N.; Kawai, T.; Tsukagoshi, H.; Ishiguro, S.; Nakamura, K. An AP2-type transcription factor, WRINKLED1, of Arabidopsis thaliana binds to the AW-box sequence conserved among proximal upstream regions of genes involved in fatty acid synthesis. Plant J. 2009, 60, 476–487. [Google Scholar] [CrossRef]
  48. Feng, K.; Hou, X.; Xing, G.; Liu, J.; Duan, A.; Xu, Z.; Li, M.; Zhuang, J.; Xiong, A. Advances in AP2/ERF super-family transcription factors in plant. Crit. Rev. Biotechnol. 2020, 40, 750–776. [Google Scholar] [CrossRef]
  49. Okamuro, J.K.; Caster, B.; Villarroel, R.; Van Montagu, M.; Jofuku, K.D. The AP2 domain of APETALA2 defines a large new family of DNA binding proteins in Arabidopsis. Proc. Natl. Acad. Sci. USA 1997, 94, 7076–7081. [Google Scholar] [CrossRef]
  50. Shigyo, M.; Hasebe, M.; Ito, M. Molecular evolution of the AP2 subfamily. Gene 2006, 366, 256–265. [Google Scholar] [CrossRef]
  51. Kim, S.; Soltis, P.S.; Wall, K.; Soltis, D.E. Phylogeny and Domain Evolution in the APETALA2-like Gene Family. Mol. Biol. Evol. 2006, 23, 107–120. [Google Scholar] [CrossRef]
  52. Han, Y.; Cai, M.; Zhang, S.; Chai, J.; Sun, M.; Wang, Y.; Xie, Q.; Chen, Y.; Wang, H.; Chen, T. Genome-wide identification of AP2/ERF transcription factor family and functional analysis of DcAP2/ERF#96 associated with abiotic stress in Dendrobium catenatum. Int. J. Mol. Sci. 2022, 23, 13603. [Google Scholar] [CrossRef]
  53. Roy, S.W.; Gilbert, W. The evolution of spliceosomal introns: Patterns, puzzles and progress. Nat. Rev. Genet. 2006, 7, 211–221. [Google Scholar] [CrossRef] [PubMed]
  54. Kordiš, D. Extensive intron gain in the ancestor of placental mammals. Biol. Direct 2011, 6, 59. [Google Scholar] [CrossRef] [PubMed]
  55. Binford, G.J.; Bodner, M.R.; Cordes, M.H.; Baldwin, K.L.; Rynerson, M.R.; Burns, S.N.; Zobel-Thropp, P.A. Molecular evolution, functional variation, and proposed nomenclature of the gene family that includes sphingomyelinase D in sicariid spider venoms. Mol. Biol. Evol. 2009, 26, 547–566. [Google Scholar] [CrossRef]
  56. Huang, Y.-L.; Zhang, L.-K.; Zhang, K.; Chen, S.-M.; Hu, J.-B.; Cheng, F. The impact of tandem duplication on gene evolution in Solanaceae species. J. Integr. Agric. 2022, 21, 1004–1014. [Google Scholar] [CrossRef]
  57. Rensing, S.A. Gene duplication as a driver of plant morphogenetic evolution. Curr. Opin. Plant Biol. 2014, 17, 43–48. [Google Scholar] [CrossRef]
  58. Qiao, X.; Li, Q.; Yin, H.; Qi, K.; Li, L.; Wang, R.; Zhang, S.; Paterson, A.H. Gene duplication and evolution in recurring polyploidization–diploidization cycles in plants. Genome Biol. 2019, 20, 38. [Google Scholar] [CrossRef]
  59. Van de Peer, Y.; Mizrachi, E.; Marchal, K. The evolutionary significance of polyploidy. Nat. Rev. Genet. 2017, 18, 411–424. [Google Scholar] [CrossRef]
  60. Riaz, M.W.; Lu, J.; Shah, L.; Yang, L.; Chen, C.; Mei, X.D.; Xue, L.; Manzoor, M.A.; Abdullah, M.; Rehman, S.; et al. Expansion and Molecular Characterization of AP2/ERF Gene Family in Wheat (Triticum aestivum L.). Front. Genet. 2021, 12, 632155. [Google Scholar] [CrossRef]
  61. Lawton-Rauh, A. Evolutionary dynamics of duplicated genes in plants. Mol. Phylogenetics Evol. 2003, 29, 396–409. [Google Scholar] [CrossRef]
  62. Jung, S.; Jiwan, D.; Cho, I.; Lee, T.; Abbott, A.; Sosinski, B.; Main, D. Synteny of Prunus and other model plant species. BMC Genom. 2009, 10, 76. [Google Scholar] [CrossRef]
  63. Nakashima, K.; Ito, Y.; Yamaguchi-Shinozaki, K. Transcriptional regulatory networks in response to abiotic stresses in arabidopsis and grasses. Plant Physiol. 2009, 149, 88–95. [Google Scholar] [CrossRef]
  64. Soares, V.L.F.; Rodrigues, S.M.; de Oliveira, T.M.; de Queiroz, T.O.; Lima, L.S.; Hora-Júnior, B.T.; Gramacho, K.P.; Micheli, F.; Cascardo, J.C.M.; Otoni, W.C.; et al. Unraveling new genes associated with seed development and metabolism in Bixa orellana L. by expressed sequence tag (EST) analysis. Mol. Biol. Rep. 2011, 38, 1329–1340. [Google Scholar] [CrossRef] [PubMed]
  65. Kanai, M.; Mano, S.; Kondo, M.; Hayashi, M.; Nishimura, M. Extension of oil biosynthesis during the mid-phase of seed development enhances oil content in Arabidopsis seeds. Plant Biotechnol. J. 2016, 14, 1241–1250. [Google Scholar] [CrossRef] [PubMed]
  66. Xie, Z.; Nolan, T.M.; Jiang, H.; Yin, Y. AP2/ERF Transcription Factor Regulatory Networks in Hormone and Abiotic Stress Responses in Arabidopsis. Front. Plant Sci. 2019, 10, 228. [Google Scholar] [CrossRef] [PubMed]
  67. Pegoraro, C.; Tadiello, A.; Girardi, C.L.; Chaves, F.C.; Quecini, V.; de Oliveira, A.C.; Trainotti, L.; Rombaldi, C.V. Transcriptional regulatory networks controlling woolliness in peach in response to preharvest gibberellin application and cold storage. BMC Plant Biol. 2015, 15, 279. [Google Scholar] [CrossRef]
  68. Liu, N. Effects of IAA and ABA on the Immature Peach Fruit Development Process. Hortic. Plant J. 2019, 5, 145–154. [Google Scholar] [CrossRef]
  69. Soto, A.; Ruiz, K.B.; Ravaglia, D.; Costa, G.; Torrigiani, P. ABA may promote or delay peach fruit ripening through modulation of ripening- and hormone-related gene expression depending on the developmental stage. Plant Physiol. Biochem. 2013, 64, 11–24. [Google Scholar] [CrossRef]
  70. Hu, Y.X.; Wang, Y.H.; Liu, X.F.; Li, J.Y. Arabidopsis RAV1 is down-regulated by brassinosteroid and may act as a negative regulator during plant development. Cell Res. 2004, 14, 8–15. [Google Scholar] [CrossRef]
  71. Woo, H.R.; Kim, J.H.; Kim, J.; Kim, J.; Lee, U.; Song, I.J.; Kim, J.H.; Lee, H.Y.; Nam, H.G.; Lim, P.O. The RAV1 transcription factor positively regulates leaf senescence in Arabidopsis. J. Exp. Bot. 2010, 61, 3947–3957. [Google Scholar] [CrossRef]
  72. Cernac, A.; Benning, C. WRINKLED1 encodes an AP2/EREB domain protein involved in the control of storage compound biosynthesis in Arabidopsis. Plant J. 2004, 40, 575–585. [Google Scholar] [CrossRef]
  73. Baud, S.; Mendoza, M.S.; To, A.; Harscoët, E.; Lepiniec, L.; Dubreucq, B. WRINKLED1 specifies the regulatory action of LEAFY COTYLEDON2 towards fatty acid metabolism during seed maturation in Arabidopsis. Plant J. 2007, 50, 825–838. [Google Scholar] [CrossRef]
  74. Deng, S.; Mai, Y.; Shui, L.; Niu, J. WRINKLED1 transcription factor orchestrates the regulation of carbon partitioning for C18:1 (oleic acid) accumulation in Siberian apricot kernel. Sci. Rep. 2019, 9, 2693. [Google Scholar] [CrossRef]
  75. Savadi, S.; Lambani, N.; Kashyap, P.L.; Bisht, D.S. Genetic engineering approaches to enhance oil content in oilseed crops. Plant Growth Regul. 2017, 83, 207–222. [Google Scholar] [CrossRef]
Figure 1. The evolutionary relationships among AP2/ERF family members were reconstructed through maximum likelihood (ML) analysis with IQ-TREE software (v2.0.4), with graphical representation generated using iTOL. Color-coded clusters distinguish the five phylogenetic groups, including four major subfamilies and Soloist members.
Figure 1. The evolutionary relationships among AP2/ERF family members were reconstructed through maximum likelihood (ML) analysis with IQ-TREE software (v2.0.4), with graphical representation generated using iTOL. Color-coded clusters distinguish the five phylogenetic groups, including four major subfamilies and Soloist members.
Forests 16 01353 g001
Figure 2. Structural characterization of apricot (Prunus armeniaca L.) AP2/ERF genes. (A) Gene architecture showing exon (yellow boxes), intron (black lines), and UTR (green boxes) organization. Gene labels are color-coded according to phylogenetic classification (four subfamilies plus Soloist). (B) Protein motif composition, with 20 distinct motifs represented by colored segments. (C) Domain architecture, illustrating six conserved domains through color-coded annotations. (D) Predicted cis-element in apricot AP2/ERF genes. The 2.0 kb promoter sequences were subjected to PlantCARE for cis-element analysis, with visualization implemented in TBtools. In the resulting schematic, eighteen distinct color-coded segments along the promoter lines represent eighteen functional categories of cis-regulatory elements.
Figure 2. Structural characterization of apricot (Prunus armeniaca L.) AP2/ERF genes. (A) Gene architecture showing exon (yellow boxes), intron (black lines), and UTR (green boxes) organization. Gene labels are color-coded according to phylogenetic classification (four subfamilies plus Soloist). (B) Protein motif composition, with 20 distinct motifs represented by colored segments. (C) Domain architecture, illustrating six conserved domains through color-coded annotations. (D) Predicted cis-element in apricot AP2/ERF genes. The 2.0 kb promoter sequences were subjected to PlantCARE for cis-element analysis, with visualization implemented in TBtools. In the resulting schematic, eighteen distinct color-coded segments along the promoter lines represent eighteen functional categories of cis-regulatory elements.
Forests 16 01353 g002
Figure 3. The chromosomal distribution of AP2/ERF genes across all eight apricot chromosomes was visualized using TBtools’ Gene Location Visualize (GTF/GFF) and Gene Density Profile. Gene names are indicated on the right side of each chromosome.
Figure 3. The chromosomal distribution of AP2/ERF genes across all eight apricot chromosomes was visualized using TBtools’ Gene Location Visualize (GTF/GFF) and Gene Density Profile. Gene names are indicated on the right side of each chromosome.
Forests 16 01353 g003
Figure 4. Comparative synteny analysis of AP2/ERF transcription factors in apricot. (A) Intra-genomic synteny of AP2/ERF genes. (B) Inter-genomic synteny among apricot, Arabidopsis thaliana (L.) Heynh., and peach (Prunus persica (L.) Batsch). MCScanX was used for duplication detection and TBtools for visualization, with gray and red lines representing segmental duplications and syntenic AP2/ERF pairs, respectively.
Figure 4. Comparative synteny analysis of AP2/ERF transcription factors in apricot. (A) Intra-genomic synteny of AP2/ERF genes. (B) Inter-genomic synteny among apricot, Arabidopsis thaliana (L.) Heynh., and peach (Prunus persica (L.) Batsch). MCScanX was used for duplication detection and TBtools for visualization, with gray and red lines representing segmental duplications and syntenic AP2/ERF pairs, respectively.
Forests 16 01353 g004
Figure 5. Functional annotation of apricot AP2/ERF proteins was performed through Gene Ontology (GO) classification. The eggNOG-mapper tool was employed for gene function annotation, followed by GO enrichment analysis conducted with TBtools. The color-coded bars represent distinct GO categories: biological process (blue), cellular component (orange), and molecular function (green).
Figure 5. Functional annotation of apricot AP2/ERF proteins was performed through Gene Ontology (GO) classification. The eggNOG-mapper tool was employed for gene function annotation, followed by GO enrichment analysis conducted with TBtools. The color-coded bars represent distinct GO categories: biological process (blue), cellular component (orange), and molecular function (green).
Forests 16 01353 g005
Figure 6. Expression profiles of apricot AP2/ERF genes in kernel, fruit, flower, flower bud, and young leaf tissues. (A) The heatmap illustrates Z-score-transformed FPKM values. Cluster analysis was performed based on expression trends and displayed in the circular map, grouping genes with similar patterns. K1–K5: Developing kernels at sequential stages; F1–F8: Fruit flesh samples representing progressive developmental phases; FL: Mature flower; FB: Flower bud. Data visualization was performed using R with normalized FPKM values. Flower, leaf, kernel, and fruit icons adjacent to gene names denote the tissue-specific expression of corresponding genes. (BG) RT-qPCR validation of the expression levels of AP2/ERF genes in the different developmental stages of apricot kernels (Blue bars and left Y-axis). Error bars represent the means ± SE (n = 3). The FPKM values of the corresponding genes are also shown (orange lines and Y-axis on the right).
Figure 6. Expression profiles of apricot AP2/ERF genes in kernel, fruit, flower, flower bud, and young leaf tissues. (A) The heatmap illustrates Z-score-transformed FPKM values. Cluster analysis was performed based on expression trends and displayed in the circular map, grouping genes with similar patterns. K1–K5: Developing kernels at sequential stages; F1–F8: Fruit flesh samples representing progressive developmental phases; FL: Mature flower; FB: Flower bud. Data visualization was performed using R with normalized FPKM values. Flower, leaf, kernel, and fruit icons adjacent to gene names denote the tissue-specific expression of corresponding genes. (BG) RT-qPCR validation of the expression levels of AP2/ERF genes in the different developmental stages of apricot kernels (Blue bars and left Y-axis). Error bars represent the means ± SE (n = 3). The FPKM values of the corresponding genes are also shown (orange lines and Y-axis on the right).
Forests 16 01353 g006
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

He, Y.; Wang, L.; Jiang, N.; Zhang, D.; Shi, X.; Wuyun, T.; Liu, H. Genome-Wide Identification and Expression Profiling of AP2/ERF Transcription Factor Genes in Prunus armeniaca L. Forests 2025, 16, 1353. https://doi.org/10.3390/f16081353

AMA Style

He Y, Wang L, Jiang N, Zhang D, Shi X, Wuyun T, Liu H. Genome-Wide Identification and Expression Profiling of AP2/ERF Transcription Factor Genes in Prunus armeniaca L. Forests. 2025; 16(8):1353. https://doi.org/10.3390/f16081353

Chicago/Turabian Style

He, Yanguang, Lin Wang, Nan Jiang, Donglin Zhang, Xiaodan Shi, Tana Wuyun, and Huimin Liu. 2025. "Genome-Wide Identification and Expression Profiling of AP2/ERF Transcription Factor Genes in Prunus armeniaca L." Forests 16, no. 8: 1353. https://doi.org/10.3390/f16081353

APA Style

He, Y., Wang, L., Jiang, N., Zhang, D., Shi, X., Wuyun, T., & Liu, H. (2025). Genome-Wide Identification and Expression Profiling of AP2/ERF Transcription Factor Genes in Prunus armeniaca L. Forests, 16(8), 1353. https://doi.org/10.3390/f16081353

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop