Next Article in Journal
The Methylation of the CYP11A1 Gene Affects the Expression Pattern in Different-Diameter Granulosa Cells of Qira Black Sheep
Previous Article in Journal
Transcriptome Analysis of Paeonia ostii ‘Fengdan’ Seeds Uncovers Starch and Sucrose Metabolism Conferring High Yield Under Brassinosteroid Treatment
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Identification of the FLA Gene Family in Soybean and Preliminary Functional Analysis of Its Drought-Responsive Candidate Genes

1
Xinjiang Key Laboratory for Ecological Adaptation and Evolution of Extreme Environment Organisms, College of Life Sciences, Xinjiang Agricultural University, Urumqi 830052, China
2
College of Life Sciences, Xinjiang Agricultural University, Urumqi 830052, China
3
College of Agriculture, Xinjiang Agricultural University, Urumqi 830052, China
*
Author to whom correspondence should be addressed.
Genes 2025, 16(12), 1425; https://doi.org/10.3390/genes16121425
Submission received: 16 November 2025 / Revised: 25 November 2025 / Accepted: 26 November 2025 / Published: 29 November 2025
(This article belongs to the Section Plant Genetics and Genomics)

Abstract

Background/Objectives: Fasciclin-like arabinogalactan proteins (FLAs) are critical components of the plant cell wall, playing vital roles in development and abiotic stress responses. However, a systematic genome-wide analysis of the FLA family in soybean (Glycine max L.), a major legume crop susceptible to drought, is lacking. This study aimed to comprehensively identify GmFLA members and investigate their potential functions in drought tolerance. Methods: We identified GmFLA genes via BLASTP (v2.16.0) and Hidden Markov Model (HMM) searches against the soybean genome. Subsequent analyses encompassed their physicochemical properties, chromosomal distribution, gene structure, phylogenetic relationships, conserved domains, cis-acting elements, and subcellular localization. Drought-responsive candidates were screened using a Gene Expression Omnibus (GEO) dataset, and their expression profiles were validated under drought stress using quantitative real-time PCR (qRT-PCR). Results: A total of 55 GmFLA genes were identified and unevenly distributed across 14 chromosomes. Most genes featured a single-exon structure and contained a conserved Fasciclin domain, with predicted localization primarily to the chloroplast. Phylogenetic analysis grouped them into three distinct subclasses with Arabidopsis homologs, suggesting lineage-specific expansion. Promoter analysis revealed an abundance of stress- and hormone-related cis-elements. Expression analysis identified five candidate genes (GmFLA5, GmFLA15, GmFLA40, GmFLA47, and GmFLA54) that showed tissue-specific expression changes under drought treatment. Conclusions: This study provides the first comprehensive genomic characterization of the GmFLA gene family and identifies candidate GmFLAs with drought-responsive expression patterns. Our findings establish a foundation for future functional research to investigate their potential roles in soybean drought response. Furthermore, these candidates serve as potential targets for further investigation in strategies aimed at improving soybean drought tolerance.

1. Introduction

Plant cells adapt to environmental changes, respond to external stimuli, and maintain homeostasis through intricate intercellular communication and interactions [1]. This process relies on signal transmission across the cell wall. As a dynamic and complex structure, the cell wall not only provides mechanical support and protection but also directly participates in signal transduction, cell communication, and immune defense [2]. Critically, the cell wall is not a static barrier but is actively remodeled under drought stress. This remodeling manifests in various ways: for instance, drought inhibits the biosynthesis of primary cell wall components like pectin and hemicellulose in cotton fibers, suppressing the expression of key biosynthetic genes such as galacturonosyltransferase (GhGAUT) and xylosyltransferase (GhXXT) [3]. Concurrently, studies in Arabidopsis have shown that drought stress alters the expression and activity of xyloglucan-modifying enzymes such as xyloglucan endotransglucosylase/hydrolases (XTHs), which remodel the cellulose–xyloglucan network and influence cell wall mechanical properties and water retention capacity [4]. These changes collectively influence cell wall mechanical properties and water retention capacity. Hydroxyproline-rich glycoproteins (HRGPs) are a widely distributed class of proteins within the cell wall, playing crucial roles in plant growth and development [5] as well as disease resistance [6]. Based on differences in glycosylation levels, the HRGP superfamily can be classified into three categories: arabinogalactan proteins (AGPs), proline-rich proteins (PRPs), and extended proteins (EXTs) [7].
Arabinogalactan proteins (AGPs) are a class of highly glycosylated glycoproteins, with their glycosyl side chains accounting for over 90% of their molecular weight. Most are GPI-anchored to the plasma membrane [8]. AGPs are widely distributed in the cell walls, plasma membranes, and extracellular matrix of higher plants [9], participating in regulating various physiological processes such as somatic embryogenesis, cell proliferation, pollen tube growth, and hormone signaling [10]. Based on protein structural features, AGPs can be further classified into classical AGPs, AG peptides, chimeric AGPs (CAGPs), and non-classical AGPs [11,12]. Among these, fasciclin-like AGPs (FLAs) within chimeric AGPs have garnered significant attention due to their incorporation of 1–2 fasciclin domains with adhesive functions [13]. Notably, FLAs are implicated in cell adhesion, intercellular communication, and abiotic stress responses [4]. Current understanding of FLA genes in drought response among legumes is still developing. While transcriptomic studies in model legumes like Medicago truncatula have identified various drought-responsive genes, the specific mechanisms and functional roles of FLAs in this process require further elucidation.
Fasciclin-like arabinogalactan proteins (FLAs) constitute a well-defined AGP subfamily. Beyond the typical AGP glycosylation region, they contain 1–2 FAS domains [14] and predominantly possess an N-terminal signal peptide and a C-terminal GPI anchor signal. The FLA gene family has been identified in various plant species, with species-specific variations in the number of members. Examples include Arabidopsis thaliana (L.), Oryza sativa L., Triticum aestivum L., Populus trichocarpa Torr. & Gray, American black poplar (Populus deltoides Marshall), Salix suchowensis W.C. Cheng in S.Y. Jin, cotton (Gossypium hirsutum Linn), Chinese cabbage (Brassica rapa ssp. chinensis), and tomato (Solanum lycopersicum L.), with 21, 27, 34, 35, 40, 46, 19, 36, and 24 FLA members, respectively [15,16,17,18,19,20,21,22]. FLAs are increasingly recognized not only as structural components but also as potential contributors to cell wall integrity sensing and stress signaling, suggesting their possible role in connecting cell wall biology with stress physiology. Fasciclin-like arabinogalactan proteins (FLAs) are key glycoproteins regulating cell wall development and cellulose deposition, broadly influencing crop fiber quality and stem mechanical properties [23,24]. Research indicates that cotton GhFLA1 regulates fiber initiation and elongation [25]; Arabidopsis AtFLA11/12 participates in stem biomechanical regulation [26], AtFLA3 influences pollen development [27], while AtFLA1/4/18 mediate root development and ABA signal transduction in stress responses [15,28,29]. Furthermore, this family’s involvement in abiotic stress response is evidenced in other species: in tomato, SlFLA1 and SlFLA3 were significantly induced by drought stress [22], while conversely, in potato, StFLA4 acted as a negative regulator of drought tolerance [30]. Separately, the banana MaFLA27 gene was shown to confer cold tolerance by modulating cell wall remodeling [4].
As a major food and oil crop and model legume, soybean production is frequently constrained by drought stress [31,32]. Although FLA genes have been extensively studied across multiple species, their systematic identification in soybean and functional roles in drought response remain unclear. This study aims to deepen understanding of this gene family in soybean and lay the foundation for future functional studies of soybean FLA genes in drought response. To this end, we conducted a genome-wide identification of the FLA gene family in soybean, analyzing its phylogenetic relationships, gene structures, conserved motifs, and promoter cis-acting elements. By integrating public transcriptomic data with qRT-PCR experiments, we preliminarily screened and validated several drought-response candidate genes. This establishes a theoretical foundation for subsequent investigations into the functional mechanisms of soybean FLA genes in drought tolerance and growth development.

2. Results

2.1. Identification and Nomenclature of the FLA Gene Family

This study employed an integrated strategy to systematically identify members of the FLA gene family in the soybean (G. max) genome. First, homology-based BLASTP alignment was performed using Arabidopsis FLA protein sequences as queries (E-value < 1 × 10−5), combined with a Hidden Markov Model (HMM) search for the conserved Fasciclin domain (PF02469) from the Pfam database (E-value < 1 × 10−5). The candidate sequences from both approaches were merged and subjected to rigorous online validation using the NCBI Conserved Domain Database. This validation confirmed that all 55 non-redundant genes contained a complete, typical Fasciclin domain (domain E-value < 0.01 and coverage > 60%). Based on their physical locations across soybean’s 20 chromosomes, these genes were sequentially named GmFLA1 through GmFLA55.

2.2. Chromosomal Distribution and Gene Localization

Chromosomal distribution analysis revealed (Figure 1) that the 55 GmFLA genes are not evenly distributed across all chromosomes but are concentrated on 14 chromosomes. Among these, distribution on chromosomes 11 and 12 is particularly concentrated. Analysis indicates no significant correlation between the number of genes on each chromosome and its physical size.

2.3. Analysis of Physicochemical Properties of Encoded Proteins

A systematic analysis of the physicochemical properties of GmFLA proteins was conducted (Table 1). The amino acid lengths of family members ranged from 206 (GmFLA13) to 454 (GmFLA39), with most concentrated between 250 and 450 amino acids; their predicted molecular weights ranged from approximately 24 kDa (GmFLA13) to 50 kDa (GmFLA39), with most falling between 25 and 50 kDa. The isoelectric point (pI) distribution was broad, spanning from 4.53 (GmFLA13) to 9.48 (GmFLA42), with most members exhibiting pI values between 5.0 and 9.5. The average instability index is 43.5, though most members have values below 40, suggesting these proteins may exhibit high stability. Lipid indices are generally high (average 94.3, with most distributed between 80 and 105), reflecting a strong hydrophobic tendency across the family. Further hydrophilicity/hydrophobicity analysis revealed that 40% of members exhibited hydrophobic properties, notably GmFLA19 (0.263), GmFLA14 (0.189), and GmFLA13 (0.178); while the remaining 60% exhibited hydrophilicity, such as GmFLA32 (−0.331), GmFLA27 (−0.308), and GmFLA5 (−0.235).
Additionally, subcellular localization predictions indicate that members of this family exhibit diverse potential localizations: 26 members are predicted to localize to chloroplasts, while the remaining members localize to the cell wall, vacuole, and plasma membrane. It should be noted that these predictions are based solely on amino acid sequences and may not fully reflect the complex post-translational modification processes that ultimately determine FLA protein localization. As recently revealed by studies on Arabidopsis FLA proteins, the final localization of FLA proteins may be jointly regulated by GPI anchoring and AG glycosylation. These two mechanisms collectively determine their cell surface localization and subsequent release to the cell wall [33]. Signal peptide prediction results indicate that, except for GmFLA11, GmFLA13, GmFLA20, GmFLA25, and GmFLA42, all other members contain signal peptides. This strongly supports that the vast majority of GmFLAs enter the secretory pathway and function at the cell surface or within the cell wall matrix.

2.4. Structural and Phylogenetic Analysis of the FLA Gene Family in Soybean

Our analysis indicates that the number of exons varies among members of this family, ranging from 1 to 6 (Figure 2). Notably, the majority of members (73%) contain a single exon, suggesting this is a characteristic feature of the family. Most FLA family members lack introns; only 16 members possess introns, and 10 of these have two or more introns. The lengths of these introns vary significantly across members and positions, ranging from tens of base pairs to several kilobases. This suggests that multi-intron structures may be retained by functional selection and could be associated with more complex alternative splicing regulation.
To investigate the evolutionary relationships among the FLA gene families in soybean and Arabidopsis, we constructed a phylogenetic tree using MEGA 11 software (Figure 3). As shown, FLA proteins from these plants were grouped into three subclades based on homology. The smallest subclade contained 12 members (7 from the soybean FLA family), while the largest subclade comprised 45 members (39 from the soybean FLA family, accounting for 87% of the largest subclade). This subclade exhibited numerous soybean-specific branches independent of Arabidopsis. Soybean underwent extensive lineage-specific gene duplication events during evolution, exhibiting significantly higher expansion intensity than Arabidopsis. This provides a molecular basis for the adaptive evolution of legumes. Future research should focus on elucidating the specific regulatory mechanisms of these branched genes in stress responses and other potential biological processes.

2.5. Analysis of the Conserved Domains and Motifs in the Soybean FLA Gene Family

Analysis of conserved domains in soybean FLA proteins using the NCBI CD-Search tool revealed that all family members harbor a typical fasciclin domain (Figure 4), with highly conserved position and boundaries (average coverage > 95%). This domain persists stably across phylogenetic branches, confirming it as the core functional module of the soybean FLA family and suggesting strong functional constraints during evolution. Visual analysis of 10 conserved motifs (Motif 1–10) (Figure 5 and Figure 6) identified using MEME v5.5.2 revealed that Motif 1 is present in all soybean FLA members (100%). Its sequence localization overlaps with the core region of the Fasciclin domain, indicating this motif plays a crucial role in maintaining domain function. Secondary conserved motifs Motif 12 and Motif 3 exhibited frequencies of 92.1% and 85.3%, respectively, primarily distributed in the flanking regions of the Fasciclin domain; a repetition phenomenon occurred where Motif 6, Motif 8, and Motif 9 exhibit duplicate copies (each appearing twice) in 32.7% of members. These repeated units are predominantly located at the N/C-termini of proteins, suggesting potential involvement in regulating protein interactions or subcellular localization.

2.6. Analysis of Cis-Acting Elements in the Soybean FLA Gene Family

Analysis of cis-acting elements in the promoter regions of the 55 GmFLA genes revealed that members of this family commonly harbor multiple elements associated with hormone response, abiotic stress, and growth and development (Figure 7 and Figure 8). Critically, we identified a high prevalence of key drought-responsive motifs, including the abscisic-acid-responsive element (ABRE, 88 total), dehydration-responsive elements (DRE, present in multiple members), and MYB binding sites (50 total), suggesting that GmFLA genes are likely involved in core drought and ABA signaling pathways in soybean.
Among other stress-related elements, defense and stress response elements totaled 37 (with GmFLA38 containing 4), and low-temperature response elements numbered 11 (GmFLA36 and GmFLA55 each contained 2). Hormone response elements were also widely distributed, with methyl jasmonate response elements (70), abscisic acid response elements (ABRE, 88), gibberellin response elements (45), and salicylic acid response elements (30) being the most abundant.
Furthermore, light-response elements (82 in total) and a small number of circadian clock regulatory elements were identified, indicating potential roles in light signal integration. GmFLA15, GmFLA44, and GmFLA47 were the primary carriers of light-response, ABRE, and MeJA-response elements, respectively. Overall, the diverse cis-element profile suggests that GmFLA family genes may participate in complex regulatory networks coordinating light signaling, hormone-mediated growth, and stress responses.

2.7. Preliminary Screening of the FLA Gene Family in Soybean for Drought Response

In the Gene Expression Omnibus (GEO) database, the “Search for Studies at GEO Datasets” tool was used to retrieve records using the keyword “drought stress”. The search was then filtered by species (soybean) and data expression type (“Expression profiling by array”), yielding 10 records. Five datasets (GSE102749, GSE65553, GSE50408, GSE429663, and GSE40604) were selected for subsequent analysis based on the following criteria: compatibility with the GEO2R analytical tool, inclusion of both biological replicates and appropriate control samples, and coverage of different stress durations and tissue types (roots and leaves). These datasets were analyzed using the online tool Analyze with GEO2R within the GEO database. Root and leaf expression data were processed separately to maintain tissue-specific resolution in the analysis of drought responses. Differentially expressed genes (DEGs) were screened based on p < 0.05 and |logFC| ≥ 1, ultimately identifying five genes (Table 2): GmFLA5, GmFLA15, GmFLA40, GmFLA47, and GmFLA54. The GenBank accession numbers and detailed information of these five genes are provided in Table S1 (see Supplementary Materials).

2.8. Expression Patterns of Candidate FLA Genes Under Different Stress Conditions

Gene expression analysis was evaluated via quantitative real-time PCR (qRT-PCR). The expression of soybean FLA genes was analyzed in vegetative-stage seedlings following a 15% PEG6000-induced drought stress. Root, stem, and leaf tissues were sampled at 0, 2, 8, and 12 h post-treatment (n = 3 biological replicates). qRT-PCR revealed distinct temporal expression patterns among several FLA genes, demonstrating their activation during the early drought response (Figure 9).
The qRT-PCR results revealed that five candidate GmFLA genes (GmFLA5, GmFLA15, GmFLA40, GmFLA47, and GmFLA54) exhibited significant tissue-specific and time-dependent expression patterns under drought stress. In root tissues, the consistent downregulation of all five genes coincides with the typical drought-induced inhibition of root growth, potentially reflecting a strategic reallocation of resources from cell wall expansion to the maintenance of essential root functions involved in water uptake. Within stem tissues, the gene expression patterns diverged. The upregulation of GmFLA5, GmFLA40, and GmFLA47 suggests their potential involvement in stem mechanical reinforcement and vascular function maintenance under dehydration stress, while the sustained suppression of GmFLA54 may indicate its specific role is dispensable during stem stress adaptation. In leaves, all genes demonstrated the most rapid response, with strong induction observed at the 2 h stress time point. This rapid induction in leaves aligns with ABA-mediated drought responses, potentially contributing to cell wall modifications that facilitate stomatal regulation and water conservation under dehydration conditions. Although expression levels gradually declined with prolonged stress, they remained significantly elevated above baseline levels at 12 h. Notably, GmFLA54 exhibited a distinctive pattern, showing stable suppression in both roots and stems but strong induction in leaves, highlighting its unique organ-specific regulatory mechanism.

3. Discussion

This study successfully identified 55 FLA genes in the soybean genome, a number significantly higher than that in Arabidopsis (21) and rice (27). This finding aligns with two whole-genome duplication events in soybean history [15,16,17,34], suggesting lineage-specific expansion as the primary driver of GmFLA family evolution. Such expansion likely provided abundant genetic material for soybean adaptation to complex environments, such as drought stress and nodule development. The retention of a substantial number of GmFLA duplicates following polyploidization events may reflect selective pressures to maintain or refine specialized functions pertinent to soybean’s ecological adaptation and agronomic traits. Phylogenetic analysis further revealed that 82% of members clustered within a single major evolutionary branch, forming multiple soybean-specific lineages (Figure 1), strongly supporting the central role of gene duplication in family expansion. The emergence of these soybean-specific lineages suggests potential functional diversification or specialization within the FLA family, possibly contributing to unique aspects of soybean biology. Notably, this branch includes several FLA homologs known to be associated with stress responses, suggesting their functions could potentially be enriched toward environmental adaptation.
All GmFLAs contain a typical Fasciclin domain, with Motif 1 being highly conserved across all members and covering the core region of this domain. This feature aligns closely with the reported functions of FLAs in Arabidopsis and rice, which mediate protein–protein or protein–cell wall interactions [14,26], suggesting that GmFLAs may similarly participate in cell wall construction and regulation. Notably, the conserved fasciclin domain, often associated with cell adhesion, together with the characteristic AGP (arabinogalactan protein) motifs, could potentially enable these GmFLAs to potentially serve as molecular bridges linking the plasma membrane to the cell wall matrix, possibly participating in the perception and transmission of osmotic stress signals under drought conditions. Based on this structural feature, we hypothesize that specific GmFLAs might function as potential components in the cell wall–plasma membrane continuum, potentially modulating mechanical properties of cell walls and contributing to membrane integrity under dehydration stress. More specifically, they could potentially influence cell-wall stiffening by cross-linking wall components or through interactions with the cellulose–hemicellulose network, and might contribute to osmotic adjustment by affecting wall porosity and water-holding capacity.
Subcellular localization predictions suggest diverse localizations for GmFLAs, including chloroplasts (26 members), cell wall, vacuole, and plasma membrane. However, these computational predictions require cautious interpretation, as the actual localization of FLA proteins is known to be determined by complex post-translational mechanisms. Studies in Arabidopsis demonstrate that GPI anchoring and AG glycosylation collectively guide FLAs to the cell surface and subsequently to the cell wall [33]. This established mechanism aligns with our signal peptide analysis, which confirms that most GmFLAs contain signal peptides and are likely secreted to function primarily at the cell surface or within the cell wall matrix [8,9].
Promoter analysis revealed widespread cis-elements for light, ABA, and MeJA responses, with drought-induced MYB binding sites occurring most frequently. This provides a potential transcriptional regulatory basis for GmFLAs’ potential involvement in drought responses. The prevalence of ABA-responsive elements further implies that these GmFLAs might function within the abscisic acid signaling pathway, a central regulator of drought responses. Moreover, the frequent occurrence of MYB binding sites suggests a potential link to reactive oxygen species (ROS) signaling and cell wall integrity surveillance pathways, both of which are activated under drought stress and involve MYB transcription factors. This implies GmFLAs might function at the intersection of ABA signaling, ROS homeostasis, and cell wall dynamics under drought. These regulatory features support a functional hypothesis where drought-responsive GmFLAs may integrate ABA and ROS signaling pathways to coordinate cell wall remodeling with physiological responses to water deficit. Notably, members such as GmFLA38 and GmFLA44 contain multiple stress response elements, suggesting they might act as signal integration nodes. Their regulatory mechanisms could be further validated via yeast one-hybrid assays or ChIP-qPCR [22,28].
Through GEO data screening and qRT-PCR validation, this study identified five GmFLA genes (GmFLA5, GmFLA15, GmFLA40, GmFLA47, GmFLA54) exhibiting tissue-specific expression patterns under drought stress. Under drought stress, the GmFLA gene exhibits highly organ-specific expression, reflecting differentiated adaptive strategies adopted by different tissues in response to water deficit. The consistent downregulation of all genes in roots indicates an active metabolic conservation strategy, where resources are redirected toward critical survival mechanisms by suppressing gene expression associated with cell wall expansion. This root-specific downregulation, contrasting with upregulation in leaves, could reflect distinct tissue priorities: roots may reduce energy-intensive processes like cell wall remodeling to conserve resources, whereas leaves might prioritize cell wall reinforcement to limit water loss and maintain turgor. This coordinated downregulation in roots may represent a strategic reallocation of energy away from expansive growth and towards the reinforcement of existing cell walls, a process potentially mediated by specific FLAs, to enhance soil penetration and water foraging under dry conditions. We hypothesize that this transcriptional reprogramming might fine-tune root cell wall extensibility and mechanical strength, potentially influencing root system architecture and hydraulic conductance under water deficit. Differential expression patterns in stems reveal functional specificity: Upregulated genes likely contribute to maintaining vascular function and structural reinforcement. Conversely, the downregulation of GmFLA15 and GmFLA54 suggests their functions require suppression during stem stress adaptation. The rapid and robust induction of all genes in leaves strongly supports the critical role of FLA proteins in early leaf defense responses, such as stomatal closure and cell wall remodeling. This rapid induction in leaves is consistent with a putative role in cell wall remodeling to limit water loss and maintain tissue turgor, possibly through interactions with pectin and hemicellulose networks. Based on their domain characteristics and expression patterns, we propose that leaf-induced GmFLAs may modulate cell wall porosity and barrier properties, thereby regulating transpirational water loss while maintaining photosynthetic capacity under drought conditions. Particularly noteworthy are GmFLA15 and GmFLA54, which exhibit specific suppression patterns in roots and stems but are induced in leaves, making them promising candidates for studying organ-specific functional differentiation. Although this study, based on bioinformatics screening and partial gene validation, has not yet conducted systematic functional analysis of all 55 GmFLA members, its findings suggest the potential role of this family in drought response and provide candidate targets and hypotheses for subsequent functional studies using overexpression or silencing technologies.
Beyond the mechanistic insights, our findings possess broader biological and practical implications. The identified drought-responsive GmFLA genes, particularly those with tissue-specific induction patterns like GmFLA15 and GmFLA54 in leaves, represent promising candidates for molecular breeding. Their expression profiles could potentially serve as transcriptional markers for marker-assisted selection or biomarker development aimed at engineering drought-tolerant soybean varieties. While the functional conservation of these specific drought-responsive GmFLA genes across diverse soybean cultivars remains to be experimentally determined, our findings provide a theoretical foundation for improving drought tolerance in soybean. Future studies should investigate the variation and functional consistency of these candidate genes across different soybean germplasms. The potential applicability of these findings to other legume species would require identification and characterization of corresponding FLA homologs in those systems.
Future research should prioritize constructing overexpression, RNAi, or CRISPR-Cas9 mutant lines of key genes (e.g., GmFLA15, GmFLA54), combined with analyses of physiological indicators (water loss rate, photosynthetic efficiency) and cell wall components (pectin, hemicellulose), to systematically elucidate their specific mechanisms in drought resistance. Specifically, future work should test the proposed hypotheses by determining whether manipulating these candidate genes exerts effects on root architecture and leaf water retention precisely through modulating cell wall composition and mechanical properties. It is important to note that the PEG-simulated drought used in this study primarily induces osmotic stress and may not fully replicate field conditions; thus, field-based validation remains necessary. Furthermore, this study lacks protein-level validation. Future work should include Western blot analysis to confirm protein expression and glycosylation profiling to assess post-translational modifications, given their importance for AGP function. Glycoproteomics could be employed to investigate these regulatory mechanisms. Ultimately, field trials and cross-species comparisons will help comprehensively reveal the FLA family’s functional diversity in soybean environmental adaptation.

4. Materials and Methods

4.1. Identification of FLA Family Members in Soybean

The soybean genome data Glycine_max_v4.0 and its annotation files were downloaded from the NCBI Genome database. Subsequently, the following operations were performed sequentially using Tbtools (v2.0) software: (1) Extraction of soybean CDS sequences via the Gtf/Gff3 Sequences Extract function; (2) Simplification of sequence identifiers in FASTA files using the Fasta ID Simplify function; (3) Finally, the CDS sequences were translated into protein sequences using the Batch Translate CDS to Protein function.
To comprehensively identify GmFLA members, we employed a dual screening strategy: First, we performed BLASTP homology searches against the soybean proteome using all known Arabidopsis FLA protein sequences (downloaded from Arabidopsis databases) as query sequences, with an E-value threshold set to <1 × 10−5. Second, we performed searches using the Simple HMM Search function in TBtools (E-value < 1 × 10−5) based on the Hidden Markov Model (HMM, PF02469) for the Fasciclin conserved domain from the Pfam database. After merging and deduplicating candidate sequences from both methods, we rigorously validated each candidate online using the NCBI conserved domain database to eliminate false positives and ensure identification accuracy. Sequences that did not contain a complete, typical Fasciclin domain (as defined by the NCBI CDD with a domain E-value < 0.01 and coverage > 60%) were discarded. Only sequences confirmed to contain a complete, typical Fasciclin domain were ultimately designated as non-redundant members of the FLA gene family.

4.2. Chromosomal Localization and Physicochemical Properties Analysis of the Soybean FLA Gene Family

The TBtools Gene Location Visualize from Gtf/Gff function was used to visualize the chromosomal distribution of soybean FLA genes. The Protein Parameter Calc module was employed to calculate physicochemical parameters including molecular weight (Mw), isoelectric point (pI), instability coefficient, and hydrophilicity (GRAVY). Subcellular localization was predicted using WoLF PSORT (https://wolfpsort.hgc.jp/, accessed on 7 December 2024). Signal peptides were analyzed with SignalP 5.0 (https://services.healthtech.dtu.dk/service.php?SignalP-5.0, accessed on 7 December 2024) using a D-value threshold > 0.5. The integrated results were cross-validated to determine protein secretion mechanisms.

4.3. Analysis of Conserved Domains, Motifs, and Gene Structures in the Soybean FLA Gene Family

Soybean FLA protein sequences were submitted to the MEME online tool (https://meme-suite.org/meme/tools/meme, accessed on 9 December 2024) with 10 motifs and maximum length parameters. Motif Display in TBtools visualized the motifs, enabling comparison of motif distribution similarities among family members. Simultaneously, based on genomic GTF/GFF annotation files and CDS sequences, the intron–exon structure diagram was drawn using TBtools Gene Structure View (Advanced) function to systematically compare structural features among different genes.

4.4. Evolutionary Analysis and Cis-Acting Element Analysis of the Soybean FLA Gene Family

Arabidopsis FLA protein sequences were retrieved from the Arabidopsis database and merged with soybean FLA sequences into a single file. A phylogenetic tree was constructed using the Neighbor-Joining method in MEGA 11 software. Key parameters included 1000 bootstrap replicates to assess branch reliability, with other parameters set to default values. The final tree underwent topology optimization, annotation refinement, and publication-grade visualization adjustments via the iTOL online platform (https://itol.embl.de/, accessed on 9 December 2024).
Using the “Gtf/Gff3 Sequences Extract” and “Fasta Extract” functions in Tbtools (v2.0) software, extract promoter sequences spanning 2000 bp upstream of the transcription start site (TSS) for all FLA genes from the soybean reference genome; Submitted the obtained sequences to the PlantCARE database (https://bioinformatics.psb.ugent.be/webtools/plantcare/html/, accessed on 26 July 2025) for cis-acting element scanning and functional annotation; Based on biological function, key regulatory elements for abiotic stress responses (e.g., drought, cold, heat stress) and hormone responses (e.g., abscisic acid, methyl jasmonate) were retained. Visual analysis of the selected element types and their distribution patterns was performed using TBtools.

4.5. Screening of Drought Response Candidate Genes

We retrieved drought-related transcriptomic datasets from the NCBI GEO database by searching with the keyword “drought stress” and applying filters for the species (G. max) and data type (expression profiling by array). The dataset selection criteria required that studies: (1) include both biological replicates (at least two per condition) and suitable control samples (well-watered conditions); (2) provide coverage of key tissues involved in drought response, specifically roots and leaves. All differential expression analyses were performed using the GEO2R platform, which automatically executes background correction and normalization specific to each platform (as described in the GEO2R documentation). Differential expression analysis between drought-treated and control groups was conducted using GEO2R’s default parameters on the normalized expression values, with thresholds set at |logFC| ≥ 1 and an adjusted p-value (adj.p.Val) < 0.05. For probe annotation, we used the latest platform annotation files (GPL) provided by NCBI GEO without further updates. Finally, the resulting list of differentially expressed genes (DEGs) was cross-referenced with our pre-identified soybean FLA gene set to screen for drought-responsive candidates.

4.6. Plant Materials and Treatment Methods

Soybeans used in this study were provided by our laboratory. Soybean seeds were sown in soil and cultivated under controlled conditions (25 °C, 16 h light/8 h dark) until the vegetative growth stage, specifically the growth phase between the first and second trifoliate leaves. Irrigation with a 15% PEG6000 solution was employed to simulate drought stress. This treatment leveraged PEG6000’s high molecular weight to prevent penetration through cell walls, thereby inducing a water deficit state within plants that closely mimics natural drought conditions [35]. This approach facilitated the maintenance of stable and controllable stress conditions.
To dynamically monitor the early transcriptional response to drought stress, root, stem, and leaf samples were randomly collected at 0, 2, 8, and 12 h post-stress application. Each time point comprised three biological replicates, with each replicate derived from pooled tissues of different plants. Samples were immediately flash-frozen in liquid nitrogen and stored at –80 °C for subsequent gene expression analysis.

4.7. RNA Extraction and Quantitative Reverse Transcription Polymerase Chain Reaction (qRT-PCR)

RNA was extracted using the Plant Total RNA Extraction Kit from Fuji Bio Technology Co., Ltd. (Chengdu, China). cDNA synthesis was performed using the All-In-One 5X RT MasterMix with gDNA Removal Kit (Applied Biological Materials Inc., Richmond, BC, Canada) under the following conditions: 37 °C for 15 min, 60 °C for 10 min, and 95 °C for 3 min. Real-time quantitative PCR was performed on the LightCycler 96 system (Roche, Mannheim, Germany). The reaction mixture contained 10 μL TB Green® Premix Ex Taq™ II, 0.8 μL each of forward and reverse primers, 2 μL cDNA template, and 6.4 μL ddH2O. Gene-specific primers were designed using the NCBI Primer-BLAST tool (https://www.ncbi.nlm.nih.gov/tools/primer-blast/, accessed on 20 December 2024) to ensure specificity, and the primer sequences used in this study are listed in Table S2 (see Supplementary Materials). The amplification specificity of each primer pair was validated by the appearance of a single peak in the melting curve analysis. The relative expression levels of target genes were calculated using the 2−ΔΔCt method and normalized against the stable reference gene CYP2 (NC_038248).

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/genes16121425/s1, Table S1: Information on the five FLA family genes identified from GEO datasets under drought stress in soybean. Table S2: Primer sequences for qRT-PCR analysis of the five FLA genes.

Author Contributions

Conceptualization, J.Z. and Z.Y.; methodology, J.Z. and Z.Y.; software, J.Z., L.Y., J.D., C.W. and Z.Y.; validation, J.Z., L.Y. and Z.Y.; formal analysis, J.Z.; investigation, J.Z., J.D. and L.Y.; resources, C.W. and Z.Y.; data curation, J.Z.; writing—original draft preparation, J.Z.; writing—review and editing, J.Z. and Z.Y.; visualization, J.Z.; supervision, Z.Y.; project administration, Z.Y.; funding acquisition, Z.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Major Science and Technology Special Project of the Xinjiang Uygur Autonomous Region, Grant Number: 2022A02008-1.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All data are displayed in the manuscript and Supplementary Materials.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Armingol, E.; Baghdassarian, H.M.; Lewis, N.E. The diversification of methods for studying cell–cell interactions and communication. Nat. Rev. Genet. 2024, 25, 381–400. [Google Scholar] [CrossRef] [PubMed]
  2. Cosgrove, D.J. Structure and growth of plant cell walls. Nat. Rev. Mol. Cell Biol. 2023, 25, 340–358. [Google Scholar] [CrossRef] [PubMed]
  3. Zhu, H.; Hu, W.; Li, Y.; Zou, J.; He, J.; Wang, Y.; Wang, S.; Zhou, Z. Drought shortens cotton fiber length by inhibiting biosynthesis, remodeling and loosening of the primary cell wall. Ind. Crop. Prod. 2023, 200, 116827. [Google Scholar] [CrossRef]
  4. Liu, J.; Chen, C.; Chen, L.; Sharif, R.; Meng, J.; Gulzar, S.; Yi, Z.; Chen, S.; Zhan, H.; Liu, H.; et al. The banana MaFLA27 confers cold tolerance partially through modulating cell wall remodeling. Int. J. Biol. Macromol. 2025, 290, 138748. [Google Scholar] [CrossRef]
  5. Pinski, A.; Betekhtin, A.; Kwasniewska, J.; Chajec, L.; Wolny, E.; Hasterok, R. 3,4-Dehydro-L-proline Induces Programmed Cell Death in the Roots of Brachypodium distachyon. Int. J. Mol. Sci. 2021, 22, 7548. [Google Scholar] [CrossRef]
  6. Dai, J.; Ma, M.; Niu, Q.; Eisert, R.J.; Wang, X.; Das, P.; Lechtreck, K.F.; Dutcher, S.K.; Zhang, R.; Brown, A. Mastigoneme structure reveals insights into the O-linked glycosylation code of native hydroxyproline-rich helices. Cell 2024, 187, 1907–1921.e1916. [Google Scholar] [CrossRef]
  7. Hijazi, M.; Velasquez, S.M.; Jamet, E.; Estevez, J.M.; Albenne, C. An update on post-translational modifications of hydroxyproline-rich glycoproteins: Toward a model highlighting their contribution to plant cell wall architecture. Front. Plant Sci. 2014, 5, 395. [Google Scholar] [CrossRef]
  8. Ma, Y.; Johnson, K. Arabinogalactan proteins–Multifunctional glycoproteins of the plant cell wall. Cell Surf. 2023, 9, 100102. [Google Scholar] [CrossRef]
  9. Lin, S.; Miao, Y.; Huang, H.; Zhang, Y.; Huang, L.; Cao, J. Arabinogalactan Proteins: Focus on the Role in Cellulose Synthesis and Deposition during Plant Cell Wall Biogenesis. Int. J. Mol. Sci. 2022, 23, 6578. [Google Scholar] [CrossRef]
  10. Gaspar, Y.; Johnson, K.L.; McKenna, J.A.; Bacic, A.; Schultz, C.J. The complex structures of arabinogalactan-proteins and the journey towards understanding function. Plant Mol. Biol. 2001, 47, 161–176. [Google Scholar] [CrossRef] [PubMed]
  11. Showalter, A.M.; Keppler, B.; Lichtenberg, J.; Gu, D.; Welch, L.R. A bioinformatics approach to the identification, classification, and analysis of hydroxyproline-rich glycoproteins. Plant Physiol. 2010, 153, 485–513. [Google Scholar] [CrossRef]
  12. Chapman, A.; Blervacq, A.S.; Vasseur, J.; Hilbert, J.L. Arabinogalactan-proteins in Cichorium somatic embryogenesis: Effect of beta-glucosyl Yariv reagent and epitope localisation during embryo development. Planta 2000, 211, 305–314. [Google Scholar] [CrossRef] [PubMed]
  13. Seifert, G.J. The FLA4-FEI Pathway: A Unique and Mysterious Signaling Module Related to Cell Wall Structure and Stress Signaling. Genes 2021, 12, 145. [Google Scholar] [CrossRef] [PubMed]
  14. Zhou, D.; Zou, T.; Zhang, K.; Xiong, P.; Zhou, F.; Chen, H.; Li, G.; Zheng, K.; Han, Y.; Peng, K.; et al. DEAP1 encodes a fasciclin-like arabinogalactan protein required for male fertility in rice. J. Integr. Plant Biol. 2022, 64, 1430–1447. [Google Scholar] [CrossRef] [PubMed]
  15. Johnson, K.L.; Kibble, N.A.; Bacic, A.; Schultz, C.J. A fasciclin-like arabinogalactan-protein (FLA) mutant of Arabidopsis thaliana, fla1, shows defects in shoot regeneration. PLoS ONE 2011, 6, e25154. [Google Scholar] [CrossRef]
  16. Ma, H.; Zhao, J. Genome-wide identification, classification, and expression analysis of the arabinogalactan protein gene family in rice (Oryza sativa L.). J. Exp. Bot 2010, 61, 2647–2668. [Google Scholar] [CrossRef]
  17. Faik, A.; Abouzouhair, J.; Sarhan, F. Putative fasciclin-like arabinogalactan-proteins (FLA) in wheat (Triticum aestivum) and rice (Oryza sativa): Identification and bioinformatic analyses. Mol. Genet. Genom. 2006, 276, 478–494. [Google Scholar] [CrossRef]
  18. Zang, L.; Zheng, T.; Chu, Y.; Ding, C.; Zhang, W.; Huang, Q.; Su, X. Genome-Wide Analysis of the Fasciclin-Like Arabinogalactan Protein Gene Family Reveals Differential Expression Patterns, Localization, and Salt Stress Response in Populus. Front. Plant Sci. 2015, 6, 1140. [Google Scholar] [CrossRef]
  19. Zhang, Y.; Zhou, F.; Wang, H.; Chen, Y.; Yin, T.; Wu, H. Genome-Wide Comparative Analysis of the Fasciclin-like Arabinogalactan Proteins (FLAs) in Salicacea and Identification of Secondary Tissue Development-Related Genes. Int. J. Mol. Sci. 2023, 24, 1481. [Google Scholar] [CrossRef]
  20. Huang, G.Q.; Xu, W.L.; Gong, S.Y.; Li, B.; Wang, X.L.; Xu, D.; Li, X.B. Characterization of 19 novel cotton FLA genes and their expression profiling in fiber development and in response to phytohormones and salt stress. Physiol. Plant 2008, 134, 348–359. [Google Scholar] [CrossRef]
  21. Huang, H.; Miao, Y.; Zhang, Y.; Huang, L.; Cao, J.; Lin, S. Comprehensive Analysis of Arabinogalactan Protein-Encoding Genes Reveals the Involvement of Three BrFLA Genes in Pollen Germination in Brassica rapa. Int. J. Mol. Sci. 2021, 22, 13142. [Google Scholar] [CrossRef] [PubMed]
  22. Yao, K.; Yao, Y.; Ding, Z.; Pan, X.; Zheng, Y.; Huang, Y.; Zhang, Z.; Li, A.; Wang, C.; Li, C.; et al. Characterization of the FLA Gene Family in Tomato (Solanum lycopersicum L.) and the Expression Analysis of SlFLAs in Response to Hormone and Abiotic Stresses. Int. J. Mol. Sci. 2023, 24, 16063. [Google Scholar] [CrossRef]
  23. Wang, H.; Jin, Y.; Wang, C.; Li, B.; Jiang, C.; Sun, Z.; Zhang, Z.; Kong, F.; Zhang, H. Fasciclin-like arabinogalactan proteins, PtFLAs, play important roles in GA-mediated tension wood formation in Populus. Sci. Rep. 2017, 7, 6182. [Google Scholar] [CrossRef] [PubMed]
  24. Zhang, Z.; Xin, W.; Wang, S.; Zhang, X.; Dai, H.; Sun, R.; Frazier, T.; Zhang, B.; Wang, Q. Xylem sap in cotton contains proteins that contribute to environmental stress response and cell wall development. Funct. Integr. Genom. 2015, 15, 17–26. [Google Scholar] [CrossRef]
  25. Huang, G.Q.; Gong, S.Y.; Xu, W.L.; Li, W.; Li, P.; Zhang, C.J.; Li, D.D.; Zheng, Y.; Li, F.G.; Li, X.B. A fasciclin-like arabinogalactan protein, GhFLA1, is involved in fiber initiation and elongation of cotton. Plant Physiol. 2013, 161, 1278–1290. [Google Scholar] [CrossRef]
  26. MacMillan, C.P.; Mansfield, S.D.; Stachurski, Z.H.; Evans, R.; Southerton, S.G. Fasciclin-like arabinogalactan proteins: Specialization for stem biomechanics and cell wall architecture in Arabidopsis and Eucalyptus. Plant J. 2010, 62, 689–703. [Google Scholar] [CrossRef] [PubMed]
  27. Shi, H.; Kim, Y.; Guo, Y.; Stevenson, B.; Zhu, J.K. The Arabidopsis SOS5 locus encodes a putative cell surface adhesion protein and is required for normal cell expansion. Plant Cell 2003, 15, 19–32. [Google Scholar] [CrossRef]
  28. Seifert, G.J.; Xue, H.; Acet, T. The Arabidopsis thaliana FASCICLIN LIKE ARABINOGALACTAN PROTEIN 4 gene acts synergistically with abscisic acid signalling to control root growth. Ann. Bot 2014, 114, 1125–1133. [Google Scholar] [CrossRef]
  29. Allelign Ashagre, H.; Zaltzman, D.; Idan-Molakandov, A.; Romano, H.; Tzfadia, O.; Harpaz-Saad, S. FASCICLIN-LIKE 18 Is a New Player Regulating Root Elongation in Arabidopsis thaliana. Front. Plant Sci. 2021, 12, 645286. [Google Scholar] [CrossRef]
  30. Nie, H.; Lu, S.; Wu, X.; Wang, P.; Li, N.; Ma, Y.; Wu, J.; Ma, Y. Function and Expression Analysis on StFLA4 in Response to Drought Stress and Tuber Germination in Potato. Agronomy 2024, 14, 2988. [Google Scholar] [CrossRef]
  31. Fang, C.; Kong, F. Soybean. Curr. Biol. 2022, 32, R902–R904. [Google Scholar] [CrossRef] [PubMed]
  32. Shaffique, S.; Hussain, S.; Kang, S.M.; Imran, M.; Injamum-Ul-Hoque, M.; Khan, M.A.; Lee, I.J. Phytohormonal modulation of the drought stress in soybean: Outlook, research progress, and cross-talk. Front. Plant Sci. 2023, 14, 1237295. [Google Scholar] [CrossRef] [PubMed]
  33. Ma, Y.; Shafee, T.; Mudiyanselage, A.M.; Ratcliffe, J.; MacMillan, C.P.; Mansfield, S.D.; Bacic, A.; Johnson, K.L. Distinct functions of FASCILIN-LIKE ARABINOGALACTAN PROTEINS relate to domain structure. Plant Physiol. 2023, 192, 119–132. [Google Scholar] [CrossRef] [PubMed]
  34. Yuan, J.; Song, Q. Polyploidy and diploidization in soybean. Mol. Breed 2023, 43, 51. [Google Scholar] [CrossRef]
  35. Wang, X.; Song, S.; Wang, X.; Liu, J.; Dong, S. Transcriptomic and Metabolomic Analysis of Seedling-Stage Soybean Responses to PEG-Simulated Drought Stress. Int. J. Mol. Sci. 2022, 23, 6869. [Google Scholar] [CrossRef]
Figure 1. Chromosomal distribution of the soybean FLA gene family. Chromosome labels are positioned on the left side of the bar chart, gene names are highlighted in red, and the scale bar is located on the left.
Figure 1. Chromosomal distribution of the soybean FLA gene family. Chromosome labels are positioned on the left side of the bar chart, gene names are highlighted in red, and the scale bar is located on the left.
Genes 16 01425 g001
Figure 2. Exon–intron structure of the soybean FLA gene family: Untranslated regions (UTRs) are marked with green bold boxes, exons are indicated by yellow bold boxes, and introns are represented by black lines.
Figure 2. Exon–intron structure of the soybean FLA gene family: Untranslated regions (UTRs) are marked with green bold boxes, exons are indicated by yellow bold boxes, and introns are represented by black lines.
Genes 16 01425 g002
Figure 3. Phylogenetic Tree of the FLA Gene Family in Soybean and Arabidopsis.
Figure 3. Phylogenetic Tree of the FLA Gene Family in Soybean and Arabidopsis.
Genes 16 01425 g003
Figure 4. Conserved Domain of the Soybean FLA Gene Family.
Figure 4. Conserved Domain of the Soybean FLA Gene Family.
Genes 16 01425 g004
Figure 5. Conserved motif of the soybean FLA gene family.
Figure 5. Conserved motif of the soybean FLA gene family.
Genes 16 01425 g005
Figure 6. Amino acid sequences of 10 conserved motifs.
Figure 6. Amino acid sequences of 10 conserved motifs.
Genes 16 01425 g006
Figure 7. Soybean FLA Gene Family Cis-Acting Elements.
Figure 7. Soybean FLA Gene Family Cis-Acting Elements.
Genes 16 01425 g007
Figure 8. Cluster Analysis of Cis-Acting Elements in the Soybean FLA Gene Family.
Figure 8. Cluster Analysis of Cis-Acting Elements in the Soybean FLA Gene Family.
Genes 16 01425 g008
Figure 9. Tissue expression analysis of soybean candidate genes under drought stress: (a) root (b) stem (c) leaf * p < 0.05, ** p < 0.01, ns, Not significant.
Figure 9. Tissue expression analysis of soybean candidate genes under drought stress: (a) root (b) stem (c) leaf * p < 0.05, ** p < 0.01, ns, Not significant.
Genes 16 01425 g009
Table 1. Physicochemical Characterization of the FLA Gene Family in Soybean.
Table 1. Physicochemical Characterization of the FLA Gene Family in Soybean.
Gene NameGene IDAAMW (kDa)pIIIAIGRAVYLocationSP
GmFLA1Glyma.02G30770025627,139.595.0351.8486.170.134vacuYes
GmFLA2Glyma.02G30870040643,557.756.3736.891.16−0.002vacuYes
GmFLA3Glyma.03G17990024826,301.835.5126.8188.910.083chloYes
GmFLA4Glyma.03G18000024425,780.246.8221.6190.820.052chloYes
GmFLA5Glyma.03G20430045350,149.185.9858.2194.97−0.235cytoYes
GmFLA6Glyma.05G16190028129,174.379.1241.4795.590.093chloYes
GmFLA7Glyma.06G30860044749,665.666.3654.3189.69−0.287extrYes
GmFLA8Glyma.08G11940028129,247.459.1444.8297.30.095extrYes
GmFLA9Glyma.08G11950028829,955.419.3444.0197.670.111plasYes
GmFLA10Glyma.08G11960031532,662.678.4745.6398.760.124chloYes
GmFLA11Glyma.08G18370033636,952.175.9272.0489.17−0.035chloNo
GmFLA12Glyma.08G32990041543,692.785.8137.5393.01−0.054plasYes
GmFLA13Glyma.09G04810020621,492.334.5342.4993.20.178golgNo
GmFLA14Glyma.09G11980036139,979.156.5340.5587.59−0.168chloYes
GmFLA15Glyma.09G26770041642,505.25.5948.5197.450.189plasYes
GmFLA16Glyma.10G24580029331,032.88.8243.8699.520.063vacuYes
GmFLA17Glyma.11G14780029130,785.59.0336.8795.880.066chloYes
GmFLA18Glyma.11G14810025927,402.347.9633.0692.320.036chloYes
GmFLA19Glyma.11G14820026127,944.048.6240.07100.150.136extrYes
GmFLA20Glyma.11G14830022724,078.736.533.24109.120.263golgNo
GmFLA21Glyma.11G14840029530,969.588.5144.7996.580.047chloYes
GmFLA22Glyma.11G14850029230,793.469.245.1294.90.02chloYes
GmFLA23Glyma.11G14850128229,640.228.6750.13104.150.083chloYes
GmFLA24Glyma.11G14870029430,890.68.5345.2297.180.097vacuYes
GmFLA25Glyma.11G15588624525,833.587.8530.498.650.027chloNo
GmFLA26Glyma.11G15630429230,731.298.9339.5990.790.041chloYes
GmFLA27Glyma.11G15651329030,551.276.7438.19103.970.163chloYes
GmFLA28Glyma.12G06890029130,717.59.2439.5899.90.089chloYes
GmFLA29Glyma.12G06940026528,023.127.7337.85105.250.182vacuYes
GmFLA30Glyma.12G06950027829,334.826.8452.07100.040.094extrYes
GmFLA31Glyma.12G06960128029,451.918.6950.8298.960.093extrYes
GmFLA32Glyma.12G06970029330,964.667.8148.5396.550.027chlo, vacuYes
GmFLA33Glyma.12G06980028029,437.898.6952.1998.290.085extrYes
GmFLA34Glyma.12G06990027829,347.877.8854.31100.040.094chlo, extrYes
GmFLA35Glyma.12G07000029330,998.687.8148.5395.220.024chlo, vacuYes
GmFLA36Glyma.12G07020029330,936.67.8148.2495.90.019chloYes
GmFLA37Glyma.12G09630045350,393.56.5856.6988.9−0.331vacuYes
GmFLA38Glyma.12G17380024025,483.099.2228.9997.540.051chloYes
GmFLA39Glyma.12G19070045450,214.246.1352.1292.11−0.261vacuYes
GmFLA40Glyma.12G20760025026,345.189.1931.4388.6−0.078extrYes
GmFLA41Glyma.13G22600031432,532.238.7344.5291.050.072chloYes
GmFLA42Glyma.13G22610025326,568.349.4837.2791.82−0.022chloNo
GmFLA43Glyma.13G29350025026,589.388.5631.4693.28−0.109extrYes
GmFLA44Glyma.13G31100045550,328.366.253.191.93−0.264chloYes
GmFLA45Glyma.13G32710024526,280.289.5627.4594.370.052chloYes
GmFLA46Glyma.13G32730028729,843.579.2533.23102.930.208plasYes
GmFLA47Glyma.14G00420044347,882.647.2941.3889.73−0.077plasYes
GmFLA48Glyma.14G00530025627,224.665.0547.1785.040.102plasYes
GmFLA49Glyma.15G08600028329,817.329.4437.12101.060.113chloYes
GmFLA50Glyma.15G08610031332,586.389.2246.891.020.036chloYes
GmFLA51Glyma.15G15540042645,166.655.6845.51105.770.286plasYes
GmFLA52Glyma.18G07660041543,368.385.834.0393.71−0.021plasYes
GmFLA53Glyma.18G22220041642,488.345.8947.0298.370.18plasYes
GmFLA54Glyma.19G18070024926,278.715.5126.2290.920.074plasYes
GmFLA55Glyma.19G20160045550,578.566.2161.0590.68−0.308cytoYes
Table 2. Differentially Expressed Gene Screening Results.
Table 2. Differentially Expressed Gene Screening Results.
DatasetGroupFold Change
GmFLA5GmFLA15GmFLA40GmFLA47GmFLA54
GSE102749water control—5 h water stress (root)0.23930.55640.97660.11590.0855
water control—48 h water stress (root)−0.18−0.0293−0.2870.162−0.927
GSE6553water control—2 h water stress (root)0.5710.50260.7178−0.02410.7653
water control—10 h water stress (root)0.8550.8480.8350.08880.799
GSE49537water control—30 min water stress (root)0.16840.58780.45360.40400.3544
water control—2 h water stress (root)0.09470.48112.19240.53130.2129
water control—5 h water stress (root)0.61821.60811.70331.03371.6471
water control—30 min water stress (leaf)0.32391.3927−0.21750.2456−0.1039
water control—2 h water stress (leaf)1.57162.55992.08711.27210.6380
water control—5 h water stress (leaf)2.72915.05475.45402.47452.5847
GSE29663water control—6 d water stress (leaf)0.21910.06452.40101.27711.0363
GSE40604water control—5 d water stress (leaf)1.5291−0.05310.97841.21800.5417
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhang, J.; Yang, L.; Dou, J.; Wang, C.; Yao, Z. Identification of the FLA Gene Family in Soybean and Preliminary Functional Analysis of Its Drought-Responsive Candidate Genes. Genes 2025, 16, 1425. https://doi.org/10.3390/genes16121425

AMA Style

Zhang J, Yang L, Dou J, Wang C, Yao Z. Identification of the FLA Gene Family in Soybean and Preliminary Functional Analysis of Its Drought-Responsive Candidate Genes. Genes. 2025; 16(12):1425. https://doi.org/10.3390/genes16121425

Chicago/Turabian Style

Zhang, Jiyue, Lina Yang, Jingxuan Dou, Cong Wang, and Zhengpei Yao. 2025. "Identification of the FLA Gene Family in Soybean and Preliminary Functional Analysis of Its Drought-Responsive Candidate Genes" Genes 16, no. 12: 1425. https://doi.org/10.3390/genes16121425

APA Style

Zhang, J., Yang, L., Dou, J., Wang, C., & Yao, Z. (2025). Identification of the FLA Gene Family in Soybean and Preliminary Functional Analysis of Its Drought-Responsive Candidate Genes. Genes, 16(12), 1425. https://doi.org/10.3390/genes16121425

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop