Next Article in Journal
BRCA2 Deletion Induces Alternative Lengthening of Telomeres in Telomerase Positive Colon Cancer Cells
Previous Article in Journal
Histone Methylation Participates in Gene Expression Control during the Early Development of the Pacific Oyster Crassostrea gigas

Genes 2019, 10(9), 696;

Genome-Wide Identification and Expression Profiles of Late Embryogenesis-Abundant (LEA) Genes during Grain Maturation in Wheat (Triticum aestivum L.)
Key Laboratory of Wheat Biology and Genetic Improvement for Low & Middle Yangtze Valley, Ministry of Agriculture/Lixiahe Agricultural Institute of Jiangsu Province, Yangzhou 225007, China
Yangzhou University, Yangzhou 225009, China
Author to whom correspondence should be addressed.
Received: 21 July 2019 / Accepted: 6 September 2019 / Published: 10 September 2019


Late embryogenesis-abundant (LEA) genes play important roles in plant growth and development, especially the cellular dehydration tolerance during seed maturation. In order to comprehensively understand the roles of LEA family members in wheat, we carried out a series of analyses based on the latest genome sequence of the bread wheat Chinese Spring. 121 Triticum aestivum L. LEA (TaLEA) genes, classified as 8 groups, were identified and characterized. TaLEA genes are distributed in all chromosomes, most of them with a low number of introns (≤3). Expression profiles showed that most TaLEA genes expressed specifically in grains. By qRT-PCR analysis, we confirmed that 12 genes among them showed high expression levels during late stage grain maturation in two spring wheat cultivars, Yangmai16 and Yangmai15. For most genes, the peak of expression appeared earlier in Yangmai16. Statistical analysis indicated that expression level of 8 genes in Yangmai 16 were significantly higher than Yangmai 15 at 25 days after anthesis. Taken together, our results provide more knowledge for future functional analysis and potential utilization of TaLEA genes in wheat breeding.
late embryogenesis-abundant gene; wheat (Triticum aestivum L.); grain maturation; genome-wide identification

1. Introduction

The dehydration process of embryos during grain maturation is the terminal event of seed development, correlating with the acquisition of desiccation tolerance and with the specific induction of highly abundant hydrophilic proteins described as Late Embryogenesis Abundant (LEA) [1,2]. This process indicates the “switch” of seed transformation from development to germination, acting as an adaptability strategy of angiosperm to ensure the survival of seeds during storage or the environmental stress of long-term evolution [3,4]. Seeds are classified as orthodox and recalcitrant based on their storage behavior and desiccation tolerance in the drying process [5]. Orthodox seeds undergo mature dehydration on the maternal plants. Their water content is usually low and can be further dried to 1 to 5% moisture without damage [4,6].
LEA was first isolated from cotyledons of cotton at the late embryonic development stage [7]. In the following 30 years, LEA proteins have been detected in a variety of higher plant development seeds [8,9,10,11,12,13]. The LEA family is usually composed of multiple members in plants. Previous study identified at least 13 and 34 LEA homologous genes in maize [14] and rice, respectively [15]. With the development of genome-wide sequencing technology, the LEA gene family has been further elucidated in many other species, such as Solanum lycopersicum (27 LEA genes) [16], Dendrobium officinale (Orchidaceae) (17) [17], Brassica mapus (108) [18], Amaranth (105) [19], and Solanum Tuberosum (74) [13]. Furthermore, LEA genes are not unique to plants. They have been also identified in fungi, bacteria and animals [20].
LEA proteins have small molecular weights from 10 to 30 kDa. They are enriched in glycine and other hydrophilic amino acids, resulting in high hydrophilicity and thermal stability [21]. LEA proteins consist of a non-periodic linear and facultative α-helix which formed the main hydrophobic interaction between monomers without thermodynamically dominant state [8,22]. According to the similarity, LEA family is generally divided into 8 groups, namely LEA1, LEA2, LEA3, LEA4, LEA5, LEA6, dehydrin and SMP (seed maturation protein) [23]. Recently, ASR proteins (abscisic stress ripening) were classified as LEA7 group [24,25], extending LEA family to 9 groups. The division of family members is not consistent in different species [26]. While the sequence conservation implies their conserved role in desiccation tolerance [27].
Moreover, LEA proteins were reported to be widely involved in plant stress response in Arabidopsis, tobacco, grape, and other species [28]. In monocotyledons such as maize, wheat and rice, LEA expression can be induced by varieties of abiotic stresses to improve plant tolerance [29,30,31], which is beyond the scope of this article.
Wheat, as an important staple crop in the world, is becoming the focus of global food security issues [32,33]. In China, due to changes of climate and cultivation methods, new requirements for early maturing traits and dehydration characteristics of winter wheat have been put forward recently [34,35]. The moisture content of wheat seeds, like orthodox seeds, decline up to 40% quickly after physiological ripening. The speed of grain dehydration affects the desiccation tolerance and is highly related to the harvest time after maturity. Rapid dehydration after maturity is able to avoid the bad meteorological conditions in late stages of seed maturation, reduce the occurrence of pre-harvest sprouting, and shorten the cost of drying after harvest.
Wheat is an allohexaploid species with a large and complex genome consisting of three closely related sub genomes (A, B, and D). The continuous improvement and release of genomic data promoted the identification and analysis of wheat genes genome-widely [36,37,38,39,40,41,42]. The identification and use of the elite gene resource may help us improve the stress tolerance of wheat and solve the problems caused by changing global climate and food security status [42].
In this study, we systematically characterized the LEA genes in wheat based on the latest genome sequences. The genome composition, phylogeny, conserved motifs, chromosome locations, and the expression pattern of TaLEA proteins were systematically analyzed. These results will not only help us to reveal the molecular mechanism of mature dehydration in wheat and other cereal crops, but also provide a theoretical basis for their application to promote crop variety improvement.

2. Materials and Methods

2.1. Identification and Phylogenetic Analysis of Late Embryogenesis-Abundant (LEA) Genes in Wheat Genomes

The nucleotide and protein sequence in the whole-genome of T. aestivum was downloaded from the wheat genome URGI database ( These sequences were first used to construct a local protein database with which to search against known LEA protein sequences collected from A. thaliana (51) [9] and O. sativa (34) [15], through a local protein basic local alignment search (BLASTP) program ( with an E-value cut-off <10−10 and an identity of 50% as the threshold. The Hidden Markov Model (HMM) which was constructed using HMMER 3.0 program was used to search all the encoding protein sequences of wheat to find all predicted LEA family members. A self-blast of these sequences was performed by comparing of HMMER and BLAST hits to remove the redundancy, and no alternative splice variants were considered. The aligned sequences were then used as candidate LEA family sequences. The obtained candidate sequences were subjected to domain annotation of the target sequence using the software PfamScan and Pfam A databases, and the sequences containing the complete LEA-1 (Pfam ID PF03760.14), LEA-2 (PF03168.12), LEA-3 (PF03242.12), LEA-4 (PF02987.15), LEA-5 (PF00477.16), LEA-6 (PF10714.8), Dehydrin (PF00257.18), and SMP (PF04927.11) domains were identified as the final wheat LEA family sequences. The identified wheat LEA proteins were termed according to the subfamily it belongs and the order of gene ID. The biochemical parameters such as theoretical isoelectric point (PI), molecular weight (MW), amino acid number, instability, and hydrophilicity of the obtained proteins were calculated by the ProtParam ( To investigate the evolutionary relationships among these TaLEA genes, multiple sequence alignment was performed using MAFFT (Multiple Alignment using Fast Fourier Transform: with the default parameters. A tree was constructed based on the full-length protein sequences using the Neighbor-Joining (NJ) method with Partial deletion and p-distance model, and a Bootstrap test of 1000 replicates for internal branch reliability. Another two phylogenetic trees for LEA protein family of T. aestivum, O. sativa, and A. thaliana were generated using Maximum-Likehood [ML, by FastTree (an open-source software developed by Morgan N. Price in Adam Arkin’s group at Lawrence Berkeley National Lab, Berkeley, USA) with default parameters] and Maximum-Parsimony (MP, by MEGA7.0 software [43] with the Max No. of Trees to Retain as 100) method to better understand the evolution of LEA genes.

2.2. Structural Characterisation of Wheat LEA

The exon-intron structures of the TaLEA family genes were determined based on alignments of their coding sequences with the corresponding genomic sequences, and a diagram was obtained using GSDS (Gene structure display server: To better understand the similarity and diversity of protein motifs, MEME (Multiple Expectation Maximization for Motif Elicitation: was used to identify the conserved motif structures encoded by each group of TaLEA family genes. The parameters as follow: optimum motif widths of 6–50 residues and a maximum of 15 motifs. The schematic diagram of the amino acid motifs for each TaLEA gene was drawn accordingly.

2.3. Distribution of LEA Genes on Wheat Chromosomes

TaLEA were mapped on wheat chromosomes according to the positional information of the TaLEA genes from the wheat genome annotation datebase, and the chromosome physical location map was displayed using MG2C (Map Gene 2 Chromosome V2:

2.4. Expression Profile Analysis of Wheat LEA Genes

The publicly available wheat RNA-Seq datasets used for generating gene expression levels were downloaded from the Wheat Expression Browser (, then used to analyze the expression profiles of the identified TaLEA genes. These data are from the studies of developmental time-course of common hexaploid wheat variety Azhurnaya and Chinese Spring. The results were summarized according to high expression level tissues as grains, roots, leaves/shoots, and spikes. The original TPM (transcripts per kilobase million) of each gene plus 0.0001 to prevent an expression value of ‘0’. The heat map was drawn in Log10 (tpm + 0.0001)-transformed expression values according to the standardization. Genes highly expressed in the grain were selected for qRT-PCR detection.

2.5. Plant Materials, Growth Conditions, and Sampling

Based on many years production experience and research, Yangmai16 had a faster grain filling speed, earlier maturity compared with Yangmai15 [35]. Therefore, these two spring wheat varieties Yangmai16 and Yangmai15, which were bred by the author’s unit, were planted and cultivated under the same condition. At the flowering stage, two individual florets at the base of the center ears were marked with a marker pen and the plants were tagged with anthesis dates. Wheat grains were sampled at 10, 15, 20, 25, 30, and 35 DAA (days after anthesis) for extraction of total RNA and further determination.

2.6. Quantitative Real-Time Polymerase Chain Reaction Analysis of TaLEA Proteins

In order to identify the expression profile of TaLEA gene in different wheat cultivars, and to explain the relationship between the LEA protein and maturity, as well as the grain production, high expression level LEA genes were collected for quantitative real-time polymerase chain reaction (qRT-PCR) analysis. Total RNA was extracted from the seeds by Trizol reagent (Biotech, Shanghai), and the first strand of cDNA was synthesized using Transcriptor First Strand cDNA Synthesis Kit (Roche, Penzberg, Germany). The wheat gene Actin was used as an internal reference, and the selected 12 TaLEA genes, including TaDehydrin-1, TaDehydrin-3, TaDehydrin-5, TaDehydrin-7, TaLEA1-1, TaLEA1-3, TaLEA4-13, TaLEA5-1, TaLEA5-3, TaLEA5-4, TaSMP-1 and TaSMP-2, were subjected to qRT-PCR using a fluorescence quantitative reagent Ultra SYBR Mixture (Kang Wei Century, Beijing) on a Roch Light-Cycler 480 fluorescence quantitative PCR machine (Roche, Penzberg, Germany); three independent biological replicates. The primer sequences used are listed in Table S1.

2.7. Determination of the Dry Weight and Moisture Content of Grains

Sampled grains were used for measurements of grain dry weight and moisture content. The fresh grains then dried at 70 °C to constant weight were weighed for the fresh weight (WF) and dry weight (WD). Moisture content of grains was calculated by (WFWD)/WF × 100%.
The processes of grain filling were fitted by Logistic’ equation as described by [44]:
f(x) = a/(1 + b × exp(−c × x))
The dehydration progress was fitted by Gompertz equation as described by [45]:
f(x) = a × exp(−exp(−b × (x − c)))

3. Results

3.1. Genome-Wide Identification and Phylogenetic Analysis of Wheat LEA Genes

Based on genome-wide blast, a total of 121 non-redundant LEA homologous containing the complete LEA, Dehydrin, and SMP domains were identified from the hexaploid wheat genome, which were considered as the putative wheat LEA genes. By phylogenetic analysis, these 121 genes were divided into 8 groups (groups LEA1LEA6, Dehydrin, and SMP) (Table 1). The largest group Dehydrin contains 47 members, while the smallest groups LEA3 and LEA6 have only two members. Groups LEA1, LEA2, LEA4, LEA5, and SMP contain 19, 15, 15, 4, and 17 genes, respectively.
The physical and chemical parameters of 121 TaLEA proteins were shown according to the analysis using the ProtParam online tool. The wheat LEA protein has a maximum length of 457 amino acid residues (TaDehydrin-8) and the shortest of 99 amino acid residues (TaLEA3-2), with isoelectric points (pI values) ranged from 4.18 (TaSMP-15) to 10.74 (TaDehydrin-28). The molecular weights range from 10.64 kDa (TaLEA3-2) to 43.89 kDa (TaDehydrin-8) with an average of 20.98 kDa. Only 16 proteins have a molecular weight >30 kDa. Analysis of the grand average of hydropathicity (GRAVY) index indicated that all TaLEA proteins were hydrophilic, with all of the index <0. The lipid index reflects the thermal stability of the protein in Table 1.
The phylogenetic analysis (Figure 1) revealed two major clades of the TaLEA family. The 47 genes of the TaDehydrin group, the LEA5 group, and part of the LEA1 group were clustered in one clade, while the other six groups were clustered in another one. According to the evolutionary relationship, TaSMP is more closely related to LEA6 and LEA2; LEA1 and LEA4 are more closely related. To classify the TaLEA genes into different subfamilies with higher confidence and to better understand their evolutionary relationship, another two phylogenetic trees were constructed with Arabidopsis and Oryza (Figures S1 and S2). It was indicated that the subfamilies classification of LEA genes in wheat is credible.

3.2. Structural Characterization of Wheat LEA

All Gene structure analysis revealed that (Figure 2) wheat LEA genes generally contained few introns, and 24 had no introns. Only two of the 97 intron-containing LEA proteins contained more than one intron. Among the 47 genes of the TaDehydrin group, 46 genes contained only one intron, while the other one (TaDehydrin-21) contained three introns. In the TaLEA1 group, 16 genes had no introns, and the other 3 genes contained only one intron. Among the 15 genes in the TaLEA2 group, 11 were single-intron genes, and the other 4 were intron-free genes. Neither LEA3 nor LEA6 contained introns. There were 14 single-intron genes and 1 double intron gene among the TaLEA4 group. Both of the TaLEA5 and TaSMP groups contained only one intron.
To elucidate the similarity and diversity of protein motifs, the conserved motifs of wheat LEA proteins in each group were analyzed by MEME software. 15 conserved motifs were identified, namely motif1~motif15 (Figure 3). The results indicated that with the exception of the TaLEA3, TaLEA5, and TaLEA6, there were conservative motifs specific to each group (Figure 3). The composition of structural motifs was diverse among different LEA groups, but similar within the same group. Moreover, the motifs encoding LEA domains were relatively conserved, indicating that the functions of TaLEA proteins are intergroup specific. All of the 19 genes in the LEA1 group have motif15, and 12 of them also contain motif13. Among the LEA2 group, 9 members contained 2 motifs and 6 members contained 5 motifs, while each member possessed motif3 and motif4. The TaSMP group contained motif7 and motif10. All LEA4 members had motif13, with 10 of them also having motif11, and 3 of them having motif5 and motif15. There were 27 genes in the Dehydrin group possessing motif1, motif2, motif5, motif6, motif8, and motif9, and all 47 members contained motif1, motif2, and motif5.

3.3. Chromosomal Locations of Wheat LEA Genes

Using MapGene2Chrom software to analyze genomic position data, 117 genes were distributed on 21 wheat chromosomes, while the location of the other 4 genes remained unclear (Figure 4). The analysis from the 7 homologous chromosome groups showed that there were 29 LEA genes on the chromosome groups five, representing the most abundant regions, followed by group six with 23, while the minimum was on chromosome groups two, with only 6 genes. Chromosome groups one and three had 18 and 17 genes, respectively. Chromosome groups four and seven had 12 genes. In total, 37, 40, and 40 TaLEA genes were located on the A, B, and D sub-genome, respectively, which were almost evenly distributed.
According to the chromosomal distribution of the TaLEA gene, there were 43 of the Dehydrin group distributed on homologous chromosome groups three, four, five, and six, except for the four genes were still unclear. Among them, there were 7, 8, 7 on chromosome 6A, 6B, and 6D, and 3, 4, 5 on 5A, 5B, and 5D, respectively, accounting for 72.34% of all Dehydrin members. Such a concentrated distribution deserves further study in the future.

3.4. Expression Profile Analysis of Wheat LEA Genes in Different Tissues

To gain insight about the expression profile of these identified TaLEA genes, the publicly available RNA-Seq data were collected from the Wheat Expression Browser database. The expression analysis was performed on grains, roots, leaves/shoots, and spikes. A heat map was created according to the Log10 (tpm + 0.0001) of 121 TaLEA genes (Figure 5). It was indicated that expression patterns of TaLEA genes in different tissues were quite different. 93 genes highly expressed in grains, and among them there were 35 out of 47 members in the Dehydrin subfamily; 17 out of 19 in LEA1; all members of LEA4, LEA5 and LEA6. Whereas, very low expression levels of these 93 genes were observed in the other three tissues, except for 12 in roots and 8 in spikes were also high. 11 highly expressed genes were only observed in spikes, and among them there were 6 members of LEA2 and two members of the LEA3 subfamily. Almost no gene highly expressed in all four tissues and only several genes had higher expression levels in multiple tissues.

3.5. qRT-PCR Analysis of Wheat LEA Genes in the Grain Maturation of Two Different Cultivers

According to the analysis of gene expression in different tissues, 12 TaLEA genes highly expressed in grains were selected to verify the potential role of TaLEA genes in the maturation of seed. Expression levels at different stages post anthesis of Yangmai16 and Yangmai15 were analyzed by qRT-PCR.
Results showed that the expression of these TaLEA genes have temporal specificity. On the whole, the expression level of TaLEA genes increased from 15 DAA to 35 DAA (Figure 6) in two genotypes. Interestingly, the expression patterns of TaLEA genes in two genotypes presented different characteristics. In Yangmai16, the maximum expression level of 9 genes except TaDehydrin-3, TaLEA5-1 and TaSMP-1 were observed at 30 DAA, whereas there were 8 genes in Yangmai 15 with maximum expression level at 35 DAA. For most genes, the peak of expression appeared earlier in Yangmai16. Statistical analysis indicated that, at 25 DAA, the expression level of 8 genes in Yangmai 16 was significantly higher than Yangmai 15.
According to the determination of phenotypic character of maturation, the increase of grain dry weight in Yangmai16 was faster than that in Yangmai15 (Figure 7A). At 30 DAA, the grain filling of yangmai16 grain was almost completed. Statistical analysis indicated that the average grain filling rate was significantly higher in Yangmai 16 (P = 0.0185). The moisture content curve and the average seed dehydration rate indicated that the dehydration of Yangmai 16 was faster (P = 1.69 × 10−4) and started earlier than Yangmai 15, which could be judged by the appearance and color of the seeds (Figure 7B,C).

4. Discussion

4.1. Identification and Analysis of Wheat LEA Gene Family

In this study, 121 wheat LEA homologous were identified by genomic analysis. Wheat LEA family is bigger than rice (34), corn (32), soybean (36), potato (74), Arabidopsis (51), and canola (108), but less than upland cotton (242). We speculated that in the process of hybridization and chromosome doubling by three species, the number of genes had also doubled and finally formed an allohexaploid species with a huge genome [46,47,48]. It is also possible that most of the wheat LEA genes were retained during subsequent evolution due to genome replication [49]. Functional differentiation of LEA genes during the evolution process could facilitate the regulation of development and adaptation to the environment.
The largest group of the wheat LEA family Dehydrin accounts for about 40% of the total number of family members. However, the largest group was Dehydrin in rice [15] and LEA3 in soybeans [50]. In plum blossom, the largest groups were Dehydrin and LEA2 [10]. LEA2 group in potato had the most members, accounting for 45 of all 74 LEA genes, while the Dehydrin group had only 5 [13]. The difference above indicated that LEA family genes may have evolved independently after the differentiation of these species.
Gene structure analysis revealed that wheat LEA family genes generally lack introns. For instance, there were 24 genes that don’t have introns and 95 genes that have only one intron, while only 2 genes contained more than one intron. The phenomenon was also found in other species [13,50,51]. Perhaps it could be explained by the early intron model of protein-coding gene origin [52]. In general, genes that respond to stress contain fewer introns. Introns can have a deleterious effect on gene expression by delaying transcript production [18]. Moreover, introns can extend the length of the nascent transcript, resulting in an additional expense for transcription [53]. The small number of introns may be a result of adaptation to the environment, which takes less time from transcription to translation, allowing rapid gene expression and functional protein production in response to stresses [53]. DREB [54], AP2/ERF [55], and HD-Zip [42], which play important roles in response to diverse stresses, showing similar phenomenon.
The expression profiles of TaLEA family members in different tissues suggested that many TaLEA genes have obvious spatial specificity. However, they were still mainly enriched in the spikes and grains. Our study could be helpful for further investigation of LEA genes with spatiotemporal expression specificity and evaluation of their roles in desiccation tolerance.

4.2. Wheat Seed Desiccation and the Roles of TaLEA Genes in Acquirement of Desiccation Tolerance

With the prospect of climate changes that are shaping future agricultural practices, the production of highly vigorous seeds have become a key lever to improve yields and a focus of food security and genetic resources [56]. Because of the long growth duration of common wheat varieties, plants must face various biotic and abiotic stresses in fields, especially during the stage of seed maturity [57]. The stage of seed maturity at harvest, which was also associated with the desiccation tolerance, is a major factor influencing seed vigor and grain quality [58]. Nowadays, faster filling and dehydration of wheat seeds achieved more popularity in China because of the important role for optimizing the new cropping systems.
The wheat seeds undergo three development phases: embryo morphogenesis, material accumulation, and mature dehydration. During the last stage, the metabolism is correspondingly weakened, along with the rapid decrease of water content, which then turn on the transition from morphogenesis to germination [59,60,61]. Before and during dehydration, physiological and biochemical changes contribute to the acquisition of desiccation tolerance.
The correlation between LEA protein and desiccation tolerance had been confirmed in many orthodox seeds [62,63]. In this study, the spatial expression pattern of TaLEA indicated that some genes display high expression levels mainly in spikes and grains. Two spring wheat variety Yangmai16 and Yangmai15 were then used for quantitative verification. The results indicated that Yangmai16 had certain advantages in time and quantity of expression at 25 DAA, in which the maximum expression of 8 TaLEA genes appeared earlier than that of Yangmai15. It could be a hint of the inherent genetic differences between two varieties. We speculated that the phenotype of earlier and faster dehydration, which may also possess higher desiccation tolerance theoretically, were correlated with the different expression levels of LEA genes in grains.
The contribution of LEA to maintain seed desiccation tolerance is possibility due to their thermal stability and high hydrophilicity, acting as a dehydration protectant and substitute of water [64]. During the mature dehydration of seeds, the protein of Dehydrins will be present in the cell like a “space filler”, maintaining the dissolved character of cell fluid to avoid damage to cell architecture [21]. Magnetic Resonance Imaging (MRI) has been used for tracking and locating the distribution regular of water during seed maturation. It was shown that under the action of Dehydrins, water moves against potential, and the sufficient water in the surroundings of the seed could maintain the available flowing water content in the cells [65,66]. LEA were also supposed to regulate the expression of other genes by direct binding, which still lacks evidence.
Four questions that need to be considered were raised by Song and Fu in the last century [67]: What are the signals that induce the mature dehydration during seed development? Are these signals from the seed itself (embryo, storage tissue, and seed coat) or the maternal tissues? What are the acceptors in seeds that response to the dehydration signal? And last but not least, what are the subtle effects of LEA proteins in cells and sub-cells leading to desiccation tolerance? Answers to these questions would be valuable for a better understanding of mature dehydration mechanisms and germplasm conservation, as well as the genetic improvement of crops.

Supplementary Materials

The following are available online at, Table S1: Primers for qRT-PCR analysis; Figure S1: Phylogenetic analysis of LEA proteins among wheat (121), Arabidopsis (51) and rice (34) (by ML method); Figure S2: Phylogenetic analysis of LEA proteins among wheat (121), Arabidopsis (51) and rice (34) (by MP method).

Author Contributions

Conceptualization, D.L. and D.G.; methodology, formal analysis, and software, J.S. and G.L.; validation, J.S., J.L. and X.Z.; investigation, D.L., D.Z., C.Z. and H.W.; writing-original draft preparation, D.L. and J.S.; writing—review and editing, D.L. and D.G.; supervision and project administration, D.G.; funding acquisition, D.L. and D.G.


This research was funded by National Natural Science Foundation of China, grant number 31700163; the National Key Research and Development Program of China, grant number 2017YFD0100800, and Natural Science Foundation of Yangzhou, grant number YZ2017100.


We are grateful to Dawei Yan for language improvement of the manuscript.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.


  1. Dure, L.; Crouch, M.; Harada, J.; Ho, T.H.; Mundy, J.; Quatrano, R.; Thomas, T.; Sung, Z.R. Common amino acid sequence domains among the LEA proteins of higher plants. Plant Mol. Biol. 1989, 12, 475–486. [Google Scholar] [CrossRef] [PubMed]
  2. Olveracarrillo, Y.; Reyes, J.L.; Covarrubias, A.A. Late embryogenesis abundant proteins: Versatile players in the plant adaptation to water limiting environments. Plant Signal. Behav. 2011, 6, 586–589. [Google Scholar] [CrossRef] [PubMed]
  3. Finch-Savage, W.E.; Leubner-Metzger, G. Seed dormancy and the control of germination. New Phytol. 2006, 171, 501–523. [Google Scholar] [CrossRef] [PubMed]
  4. Angelovici, R.; Galili, G.; Fernie, A.R.; Fait, A. Seed desiccation: A bridge between maturation and germination. Trends Plant Sci. 2010, 15, 211–218. [Google Scholar] [CrossRef] [PubMed]
  5. Roberts, E.H. Predicting the storage life of seeds. Seed Sci. Technol. 1973, 1, 499–514. [Google Scholar]
  6. Manz, B.; Muller, K.; Kucera, B.; Volke, F.; Leubnermetzger, G. Water uptake and distribution in germinating tobacco seeds investigated in vivo by nuclear magnetic resonance imaging. Plant Physiol. 2005, 138, 1538–1551. [Google Scholar] [CrossRef] [PubMed]
  7. Dure, L.; Greenway, S.; Galau, G.A. Developmental biochemistry of cottonseed embryogenesis and germination: Changing messenger ribonucleic acid populations as shown by in vitro and in vivo protein synthesis. Biochemistry 1981, 20, 4162–4168. [Google Scholar] [CrossRef] [PubMed]
  8. Ingram, J.; Bartels, D. The molecular basis of dehydration tolerance in plants. Annu. Rev. Plant Physiol. Plant Mol. Biol. 1996, 47, 377–403. [Google Scholar] [CrossRef] [PubMed]
  9. Hundertmark, M.; Hincha, D.K. LEA (Late Embryogenesis Abundant) proteins and their encoding genes in Arabidopsis thaliana. BMC Genom. 2008, 9, 118–128. [Google Scholar] [CrossRef]
  10. Du, D.; Zhang, Q.; Cheng, T.; Pan, H.; Yang, W.; Sun, L. Genome-wide identification and analysis of late embryogenesis abundant (LEA) genes in Prunus mume. Mol. Biol. Rep. 2013, 40, 1937–1946. [Google Scholar] [CrossRef]
  11. Lim, J.; Lim, C.W.; Lee, S.C. The pepper Late Embryogenesis Abundant protein, CaDIL1, positively regulates drought tolerance and ABA signaling. Front. Plant Sci. 2018, 9, 1301. [Google Scholar] [CrossRef] [PubMed]
  12. Wang, W.; Gao, T.; Chen, J.; Yang, J.; Huang, H.; Yu, Y. The late embryogenesis abundant gene family in tea plant (Camellia sinensis): Genome-wide characterization and expression analysis in response to cold and dehydration stress. Plant Physiol. Biochem. 2019, 135, 277–286. [Google Scholar] [CrossRef] [PubMed]
  13. Chen, Y.; Li, C.; Zhang, B.; Yi, J.; Yang, Y.; Kong, C.; Lei, C.; Gong, M. The role of the Late Embryogenesis-Abundant (LEA) protein family in development and the abiotic stress response: A comprehensive expression analysis of potato (Solanum Tuberosum). Genes 2019, 10, 148. [Google Scholar] [CrossRef] [PubMed]
  14. Amara, I.; Odena, A.; De Oliveira, E.; Moreno, A.; Masmoudi, K.; Pages, M.; Goday, A. Insights into maize LEA proteins: From proteomics to functional approaches. Plant Cell Physiol. 2012, 53, 312–329. [Google Scholar] [CrossRef] [PubMed]
  15. Wang, X.; Zhu, H.; Jin, G.; Liu, H.; Wu, W.; Zhu, J. Genome-scale identification and analysis of LEA genes in rice (Oryza sativa L.). Plant Sci. 2007, 172, 414–420. [Google Scholar] [CrossRef]
  16. Cao, J.; Li, X. Identification and phylogenetic analysis of late embryogenesis abundant proteins family in tomato (Solanum lycopersicum). Planta 2015, 241, 757–772. [Google Scholar] [CrossRef] [PubMed]
  17. Ling, H.; Zeng, X.; Guo, S. Functional insights into the late embryogenesis abundant (LEA) protein family from Dendrobium officinale (Orchidaceae) using an Escherichia coli system. Sci. Rep. 2016, 6, 39693. [Google Scholar] [CrossRef]
  18. Liang, Y.; Xiong, Z.; Zheng, J.; Xu, D.; Zhu, Z.; Xiang, J.; Gan, J.; Raboanatahiry, N.; Yin, Y.; Li, M. Genome-wide identification, structural analysis and new insights into late embryogenesis abundant (LEA) gene family formation pattern in Brassica napus. Sci. Rep. 2016, 6, 24265. [Google Scholar] [CrossRef]
  19. Bojorquez-Velazquez, E.; Barrera-Pacheco, A.; Espitia-Rangel, E.; Herrera-Estrella, A.; Barba de la Rosa, A.P. Protein analysis reveals differential accumulation of late embryogenesis abundant and storage proteins in seeds of wild and cultivated amaranth species. BMC Plant Biol. 2019, 19, 59. [Google Scholar] [CrossRef]
  20. Hand, S.C.; Menze, M.A.; Toner, M.; Boswell, L.C.; Moore, D. LEA proteins during water stress: Not just for plants anymore. Annu. Rev. Physiol. 2011, 73, 115–134. [Google Scholar] [CrossRef]
  21. Battaglia, M.; Olveracarrillo, Y.; Garciarrubio, A.; Campos, F.; Covarrubias, A.A. The enigmatic LEA proteins and other hydrophilins. Plant Physiol. 2008, 148, 6–24. [Google Scholar] [CrossRef] [PubMed]
  22. Hincha, D.K.; Thalhammer, A. LEA proteins: IDPs with versatile functions in cellular dehydration tolerance. Biochem. Soc. Trans. 2012, 40, 1000–1003. [Google Scholar] [CrossRef] [PubMed]
  23. Hunault, G.; Jaspard, E. LEAPdb: A database for the late embryogenesis abundant proteins. BMC Genom. 2010, 11, 221. [Google Scholar] [CrossRef] [PubMed]
  24. Gonzalez, R.M.; Iusem, N.D. Twenty years of research on Asr (ABA-stress-ripening) genes and proteins. Planta 2014, 239, 941–949. [Google Scholar] [CrossRef] [PubMed]
  25. Wetzler, D.E.; Wightman, F.F.; Bucci, H.A.; Rinaldi, J.; Caramelo, J.J.; Iusem, N.D.; Ricardi, M.M. Conformational plasticity of the intrinsically disordered protein ASR1 modulates its function as a drought stress-responsive gene. PLoS ONE 2018, 13, e0202808. [Google Scholar] [CrossRef]
  26. Tunnacliffe, A.; Wise, M.J. The continuing conundrum of the LEA proteins. Naturwissenschaften 2007, 94, 791–812. [Google Scholar] [CrossRef] [PubMed]
  27. Ren, X.; Zhu, C.; Zeng, G. Current research of basic substances related to desiccation tolerance in seeds. Chin. Bull. Bot. 2001, 18, 183–189. [Google Scholar]
  28. Shinde, S.; Nurul Islam, M.; Ng, C.K. Dehydration stress-induced oscillations in LEA protein transcripts involves abscisic acid in the moss, Physcomitrella patens. New Phytol. 2012, 195, 321–328. [Google Scholar] [CrossRef]
  29. Duan, J.; Cai, W. OsLEA3-2, an abiotic stress induced gene of rice plays a key role in salt and drought tolerance. PLoS ONE 2012, 7, e45117. [Google Scholar] [CrossRef]
  30. Liu, Y.; Wang, L.; Xing, X.; Sun, L.; Pan, J.; Kong, X.; Zhang, M.; Li, D. ZmLEA3, a multifunctional group 3 LEA protein from Maize (Zea mays L.), is involved in biotic and abiotic stresses. Plant Cell Physiol. 2013, 54, 944–959. [Google Scholar] [CrossRef]
  31. Wang, M.; Li, P.; Li, C.; Pan, Y.; Jiang, X.; Zhu, D.; Zhao, Q.; Yu, J. SiLEA14, a novel atypical LEA protein, confers abiotic stress resistance in foxtail millet. BMC Plant Biol. 2014, 14, 290. [Google Scholar] [CrossRef] [PubMed]
  32. Sanchez-Garcia, M.; Alvaro, F.; Peremarti, A.; Trevaskis, B.; Martin-Sanchez, J.A.; Royo, C. Breeding effects on dry matter accumulation and partitioning in Spanish bread wheat during the 20th century. Euphytica 2015, 203, 321–336. [Google Scholar] [CrossRef]
  33. Meena, R.P.; Karnam, V.; Tripathi, S.C.; Jha, A.; Sharma, R.K.; Singh, G.P. Irrigation management strategies in wheat for efficient water use in the regions of depleting water resources. Agric. Water Manag. 2019, 214, 38–46. [Google Scholar] [CrossRef]
  34. Gao, D.; Zhang, X.; Kang, J.; Bie, T.; Zhang, B.; Zhang, X.; Cheng, S. Negative effect of late sowing on wheat production in middle and lower reaches of Yangtze river valley and breeding Strategies. J. Triticeae Crops 2014, 34, 279–283. [Google Scholar] [CrossRef]
  35. Wang, H.; Zhu, D.; Wang, J.; Chen, M.; Zhang, X.; Gao, D. Study on the early maturing characteristics of Yangmai 16 under late planting. J. Triticeae Crops 2016, 36, 1657–1666. [Google Scholar] [CrossRef]
  36. Pfeifer, M.; Kugler, K.G.; Sandve, S.R.; Zhan, B.; Rudi, H.; Hvidsten, T.R.; International Wheat Genome Sequencing Consortium; Mayer, K.F.X.; Olsen, O. Genome interplay in the grain transcriptome of hexaploid bread wheat. Science 2014, 345, 1250091. [Google Scholar] [CrossRef]
  37. Appels, R.; Eversole, K.; Stein, N.; Feuillet, C.; Keller, B.; Rogers, J.; Pozniak, C.J.; Choulet, F.; Distelfeld, A.; Poland, J.; et al. Shifting the limits in wheat research and breeding using a fully annotated reference genome. Science 2018, 361, eaar7191. [Google Scholar] [CrossRef]
  38. Alaux, M.; Rogers, J.; Letellier, T.; Flores, R.; Alfama, F.; Pommier, C.; Mohellibi, N.; Durand, S.; Kimmel, E.; Michotey, C.; et al. Linking the international wheat genome sequencing consortium bread wheat reference genome sequence to wheat genetic and phenomic data. Genome Biol. 2018, 19, 111. [Google Scholar] [CrossRef]
  39. Ramirez-Gonzalez, R.H.; Borrill, P.; Lang, D.; Harrington, S.A.; Brinton, J.; Venturini, L.; Davey, M.; Jacobs, J.; van Ex, F.; Pasha, A.; et al. The transcriptional landscape of polyploid wheat. Science 2018, 361, eaar6089. [Google Scholar] [CrossRef]
  40. Xu, L.; Tang, Y.; Gao, S.; Su, S.; Hong, L.; Wang, W.; Fang, Z.; Li, X.; Ma, J.; Quan, W.; et al. Comprehensive analyses of the annexin gene family in wheat. BMC Genom. 2016, 17, 415. [Google Scholar] [CrossRef]
  41. Qiao, L.; Zhang, X.; Han, X.; Zhang, L.; Li, X.; Zhan, H.; Ma, J.; Luo, P.; Zhang, W.; Cui, L.; et al. A genome-wide analysis of the auxin/indole-3-acetic acid gene family in hexaploid bread wheat (Triticum aestivum L.). Front. Plant Sci. 2015, 6, 770. [Google Scholar] [CrossRef] [PubMed]
  42. Yue, H.; Shu, D.; Wang, M.; Xing, G.; Zhan, H.; Du, X.; Song, W.; Nie, X. Genome-wide identification and expression analysis of the HD-Zip gene family in wheat (Triticum aestivum L.). Genes 2018, 9, 70. [Google Scholar] [CrossRef] [PubMed]
  43. Kumar, S.; Stecher, G.; Tamura, K. MEGA7: Molecular Evolutionary Genetics Analysis version 7.0 for bigger datasets. Mol. Biol. Evol. 2016, 33, 1870–1874. [Google Scholar] [CrossRef] [PubMed]
  44. Huidong, M. Logistic equation and its application. J. Jingsu Agric. Coll. 1983, 4, 53–57. [Google Scholar] [CrossRef]
  45. Seber, G.; Wild, C. Nonlinear Regression. Chaper 7: Growth Model; Wiley: New York, NY, USA, 1989; pp. 325–365. [Google Scholar]
  46. Schlueter, J.A.; Dixon, P.; Granger, C.; Grant, D.; Clark, L.; Doyle, J.J.; Shoemaker, R.C. Mining EST databases to resolve evolutionary events in major crop species. Genome 2004, 47, 868–876. [Google Scholar] [CrossRef]
  47. Blanc, G.; Barakat, A.; Guyot, R.; Cooke, R.; Delseny, M. Extensive duplication and reshuffling in the Arabidopsis genome. Plant Cell 2000, 12, 1093–1101. [Google Scholar] [CrossRef] [PubMed]
  48. Wang, X.; Shi, X.; Hao, B.; Ge, S.; Luo, J. Duplication and DNA segmental loss in the rice genome: implications for diploidization. New Phytol. 2005, 165, 937–946. [Google Scholar] [CrossRef]
  49. Blanc, G.; Wolfe, K.H. Widespread paleopolyploidy in model plant species inferred from age distributions of duplicate genes. Plant Cell 2004, 16, 1667–1678. [Google Scholar] [CrossRef]
  50. Li, L.; Xu, H.; Yang, X.; Li, Y.; Hu, Y. Genome-wide identification, classification and expression analysis of LEA gene family in soybean. Sci. Agric. Sin. 2011, 44, 3945–3954. [Google Scholar] [CrossRef]
  51. Li, X.; Cao, J. Late Embryogenesis Abundant (LEA) gene family in maize: Identification, evolution, and expression profiles. Plant Mol. Biol. Report. 2016, 34, 15–28. [Google Scholar] [CrossRef]
  52. Zhaxybayeva, O.; Gogarten, J.P. Spliceosomal introns: New insights into their evolution. Curr. Biol. 2003, 13, R764–R766. [Google Scholar] [CrossRef] [PubMed]
  53. Jeffares, D.C.; Penkett, C.J.; Bahler, J. Rapidly regulated genes are intron poor. Trends Genet. 2008, 24, 375–378. [Google Scholar] [CrossRef] [PubMed]
  54. Tian, W.; Guo, Q.; Li, Z.; Ding, N.; Zhang, S.; Wen, S. Genome-wide identification and expression analysis in the heat stress of the DREB transcription factor family in bread wheat (Triticum aestivum L.). J. Triticeae Crops 2018, 38, 1146–1156. [Google Scholar] [CrossRef]
  55. Shu, Y.; Liu, Y.; Zhang, J.; Song, L.; Guo, C. Genome-wide analysis of the AP2/ERF superfamily genes and their responses to abiotic stress in Medicago truncatula. Front. Plant Sci. 2015, 6, 1247. [Google Scholar] [CrossRef]
  56. Leprince, O.; Pellizzaro, A.; Berriri, S.; Buitink, J. Late seed maturation: Drying without dying. J. Exp. Bot. 2016, 68, 827–841. [Google Scholar] [CrossRef] [PubMed]
  57. Wang, Y.; Xu, H.; Zhu, H.; Tao, Y.; Zhang, G.; Zhang, L.; Zhang, C.; Zhang, Z.; Ma, Z. Classification and expression diversification of wheat dehydrin genes. Plant Sci. 2014, 214, 113–120. [Google Scholar] [CrossRef]
  58. Finch-Savage, W.E.; Bassel, G.W. Seed vigour and crop establishment: Extending performance beyond adaptation. J. Exp. Bot. 2016, 67, 567–591. [Google Scholar] [CrossRef]
  59. Nadaud, I.; Girousse, C.; Debiton, C.; Chambon, C.; Bouzidi, M.F.; Martre, P.; Branlard, G. Proteomic and morphological analysis of early stages of wheat grain development. Proteomics 2010, 10, 2901–2910. [Google Scholar] [CrossRef]
  60. Shewry, P.R.; Mitchell, R.A.C.; Tosi, P.; Wan, Y.; Underwood, C.; Lovegrove, A.; Freeman, J.; Toole, G.A.; Mills, E.N.C.; Ward, J.L. An integrated study of grain development of wheat (cv. Hereward). J. Cereal Sci. 2012, 56, 21–30. [Google Scholar] [CrossRef]
  61. Dong, K.; Zhen, S.; Cheng, Z.; Cao, H.; Ge, P.; Yan, Y. Proteomic analysis reveals key proteins and phosphoproteins upon seed germination of wheat (Triticum aestivum L.). Front. Plant Sci. 2015, 6, 1017. [Google Scholar] [CrossRef]
  62. Xiao, B.; Huang, Y.; Tang, N.; Xiong, L. Over-expression of a LEA gene in rice improves drought resistance under the field conditions. Theor. Appl. Genet. 2007, 115, 35–46. [Google Scholar] [CrossRef]
  63. Xu, D.; Duan, X.; Wang, B.; Hong, B.; Ho, T.; Wu, R. Expression of a late embryogenesis abundant protein gene, HVA1, from barley confers tolerance to water deficit and salt stress in transgenic rice. Plant Physiol. 1996, 110, 249–257. [Google Scholar] [CrossRef] [PubMed]
  64. Olvera-Carrillo, Y.; Campos, F.; Reyes, J.L.; Garciarrubio, A.; Covarrubias, A.A. Functional analysis of the group 4 Late Embryogenesis Abundant proteins reveals their relevance in the adaptive response during water deficit in Arabidopsis. Plant Physiol. 2010, 154, 373–390. [Google Scholar] [CrossRef]
  65. Garnczarska, M.; Zalewski, T.; Kempka, M. Changes in water status and water distribution in maturing lupin seeds studied by MR imaging and NMR spectroscopy. J. Exp. Bot. 2007, 58, 3961–3969. [Google Scholar] [CrossRef] [PubMed]
  66. Garnczarska, M.; Zalewski, T.; Wojtyla, L. A comparative study of water distribution and dehydrin protein localization in maturing pea seeds. J. Plant Physiol. 2008, 165, 1940–1946. [Google Scholar] [CrossRef] [PubMed]
  67. Song, S.; Fu, J. Role of maturation drying in seed development and germination. Chin. Bull. Bot. 1998, 15, 23–32. [Google Scholar]
Figure 1. Phylogenetic analysis of wheat Late Embryogenesis-abundant (LEA) proteins. The LEA1, LEA2, LEA3, LEA4, LEA5, LEA6, Dehydrin, and seed maturation protein (SMP) groups are presented in blue, pink, light green, bright purple, orange, light blue, purple, and red, respectively.
Figure 1. Phylogenetic analysis of wheat Late Embryogenesis-abundant (LEA) proteins. The LEA1, LEA2, LEA3, LEA4, LEA5, LEA6, Dehydrin, and seed maturation protein (SMP) groups are presented in blue, pink, light green, bright purple, orange, light blue, purple, and red, respectively.
Genes 10 00696 g001
Figure 2. Exon-intron structures of TaLEA genes. Exon-intron are indicated by wide color bar and black line, respectively.
Figure 2. Exon-intron structures of TaLEA genes. Exon-intron are indicated by wide color bar and black line, respectively.
Genes 10 00696 g002
Figure 3. Conserved motifs (left) and their distributions (right) of TaLEA genes predicted by the MEME (Multiple Expectation Maximization for Motif Elicitation) online tool. Each motif is represented by a different colored box with corresponding number. The domain of each subfamily is represented by a different colored thick line below.
Figure 3. Conserved motifs (left) and their distributions (right) of TaLEA genes predicted by the MEME (Multiple Expectation Maximization for Motif Elicitation) online tool. Each motif is represented by a different colored box with corresponding number. The domain of each subfamily is represented by a different colored thick line below.
Genes 10 00696 g003
Figure 4. Distribution of TaLEA gene family members on wheat chromosomes. TaLEA1, TaLEA2, TaLEA3, TaLEA4, TaLEA5, TaLEA6, TaDehydrin and TaSMP subfamily are presented in orange, light green, red, purple, black, dark green, pink, and blue.
Figure 4. Distribution of TaLEA gene family members on wheat chromosomes. TaLEA1, TaLEA2, TaLEA3, TaLEA4, TaLEA5, TaLEA6, TaDehydrin and TaSMP subfamily are presented in orange, light green, red, purple, black, dark green, pink, and blue.
Genes 10 00696 g004
Figure 5. Expression profiling of TaLEA genes in different tissues. RNA-seq date downloaded from Wheat Expression Browser was used to analysis expression pattern. The heat map was drawn in Log10 (tpm + 0.0001)-transformed expression values. Color scale on the right represent relative expression levels: red represents high level and blue represent low level.
Figure 5. Expression profiling of TaLEA genes in different tissues. RNA-seq date downloaded from Wheat Expression Browser was used to analysis expression pattern. The heat map was drawn in Log10 (tpm + 0.0001)-transformed expression values. Color scale on the right represent relative expression levels: red represents high level and blue represent low level.
Genes 10 00696 g005
Figure 6. Expression profiles of 12 wheat LEA genes during grain filling and maturation. Purple and light purple columns indicate Yangmai16 and Yangmai 15, respectively. Values represent means ± standard deviation of three replicates. Asterisks reveal the gene significantly higher or lower in Yangmai 16 than in Yangmai 15 by t-test (* p < 0.05, ** p < 0.01).
Figure 6. Expression profiles of 12 wheat LEA genes during grain filling and maturation. Purple and light purple columns indicate Yangmai16 and Yangmai 15, respectively. Values represent means ± standard deviation of three replicates. Asterisks reveal the gene significantly higher or lower in Yangmai 16 than in Yangmai 15 by t-test (* p < 0.05, ** p < 0.01).
Genes 10 00696 g006
Figure 7. Phenotypic character of grain maturation in Yangmai 15 and Yangmai 16. (A), comparison of grain dry weight. (B), comparison of grain moisture content. (C), appearance of grains during maturation progress; Bar = 4 cm.
Figure 7. Phenotypic character of grain maturation in Yangmai 15 and Yangmai 16. (A), comparison of grain dry weight. (B), comparison of grain moisture content. (C), appearance of grains during maturation progress; Bar = 4 cm.
Genes 10 00696 g007
Table 1. Description of Late Embryogenesis Abundant (LEA) genes identified from the wheat genome.
Table 1. Description of Late Embryogenesis Abundant (LEA) genes identified from the wheat genome.
Gene_NameGene_IDAmino Acid NumberMWpIInstability IndexAliphatic IndexGRAVY
Note: MW, Molecular weight (Da); pI, Isoelectric point; GRAVY, Grand average of hydropathicity.

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (
Back to TopTop