Next Article in Journal
Peptide-Lipid Interactions: Experiments and Applications
Previous Article in Journal
Berry Phenolics of Grapevine under Challenging Environments

Int. J. Mol. Sci. 2013, 14(9), 18740-18757; doi:10.3390/ijms140918740

Article
The MAPKKK Gene Family in Gossypium raimondii: Genome-Wide Identification, Classification and Expression Analysis
Zujun Yin , Junjuan Wang , Delong Wang , Weili Fan , Shuai Wang and Wuwei Ye *
State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Anyang 455000, Henan, China; E-Mails: zujuny@163.com (Z.Y.); wangjj@cricaas.com.cn (J.W.); wangdl@cricaas.com.cn (D.W.); hai-19@163.com (W.F.); wangshuai_19871201@163.com (S.W.)
*
Author to whom correspondence should be addressed; E-Mail: yew158@163.com; Tel./Fax: +86-372-256-283.
Received: 31 July 2013; in revised form: 22 August 2013 / Accepted: 26 August 2013 /
Published: 11 September 2013

Abstract

: Mitogen-activated protein kinase (MAPK) cascades are conserved signal transduction pathways in all eukaryotic organisms. MAPKKKs (MAPK kinase kinases) operate at the top levels of these cascades. Recently, this family of genes has been systematically investigated in Arabidopsis, rice and maize, but has not yet been characterized in cotton. In this study, we identified 78 putative MAPKKK genes in the genome of the diploid cotton, Gossypium raimondii. They were classified into three subfamilies, of which 12 were ZIK, 22 were MEKK and 44 were Raf. The ZIK and MEKK genes displayed a scattered genomic distribution across 11 of the 13 chromosomes, whereas Raf genes were distributed across the entire genome. Their conserved patterns observed for introns and additional domains were consistent with the evolutionary relationships inferred from the phylogenetic analysis within subfamily. Transcriptome sequencing data were used to investigate their transcript profiles in mature leaves, 0 day and 3 days post-anthesis (DPA) ovules. Sixty MAPKKK genes were expressed, of which 41 were strongly expressed in mature leaves. Twelve MAPKKK genes were more highly expressed in 3-DPA ovules than in 0-DPA ovules. Our results provide a foundation for future evolutionary and functional characterizations of MAPKKK genes in cotton and probably other Gossypium plants.
Keywords:
cotton; ovule; MAPK cascade; MAPKKK; gene family

1. Introduction

Mitogen-activated protein kinase (MAPK) cascades are evolutionarily conserved signal transduction modules that are found in all eukaryotes [1]. In plants, signaling through the MAPK cascade can lead to diverse cellular activities, including cell division and differentiation, responses to abiotic and biotic stresses and programmed cell death [2,3]. The cascades are composed of three protein kinase modules: MAPK kinase kinase (MAPKKKs/MEKKs), MAPK kinase (MAPKKs/MKKs) and MAPK (MAPKs/MPKs) [1]. They are sequentially activated through phosphorylation by their upstream components. Upstream signals activate the MAPKKKs, which in turn activate the MAPKKs. They result in the activation of the specific MAP kinases. Eventually, the activated MAPK phosphorylate various transcription factors and other signaling components that modulate the expression of downstream genes [4]. The phosphorylation of proteins by MAPKs can affect many aspects of their function, including protein stability, cellular localization, DNA binding, protein-protein interactions and the regulation of other post-translational modifications [5,6]. In addition to phosphorylating target molecules, there is evidence that MAPK cascade components can also non-enzymatically regulate transcription [7].

In plants, MAPKKKs contain long N- or C-terminal regions compared to MAPKs and MAPKKs [4]. This family has been clustered into three groups based on the sequence in their kinase catalytic domain: the Raf-like family, the MEKK-like family and the ZIK-like family [8]. In Arabidopsis, the well-studied Raf-like MAPKKK genes include the Constitutive Triple Response1 (CTR1) and Enhanced Disease Resistance1 (EDR1) genes [9,10]. CTR1 was found to inhibit MKK9-MPK3/MPK6 activation during ethylene signaling and probably acts as an unconventional MAPKKK [11,12]. EDR1 encodes a CTR1-like kinase and acts as a negative regulator of disease resistance and ethylene-induced senescence [13]. Two rice Raf-like MAPKKKs, named Accelerated Cell Death and Resistance 1 (ACDR1) and Drought-hypersensitive Mutant 1 (DSM1), have been reported to positively regulate fungal disease and drought resistance, respectively [14,15]. Rice Increased Leaf Angle 1 (ILA1) is a key factor regulating mechanical tissue formation at the leaf lamina joint [16]. Tobacco MAPKKK Nicotiana Protein Kinase 1 (NPK1) and Arabidopsis NPK1-related protein kinase 1 (ANP) are found in the equatorial region of phragmoplasts and are involved in cytoskeletal regulation [17]. The MEKK1 gene, one of the first MAPKKKs to be characterized in Arabidopsis, is involved in pathogen defense and abiotic stress responses [18,19]. It has become evident that the MEKK1-MKK1/2-MPK4 pathway is a central regulator of reactive oxygen species (ROS) metabolism [20]. In tomato, MAPKKKα and MAPKKKɛ act as signaling molecules that positively regulate cell death networks associated with plant immunity [21,22]. Accumulated functional characterization of MAPKKKs has highlighted its diverse function in a developmental, tissue and signal-dependent context.

Recently, MAPKKK gene family has been systematic investigated in Arabidopsis, rice and maize. The Arabidopsis genome contains approximately 80 MAPKKK genes, which include 48 Raf kinases, 21 MEKK kinases and 11 ZIK kinases [8,23]. The rice genome contains 75 MAPKKK genes [24]. Half of them were present in the first three chromosomes. Seventy four MAPKKKs were identified in the sequenced maize genome. The expression profiles of 57 genes were examined in different organs using microarray data [25].

Cotton belongs to the genus Gossypium, which consists of diploid and tetraploid species, and has originated from a common ancestor approximately 5–10 million years ago. In scientific research, cotton serves as an excellent model system for studying polyploidization, cell elongation and cell wall biosynthesis. Little is known, however, about the identification and characterization of the MAPKKK gene family in cotton. Gossypium raimondii is a diploid cotton. Its progenitor is the putative contributor of the D subgenome to the economically important fiber-producing cotton species G. hirsutum and G. barbadense [26]. Recently, the G. raimondii genome was sequenced, which made it possible to identify all the MAPKKK genes in this species for the first time [27]. In this study, 78 MAPKKK genes were identified from the G. raimondii genome. Detailed information on their genomic structures, chromosomal locations and phylogenetic trees is provided. Their transcript profiles in leaves and at the 0 day post-anthesis (DPA) and 3-DPA ovule developmental stages were further investigated using transcriptome sequencing data. Our results provide the basis for future research on their evolutionary mechanisms and the signaling pathways mediated by MAPKKKs in cotton.

2. Results and Discussion

2.1. Genome-Wide Identification of the MAPKKK Family in G. raimondii

MAPK cascades are evolutionarily conserved signaling modules in eukaryotes, such as animals, yeasts and plants. Based on the high degree of sequence conservation, putative orthologs of MAPK cascade members was identified by sequence comparison and signature motif searches [24]. In order to identify MAPKKK genes in G. raimondii, MAPKKK protein sequences from Arabidopsis and rice were used as queries in a BLAST search of the publically available G. raimondii sequence database. After extensive analysis, a total of 78 genes were defined as G. raimondii MAPKKKs (Table 1). During screening of the potential MAPKKKs, the conserved protein domains in their sequences were analyzed using the PROSITE program. All of them contained a serine/threonine protein kinase active site (PS00108), a protein kinase domain (PS50011) and a protein kinase ATP-binding region (PS00107). These characteristic features suggested that they were members of MAPK cascade gene family.

Based on Arabidopsis MAPKKK nomenclature suggestions [28], each gene was named with a two-letter code corresponding to G. raimondii (Gr). GrMAPKKKs were numbered from 1 to 78 according to the BLASTP search output from top to bottom (Table 1). The open reading frame (ORF) lengths of the GrMAPKKK genes ranged from 885 bp (GrMAPKKK2) to 4200 bp (GrMAPKKK14). Their protein sequences contained 294–1399 amino acids (aa), with the majority (84.62%) containing 350–900 aa. The molecular weight (Mw) of these proteins ranged from 33.68 kDa (GrMAPKKK2) to 154.46 kDa (GrMAPKKK14). The theoretical isoelectric point (pI) ranged from 4.96 (GrMAPKKK5) to 9.29 (GrMAPKKK21), with an average of about 6.75. Protein subcellular localization prediction is an essential step for understanding protein function and its pattern of interactions in protein networks [29]. Most of GrMAPKKK proteins were predicted to be located in the nucleus and the cytoplasm (Table 1). Five GrMAPKKK proteins (GrMAPKKK38, GrMAPKKK45, GrMAPKKK48, GrMAPKKK55 and GrMAPKKK61) were predicted to be located in the mitochondria. Two GrMAPKKK proteins (GrMAPKKK29 and GrMAPKKK44) were predicted to be located in the chloroplast. At present, the subcellular localization of most known MAPKKK proteins remains to be experimentally determined. In rice, DSM1 is a Raf-like MAPKKK gene functioning as an early signaling component in regulating responses to drought stress by regulating scavenging of ROS [15]. Its protein is confirmed to be located in nucleus by transient expression analysis. Another Raf-like MAPKKK protein ILA1 is predominantly resident in the nucleus and expressed in the vascular bundles of leaf lamina joints [16]. Detailed information about MAPKKK proteins in mitochondria or chloroplast is less in plant kingdom recently. Takabatake et al. [30] demonstrated that MAPK cascade could transduce the cell death signal to mitochondria for N gene-dependent cell death through operating downstream of heat shock protein 90 (HSP90) in tobacco.

2.2. Phylogenetic Analysis and Genomic Distribution of GrMAPKKKs

MAPKKK genes act at the highest level of the MAPK cascades and form the largest group of components in the MAPK cascade pathway. To investigate the molecular evolution and phylogenetic relationships between different MAPKKK family members, the MAPKKK proteins in Arabidopsis, rice, maize and G. raimondii were aligned by ClustalW and analyzed using MEGA v5. Using full-length protein sequences of the MAPKKKs, phylogenetic trees were constructed with the neighbor-joining (NJ) method and the minimal evolution (ME) method. The two methods produced the same topologies, even at the deep nodes. As shown in Figure 1, the phylogenetic trees indicate that the GrMAPKKK genes were placed into three categories, which were based on their sequence similarity with orthologs in other plants. The three categories were ZIK, MEKK and Raf. The ZIK subfamily consists of the fewest number of MAPKKK genes. Twelve GrMAPKKKs belonged to this group. In Arabidopsis, rice and maize, 11 AtMAPKKKs, 10 OsMAPKKKs and 6 ZmMAPKKKs were grouped in this subfamily [24,28]. The MEKK subfamily consists of 22 GrMAPKKK genes in G. raimondii. The remaining 44 GrMAPKKK genes belong to the Raf group, which is the largest subfamily of MAPKKKs (Figure 1).

The complete genome sequence gave an overview of the chromosomal distribution of GrMAPKKK genes. These important signaling components were mapped on all 13 chromosomes of G. raimondii (Figure 2). Chromosome VI carried 11 divergent GrMAPKKK genes and chromosome X contained 3 GrMAPKKK genes. It is thought that gene families can arise through tandem amplification, resulting in a clustered occurrence. They can also arise through segmental duplication of chromosomal regions, resulting in a scattered occurrence of family members [31]. In the G. raimondii genome, both the ZIK genes and the MEKK genes were found to be located on chromosomes: II, III, IV, V, VI, VII, VIII, XI, XII and XIII. Although there are many ZIK or MEKK paralogs that have high levels of sequence similarity, nevertheless, they showed a scattered genomic distribution across 11 of the 13 chromosomes. Recent studies have shown that the G. raimondii genome has undergone at least two rounds of genome-wide duplication, 2355 syntenic blocks have been identified and 39 triplicated regions [27]. The scattered genomic distribution pattern of the two subfamily genes probably reflects a series of whole genome, chromosomal and large segmental duplication events that typify the G. raimondii genome. Forty-four Raf subfamily members were found on all 13 chromosomes. All of them were also randomly distributed across the genome, apart from GrMAPKKK48, GrMAPKKK51, GrMAPKKK57 and GrMAPKKK61. GrMAPKKK48 and GrMAPKKK57 were found to be physically located near to each other on chromosome VI. GrMAPKKK51 and GrMAPKKK61 formed a cluster and were located on chromosome V. The two clusters consisted of two Raf gene pairs: GrMAPKKK48/GrMAPKKK61 and GrMAPKKK51/GrMAPKKK57. This pattern probably results from translocation and insertion events between the two chromosomes. Twelve GrMAPKKK paralogous gene pairs were found after phylogenic analysis. They were in the same clade of the phylogenetic tree, which suggested that the GrMAPKKK gene family may have undergone multiple duplications during its evolutionary history. It has been reported that 75 OsMAPKKKs are distributed on all 12 chromosomes of rice and half of them are present on the first three chromosomes [24]. In contrast, the 74 ZmMAPKKKs are distributed on all 10 chromosomes of maize without physical accumulation [25]. The scattered distribution of G. raimondii MAPKKK genes suggested that recent duplication events have occurred in this gene family.

2.3. Gene Structural Organization and Domain Analysis of GrMAPKKKs

Analysis of the exon-intron junction patterns can provide additional insights into the evolution of gene families. In order to obtain some insight into the gene structures of GrMAPKKK family genes, their exon/intron organizations were analyzed. The differences in gene structure among GrMAPKKKs were significant. As shown in Figure 3, the GrMAPKKKs had exons varying from one (GrMAPKKK23, GrMAPKKK27, GrMAPKKK28 and GrMAPKKK34) to 23 (GrMAPKKK13 and GrMAPKKK15). A majority of the GrMAPKKKs (83%) had more than 3 exons. ZIK subfamily members can be divided into three groups, according to their exon/intron structures. The GrMAPKKK2 and GrMAPKKK9 gene pairs had 2 exons and the GrMAPKKK1 and GrMAPKKK6 gene pairs had 8 exons. The remaining ZIK genes had 7 exons. The gene structures of the ZIK subfamily members were less divergent than the MEKK and Raf subfamilies. This suggested that the genes in this family had preserved a relatively constant exon-intron composition during the evolution of the G. raimondii genome. Exons are considered to be under a high selection pressure compared to introns. However, some losses or gains of exons were identified during the evolution of the MEKK family genes. For example, there was a small segment at the 3′ terminal of GrMAPKKK29-32, but not in GrMAPKKK23, GrMAPKKK27, GrMAPKKK28 and GrMAPKKK34, although they were within the same phylogenetic group. The Raf subfamily could be divided into six groups. The first five groups had conserved structural patterns, while group VI possessed a complex distribution of exons and introns. The intron numbers for the Raf subfamily genes ranged from two (GrMAPKKK36 and GrMAPKKK42) to 16 (GrMAPKKK59). Despite differences in the length of particular introns, it was clear that the exon structural pattern was well conserved between close paralogs, such as GrMAPKKK46 and GrMAPKKK54. GrMAPKKK35, GrMAPKKK66 and GrMAPKKK74 were in the same phylogenetic group. When their structural patterns were compared, we found that there was a missing exon in the middle of GrMAPKKK66 sequence. This indicated that the exon/intron structures of each gene cluster originated from tandem or segmental duplication events and tended to share a similar structural organization, but with evolutionary tiny difference.

Conserved domains in the GrMAPKKK protein sequences were analyzed using the PROSITE program. The characteristic feature of the ZIK family is the conserved signature, GTPEFMAPELY (Figure 4). The presence of this specific signature in a protein strongly suggests that it is a member of the ZIK family. In addition, a majority of the family members were found to have a N-terminal kinase domain, which was consistent with their orthologs in Arabidopsis, rice and maize. The MEKK family in plants has been shown to be similar to animal MEKKs and yeast MAPKKKs. Members of this family have less conserved protein structures and a shared motif: G(T/S)PX(W/F)MAPEV. Their kinase domain is located either at the N or C-terminal or in the central part of sequences. In contrast to the ZIK family, most of the Raf family proteins have a C-terminal kinase domain and a long N-terminal regulatory domain. GTXX(W/Y)MAPE(L/V) was a less conserved signature in these subfamily members. The great diversity in MAPKKKs may allow them to regulate many specific signaling pathways in plants, despite the relatively limited numbers of MAPKKs and MAPKs.

2.4. In Silico Analysis of Expression of MAPKKKs Based on Transcriptome Sequencing Data

Transcriptome analysis was previously used for identification of protein-coding genes during genome annotation of G. raimondii [27]. In this study, sequenced reads that were mapped on the MAPKKK sequences were converted to RPKM to estimate gene expression levels. The data was downloaded from the NCBI and the search was performed using at least 20 nucleotide long signatures.

Transcript abundance was examined in mature leaves, 0-DPA ovules and 3-DPA ovules. Our analysis revealed that 60 MAPKKK genes (76.92%) were expressed (Figure 5). Some family members exhibited tissue-specific expression, whereas others were more ubiquitously expressed. The expression patterns of some duplicate genes were partially redundant, which suggested that sub-functionalization had occurred during their evolution. Among the ZIK family members, GrMAPKKK1 was the most highly expressed member in 0-DPA ovules. It is an ortholog of At5g41990, which is called AtWNK kinase 8 (WNK8) in Arabidopsis. AtWNK8 kinase can phosphorylate AtRGS1 (regulator of G-protein signaling 1) and causes AtRGS1 endocytosis, which is required for both G-protein-mediated sugar signaling and cell proliferation [32]. Wang et al. [33] showed that the T-DNA knockout AtWNK8 mutation caused early flowering in Arabidopsis. GrMAPKKK3 was the most highly expressed member in mature leaves. It was also expressed in 3-DPA ovules, but not in 0-DPA ovules. GrMAPKKK16 is a member of MEKK subfamily and is an ortholog of ANP1/AtMEKK1 from Arabidopsis. It was most highly expressed in 3-DPA ovules. Previous studies demonstrated that Arabidopsis ANP1 was functionally redundant to ANP2 and ANP3 in the positive regulation of cytokinesis [34]. Two of the three double-mutant combinations displayed defects in cell division and growth. The mutants cause aberrant development of roots, shoots, cotyledons, rosette and cauline leaves, where many cells with incomplete daughter walls are seen [35,36]. In the Raf family, 11 MAPKKK genes (GrMAPKKK38, GrMAPKKK39, GrMAPKKK42, GrMAPKKK52, GrMAPKKK57, GrMAPKKK61, GrMAPKKK64, GrMAPKKK66, GrMAPKKK69, GrMAPKKK71 and GrMAPKKK74) were more highly expressed in 0-DPA ovules than in 3-DPA ovules, whereas 8 MAPKKK genes (GrMAPKKK40, GrMAPKKK45, GrMAPKKK46, GrMAPKKK47, GrMAPKKK54, GrMAPKKK58, GrMAPKKK77 and GrMAPKKK78) were more highly expressed in 3-DPA ovules than in 0-DPA ovules. GrMAPKKK46, GrMAPKKK54 and GrMAPKKK58 had a common ortholog, CTR1 in Arabidopsis. CTR1 was the first discovered component of the ethylene signal transduction pathway. It acts as a negative regulator of ethylene signaling because the ctr1 mutant alleles exhibited a constitutive ethylene response phenotype [37]. In the absence of ethylene, CTR1 interacts with ethylene receptors, thereby actively suppressing the ethylene signal response [37,38]. After the binding of ethylene to the receptors, CTR1 becomes inactivated and the signaling cascade is initiated. Ethylene governs a range of developmental and response processes in plants. After undertaking genomic, genetic, molecular biological and physiological studies, Shi et al. [39] demonstrated that ethylene had a prominent role in promoting cotton fiber cell elongation. Exogenously applied ethylene could promote robust fiber cell expansion, whereas its biosynthetic inhibitor specifically suppressed fiber growth. The higher expression of the three CTR1 homologs: GrMAPKKK46, GrMAPKKK54 and GrMAPKKK58 in no-fiber G. raimondii, suggested that they had a negative effect on early fiber cell development.

3. Experimental Section

3.1. Sequences Data and Database Search

Multiple database searches were performed to identify the MAPKKK genes in G. raimondii. The completed genome sequence of this species was downloaded from the National Centre for Biotechnology Information (NCBI) [40]. Its BioProject accession is PRJNA171262. The protein sequences were obtained from Cotton Genome Project (CGP) and were used to construct a local protein database [41]. It comprised 40,976 sequences. The method used to identify the GrMAPKKK genes was similar to that used for rice and maize [24,25]. The MAPKKK proteins from Arabidopsis and rice were used as query sequences. They were collected from the published literature and downloaded from The Arabidopsis Information Resource (TAIR) and the Rice Genome Annotation Project, respectively [42,43]. The BLAST search was carried out using BLASTP. The aligned parts were inspected and compared manually in order to determine their identity. The G. raimondii proteins that had a 50% identity with the query sequence were selected out. These proteins were aligned with themselves and any redundancy removed. Sequence characterizations of the remaining proteins, including the serine/threonine protein kinase active site, the protein kinase domain and the protein kinases ATP-binding region were checked. Motif scanning was done using the PROSITE program at ExPASy [44].

3.2. pI, Mw and Subcellular Localization Predictions for MAPKKKs

The theoretical pI and Mw of the proteins were calculated by the Compute pI/Mw tool in the ExPASy server [45]. Protein pI was calculated using the pK values of amino acids, as described by Bjellqvist et al. [46], which were defined by examining polypeptide migration between pH 4.5 and 7.3 in an immobilized pH gradient gel environment with 9.2 M and 9.8 M urea at 15 °C or 25 °C. Protein Mw was calculated by adding the average isotopic masses of the amino acids in the protein and the average isotopic mass of one water molecule. Molecular weight values are given in Daltons (Da) and the subcellular localization of each MAPKKK was analyzed using the CELLO v2.5 server [47].

3.3. Chromosomal Locations and Gene Structure Analysis

The cDNA sequences of G. raimondii were obtained from the Cotton Genome Project. To determine the locations of the MAPKKK genes on chromosomes, their sequences were used as query sequences for a BLASTN search against the whole G. raimondii genome. The exon/intron structures for individual MAPKKK genes were determined by aligning the cDNA sequences to their corresponding genomic DNA sequences.

3.4. Phylogenetic Analysis and Gene Duplication of MAPKKK Genes

The full-length protein sequences of the MAPKKKs were multi-aligned using the ClustalW2 program [48]. Phylogenetic trees were constructed by employing the NJ method and the ME method found in the MEGA v5 software suite. For both methods, bootstrap testing of the phylogeny was performed with 1000 replications. Other parameters followed the default parameters. The TREEVIEW program and MEGA v5 were used to display the phylogenetic trees. The following criteria were used to define the gene duplication: (1) the alignment length covered > 80% of the longer gene; (2) the aligned region had an identity > 80% and (3) only one duplication event was counted for tightly linked genes.

3.5. Expression Analyses of the MAPKKK Genes

The expression pattern of the MAPKKK genes was analyzed in three tissue samples: mature leaves, 0-DPA ovules and 3-DPA ovules of G. raimondii. Transcriptome sequencing data for these three samples were obtained from the NCBI Sequence Read Archive (SRA). The accession numbers were: SRX111367, SRX111365 and SRX111366, respectively. Sequenced reads that were mapped on the MAPKKK sequences were converted to RPKM in order to estimate gene expression levels [49]. The formula used was:

RPKM = 10 6 C / ( N L / 10 3 )

where C is the number of reads that were uniquely aligned to the transcript, N is the total number of reads that were uniquely aligned to all the transcripts in a specific sample and L is number of bases in the transcript.

4. Conclusions

MAPK cascades act as important signal transduction modules in eukaryotes for many different cellular activities. A MAPK cascade, in its simplest form, consists of a MAPKKK-MAPKK-MAPK module that is linked in various ways to upstream receptors and downstream targets. As the first level of the phosphorylating cascade, the MAPKKK family has the most members and exhibits the most divergence. So far, 80, 75 and 74 MAPKKK genes in the Arabidopsis, rice and maize genomes have been reported, respectively. In this study, we identified 78 MAPKKK genes in the sequenced genome of diploid cotton G. raimondii. Their phylogenetic relationship, genomic distribution, conserved protein motif and exon/intron organization were characterized. The MAPKKK genes were divided into three major groups. There were 12 MAPKKKs in the ZIK group, 22 MAPKKKs in the MEKK group and 44 MAPKKKs in the Raf group. The genes in each group had a similar motif in the deduced amino acid sequences, which supported their identification as members of ZIK, MEKK and Raf. The ZIK and MEKK genes were located on chromosomes: II, III, IV, V, VI, VII, VIII, XI, XII and XIII. The Raf genes were distributed across all 13 chromosomes. Although there were numerous MAPKKKs paralogs displaying high levels of sequence similarity, few clusters containing closely related genes were found. This pattern probably reflects a series of whole genome, chromosomal and large segmental duplication events for the G. raimondii genome. Based on transcriptome sequencing data, we analyzed expression patterns of MAPKKKs at the fiber initiation stage and in mature leaves. They showed dramatic differences in expression levels in different tissues. Interestingly, GrMAPKKK46, GrMAPKKK54 and GrMAPKKK58 had higher expression levels in 3-DPA ovules than in 0-DPA ovules. Their direct ortholog was CTR1, which is known to be a negative regulator of the ethylene signaling pathway in Arabidopsis. When combined with the prominent role for ethylene in promoting cotton fiber development, this observation might help us to investigate the regulation mechanism for fiber cell elongation in cotton species. Taken together, our results should facilitate the functional annotation of these first signaling components in the MAPK cascade. Additionally, the results should provide an important foundation for the study of very poorly characterized MAPKKKs in non-sequenced tetraploid cotton.

Acknowledgments

This work was supported by grants from the National Natural Science Foundation of China (Grant No. 31201247), and the China Major Projects for Transgenic Breeding (Grant No. 2011ZX005-004).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Nishihama, R.; Banno, H.; Shibata, W.; Hirano, K.; Nakashima, M.; Usami, S.; Machida, Y. Plant homologues of components of MAPK (mitogen-activated protein kinase) signal pathways in yeast and animal cells. Plant Cell Physiol 1995, 36, 749–757. [Google Scholar]
  2. Mishra, N.S.; Tuteja, R.; Tuteja, N. Signaling through MAP kinase networks in plants. Arch. Biochem. Biophys 2006, 452, 55–68. [Google Scholar]
  3. Opdenakker, K.; Remans, T.; Vangronsveld, J.; Cuypers, A. Mitogen-activated protein (MAP) kinases in plant metal stress: Regulation and responses in comparison to other biotic and abiotic stresses. Int. J. Mol. Sci 2012, 13, 7828–7853. [Google Scholar]
  4. Rodriguez, M.C.; Petersen, M.; Mundy, J. Mitogen-activated protein kinase signaling in plants. Annu. Rev. Plant Biol 2010, 61, 621–649. [Google Scholar]
  5. Yang, S.H.; Sharrocks, A.D.; Whitmarsh, A.J. Transcriptional regulation by the MAP kinase signaling cascades. Gene 2003, 320, 3–21. [Google Scholar]
  6. Whitmarsh, A.J. Regulation of gene transcription by mitogen-activated protein kinase signaling pathways. Biochim. Biophys. Acta 2007, 1773, 1285–1298. [Google Scholar]
  7. Rodriguez, J.; Crespo, P. Working without kinase activity: Phosphotransfer-independent functions of extracellular signal-regulated kinases. Sci. Sign. 2011, 4, re3. [Google Scholar]
  8. Taj, G.; Agarwal, P.; Grant, M.; Kumar, A. MAPK machinery in plants: Recognition and response to different stresses through multiple signal transduction pathways. Plant Sign. Behave 2010, 5, 1370–1378. [Google Scholar]
  9. Kieber, J.J.; Rothenberg, M.; Roman, G.; Feldmann, K.A.; Ecker, J.R. CTR1, a negative regulator of the ethylene response pathway in Arabidopsis, encodes a member of the raf family of protein kinases. Cell 1993, 72, 427–441. [Google Scholar]
  10. Frye, C.A.; Tang, D.; Innes, R.W. Negative regulation of defense responses in plants by a conserved MAPKK kinase. Proc. Natl. Acad. Sci. USA 2001, 98, 373–378. [Google Scholar]
  11. Testerink, C.; Larsen, P.B.; van der Does, D.; van Himbergen, J.A.; Munnik, T. Phosphatidic acid binds to and inhibits the activity of Arabidopsis CTR1. J. Exp. Bot 2007, 58, 3905–3914. [Google Scholar]
  12. Yoo, S.D.; Sheen, J. MAPK signaling in plant hormone ethylene signal transduction. Plant Sign. Behav 2008, 3, 848–849. [Google Scholar]
  13. Tang, D.; Christiansen, K.M.; Innes, R.W. Regulation of plant disease resistance, stress responses, cell death, and ethylene signaling in Arabidopsis by the EDR1 protein kinase. Plant Physiol 2005, 138, 1018–1026. [Google Scholar]
  14. Kim, J.A.; Cho, K.; Singh, R.; Jung, Y.H.; Jeong, S.H.; Kim, S.H.; Lee, J.E.; Cho, Y.S.; Agrawal, G.K.; Rakwal, R.; et al. Rice OsACDR1 (Oryza sativa accelerated cell death and resistance 1) is a potential positive regulator of fungal disease resistance. Mol. Cells 2009, 28, 431–439. [Google Scholar]
  15. Ning, J.; Li, X.; Hicks, L.M.; Xiong, L. A Raf-like MAPKKK gene DSM1 mediates drought resistance through reactive oxygen species scavenging in rice. Plant Physiol 2010, 152, 876–890. [Google Scholar]
  16. Ning, J.; Zhang, B.; Wang, N.; Zhou, Y.; Xiong, L. Increased leaf angle1, a Raf-like MAPKKK that interacts with a nuclear protein family, regulates mechanical tissue formation in the Lamina joint of rice. Plant Cell 2011, 23, 4334–4347. [Google Scholar]
  17. Jin, H.; Axtell, M.J.; Dahlbeck, D.; Ekwenna, O.; Zhang, S.; Staskawicz, B.; Baker, B. NPK1, an MEKK1-like mitogen-activated protein kinase kinase kinase, regulates innate immunity and development in plants. Dev. Cell 2002, 3, 291–297. [Google Scholar]
  18. Asai, T.; Tena, G.; Plotnikova, J.; Willmann, M.R.; Chiu, W.L.; Gomez-Gomez, L.; Boller, T.; Ausubel, F.M.; Sheen, J. MAP kinase signalling cascade in Arabidopsis innate immunity. Nature 2002, 415, 977–983. [Google Scholar]
  19. Mizoguchi, T.; Irie, K.; Hirayama, T.; Hayashida, N.; Yamaguchi-Shinozaki, K.; Matsumoto, K.; Shinozaki, K. A gene encoding a mitogen-activated protein kinase kinase kinase is induced simultaneously with genes for a mitogen-activated protein kinase and an S6 ribosomal protein kinase by touch, cold, and water stress in Arabidopsis thaliana. Proc. Natl. Acad. Sci. USA 1996, 93, 765–769. [Google Scholar]
  20. Pitzschke, A.; Djamei, A.; Bitton, F.; Hirt, H. A major role of the MEKK1-MKK1/2-MPK4 pathway in ROS signalling. Mol. Plant 2009, 2, 120–137. [Google Scholar]
  21. Melech-Bonfil, S.; Sessa, G. Tomato MAPKKKɛ is a positive regulator of cell-death signaling networks associated with plant immunity. Plant J 2010, 64, 379–391. [Google Scholar]
  22. Del Pozo, O.; Pedley, K.F.; Martin, G.B. MAPKKKα is a positive regulator of cell death associated with both plant immunity and disease. EMBO J 2004, 23, 3072–3082. [Google Scholar]
  23. Wrzaczek, M.; Hirt, H. Plant MAP kinase pathways: How many and what for? Biol. Cell 2001, 93, 81–87. [Google Scholar]
  24. Rao, K.P.; Richa, T.; Kumar, K.; Raghuram, B.; Sinha, A.K. In silico analysis reveals 75 members of mitogen-activated protein kinase kinase kinase gene family in rice. DNA Res 2010, 17, 139–153. [Google Scholar]
  25. Kong, X.; Lv, W.; Zhang, D.; Jiang, S.; Zhang, S.; Li, D. Genome-wide identification and analysis of expression profiles of maize mitogen-activated protein kinase kinase kinase. PLoS One 2013, 8, e57714. [Google Scholar]
  26. Paterson, A.H.; Wendel, J.F.; Gundlach, H.; et al. Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres. Nature 2012, 492, 423–427. [Google Scholar]
  27. Wang, K.; Wang, Z.; Li, F.; Ye, W.; Wang, J.; Song, G.; Yue, Z.; Cong, L.; Shang, H.; Zhu, S.; et al. The draft genome of a diploid cotton Gossypium raimondii. Nat. Genet 2012, 44, 1098–1103. [Google Scholar]
  28. Ichimura, K. Mitogen-activated protein kinase cascades in plants: A new nomenclature. Trends Plant Sci. 2002, 7, 301–308. [Google Scholar]
  29. Chou, K.C.; Shen, H.B. Recent progress in protein subcellular location prediction. Anal. Biochem 2007, 370, 1–16. [Google Scholar]
  30. Takabatake, R.; Ando, Y.; Seo, S.; Katou, S.; Tsuda, S.; Ohashi, Y.; Mitsuhara, I. MAP kinases function downstream of HSP90 and upstream of mitochondria in TMV resistance gene N-mediated hypersensitive cell death. Plant Cell Physiol 2007, 48, 498–510. [Google Scholar]
  31. Schauser, L.; Wieloch, W.; Stougaard, J. Evolution of NIN-like proteins in Arabidopsis, rice, and Lotus japonicus. J. Mol. Evolut 2005, 60, 229–237. [Google Scholar]
  32. Urano, D.; Phan, N.; Jones, J.C.; Yang, J.; Huang, J.; Grigston, J.; Taylor, J.P.; Jones, A.M. Endocytosis of the seven-transmembrane RGS1 protein activates G-protein-coupled signalling in Arabidopsis. Nat. Cell Biol 2012, 14, 1079–1088. [Google Scholar]
  33. Wang, Y.; Liu, K.; Liao, H.; Zhuang, C.; Ma, H.; Yan, X. The plant WNK gene family and regulation of flowering time in Arabidopsis. Plant Biol 2008, 10, 548–562. [Google Scholar]
  34. Krysan, P.J.; Jester, P.J.; Gottwald, J.R.; Sussman, M.R. An Arabidopsis mitogen-activated protein kinase kinase kinase gene family encodes essential positive regulators of cytokinesis. Plant Cell 2002, 14, 1109–1120. [Google Scholar]
  35. Muller, J.; Beck, M.; Mettbach, U.; Komis, G.; Hause, G.; Menzel, D.; Samaj, J. Arabidopsis MPK6 is involved in cell division plane control during early root development, and localizes to the pre-prophase band, phragmoplast, trans-Golgi network and plasma membrane. Plant J 2010, 61, 234–248. [Google Scholar]
  36. Beck, M.; Komis, G.; Ziemann, A.; Menzel, D.; Samaj, J. Mitogen-activated protein kinase 4 is involved in the regulation of mitotic and cytokinetic microtubule transitions in Arabidopsis thaliana. New Phytol 2011, 189, 1069–1083. [Google Scholar]
  37. Clark, K.L.; Larsen, P.B.; Wang, X.; Chang, C. Association of the Arabidopsis CTR1 Raf-like kinase with the ETR1 and ERS ethylene receptors. Proc. Natl. Acad. Sci. USA 1998, 95, 5401–5406. [Google Scholar]
  38. Huang, Y.; Li, H.; Hutchison, C.E.; Laskey, J.; Kieber, J.J. Biochemical and functional analysis of CTR1, a protein kinase that negatively regulates ethylene signaling in Arabidopsis. Plant J 2003, 33, 221–233. [Google Scholar]
  39. Shi, Y.H.; Zhu, S.W.; Mao, X.Z.; Feng, J.X.; Qin, Y.M.; Zhang, L.; Cheng, J.; Wei, L.P.; Wang, Z.Y.; Zhu, Y.X. Transcriptome profiling, molecular biological, and physiological studies reveal a major role for ethylene in cotton fiber cell elongation. Plant Cell 2006, 18, 651–664. [Google Scholar]
  40. NCBI. Available online: http://www.ncbi.nlm.nih.gov/ (accessed on 29 December 2012).
  41. Cotton Genome Project. Available online: http://cgp.genomics.org.cn/page/species/index.jsp/ (accessed on 29 December 2012).
  42. The Arabidopsis Information Resource. Available online: http://www.arabidopsis.org/ (accessed on 9 January 2013).
  43. Rice Genome Annotation Project. Available online: ftp://ftp.plantbiology.msu.edu/ (accessed on 10 January 2013).
  44. PROSITE program. ExPASy server, Available online: http://www.expasy.ch/prosite/ (accessed on 19 June 2013).
  45. Compute pI/Mw tool. ExPASy server, Available online: http://web.expasy.org/compute_pi/ (accessed on 19 June 2013).
  46. Bjellqvist, B.; Hughes, G.J.; Pasquali, C.; Paquet, N.; Ravier, F.; Sanchez, J.C.; Frutiger, S.; Hochstrasser, D. The focusing positions of polypeptides in immobilized pH gradients can be predicted from their amino acid sequences. Electrophoresis 1993, 14, 1023–1031. [Google Scholar]
  47. CELLO v2.5. Available online: http://cello.life.nctu.edu.tw/ (accessed on 29 June 2013).
  48. ClustalW2. Available online: http://www.ebi.ac.uk/Tools/msa/clustalw2/ (accessed on 29 June 2013).
  49. Mortazavi, A.; Williams, B.A.; McCue, K.; Schaeffer, L.; Wold, B. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat. Methods 2008, 5, 621–628. [Google Scholar]
Ijms 14 18740f1 1024
Figure 1. Phylogenetic trees of the MAPKKK family genes in Arabidopsis, rice, maize and G. raimondii. The unrooted tree was generated by the MEGA v5 program using the neighbor-joining method. Bootstrap values from 1000 replicates are indicated at each branch. The phylogenetic trees were constructed based on the full-length protein sequences of the MAPKKKs. At, Arabidopsis thaliana; Os, Oryza sativa; GRMZM, gene model IDs from the Maize Genome Sequencing Project.

Click here to enlarge figure

Figure 1. Phylogenetic trees of the MAPKKK family genes in Arabidopsis, rice, maize and G. raimondii. The unrooted tree was generated by the MEGA v5 program using the neighbor-joining method. Bootstrap values from 1000 replicates are indicated at each branch. The phylogenetic trees were constructed based on the full-length protein sequences of the MAPKKKs. At, Arabidopsis thaliana; Os, Oryza sativa; GRMZM, gene model IDs from the Maize Genome Sequencing Project.
Ijms 14 18740f1 1024
Ijms 14 18740f2 1024
Figure 2. Chromosomal locations of MAPKKK genes in the G. raimondii genome. The ZIK, MEKK and Raf subfamily genes are separately indicated with green, orange and blue, respectively. The arrows indicate the rightward or leftward direction of transcription.

Click here to enlarge figure

Figure 2. Chromosomal locations of MAPKKK genes in the G. raimondii genome. The ZIK, MEKK and Raf subfamily genes are separately indicated with green, orange and blue, respectively. The arrows indicate the rightward or leftward direction of transcription.
Ijms 14 18740f2 1024
Ijms 14 18740f3 1024
Figure 3. Intron and exon organization of MAPKKK family genes in G. raimondii. Introns and exons are represented by black lines and colored boxes, respectively. MAPKKK genes have been grouped according to phylogenetic classification.

Click here to enlarge figure

Figure 3. Intron and exon organization of MAPKKK family genes in G. raimondii. Introns and exons are represented by black lines and colored boxes, respectively. MAPKKK genes have been grouped according to phylogenetic classification.
Ijms 14 18740f3 1024
Ijms 14 18740f4 1024
Figure 4. Protein sequence alignment of MAPKKK genes in G. raimondii. Alignment was performed using ClustalW program. The highlighted part shows the conserved signature motif.

Click here to enlarge figure

Figure 4. Protein sequence alignment of MAPKKK genes in G. raimondii. Alignment was performed using ClustalW program. The highlighted part shows the conserved signature motif.
Ijms 14 18740f4 1024
Ijms 14 18740f5 1024
Figure 5. Transcript abundance of MAPKKK genes in mature leaves, 0-DPA ovules and 3-DPA ovules of G. raimondii. The transcript expressions were calculated using RPKM method.

Click here to enlarge figure

Figure 5. Transcript abundance of MAPKKK genes in mature leaves, 0-DPA ovules and 3-DPA ovules of G. raimondii. The transcript expressions were calculated using RPKM method.
Ijms 14 18740f5 1024
Table Table 1. Characteristics of MAPK kinase kinase (MAPKKKs) from G. raimondii.

Click here to display table

Table 1. Characteristics of MAPK kinase kinase (MAPKKKs) from G. raimondii.
Gene nameAccession numberAApIMwPredicted subcellular localization
GrMAPKKK1Cotton_D_gene_100309767055.2479.82Nuclear
GrMAPKKK2Cotton_D_gene_100249422945.3733.68Nuclear
GrMAPKKK3Cotton_D_gene_100213846135.0969.23Nuclear
GrMAPKKK4Cotton_D_gene_100253605926.1967.33Nuclear
GrMAPKKK5Cotton_D_gene_100256896684.9675.59Nuclear
GrMAPKKK6Cotton_D_gene_100168196865.1777.71Nuclear
GrMAPKKK7Cotton_D_gene_100111357344.9983.01Nuclear
GrMAPKKK8Cotton_D_gene_100075387275.4783.62Nuclear
GrMAPKKK9Cotton_D_gene_100069032995.2134.12Nuclear
GrMAPKKK10Cotton_D_gene_100287276435.5672.19Nuclear
GrMAPKKK11Cotton_D_gene_100197416095.2567.84Nuclear
GrMAPKKK12Cotton_D_gene_100363315935.3667.67Nuclear
GrMAPKKK13Cotton_D_gene_1001804013705.96150.94Nuclear
GrMAPKKK14Cotton_D_gene_1003469213995.86154.46Nuclear
GrMAPKKK15Cotton_D_gene_1002966913975.94153.50Nuclear
GrMAPKKK16Cotton_D_gene_100170216625.7972.69Nuclear
GrMAPKKK17Cotton_D_gene_100015558979.2296.45Nuclear
GrMAPKKK18Cotton_D_gene_100022305775.4164.07Nuclear
GrMAPKKK19Cotton_D_gene_100034106436.0670.78Nuclear
GrMAPKKK20Cotton_D_gene_100197516619.2071.56Nuclear
GrMAPKKK21Cotton_D_gene_100305108969.2996.69Nuclear
GrMAPKKK22Cotton_D_gene_100329837118.9978.75Nuclear
GrMAPKKK23Cotton_D_gene_100086023384.9737.35Cytoplasmic
GrMAPKKK24Cotton_D_gene_100380466609.0571.32Nuclear
GrMAPKKK25Cotton_D_gene_100393215765.6363.66Nuclear
GrMAPKKK26Cotton_D_gene_100404377428.6881.55Nuclear
GrMAPKKK27Cotton_D_gene_100248964374.6548.84Cytoplasmic
GrMAPKKK28Cotton_D_gene_100303143365.7836.99Cytoplasmic
GrMAPKKK29Cotton_D_gene_100003054667.8552.12Chloroplast
GrMAPKKK30Cotton_D_gene_100253304435.5149.41Nuclear
GrMAPKKK31Cotton_D_gene_100069724955.6355.42Cytoplasmic
GrMAPKKK32Cotton_D_gene_100338564385.4948.15Nuclear
GrMAPKKK33Cotton_D_gene_100303286138.4767.53Nuclear
GrMAPKKK34Cotton_D_gene_100312214465.5849.38Cytoplasmic
GrMAPKKK35Cotton_D_gene_100401259865.25108.05Nuclear
GrMAPKKK36Cotton_D_gene_100351213719.1442.42Nuclear
GrMAPKKK37Cotton_D_gene_100278963747.9241.92Nuclear
GrMAPKKK38Cotton_D_gene_100378784608.6351.88Mitochondrial
GrMAPKKK39Cotton_D_gene_100194473817.9242.44Nuclear
GrMAPKKK40Cotton_D_gene_100305703995.4244.40Nuclear
GrMAPKKK41Cotton_D_gene_100327603777.9442.04Nuclear
GrMAPKKK42Cotton_D_gene_100312003748.9242.57Nuclear
GrMAPKKK43Cotton_D_gene_100380724207.8946.91Cytoplasmic
GrMAPKKK44Cotton_D_gene_100127744158.1146.35Chloroplast
GrMAPKKK45Cotton_D_gene_100148864278.6248.02Mitochondrial
GrMAPKKK46Cotton_D_gene_100281688496.1394.22Nuclear
GrMAPKKK47Cotton_D_gene_100118656095.8568.46Cytoplasmic
GrMAPKKK48Cotton_D_gene_100003795409.0361.20Mitochondrial
GrMAPKKK49Cotton_D_gene_100102623518.0939.70Cytoplasmic
GrMAPKKK50Cotton_D_gene_100193943546.5739.98Cytoplasmic
GrMAPKKK51Cotton_D_gene_100025873538.9239.42Cytoplasmic
GrMAPKKK52Cotton_D_gene_100354933918.3843.63Cytoplasmic
GrMAPKKK53Cotton_D_gene_100255985635.9163.78Cytoplasmic
GrMAPKKK54Cotton_D_gene_100130518576.1894.66Nuclear
GrMAPKKK55Cotton_D_gene_100335684028.4244.93Mitochondrial
GrMAPKKK56Cotton_D_gene_100359823918.3543.59Nuclear
GrMAPKKK57Cotton_D_gene_100355563558.8239.84Cytoplasmic
GrMAPKKK58Cotton_D_gene_100039208606.4995.59Nuclear
GrMAPKKK60Cotton_D_gene_100058665755.7165.08Cytoplasmic
GrMAPKKK61Cotton_D_gene_100230834779.1554.17Mitochondrial
GrMAPKKK62Cotton_D_gene_100181835526.0962.28Cytoplasmic
GrMAPKKK63Cotton_D_gene_100316293918.6543.44Nuclear
GrMAPKKK64Cotton_D_gene_100191013525.9839.79Cytoplasmic
GrMAPKKK65Cotton_D_gene_100030243817.0542.54Cytoplasmic
GrMAPKKK66Cotton_D_gene_100150949045.45100.46Cytoplasmic
GrMAPKKK67Cotton_D_gene_100212667826.8786.35Nuclear
GrMAPKKK68Cotton_D_gene_100344485766.2665.42Nuclear
GrMAPKKK69Cotton_D_gene_100052803838.1942.40Cytoplasmic
GrMAPKKK70Cotton_D_gene_1002828011375.56126.21Nuclear
GrMAPKKK71Cotton_D_gene_100157018485.8894.81Nuclear
GrMAPKKK72Cotton_D_gene_100124017656.5484.93Nuclear
GrMAPKKK73Cotton_D_gene_1001603111075.46122.84Cytoplasmic
GrMAPKKK74Cotton_D_gene_100372388545.1894.24Cytoplasmic
GrMAPKKK75Cotton_D_gene_100057977438.1782.52Nuclear
GrMAPKKK76Cotton_D_gene_1000443413605.18147.72Nuclear
GrMAPKKK77Cotton_D_gene_100117208796.8097.86Nuclear
GrMAPKKK78Cotton_D_gene_100339545356.4360.56Cytoplasmic

AA: Amino acid; pI: The theoretical isoelectric point of proteins; Mw: The theoretical molecular weight of proteins.

Int. J. Mol. Sci. EISSN 1422-0067 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert