Next Article in Journal
A Chemist’s Perspective on the Role of Phosphorus at the Origins of Life
Next Article in Special Issue
Reporting Key Features in Cold-Adapted Bacteria
Previous Article in Journal
Silicate-Promoted Phosphorylation of Glycerol in Non-Aqueous Solvents: A Prebiotically Plausible Route to Organophosphates
Previous Article in Special Issue
DNA Protection Protein, a Novel Mechanism of Radiation Tolerance: Lessons from Tardigrades
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Tracing the Repertoire of Promiscuous Enzymes along the Metabolic Pathways in Archaeal Organisms

by
Mario Alberto Martínez-Núñez
1,*,
Zuemy Rodríguez-Escamilla
2,
Katya Rodríguez-Vázquez
3 and
Ernesto Pérez-Rueda
4,5
1
Laboratorio de Estudios Ecogenómicos, Facultad de Ciencias, Unidad Académica de Ciencias y Tecnología de la UNAM en Yucatán, Universidad Nacional Autónoma de México, Carretera Sierra Papacal-Chuburna Km. 5, C.P. 97302, Mérida, Yucatán, Mexico
2
Departamento de Microbiología, Instituto de Biotecnología, Universidad Nacional, Autónoma de México, C.P. 62210, Cuernavaca, Morelos, Mexico
3
Instituto de Investigaciones en Matemáticas Aplicadas y en Sistemas, Universidad Nacional Autónoma de México, Ciudad Universitaria, C.P. 04510, Ciudad de México, Mexico
4
Departamento de Ingeniería Celular y Biocatálisis, Instituto de Biotecnología, Universidad Nacional Autónoma de México, C.P. 62210, Cuernavaca, Morelos, Mexico
5
Instituto de Investigaciones en Matemáticas Aplicadas y en Sistemas, Universidad Nacional Autónoma de México, Unidad Académica Yucatán, Carretera Sierra Papacal-Chuburna Km. 5, C.P. 97302, Mérida, Yucatán, Mexico
*
Author to whom correspondence should be addressed.
Submission received: 12 April 2017 / Revised: 9 July 2017 / Accepted: 10 July 2017 / Published: 13 July 2017
(This article belongs to the Special Issue Extremophiles and the Origin of Life)

Abstract

:
The metabolic pathways that carry out the biochemical transformations sustaining life depend on the efficiency of their associated enzymes. In recent years, it has become clear that promiscuous enzymes have played an important role in the function and evolution of metabolism. In this work we analyze the repertoire of promiscuous enzymes in 89 non-redundant genomes of the Archaea cellular domain. Promiscuous enzymes are defined as those proteins with two or more different Enzyme Commission (E.C.) numbers, according the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. From this analysis, it was found that the fraction of promiscuous enzymes is lower in Archaea than in Bacteria. A greater diversity of superfamily domains is associated with promiscuous enzymes compared to specialized enzymes, both in Archaea and Bacteria, and there is an enrichment of substrate promiscuity rather than catalytic promiscuity in the archaeal enzymes. Finally, the presence of promiscuous enzymes in the metabolic pathways was found to be heterogeneously distributed at the domain level and in the phyla that make up the Archaea. These analyses increase our understanding of promiscuous enzymes and provide additional clues to the evolution of metabolism in Archaea.

1. Introduction

The adaptation of organisms to environment changes depends on the cellular adequacy to abiotic and biotic variables, such as temperature, nutrients availability, salinity, or the interaction with others organisms, which are in constant changes. In this context, diverse cellular components such as transporters, regulatory proteins and enzymes converge to contend and adapt against these changes. The role that enzymes play in these processes is fundamental, since they achieve the biochemical transformations of substrates into useful products, providing the cell with matter and energy necessary to grow. Enzymes have usually been described by catalyzing only one reaction on specific substrates, thus they are commonly considered as “specialists”; however, some enzymes may also be multifunctional [1,2]. Multifunctional enzymes have been defined as proteins playing multiple physiological roles in the cell, and can be classified as moonlighting and promiscuous. Moonlighting enzymes are usually composed by a structural domain that performs the catalytic activity, and a non-catalytic domain, associated with regulation or protein–protein interactions, among others activities; in counterpart, promiscuous enzymes are only composed of catalytic domains performing several biochemical functions [3,4,5,6]. Promiscuous enzymes can be further classified into substrate promiscuous enzymes, i.e., enzymes with relaxed or broad substrate specificity; and catalytic promiscuous enzymes, i.e., enzymes catalyzing distinctly different chemical transformations with different transition states [7,8]. In this context, diverse mechanisms that allow enzymatic promiscuity, such as conformational diversity, alternative substrates positions, and different protonation states, among others have been described [9]. Together, these functional and evolutionary processes associated with enzymes shape the evolution of metabolic systems [10,11].
From a genomic perspective, it has been reported that around 10% of the total enzymatic repertoire in bacterial and archaeal organisms corresponds to promiscuous enzymes [12]. In this regard, taking into account the lower number of genes on archaeal genomes, whose genome sizes range from ~0.5 Mb in the case of Nanoarchaeum equitans [13] to ~5.5 Mb in Methanosarcina barkeri [14], it has been proposed that archaeal enzymes show a greater degree of multifunctionality [2]. In order to have a more detailed perspective of the multifunctionality of archaeal enzymes, we evaluated in a systematic and genomic way the content and the structural domains in the enzymatic repertoire of archaeal genomes, and in particular, those involving the promiscuous enzymes, as well as the distribution of promiscuous enzymes on metabolic pathways. To achieve this analysis, the content of promiscuous enzymes and their structural domains was accomplished in 89 non-redundant archaeal genomes. Based on statistical evaluations, it was identified that the distribution of promiscuous enzymes is lower in Archaea than in Bacteria. Another outstanding result of our analysis is the greater structural diversity that promiscuous enzymes exhibit in comparison to specialized enzymes, both in Archaea and Bacteria. We also found that the presence of promiscuous enzymes in the metabolic pathways was heterogeneous, both at the domain level, as in phyla associated with Archaea. Finally, we found that the substrate promiscuity is greater than catalytic promiscuity in the archaeal enzymatic repertoire.

2. Material and Methods

2.1. Genomes and Proteomes Analyzed

89 archaeal and 705 bacterial non-redundant genomes analyzed in this work were retrieved from the National Center for Biotechnology Information (NCBI) ftp server public database section genomes (ftp://ncbi.nlm.nih.gov). These genomes were defined as non-redundant, to exclude any bias associated with overrepresentation of species or strains, as has previously been reported [12]. In brief, through the concatenation of 21 conserved proteins across diverse sequenced genomes, a single data set for phylogenetic analysis was constructed [15]. By eliminating genomes located more closely together on a phylogenetic tree, 794 genomes phylogenetically distant from each other were obtained (Table S1). We considered only genes with open reading frames (ORFs) that encode predicted protein sequences.

2.2. Identification of Enzymes

For each protein sequence, the annotation of an Enzyme Commission number (E.C. number) using the Kyoto Encyclopedia of Genes and Genomes (KEGG) database was associated [16]. Then, for each enzyme associated with an E.C. number, the assignments of structural domains at sequence level were determined with the Superfamily database [17]. Our inclusion criterion was quite strict because we were only interested in enzymatic sequences with identified metabolic contexts and structural assignments. Finally, promiscuous enzymes were identified in this set of filtered data as those protein sequences containing two or more different E.C. numbers (Table S2). All processes and data management were conducted using ad hoc Perl scripts.

Accuracy of the Promiscuous Enzymes

In order to evaluate our operational definition of promiscuous enzymes, the enzyme promiscuity prediction server was used [18]. In this regard, a representative dataset of 400 out of 51,572 promiscuous enzymes was selected by using the formula (N × Z2 × p × q)/((NE2) + (Z2 × p × q)), with a margin of error of 4.9% and a confidence level of 95% (N = population size, Z = confidence level, E = sample error desired; p and q are the probability to be promiscuous or not, and were assumed to be 0.5). In addition, 400 out 19,411 promiscuous enzymes (positive dataset), and 400 of 48,846 non-promiscuous enzymes (negative control) provided by Carbonell and Faulon were also evaluated. Based on these comparisons, we calculated the: (1) true positives (TP): promiscuous enzymes with more than two different E.C. numbers, and structural domains (SUPFAM) (our dataset) and identified by the Promis server, and Carbonell and Faulon’s positive set identified by the Promis server; (2) false positives (FP): proteins identified as promiscuous enzymes with the Promis server from the Carbonell and Faulon’s negative dataset; (3) false negatives (FN): promiscuous enzymes (our dataset) and Carbonell and Faulon’s positive dataset, identified as negative with the Promis server; (4) sensitivity, Sn = TP/(TP + FN), is the fraction of promiscuous enzymes identified with the Promis server; (5) positive predictive value, PPV = TP/(TP + FP), is the fraction of the promiscuous enzymes inferred; (6) Accuracy, Ac = (Sn + PPV)/2, is the PPV and Sn average.

2.3. Statistical Analysis

Non-parametric Wilcoxon rank sum statistical tests were used to evaluate promiscuous enzymes distributions in each archaeal and bacterial domain. Statistical significance was set at p ≤ 0.05. The implementation of these tests was carried out using the package stats of R programming language for statistical analysis [19]. Data managements were conducted with ad hoc Perl scripts.

3. Results and Discussion

3.1. Archaea Exhibit a Lower Promiscuous Enzyme Fraction than Bacterial Genomes

In order to compare the total repertoire of known enzymatic proteins in the genomes of the Archaea, the content of these proteins was exhaustively evaluated. From this analysis, we observed that the 89 genomes, with an average size of 2291 ORFs, contain an average of 487 enzymes. To compare the enzymatic content found in the archaeal genomes, we analyzed 705 bacterial genomes; where, an average of 757 enzymes and an average genome size of 3265 ORFs was observed, i.e., the average content of known enzymes in archaeal genomes is 1.5 times lower than that observed in bacterial genomes. When these values were normalized to obtain the percentage of known enzymes with respect to all the genes of each genome, an average of 22% was found for Archaea, while an average value of 25% was found for Bacteria (Figure 1). The difference between the percentages of enzyme content was found as statistically significant (Wilcoxon test, p-value = 4.52 × 10−6), showing a higher proportion of known enzymes in Bacteria, with around one-quarter of their genes encoded for enzymes than in Archaea, the latter with one-fifth of the genes encoded for enzymes.
To evaluate whether there is an increase in the number of promiscuous enzymes in the repertoire of known total enzymes previously described, and if this increase might compensate the low proportion of enzymes in Archaea in comparison to Bacteria, the content of promiscuous enzymes was also analyzed. In operational terms, promiscuous enzymes were defined as those enzymes that having two or more different EC numbers, according to KEGG database. In this regard, we found an accuracy of 63% in our identification of promiscuous enzymes in our data according to the evaluation done with the Promis Server [18], suggesting that our definition of promiscuous enzymes is, in general, informative enough. We obtained the ratio between the number of promiscuous enzymes among the total enzymes for each genome. In total, an average value of 8.31% of promiscuous enzymes was found in archaeal genomes against the 8.76% found in bacterial genomes. Although the average contents of promiscuous enzymes show similar values in Archaea and Bacteria, there is a statistically significant difference (Wilcoxon test, p-value = 0.03), with a greater proportion of promiscuous enzymes in the bacterial genomes (Figure 1), suggesting that promiscuous enzymes do not compensate the apparent deficit of enzymes observed in Archaea. Therefore, the abundance of enzymes that carry out only one reaction or recognize only one substrate, i.e., specialists, in Archaea could maintain high metabolic fluxes, as has been suggested in Escherichia coli [1], necessary to contend against extreme environments, in which the available carbon sources need to be maximized. However, the role of promiscuous enzymes must be also considered as an important source of variability associated with Archaeal metabolism.

3.2. Structural Diversity in Promiscuous Enzymes is Higher than Specialist Enzymes

To gain insights into the diversity of promiscuous enzymes identified in Archaea, the structural domains associated with the enzymatic repertoire was evaluated using the protein domains assignments of Superfamily database [17]. From this evaluation, we identified an average number of 60 structural domains in archaeal promiscuous enzymes per genome, decreasing to 48 when non-redundant domains were considered. To obtain the fraction of different superfamily domains, we calculate the ratio between unique structural domains against the total structural domains per genome, an average proportion of 82% of unique structural domains in promiscuous enzymes was found. In order to compare the number and percentage of domains present in the promiscuous enzymes, the same estimates were performed for the structural domains present in the specialist enzymes. The average number of all structural domains was 641, while the average number of different structural domains was 266 in specialist enzymes. The average proportion of unique structural domains in specialist enzymes was 42%. This last value presents a statistically significant difference (Wilcoxon test, p-value = 2.2 × 10−16) with the fraction of unique structural domains in promiscuous enzymes, observing a greater proportion of different domains in the promiscuous enzymes than in specialists, 82% and 42%, respectively. This significant difference is observed not only in the promiscuous and specialists enzymes of archaeal genomes, but also in the enzymes of bacterial genomes where the fraction of unique domains is greater in promiscuous (73%) than in specialist enzymes (36%) (Wilcoxon test, p-value = 2.2 × 10−16). In both cases, Archaea and Bacteria, the fraction of non-redundant domains in promiscuous enzymes is twice that in specialist enzymes, showing a greater structural diversity in the enzymes that perform more than one biochemical reaction. Therefore, the biochemical plasticity of the promiscuous enzymes, both in the reactions and in the recognition of substrates, could be as a consequence of their greater structural diversity. In this regard, the expansion of metabolic networks in the evolution of prokaryotes has been dominated by broadening the reaction repertoire [20] or substrate repertoire, rather than by an increase in the number of enzymes encoded in the prokaryotic genomes [12].

3.3. Functional Structural Diversity in Promiscuous Enzymes Exhibits a Non-Homogeneous Distribution

To determine the structural diversity associated with promiscuous enzymes, the set of 260 non-redundant structural domains identified in the promiscuous enzymes of all archaeal genomes was evaluated. To this end, archaea was analyzed in terms of their main phyla, finding that Euryarchaeota contain 93% of these domains, followed by Crenarchaeota with 57.6%, and finally Nanoarchaeota with 4.2% (Figure 2). Eleven domains were identified as common to the three phyla, while 121 domains were common to Euryarchaeota and Crenarchaeota. It is interesting to note that all the structural domains identified in Nanoarchaeota were identified as common to all archaea (Figure 2); however, this should be taken with caution because of the low number of Nanoarchaeota genomes represented in our study.
In this regard, the NAD(P)-binding Rossmann-fold (Superfamily ID: 51735) and P-loop containing nucleoside triphosphate hydrolases (Superfamily ID: 52540) domains considered as ancient folds [21], were identified in all three phyla as the most abundant structural domains. The NAD(P)-binding Rossmann-fold domain has also been identified as the largest structural family, and associated with almost all different proteins in the metabolic pathways of small molecules of Escherichia coli [22]. In accordance with the diversity of functions associated with P-loop and NAD(P)-binding Rossmann-fold, they have been also associated with the functional metabolic category referred to as “General” functions, according to the classification of the Superfamily database, i.e., associated with multiple functions or interactions with proteins, ions, lipids, or small molecules. The most represented functional categories associated with the most abundant domains corresponds to “Metabolism”, such as Ribokinase and PLP-dependent transferases; “Information”, with domains such as Glutamyl-tRNA (Gln) amidotransferase and DNA topoisomerase; and “Others” with the domain GatB-YqeY motif (Figure 2). In contrast, the less abundant domains corresponds to the Aldehyde ferredoxin oxidoreductase and alpha/beta-Hydrolases, associated with the “Metabolism” functional category. Only one domain with functional category of “Regulation” was found in the GAF domain. This domain has been associated with ligand-binding, and signal transduction [23] and has been found in archaeal regulators, such as the Bat transcriptional regulator of the Halobacterium sp. NRC-1 responsible for coordinate regulation of bacteriorhodopsin production [24]. The DPP6 N-terminal (Superfamily ID: 82171), and Periplasmic binding protein-like II (Superfamily ID: 53850) domains identified as less abundant were associated with “Intracellular processes”, such as cell motility/division, and intracellular transport, among others. Finally, the exclusive domains associated with Euryarchaeota represents the 42% of all counted domains, such as the Phosphoglycerate mutase-like or Purine and uridine phosphorylases domains, which belong to the functional metabolic category of “Metabolism”; meanwhile, Crenarchaeota correspond to 6.9%, with domains like the Trypsin-like serine proteases, and S-adenosylmethionine decarboxylase. Finally, 12 structural domains were only identified in promiscuous enzymes (Table 1), with a non-homogeneous distribution between the three phyla. This structural diversity identified in promiscuous enzymes might provide functional diversity in archaeal metabolism, increasing the participation of these enzymes in a wide number of metabolic pathways, such as the phosphomannomutase alpha-phosphoglucomutase (EC:5.4.2.8, 5.4.2.2) of Alicyclobacillus acidocaldarius, that acts in the pentose phosphate pathway and fructose and mannose metabolisms; the 3-hexulose-6-phosphate synthase (EC:4.1.2.43, 5.3.1.27) of Methanosaeta thermophila that is present in the pentose phosphate pathway and methane metabolisms; or the 3-isopropylmalate dehydratase (EC:4.2.1.33, 4.2.1.35) of Methanohalobium evestigatum that is present in valine, leucine and isoleucine biosynthesis and C5-branched dibasic acid metabolisms. In addition, promiscuous enzymes could act in the same metabolic pathway, but catalyzing different steps, such as the phosphomethylpyrimidine kinase (EC:2.7.1.49, 2.7.4.7, 2.5.1.3) of Thermococcus sibiricus, that is involved in three different metabolic steps in thiamine metabolism; the ornithine acetyltransferase (EC:2.3.1.35, 2.3.1.1) of Methanococcoides burtonii that catalyzes two different activities in the cyclic version of arginine biosynthesis; and the polyprenyl synthetase (EC:2.5.1.1, 2.5.1.10, 2.5.1.29) of Pyrobaculum aerophilum that catalyzes three different steps in terpenoid backbone biosynthesis.

3.4. The Distribution of Promiscuous Enzymes between Functional Metabolic Categories is Variable

To determine the distribution of the promiscuous enzymes in the metabolism of archaeal genomes, each enzyme was mapped into the metabolic pathways defined by the KEGG database. At first, all enzymes (promiscuous and specialist) were associated with their metabolic pathways, and clustered into functional categories. Based on this data, the Carbohydrate Metabolism, Energy Metabolism and Metabolism of Cofactors and Vitamins were the three greater enrichment categories with promiscuous enzymes (Figure 3), while the Amino Acid Metabolism, Genetic Information Processing and Nucleotide Metabolism were the greater enrichment categories with specialist enzymes (Figure 3).
These findings suggest that the metabolic functions associated with the conservation and distribution of genetic information have evolved to recruit specialist enzymes to constrain the biochemical reactions towards the use of one substrate or one biochemical transformation, ensuring fidelity of information when being transmitted to the offspring, and forming part of the core of metabolic reactions shared by all cellular organisms [25]. In contrast, the metabolism of Carbohydrates has evolved for recruitment of enzymes able to use a great variety of substrates or performs more than one biochemical reaction allowing them to explore and use different carbon sources, such as the inositol monophosphatase from Archaeoglobus fulgidus and Methanococcus jannaschii which exhibit Inositol monophosphatase (IMPase), fructose-1,6-bisphosphatase and Nicotinamide adenine dinucleotide phosphate (NADP(H)) phosphatase activities [26,27], the phosphoglucose isomerase from Pyrobaculum aerophilum which catalyzes the isomerization of both glucose-6-phosphate and mannose-6-phosphate to fructose-6-phosphate [28], or the glucose dehydrogenase from Sulfolobus solfataricus with a broad range of sugar substrates, including d-glucose, d-galactose, d-xylose, and l-arabinose [29]. Regarding the latter, the promiscuity for the metabolism of glucose and galactose enables S. solfataricus to grow on both carbon sources and is a remarkable example of the unusual features and versatility of central carbohydrate metabolism in hyperthermophilic archaea [30].
In order to perform an analysis of the phylogenetic distribution of the archaeal promiscuous enzymes in the metabolism, we analyzed the enzymatic repertoire of Crenarchaeota and Euryarchaeota phyla. From this analysis we found that Carbohydrate and Energy Metabolisms are enriched in Crenarchaeota (Figure 4); where organisms of the order Sulfolobales, Desulforococcales, Thermoproteales and Acidilobales, mainly described as thermophilic, hyperthermophilic, and acidophilic are included. These organisms have different metabolic modes, ranging from chemoorganotrophs to chemolithotrophs [31], reinforcing the notion that the enriched metabolism found and previously described, correlates with the energy obtaining in Crenarchaeota, such as S. solfataricus, which grows aerobically and chemoorganotrophically obtained energy from organic sources such as sugars [32]. In counterpart, in Euryarchaeota, the metabolic categories related to Carbohydrates and Energy metabolisms are less enriched with promiscuous enzymes. Organisms included in this phylum, can only use a few simple substrates, most of them compounds such as H2/CO2, methanoate, methanol or methylamine, although any particular species is only capable of using two or three of them [33,34]. Or, they can use light as a source of energy as the organisms belonging to the class Haloarchaea, which have a pigment called bacteriorhodopsin that reacts with light producing a gradient of protons along the membrane that allows the synthesis of ATP [35]. Therefore, Euryarchaeota exhibit a lower metabolic plasticity in the use of substrates to obtain energy, probably due to a smaller enrichment of promiscuous enzymes in the metabolic categories of Carbohydrates and Energy. Finally, it seems that in the Euryarchaeota organisms, the metabolic processes associated with genetic information and the processes of protein synthesis have been favored to have a greater number of promiscuous enzymes (Figure 4).

3.5. Promiscuous Enzymes Exhibit a High Enrichment of Substrate Promiscuity rather than Catalytic Promiscuity

In recent years it has been seen that promiscuous enzymes are capable of carrying out more than one reaction or recognizing more than one substrate. In order to ascertain the extent of the types of promiscuity, both substrate and catalysis, promiscuous enzymes were classified based on their first and last digits of their E.C. numbers. Catalysis of the same reaction over a wide range of substrates is called substrate promiscuity, while when an enzyme catalyzes a different reaction for which it has been specialized with a different transition state is termed catalytic promiscuity [7,36]. In operational terms, enzymes exhibiting catalytic promiscuity are those that differ in the first digit from their E.C. numbers; while enzymes exhibiting substrate promiscuity are those that differ in the last digit from their associated E.C. numbers. The average number of enzymes with substrate promiscuity identified in Archaea is 29.7, while the average number of enzymes with catalytic promiscuity is 11.5, presenting a statistically significant difference (Wilcoxon test, p-value = 2.2 × 10−16). To carry out the comparison of promiscuity classes in the enzymatic repertoire of archaeal genomes, the ratio between enzymes with catalytic or substrate promiscuity among the total promiscuous enzymes present in each genome was calculated. From this normalization, an enrichment of enzymes with substrate promiscuity in comparison to enzymes with catalytic promiscuity was observed, with an average value of 70% and 29%, respectively, i.e., 2.5 times more of substrate promiscuity than catalytic promiscuity (Wilcoxon test, p-value = 2.2 × 10−16). In summary, almost three quarters of the promiscuous enzymes exhibit substrate promiscuity, while almost one-third show catalytic promiscuity, suggesting that it is more difficult to evolve towards a new catalytic mechanism than a new substrate-binding site [22].

3.6. Substrate Promiscuity is Represented to a Greater Extent in the Functional Metabolic Categories

To determine the distribution of the type of promiscuity in the metabolism of archaeal genomes, each promiscuous enzyme was mapped into the metabolic pathways defined by KEGG database. At first, all promiscuous enzymes (substrate or catalytic promiscuity) were associated with their metabolic pathways, and then were clustered into functional metabolic categories. Based on this data, we found that enzymes with substrate promiscuity are present to a greater extent in all functional metabolic categories; in contrast, enzymes with catalytic promiscuity are less represented in the metabolic pathways (Figure 5). This finding suggests that in the evolution of the promiscuous enzymes it is easier to modify the region of proteins that bind and recognize the substrate than the catalytic region that performs the biochemical transformations, as previously suggested [22].
For instance, in the Carbohydrate Metabolism category, the metabolic plasticity that provides the promiscuity of the substrate enables the exploration of a wide variety of compounds that can be used to generate energy. It is interesting to note that Nucleotide metabolism has a minor variation of catalytic functions, i.e., the number of enzymes with catalytic promiscuity is smaller, as was previously observed [22], in which was found a greater conservation of chemistry of the specialist enzymes involved in the nucleotide metabolism.
In this regard, to determine the distribution of the type of promiscuity in the metabolic categories in relation to phylogenetic taxonomy, all organisms were grouped into the main phyla Euryarchaeota and Crenarchaeota, and the average fraction of promiscuous enzymes was calculated. From this data, the content of enzymes with substrate promiscuity is not different in Euryarchaeota and Crenarchaeota, which have average values of 70% and 69%, respectively (Wilcoxon test, p-value = 0.39). Similarly, the fraction of enzymes with catalytic promiscuity in Euryarchaeota and Crenarchaeota is not different, with average values 28.38% and 28.57%, respectively (Wilcoxon test, p-value = 0.84). When the types of promiscuity are mapped on their respective metabolic pathways, and grouped into their functional metabolic categories, it was observed that the enzymes with substrate promiscuity belonging to Crenarchaeota are better represented in the Carbohydrate and Energy Metabolism categories (Table 2). In contrast, enzymes with catalytic promiscuity belonging to the Euryarchaeota are better represented in the Metabolism of Cofactors and Vitamins and Genetic Information Processing categories (Table 2). In summary, these findings suggest that although promiscuous enzymes are widely distributed in Euryarchaeota and Crenarchaeota phyla, there is a slight preference to some metabolic pathways.
Finally, an exhaustive comparison on their promiscuous enzymes between all archaeal organisms (see Supplementary Materials Figure S1), evidenced the existence of diverse metabolic pathways where these proteins are overrepresented and associated with archaeal lineages; such as methane, pentose phosphate, C5-branched dibasic acid, valine/leucine/isoleucine biosynthesis, porphyrin and chloropyll metabolisms associated with Euryarchaeota. In contrast, oxidative phosphorylation and nitrogen metabolisms are enriched with promiscuous enzymes in Crenarchaeota. In addition, diverse metabolic pathways enriched with promiscuous enzymes are not associated with any specific phylum, such as synthesis and degradation of ketone bodies, lysine degradation, fatty acid, benzoate degradation, and tryptophan among others, identified in genomes of Euryarchaeota (Haloarcula marismortui, Haloferax volcanii, Natrialba magadii, Haloterrigena turkmenica), and Crenarchaeota (Sulfolobus). Therefore, the presence of promiscuous enzymes in Archaea correlates more preferentially with the metabolic pathways than life-styles, such as it was observed with Sulfolobus, where sulfur metabolism is almost absent of promiscuous enzymes, suggesting the prevalence of specific enzymes.

4. Conclusions

Through the analysis of 89 archaeal genomes, we gained insights into how many promiscuous enzymes there are, what the existing diversity of structural domains is, and how promiscuous enzymes are distributed in metabolic pathways of Archaea organisms. Although our criterion for the identification of enzymes was stringent, because it considers the association of each protein sequence to structural (Superfamily) domain and the annotation of E.C. numbers, the resulting dataset analyzed allowed us to obtain general trends that reflect the metabolic context of archaeal organisms. In this regard, we found that in spite of having a smaller fraction of promiscuous enzymes in the archaeal genomes, its structural diversity is greater than that of the specialized enzymes, which may contribute to participate in more than one metabolic pathway, or in more than one metabolic step within the same pathway. Another interesting finding of our analysis is an enrichment of substrate promiscuity rather than catalytic promiscuity in archaeal promiscuous enzymes, which suggests that in the evolution of the promiscuous enzymes it is easier to modify the region of proteins that bind and recognize the substrate than the catalytic region that performs the biochemical transformations. We also found that exists a heterogeneous distribution of promiscuous enzymes in archaeal genomes, favoring the enrichment of promiscuous enzymes in metabolic pathways associated with the generation of energy in Crenarchaeota organism, whereas in Euryarchaeota organisms there is a greater enrichment of promiscuous enzymes in metabolic pathways associated with genetic information and protein synthesis. Existence of promiscuous enzymes in metabolic pathways may be an adaptation to survival in its extreme environment allowing it to efficiently scavenge for energy substrates [37]. Our observations can be considered valid for all archaeal species and their metabolic pathways, and the results reported here increase our understanding in the evolution of the metabolism of archaeal organisms and in particular the role of promiscuous enzymes.

Supplementary Materials

The following are available online at https://www.mdpi.com/2075-1729/7/3/30/s1.

Acknowledgements

We thank to Joaquín Morales, Sandra Sauza and Israel Sanchez for computational supporting. The authors kindly thank Pablo Carbonnell and Jean-Loup Faulon for the positive and negative datasets of promiscuous enzymes. MAM-N was supported by a grant (IA205417) from PAPIIT-DGAPA, UNAM; EP-R was supported by a grant (IN-201117) from PAPIIT-DGAPA, UNAM; ZR-E was supported by a grant (2015-02-887) from Fronteras de la Ciencia, CONACyT.

Author Contributions

M.A.M.-N. conceived and designed the experiments; M.A.M.-N. performed the experiments; M.A.M.-N., Z.R.-E., K.R.-V. and E.P.-R. analyzed the data; M.A.M.-N., Z.R.-E., K.R.-V. and E.P.-R. wrote the paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Nam, H.; Lewis, N.E.; Lerman, J.A.; Lee, D.H.; Chang, R.L.; Kim, D.; Palsson, B.O. Network context and selection in the evolution to enzyme specificity. Science 2012, 337, 1101–1104. [Google Scholar] [CrossRef] [PubMed]
  2. Jia, B.; Cheong, G.W.; Zhang, S. Multifunctional enzymes in archaea: Promiscuity and moonlight. Extremophiles 2013, 17, 193–203. [Google Scholar] [CrossRef] [PubMed]
  3. Sengupta, S.; Ghosh, S.; Nagaraja, V. Moonlighting function of glutamate racemase from mycobacterium tuberculosis: Racemization and DNA gyrase inhibition are two independent activities of the enzyme. Microbiology 2008, 154, 2796–2803. [Google Scholar] [CrossRef] [PubMed]
  4. Cheng, X.Y.; Huang, W.J.; Hu, S.C.; Zhang, H.L.; Wang, H.; Zhang, J.X.; Lin, H.H.; Chen, Y.Z.; Zou, Q.; Ji, Z.L. A global characterization and identification of multifunctional enzymes. PLoS ONE 2012, 7, e38979. [Google Scholar] [CrossRef] [PubMed]
  5. Huberts, D.H.; van der Klei, I.J. Moonlighting proteins: An intriguing mode of multitasking. Biochim. Biophys. Acta 2010, 1803, 520–525. [Google Scholar] [CrossRef] [PubMed]
  6. Hult, K.; Berglund, P. Enzyme promiscuity: Mechanism and applications. Trends Biotechnol. 2007, 25, 231–238. [Google Scholar] [CrossRef] [PubMed]
  7. Gupta, R. Recent advances in enzyme promiscuity. Sustain. Chem. Process. 2016, 4, 2. [Google Scholar] [CrossRef]
  8. Khersonsky, O.; Roodveldt, C.; Tawfik, D.S. Enzyme promiscuity: Evolutionary and mechanistic aspects. Curr. Opin. Chem. Biol. 2006, 10, 498–508. [Google Scholar] [CrossRef] [PubMed]
  9. Khersonsky, O.; Tawfik, D.S. The histidine 115-histidine 134 dyad mediates the lactonase activity of mammalian serum paraoxonases. J. Biol. Chem. 2006, 281, 7649–7656. [Google Scholar] [CrossRef] [PubMed]
  10. Suen, S.; Lu, H.H.; Yeang, C.H. Evolution of domain architectures and catalytic functions of enzymes in metabolic systems. Genome Biol. Evol. 2012, 4, 976–993. [Google Scholar] [CrossRef] [PubMed]
  11. Wagner, A. Metabolic networks and their evolution. Adv. Exp. Med. Biol. 2012, 751, 29–52. [Google Scholar] [PubMed]
  12. Martinez-Nunez, M.A.; Poot-Hernandez, A.C.; Rodriguez-Vazquez, K.; Perez-Rueda, E. Increments and duplication events of enzymes and transcription factors influence metabolic and regulatory diversity in prokaryotes. PLoS ONE 2013, 8, e69707. [Google Scholar] [CrossRef] [PubMed]
  13. Waters, E.; Hohn, M.J.; Ahel, I.; Graham, D.E.; Adams, M.D.; Barnstead, M.; Beeson, K.Y.; Bibbs, L.; Bolanos, R.; Keller, M.; et al. The genome of nanoarchaeum equitans: Insights into early archaeal evolution and derived parasitism. Proc. Natl. Acad. Sci. USA 2003, 100, 12984–12988. [Google Scholar] [CrossRef] [PubMed]
  14. Maeder, D.L.; Anderson, I.; Brettin, T.S.; Bruce, D.C.; Gilna, P.; Han, C.S.; Lapidus, A.; Metcalf, W.W.; Saunders, E.; Tapia, R.; et al. The methanosarcina barkeri genome: Comparative analysis with methanosarcina acetivorans and methanosarcina mazei reveals extensive rearrangement within methanosarcinal genomes. J. Bacteriol. 2006, 188, 7922–7931. [Google Scholar] [CrossRef] [PubMed]
  15. Brown, J.R.; Douady, C.J.; Italia, M.J.; Marshall, W.E.; Stanhope, M.J. Universal trees based on large combined protein sequence data sets. Nat. Genet. 2001, 28, 281–285. [Google Scholar] [CrossRef] [PubMed]
  16. Okuda, S.; Yamada, T.; Hamajima, M.; Itoh, M.; Katayama, T.; Bork, P.; Goto, S.; Kanehisa, M. KEGG atlas mapping for global analysis of metabolic pathways. Nucleic Acids Res. 2008, 36, W423–W426. [Google Scholar] [CrossRef] [PubMed]
  17. Wilson, D.; Pethica, R.; Zhou, Y.; Talbot, C.; Vogel, C.; Madera, M.; Chothia, C.; Gough, J. Superfamily--sophisticated comparative genomics, data mining, visualization and phylogeny. Nucleic Acids Res. 2009, 37, D380–D386. [Google Scholar] [CrossRef] [PubMed]
  18. Carbonell, P.; Faulon, J.L. Molecular signatures-based prediction of enzyme promiscuity. Bioinformatics 2010, 26, 2012–2019. [Google Scholar] [CrossRef] [PubMed]
  19. R-programming. Development core team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2011. [Google Scholar]
  20. Freilich, S.; Spriggs, R.V.; George, R.A.; Al-Lazikani, B.; Swindells, M.; Thornton, J.M. The complement of enzymatic sets in different species. J. Mol. Biol. 2005, 349, 745–763. [Google Scholar] [CrossRef] [PubMed]
  21. Caetano-Anolles, G.; Yafremava, L.S.; Gee, H.; Caetano-Anolles, D.; Kim, H.S.; Mittenthal, J.E. The origin and evolution of modern metabolism. Int. J. Biochem. Cell Biol. 2009, 41, 285–297. [Google Scholar] [CrossRef] [PubMed]
  22. Teichmann, S.A.; Rison, S.C.; Thornton, J.M.; Riley, M.; Gough, J.; Chothia, C. The evolution and structural anatomy of the small molecule metabolic pathways in Escherichia coli. J. Mol. Biol. 2001, 311, 693–708. [Google Scholar] [PubMed]
  23. Galperin, M.Y. Structural classification of bacterial response regulators: Diversity of output domains and domain combinations. J. Bacteriol. 2006, 188, 4169–4182. [Google Scholar] [CrossRef] [PubMed]
  24. Berquist, B.R.; Soneja, J.; DasSarma, S. Comparative genomic survey of information transfer systems in two diverse extremely halophilic archaea, Halobacterium sp. Strain NRC-1 and Haloarcula marismortui. In Adaptation to Life at High Salt Concentrations in Archaea, Bacteria, and Eukarya; Gunde-Cimerman, N., Oren, A., Plemenitas, A., Eds.; Springer: Dordrecht, The Netherlands, 2005; pp. 148–182. [Google Scholar]
  25. Peregrin-Alvarez, J.M.; Tsoka, S.; Ouzounis, C.A. The phylogenetic extent of metabolic enzymes and pathways. Genome Res. 2003, 13, 422–427. [Google Scholar] [CrossRef] [PubMed]
  26. Fukuda, C.; Kawai, S.; Murata, K. Nadp (h) phosphatase activities of archaeal inositol monophosphatase and eubacterial 3′-phosphoadenosine 5′-phosphate phosphatase. Appl. Environ. Microbiol. 2007, 73, 5447–5452. [Google Scholar] [CrossRef] [PubMed]
  27. Stieglitz, K.A.; Johnson, K.A.; Yang, H.; Roberts, M.F.; Seaton, B.A.; Head, J.F.; Stec, B. Crystal structure of a dual activity impase/fbpase (af2372) from Archaeoglobus fulgidus. The story of a mobile loop. J. Biol. Chem. 2002, 277, 22863–22874. [Google Scholar] [CrossRef] [PubMed]
  28. Hansen, T.; Urbanke, C.; Schonheit, P. Bifunctional phosphoglucose/phosphomannose isomerase from the hyperthermophilic archaeon Pyrobaculum aerophilum. Extremophiles 2004, 8, 507–512. [Google Scholar] [CrossRef] [PubMed]
  29. Angelov, A.; Futterer, O.; Valerius, O.; Braus, G.H.; Liebl, W. Properties of the recombinant glucose/galactose dehydrogenase from the extreme thermoacidophile, Picrophilus torridus. FEBS J. 2005, 272, 1054–1062. [Google Scholar] [CrossRef] [PubMed]
  30. Lamble, H.J.; Theodossis, A.; Milburn, C.C.; Taylor, G.L.; Bull, S.D.; Hough, D.W.; Danson, M.J. Promiscuity in the part-phosphorylative entner-doudoroff pathway of the archaeon Sulfolobus solfataricus. FEBS Lett. 2005, 579, 6865–6869. [Google Scholar] [CrossRef] [PubMed]
  31. Schonheit, P.; Schafer, T. Metabolism of hyperthermophiles. World J. Microbiol. Biotechnol. 1995, 11, 26–57. [Google Scholar] [CrossRef] [PubMed]
  32. Nunn, C.E.; Johnsen, U.; Schonheit, P.; Fuhrer, T.; Sauer, U.; Hough, D.W.; Danson, M.J. Metabolism of pentose sugars in the hyperthermophilic archaea Sulfolobus solfataricus and Sulfolobus acidocaldarius. J. Biol. Chem. 2010, 285, 33701–33709. [Google Scholar] [CrossRef] [PubMed]
  33. Blaut, M. Metabolism of methanogens. Antonie Van Leeuwenhoek 1994, 66, 187–208. [Google Scholar] [CrossRef] [PubMed]
  34. Deppenmeier, U. The unique biochemistry of methanogenesis. Prog. Nucleic Acid Res. Mol. Biol. 2002, 71, 223–283. [Google Scholar] [PubMed]
  35. Papke, R.T.; Douady, C.J.; Doolittle, W.F.; Rodriguez-Valera, F. Diversity of bacteriorhodopsins in different hypersaline waters from a single spanish saltern. Environ. Microbiol. 2003, 5, 1039–1045. [Google Scholar] [CrossRef] [PubMed]
  36. Arora, B.; Mukherjee, J.; Gupta, M.N. Enzyme promiscuity: Using the dark side of enzyme specificity in white biotechnology. Sustain. Chem. Proc. 2014, 2, 25. [Google Scholar] [CrossRef]
  37. Lamble, H.J.; Heyer, N.I.; Bull, S.D.; Hough, D.W.; Danson, M.J. Metabolic pathway promiscuity in the archaeon Sulfolobus solfataricus revealed by studies on glucose dehydrogenase and 2-keto-3-deoxygluconate aldolase. J. Biol. Chem. 2003, 278, 34066–34072. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Proportion of enzymes in prokaryotic organisms. The proportion of total and promiscuous enzymes in bacterial and archaeal genomes is shown.
Figure 1. Proportion of enzymes in prokaryotic organisms. The proportion of total and promiscuous enzymes in bacterial and archaeal genomes is shown.
Life 07 00030 g001
Figure 2. Structural domains associated with promiscuous enzymes in Archaea. Venn diagram shows the intersections of Superfamily domains corresponding to Euryarchaeota (242), Crenarchaeota (150) and Nanoarchaeota (11). Seventeen most abundant domains and twelve less abundant domains are showed by their Superfamily ID, name, functional category and phyla. E: Euryarchaeota; C: Crenarchaeota; N: Nanoarchaeota. Abbreviations used in this table: CoaB: 4′-Phosphopantothenoylcysteine synthetase; PRTase: Phosphoribosyl transferase; Nudix: Nudix stands for Nucleoside Diphosphate linked to X; ACT-like: The ACT domain is named after three of the proteins that contain it Aspartate kinase, Chorismate mutase, and TyrA; NAD(P): Nicotinamide adenine dinucleotide phosphate; PLP: Pyridoxal phosphate; Glu-tRNAGln: Glutamyl-tRNA (Gln) amidotransferase; GAF domains-like: cGMP-specific phosphodiesterases, adenylyl cyclases and FhlA; DPP6: Dipeptidyl peptidase.
Figure 2. Structural domains associated with promiscuous enzymes in Archaea. Venn diagram shows the intersections of Superfamily domains corresponding to Euryarchaeota (242), Crenarchaeota (150) and Nanoarchaeota (11). Seventeen most abundant domains and twelve less abundant domains are showed by their Superfamily ID, name, functional category and phyla. E: Euryarchaeota; C: Crenarchaeota; N: Nanoarchaeota. Abbreviations used in this table: CoaB: 4′-Phosphopantothenoylcysteine synthetase; PRTase: Phosphoribosyl transferase; Nudix: Nudix stands for Nucleoside Diphosphate linked to X; ACT-like: The ACT domain is named after three of the proteins that contain it Aspartate kinase, Chorismate mutase, and TyrA; NAD(P): Nicotinamide adenine dinucleotide phosphate; PLP: Pyridoxal phosphate; Glu-tRNAGln: Glutamyl-tRNA (Gln) amidotransferase; GAF domains-like: cGMP-specific phosphodiesterases, adenylyl cyclases and FhlA; DPP6: Dipeptidyl peptidase.
Life 07 00030 g002
Figure 3. Distribution of promiscuous and specialist enzymes of archaeal genomes into metabolic categories of Kyoto Encyclopedia of Genes and Genomes (KEGG). The proportion of the two types of enzymes, promiscuous and specialist, into metabolic categories of KEGG.
Figure 3. Distribution of promiscuous and specialist enzymes of archaeal genomes into metabolic categories of Kyoto Encyclopedia of Genes and Genomes (KEGG). The proportion of the two types of enzymes, promiscuous and specialist, into metabolic categories of KEGG.
Life 07 00030 g003
Figure 4. Distribution of promiscuous enzymes of archaeal genomes by phylum into metabolic categories of KEGG. Archaeal genomes were separated by phylum and their proportion of promiscuous enzymes is shown into metabolic categories of KEGG.
Figure 4. Distribution of promiscuous enzymes of archaeal genomes by phylum into metabolic categories of KEGG. Archaeal genomes were separated by phylum and their proportion of promiscuous enzymes is shown into metabolic categories of KEGG.
Life 07 00030 g004
Figure 5. Distribution of enzymes by their type of promiscuity. Proportion of substrate or catalytic promiscuity of archaeal enzymes into metabolic categories of KEGG.
Figure 5. Distribution of enzymes by their type of promiscuity. Proportion of substrate or catalytic promiscuity of archaeal enzymes into metabolic categories of KEGG.
Life 07 00030 g005
Table 1. Unique Superfamily structural domains found in archaeal promiscuous enzymes. In brackets is indicated the total of genomes where the domain was identified.
Table 1. Unique Superfamily structural domains found in archaeal promiscuous enzymes. In brackets is indicated the total of genomes where the domain was identified.
SuperfamilySupfam IDFunctional AnnotationPhylum
Sulfolobus fructose-1,6-bisphosphatase-like superfamily (66)111249Metabolism
 
Euryarchaeota
Crenarchaeota
Glu-tRNAGln amidotransferase C subunit superfamily (56)141000Information
Translation
 
Euryarchaeota
Crenarchaeota
Nanoarchaeota
Siroheme synthase middle domains-like superfamily (51)75615Metabolism
Euryarchaeota
Crenarchaeota
Heme-dependent peroxidases superfamily (18)48113Metabolism
Redox
Euryarchaeota
Nitrous oxide reductase, N-terminal domain superfamily (4)50974Metabolism
Redox
Euryarchaeota
Crenarchaeota
Oxidoreductase molybdopterin-binding domain superfamily (3)56524Metabolism
E- transfer
Euryarchaeota
Tropomyosin superfamily (2)57997Intra-Cellular processes
Cell motility
Euryarchaeota
DNA breaking-rejoining enzymes superfamily (2)56349Information
DNA replication/repair
Euryarchaeota
Crenarchaeota
N-terminal domain of MutM-like DNA repair proteins superfamily (2)81624Information
DNA replication/repair
Euryarchaeota
(Phosphotyrosine protein) phosphatases II superfamily (2)52799Regulation
Kinases/phosphatases
Crenarchaeota
6-hydroxymethyl-7,8-dihydropterin pyrophosphokinase, HPPK (7,8-dihydro-6-hydroxymethylpterin-pyrophosphokinase) superfamily (1)55083Metabolism
Coenzyme metabolism and transport
Crenarchaeota
Eukaryotic DNA topoisomerase I, N-terminal DNA-binding fragment superfamily (1)56741Information
DNA replication/repair
Crenarchaeota
Table 2. Distribution of promiscuous enzymes into metabolic categories of KEGG.
Table 2. Distribution of promiscuous enzymes into metabolic categories of KEGG.
Metabolic Categories (KEGG)Substrate Promiscuity (%)Catalytic Promiscuity (%)
EuryarchaeotaCrenarchaeotaEuryarchaeotaCrenarchaeota
Carbohydrate metabolism33.2136.1232.2634.79
Amino acid metabolism21.0614.6910.369.94
Energy metabolism11.8718.8110.0419.59
Metabolism of cofactors and vitamins10.947.3425.9617.83
Genetic information processing7.045.96.942.63
Nucleotide metabolism4.622.888.869.35
Metabolism of terpenoids and polyketides3.533.360.210
Lipid metabolism2.233.8400
Xenobiotics biodegradation and metabolism1.982.471.380
Environmental information processing1.332.060.960
Biosynthesis of other secondary metabolites1.611.230.213.5
Metabolism of other amino acids0.41.232.022.33
Cellular Processes0.1200.740

Share and Cite

MDPI and ACS Style

Martínez-Núñez, M.A.; Rodríguez-Escamilla, Z.; Rodríguez-Vázquez, K.; Pérez-Rueda, E. Tracing the Repertoire of Promiscuous Enzymes along the Metabolic Pathways in Archaeal Organisms. Life 2017, 7, 30. https://doi.org/10.3390/life7030030

AMA Style

Martínez-Núñez MA, Rodríguez-Escamilla Z, Rodríguez-Vázquez K, Pérez-Rueda E. Tracing the Repertoire of Promiscuous Enzymes along the Metabolic Pathways in Archaeal Organisms. Life. 2017; 7(3):30. https://doi.org/10.3390/life7030030

Chicago/Turabian Style

Martínez-Núñez, Mario Alberto, Zuemy Rodríguez-Escamilla, Katya Rodríguez-Vázquez, and Ernesto Pérez-Rueda. 2017. "Tracing the Repertoire of Promiscuous Enzymes along the Metabolic Pathways in Archaeal Organisms" Life 7, no. 3: 30. https://doi.org/10.3390/life7030030

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop