You are currently viewing a new version of our website. To view the old version click .
Molecules
  • Review
  • Open Access

24 July 2020

Bioinformatics of Metalloproteins and Metalloproteomes

and
1
Shenzhen Key Laboratory of Marine Bioresources and Ecology, College of Life Sciences and Oceanography, Shenzhen University, Shenzhen 518055, China
2
Shenzhen-Hong Kong Institute of Brain Science-Shenzhen Fundamental Research Institutions, Shenzhen 518055, China
3
Shenzhen Bay Laboratory, Shenzhen 518055, China
*
Author to whom correspondence should be addressed.
This article belongs to the Special Issue Application of Computational Methods and Biomolecular Structural Modeling to the Investigation of Metalloproteins

Abstract

Trace metals are inorganic elements that are required for all organisms in very low quantities. They serve as cofactors and activators of metalloproteins involved in a variety of key cellular processes. While substantial effort has been made in experimental characterization of metalloproteins and their functions, the application of bioinformatics in the research of metalloproteins and metalloproteomes is still limited. In the last few years, computational prediction and comparative genomics of metalloprotein genes have arisen, which provide significant insights into their distribution, function, and evolution in nature. This review aims to offer an overview of recent advances in bioinformatic analysis of metalloproteins, mainly focusing on metalloprotein prediction and the use of different metals across the tree of life. We describe current computational approaches for the identification of metalloprotein genes and metal-binding sites/patterns in proteins, and then introduce a set of related databases. Furthermore, we discuss the latest research progress in comparative genomics of several important metals in both prokaryotes and eukaryotes, which demonstrates divergent and dynamic evolutionary patterns of different metalloprotein families and metalloproteomes. Overall, bioinformatic studies of metalloproteins provide a foundation for systematic understanding of trace metal utilization in all three domains of life.

1. Introduction

Biological trace metals refer to those metals which are only needed in small quantities but are essential for normal development, survival, and reproduction of all living organisms [1,2]. These micronutrients include iron (Fe), zinc (Zn), copper (Cu), manganese (Mn), molybdenum (Mo), tungsten (W), nickel (Ni), cobalt (Co), chromium (Cr), vanadium (V), and several other elements, which play vital roles in a wide variety of biological and chemical events. Some metals act as critical cofactors or as important structural components for different enzymes, whereas others can accept or donate electrons in various redox reactions, or regulate biological processes by facilitating the binding of molecules to corresponding receptors [3,4]. The major metalloid, selenium (Se), is also involved in several pivotal cellular processes in both prokaryotes and eukaryotes [5]. Deficiency and excess of these micronutrients may cause abnormalities in development and metabolism or even death [6].
Among all metals, Fe and Zn are indispensable for all or almost all living organisms [7,8]. The utilization of other trace metals is highly diverse and scattered. Cells require metal ions as cofactors for the assembly and activation of metalloproteins. Most metals are directly incorporated into their binding sites on these proteins, whereas a small number of metals must form metal-containing cofactors or complexes before their insertion into target proteins, such as molybdopterin (or named Mo cofactor, Moco) for Mo and cobalamin (vitamin B12) for Co [9,10]. Metalloproteins comprise a numerous and diverse group within the proteomes of organisms, which not only catalyze a remarkably wide range of important reactions but also play important structural and regulatory functions [11]. It has been suggested that approximately one third of all proteins require metals for their biological roles and almost half of all enzymes must associate with one or more metal ions [12,13]. The number of metalloprotein families varies greatly depending upon which metal they use. For instance, Zn is estimated to be used by several hundred protein families, whereas less than ten protein families belong to Ni-dependent metalloenzymes [14,15]. In addition, Se exerts its biological functions mainly as the 21st amino acid selenocysteine (Sec, encoded by the codon TGA) in selenoproteins [16]. The presence of metalloproteins necessitates a tight regulation of metal metabolism and homeostasis to maintain the appropriate metal concentration in the cell while avoiding toxicity, which includes transport, delivery, storage, detoxification, and efflux machineries [17,18].
In recent years, due to the rapid development of high-throughput sequencing technologies, the availability of genomic sequences for a huge number of organisms has opened the door for systematic analysis of the occurrence and evolutionary trends of metal utilization in nature. The main achievements of these studies lie in in silico identification and comparative genomics of metalloproteins across the tree of life. A variety of bioinformatic algorithms have been developed for the detection of metalloprotein genes. Moreover, large-scale analysis of the metalloproteome (the complete set of metalloproteins) in different organisms may help us to better understand the utilization and function of trace metals in biology. This review focuses on current computational approaches used for metalloprotein gene prediction as well as relevant tools and resources. We will also introduce recent advances in comparative genomics involving several essential metals to present an integrated and holistic picture of the distribution and evolution of metalloproteins and metalloproteomes in all domains of life.

3. Comparative Genomics of Metalloproteins and Metalloproteomes

Comparative genomics provides a powerful tool for investigating evolutionary changes of genes, pathways and other characteristics along various lineages [65]. Based on comparative genomic approaches, we may better understand the use of metalloproteins and metal-dependent processes in living organisms. However, identification and quantification of complete metalloproteomes for most metals is currently impossible [66,67]. Even so, analysis of the majority of them in genomic databases can still greatly improve our understanding of the utilization and function of metals and their variations across species during evolution. In addition, analysis of genes involved in metal uptake, homeostasis, and metal-containing cofactor biosynthesis may assist in identification of metal utilization trait (i.e., the ability to use certain metal) [67,68,69]. A general procedure for comparative genomics of metal utilization is shown in Figure 1. In the following sections, we mainly focus on metalloproteins and discuss recent progress on comparative genomic analyses of metalloproteins for several important metals.
Figure 1. A general diagram for comparative genomic analysis of metal utilization.

3.1. Zinc and Iron

Zn and Fe are the two most commonly used trace metals in all organisms. A great number of proteins have been characterized or predicted to use one of the two metals (see references [25,26,30] and several web resources described above for a compiled list of Zn- or Fe-binding protein families). However, because of the widespread and complex use of the two metals, comparative analyses of the occurrence and evolutionary trends of their utilization are still very challenging to handle [69].
Zn is known to contribute to numerous biological processes in living systems, which is a key component present in hundreds of structural proteins, enzymes, transcription factors, and ribosomal proteins [14]. Previously, Zn proteomes (including both Zn-dependent proteins and some other proteins involved in Zn transport and homeostasis) have been predicted in a limited number of prokaryotic and eukaryotic organisms based on Zn-binding domains and patterns extracted from various databases [25,28,29,70]. In general, the number of Zn-binding proteins are positively correlated with the proteome size of an organism. Eukaryotes had a higher proportion (8%~10%) of Zn-binding proteins than prokaryotes (5%~6%). The majority of prokaryotic Zn proteins perform enzymatic catalysis (especially hydrolases) while the eukaryotic Zn proteome is mainly involved in both catalysis and transcription regulation of gene expression, suggesting that Zn-binding transcription factors have evolved to regulate more complex and diverse processes in higher organisms [28,29]. Another study analyzed two major groups of Zn proteins (Zn finger-containing proteins and Zn hydrolytic enzymes) in more than 800 organisms, which revealed that there is a correlation in their changes during evolution related to environmental change [71]. In recent years, comparative genomic approaches have been frequently used to investigate the distribution and diversification of certain Zn-dependent protein families, particularly Zn finger-containing transcription factors such as PRDM, Zic, and several other C2H2-Zn finger protein families, which provides a basis for further research on the origin, function, and evolutionary features of these proteins [72,73,74,75]. With an explosion in genomic resources and the rapidly expanding number of bioinformatic tools in the past decade, a more comprehensive analysis of Zn-dependent proteomes in all kingdoms of life is urgently needed.
Fe is the most abundant transition metal in cells and has a fundamental role in many metabolic processes, such as oxygen transport, electron transfer, nucleic acid synthesis, growth, and many important redox reactions [7]. Besides Fe ions, many proteins may use Fe in the form of heme or Fe-S clusters [7,11]. Due to the complexity and diversity of Fe utilization, it is very hard to identify the complete Fe-dependent proteomes; however, several studies have been performed aiming at the understanding of different groups of Fe-dependent proteins in various organisms. An early bioinformatic study investigated the occurrence of putative non-heme Fe-binding proteins in a small number of prokaryotes and eukaryotes, which demonstrated that extant organisms have inherited the majority of Fe proteome from the last universal common ancestor [26]. Compared to Zn proteome, the Fe proteome constituted a higher fraction of the proteome in archaea (7.1% on average) than in bacteria (3.9%) and in eukaryotes (1.1%). Another computational study compared the distribution of Fe-S proteins in more than 400 prokaryotic organisms with different life styles and found a strong relationship between environmental dioxygen levels and the usage of different Fe-S clusters [31]. Very recently, the complete human Fe proteome was systematically analyzed based on different types of Fe-containing cofactors [30]. About 2% of human genes encode Fe proteins (35%, 48%, and 17% for individual Fe ions, heme, and Fe-S clusters, respectively). Interestingly, genes encoding Fe proteins (especially Fe-S proteins) appeared to be more commonly related to pathologies than all other human genes, suggesting specific features of the physiological role of Fe. In addition, comparative genomic analyses were carried out for investigating Fe metabolism and homeostasis mechanisms in different organisms, such as cytosolic Fe-S cluster assembly machinery [76], heme biosynthesis and uptake machinery [77], and several other protein families involved in Fe transport and storage [78], which provide detailed insights into the composition and evolution of Fe metabolic network.

3.2. Copper

Cu is an important activator for several key enzymes participating in fundamental biological processes such as respiration, photosynthesis, and oxidative stress responses. A number of Cu-dependent proteins (cuproproteins) have been characterized in both prokaryotes and eukaryotes. The currently known cuproprotein families are shown in Table 3 (proteins involved in Cu transport and homeostasis are not included).
Table 3. List of known metalloprotein families for several metals in prokaryotes and eukaryotes.
Comparative genomic studies have been previously carried out to analyze intrinsic features of different cuproproteins or cuproproteomes (the whole set of cuproproteins) in various organisms [27,79,80,81,82]. Two early studies combined known Cu-binding domains and Cu-binding patterns to explore the occurrence of Cu proteins (including both cuproproteins and some other proteins involved in Cu transport and homeostasis) in a limited number of sequenced genomes [27,79]. The proportion of Cu-binding proteins was small when compared to that of Zn or non-heme Fe proteins. Eukaryotes have expanded the Cu proteome inherited from the last common ancestor of all organisms by evolving new Cu domains and reusing old domains for novel functions.
Some other studies provide more detailed information about cuproproteins in the three domains of life [67,80,81]. Cytochrome c oxidase subunits I (COX I) and II (COX II) are the most widely distributed cuproproteins in prokaryotes (Figure 2A). Multicopper oxidase (MCO), Cu-Zn superoxide dismutase (Cu-Zn SOD) and plastocyanin families were also found in many prokaryotes, whereas the occurrence of tyrosinase, nitrosocyanin, Cu amine oxidase, and particulate methane monooxygenase seems to be quite limited. Except for cuproproteins that were exclusively present in individual kingdoms (e.g., azurin in bacteria and rusticyanin in archaea), significant difference in the distribution of most cuproproteins was observed between archaea and bacteria (Figure 2A). On the other hand, only half of prokaryotic cuproprotein families could be found in eukaryotic organisms, and the latter have the capacity to evolve new cuproproteins such as galactose oxidase, hemocyanin, and plantacyanin. MCO, COX I, COX II, and Cu-Zn SOD were the most abundant cuproprotein families in eukaryotes, while the distribution of some cuproproteins appeared to be phylum-specific, e.g., hemocyanin in arthropods and plantacyanin in land plants. A recent comparative analysis of the presence of hemocyanin in different myriapod species suggests that these proteins have divergent evolutionary patterns in different myriapod taxa [82]. Further analysis of prokaryotic cuproproteomes revealed that larger cuproproteomes were mainly present in Alphaproteobacteria, Betaproteobacteria and Euryarchaeota/Halobacteriales. The largest bacterial and archaeal cuproproteomes reported to date were detected in several Sinorhizobium species (S. medicae and S. meliloti, 22 cuproprotein genes) and Haloarcula marismortui (25 cuproprotein genes), respectively [81]. In eukaryotes, land plants possessed the largest cuproproteomes, especially Oryza sativa containing 78 cuproprotein genes). It is interesting that larger cuproproteomes were mainly found in organisms living in oxygen-rich environments, which is consistent with the idea that proteins evolved to use Cu following the oxygenation of the Earth [80,81,82]. Because previous studies relied only on a limited number of organisms, future research is needed to update the distribution and evolution of cuproproteins/cuproproteomes using a much wider range of sequenced genomes belonging to different clades.
Figure 2. Distribution of metalloprotein families in the three domains of life. (A) Distribution of cuproproteins in Cu-utilizing organisms; (B) Distribution of molybdoproteins in Mo-utilizing organisms; (C) Distribution of tungstoproteins in W-utilizing organisms; (D) Distribution of Ni-dependent proteins in Ni-utilizing organisms; (E) Distribution of B12-dependent proteins in Co-utilizing organisms. COX I, cytochrome c oxidase subunit I; COX II, cytochrome c oxidase subunit II; MCO, multicopper oxidase; Cu-Zn SOD, Cu-Zn superoxide dismutase; Ndh2, NADH dehydrogenase 2; NiR, nitrite reductase; N2OR, nitrous oxide reductase; CuAO, Cu amine oxidase; pMMO, particulate methane monooxygenase; PHM, peptidylglycine α-hydroxylating monooxygenase; DBM, dopamine β-monooxygenase; GAO, galactose oxidase; DMSOR, dimethylsulfoxide reductase; XO, xanthine oxidase; SO, sulfite oxidase; AOR, aldehyde:ferredoxin oxidoreductase; CODH, carbon monoxide dehydrogenase; SodN, Ni-containing superoxide dismutase; CODH/ACS, acetyl-coenzyme A synthase/decarbonylase; MCR, methyl-coenzyme M reductase; MetH, methionine synthase; RNR II, B12-dependent ribonucleotide reductase class II; MCM, methylmalonyl-CoA mutase; ICM, isobutyryl-CoA mutase; ECM, ethylmalonyl-CoA mutase; EAL, ethanolamine ammonia lyase; DDH/GDH, diol/glycerol dehydratase; 5,6-LAM, D-lysine 5,6-aminomutase; GM, glutamate mutase; PceA/CprA, B12-dependent reductive dehalogenase; MtrA, methyltetrahydromethanopterin:coenzyme M methyltransferase subunit A. Data used to generate this figure are available in the supplementary information of [81,83].

3.3. Molybdenum and Tungsten

Mo is required for the activity of a number of molybdoproteins that catalyze diverse reactions in the metabolism of carbon, nitrogen, and sulfur compounds [9]. With the exception of Fe-Mo-containing nitrogenase, Mo needs to be bound to a specific pyranopterin moiety to form Moco, an active compound at the active site of all molybdoproteins [84]. Some prokaryotes (such as hyperthermophilic archaea) use W to replace Mo, which is bound to the same pyranopterin to form tungstoproteins [85]. A list of known molybdoprotein and tungstoprotein families is shown in Table 3. Each family may contain a variety of enzymes [86,87]. It has been suggested that MOSC (Moco sulfurase C-terminal domain)-containing proteins are new members of the sulfite oxidase (SO) family due to similar structures for Mo-binding domains [86]; however, the lack of significant sequence similarity between them may challenge such an alternative classification approach [88].
Several comparative genomic studies have been conducted to explore the distribution and evolution of Mo utilization trait and molybdoproteins in all domains of life, which give preliminary indications of how this transition element is used by different organisms [81,89,90]. Very recently, the occurrence of all known molybdoprotein families in nearly 6000 sequenced prokaryotes and eukaryotes was analyzed, which presents a much more comprehensive view of the evolutionary trajectories of molybdoproteins in nature [83]. Dimethylsulfoxide reductase (DMSOR) is the most widespread molybdoprotein family in both archaea and bacteria, which was present in more than 90% Mo-utilizing organisms (Figure 2B). MOSC-containing protein, xanthine oxidase (XO), and SO families were also widespread in the majority of Mo-utilizing bacteria; however, most sequenced archaea do not have MOSC-containing protein and XO families. Several new domain fusions were detected for different members of DMSOR, SO, and XO in prokaryotes, providing valuable information for the inference of protein interactions and functions. The Fe-Mo-containing nitrogenase was only needed by a small number of bacteria and methanogenic archaea. On the other hand, MOSC-containing protein (or named mARC), SO, and XO are the three eukaryotic molybdeoprotein families, all of which were detected in almost all organisms that use Mo, indicating that they are all critical for maintaining the function of Mo in this kingdom. With regard to molybdoproteomes, many organisms in Actinobacteria and several subclasses of Proteobacteria were molybdoprotein-rich organisms (>20 molybdoprotein genes). To date, the largest molybdoproteome in bacteria was found in Gordonibacter pamelaeae 7-10-1-b (73 molybdoprotein genes, mostly belonging to the DMSOR family) [83]. In contrast, very few molybdoprotein-rich organisms were observed in archaea and eukaryotes. Further examination of the relationship between environmental factors and molybdoproteins revealed that the majority of molybdoprotein families and large molybdoproteomes are more frequently present in aerobic organisms, implying that oxygen has played a crucial role in the evolution of molybdoprotein genes [83].
Although it is still very difficult to distinguish between Mo and W utilization in different members of molybdoproteins due to quite similar physical-chemical and functional properties [91], it is worth mentioning that several attempts have been made to identify tungstoproteins from molybdoprotein families based on recent advances on tungstoproteins [83,92]. The currently known tungstoproteins include nearly all enzymes of the aldehyde:ferredoxin oxidoreductase (AOR) family and certain enzymes of the DMSOR family, including formate dehydrogenase and acetylene hydratase from strictly anaerobic bacteria and formylmethanofuran dehydrogenase from methanogenic archaea [92,93,94]. Preliminary analysis of tungstoproteins in prokaryotes revealed that AOR could be detected in the majority of W-utilizing prokaryotes while W-containing DMSOR proteins were present in most W-utilizing bacteria and a small number of archaea (mainly methanogens) (Figure 2C) [83]. These exploratory studies may provide the first global view of W utilization in prokaryotes.

3.4. Nickel and Cobalt

Ni is an essential cofactor for several enzymes that play critical roles in energy and nitrogen metabolism [15,95]. Some other Ni-containing proteins, such as glyoxalase I and acireductone dioxygenase, are not strictly Ni-dependent proteins which may bind alternative metals in different or even same organisms [95]. Co is mainly used as a key component of cobalamin (or called vitamin B12), which encompasses a group of closely related corrinoid compounds found in enzymes that mediate methyl transfer reactions, isomerase rearrangements, dehalogenation, and some other processes [96,97,98]. Moreover, Co is also detected in several non-corrin Co-containing enzymes in certain organisms, which may use other metals (such as Zn and Fe) to replace Co in many other organisms [99]. In this review, we only discuss strictly Ni-dependent and B12-binding protein families which are shown in Table 3.
To our knowledge, only few comparative genomic studies have been conducted on Ni- or Co-dependent metalloproteins in a wide range of organisms from the three domains of life [81,100,101]. As prokaryotes use similar import systems for Ni and Co uptake [102,103], the utilization of the two trace metals could be highly correlated, which was supported by the observation that most prokaryotic organisms use both metals [81]. In bacteria, urease and methionine synthase (MetH) were the most frequently used Ni- and Co-dependent protein families, respectively (Figure 2D,E). However, they seemed to be rare or even absent in archaea, in which Ni-Fe hydrogenase and B12-dependent ribonucleotide reductase class II (RNR II) were the most commonly used Ni and Co enzymes. Except for a small number of organisms (such as deltaproteobacteria and Methanosarcina species), most prokaryotes possessed no more than 5 Ni- and/or Co-dependent metalloprotein genes. The largest Ni-dependent proteome was previously reported in Deltaproteobacterium MLMS-1 (16 Ni-binding protein genes, half were Ni-Fe hydrogenases) and the largest B12-dependent proteome in Dehalococcoides sp. CBDB1 (35 B12-dependent protein genes, 32 were reductive dehalogenase CprA proteins) [81,100]. Another recent study analyzed the distribution of vitamin B12 production pathway and a variety of B12-dependent enzymes in over 11,000 bacterial species, which provides important information on B12 utilization and its evolution in a much wider prokaryotic range [104]. Approximately 86% of the examined bacteria contained B12-dependent enzyme families, most of which lacked the ability to synthesize B12 and have to obtain this cofactor from exogenous sources. Proteobacteria and Bacteroidetes appeared to have larger numbers of B12-dependent enzymes than others.
In contrast to prokaryotes, the utilization of Ni and Co is quite restricted in eukaryotes, and very few organisms utilize both metals [81]. Only one Ni-dependent enzyme (urease) and three B12-dependent enzymes (methylmalonyl-CoA mutase, RNR II, and MetH) have been reported in this kingdom (Figure 2D,E). Urease and MetH were present in all Ni- and Co-utilizing eukaryotes, respectively. Analysis of Ni- and Co-dependent metalloproteomes did not reveal organisms that contained many of these proteins. Interestingly, compared to the majority of unicellular organisms that lack B12-binding proteins, Dictyostelium discoideum and several Phytophthora species contained all the three known eukaryotic B12-dependent enzymes, implying a more important role of B12 cofactor in these organisms [81]. In the future, it is necessary to perform more comprehensive surveys on the two metals using newly generated genomic resources.

3.5. Selenium

Se is a metalloid trace element, which is essential for normal physiological functions in humans, animals, and many other organisms [5,105]. It mainly occurs in the form of Sec, which is a key component of selenoproteins involved in numerous enzymatic reactions, such as redox homeostasis, thyroid hormone metabolism, anti-inflammatory actions, and reproduction [106,107]. The mechanism of Sec biosynthesis and its incorporation into proteins has been elucidated in both prokaryotes and eukaryotes [21,22]. So far, a significant number of selenoproteins have been reported in various organisms from bacteria to mammals, many of which were identified using reliable bioinformatic approaches [50,51,52,57,108,109]. Table 4 lists the majority of known and putative selenoproteins. Although the functions of most selenoproteins are not known and could only be inferred by sequence homology, it is very likely that most of them play important roles in antioxidation and detoxification [106].
Table 4. List of known selenoprotein families.
Previously, several computational and comparative genomic approaches have been carried out to investigate the distribution and evolution of Se metabolic pathways and selenoproteins in a large number of prokaryotic organisms and selected environmental samples [23,81,110,111,112,113,114,115,116], which provide detailed information on how this element is selectively used by proteins and organisms from different kingdoms. An early work analyzed the Sec biosynthetic pathway and known selenoproteins in several hundred bacterial and archaeal genomes, and found that only one-fourth of the examined organisms have selenoprotein genes. Most selenoprotein-rich organisms belong to Deltaproteobacteria and Clostridia [81]. Recently, a much more extensive evaluation has been conducted on Se metabolism and selenoproteins in bacteria by analyzing more than 5200 genomes, which generated the largest map of Se utilization in this kingdom [114]. More than 60 selenoprotein families/subfamilies could be detected in bacteria. Formate dehydrogenase alpha subunit and selenophosphate synthetase were the most widespread bacterial selenoprotein families (Figure 3). A new selenoprotein-rich phylum Synergistetes and additional selenoprotein-rich organisms have also been identified. The largest bacterial selenoproteome was found in Syntrophobacter fumaroxidans, a syntrophic propionate-oxidizing deltaproteobacterium containing 39 selenoprotein genes [81,114]. Although both aerobic and anaerobic organisms could use Sec, the fact that most selenoprotein-rich organisms (78.3%) are obligate or facultative anaerobic suggests a somewhat stronger correlation between evolution of selenoprotein genes and low oxygen level [114].
Figure 3. Distribution of the top 20 selenoproteins in Sec-utilizing bacteria. Data used to generate this figure can be found in [114].
In archaea, selenoprotein genes were only detected in a small number of organisms belonging to three phyla: Methanococcales, Methanopyrales, and Lokiarchaeota [81,116,117,118]. Compared to bacteria which contain a variety of known or predicted selenoprotein families, only nine selenoprotein families have been discovered in archaea, most of which are involved in methanogenesis [117]. The archaeal selenoproteomes show a relatively narrow size distribution (7~12 selenoproteins). Lokiarchaeota, a novel archaeal phylum and the closest archaeal relative to eukaryotes, was reported to have the largest archaeal selenoproteome (at least 12 selenoprotein genes) [118]. Further analysis of Lokiarchaeota selenoprotein genes suggests that this archaeon may serve as an intermediate form between the typical archaeal and eukaryotic Sec biosynthesis systems, providing new clues for the origin and evolution of the Sec utilization trait.
More efforts have been made to explore the distribution and evolution of selenoproteins in eukaryotes [81,119,120,121,122,123,124]. Several early comparative studies demonstrated that many selenoprotein families, such as glutathione peroxidases (GPXs), thioredoxin reductases (TXNRDs), and selenophosphate synthase 2 (SEPHS2) are shared between single-cell eukaryotes (such as green algae and many protists) and vertebrates, implying that the majority of eukaryotic selenoproteins originated from the ancestors of current eukaryotes and have been preserved throughout evolution [81,119,120,121]. However, massive and independent selenoprotein gene loss events (either loss of selenoprotein genes or replacement of Sec with Cys residue) were observed in different lineages such as fungi, land plants, nematodes, and some other organisms [119,120]. The size of eukaryotic selenoproteomes varies greatly between species. With the exception of mammals, aquatic organisms (such as algae and fish) generally have larger selenoproteomes than terrestrial ones (such as insects and nematodes). Although parallel loss of Sec utilization was observed in different groups of algae [122], the largest eukaryotic selenoproteome was described in the harmful pelagophyte alga Aureococcus anophagefferens (containing at least 59 selenoprotein genes) [125]. In animals, amphioxus was found to have the most abundant and diverse selenoproteins (containing 40 selenoprotein genes) [121]. Further investigation of selenoproteins in sequenced vertebrates defined the ancestral vertebrate (28 selenoproteins) and mammalian (25 selenoproteins) selenoproteomes, and reconstructed their evolutionary history [120]. For example, mammalian TXNRD1 and TXNRD3 were found to have evolved from an ancestral glutaredoxin-domain-containing enzyme, and selenoprotein V and GPX6 appeared at the root of placental mammals by duplications of selenoprotein W and GPX3, respectively. By evaluating the potential forces for selenoprotein gain or loss and for substitutions between Sec and Cys residues in different vertebrate clades, it was proposed that the strength of natural selection on selenoprotein genes is distinct between land vertebrates and teleost fishes, suggesting that Se availability has shaped the evolution of vertebrates [124]. In addition, selenoprotein P (SELENOP), the only human selenoprotein with multiple Sec residues, has been suggested to function as a genetic marker of Se utilization in animals (i.e., its number of Sec residues correlates with the selenoproteome size) [126]. A recent study showed that SELENOP genes are present across metazoan lineages with highly variable numbers of Sec-TGA codons, ranging from a single Sec residue in certain insects to up to 132 in bivalve mollusks, implying a highly dynamic evolutionary process of this selenoprotein [127]. Very recently, it was also reported that Sec could be encoded by several early-branching fungal phyla, which provides new insights into the evolution of Sec utilization in fungi [128].
Theoretically, comparative genomic approaches could be applied to study the metabolism of all trace metals and to identify the corresponding metalloproteins. However, due to limited knowledge about metal-binding sites and related properties for several other metals such as Mn, Cr, and V, metalloproteins that are strictly dependent on them remain poorly defined. For example, Mn is known to serve as a substitute for some other metals (e.g., Zn and Mg) in the active sites of numerous enzymes, resulting in the difficulty to distinguish Mn-dependent proteins from other metalloproteins [129]. Only a few proteins have been reported to bind Cr or V in certain organisms, including Cr-containing oligopeptide chromodulin and V-containing vanabins and haloperoxidases [130,131]; however, it is unclear whether these proteins are strictly dependent on the corresponding metal in other organisms. Therefore, comparative analysis of these metalloproteins seems to be a hard task and needs to be solved in the future.

4. Conclusions

Bioinformatics offers a powerful tool for studying metalloproteins, metalloproteomes, and their evolutionary trends in nature. Many of these studies have been devoted to the identification of metalloproteins based on known metal-binding motifs, patterns, and profiles. To date, it is still very difficult to identify complete metalloproteomes for almost all metals. Nevertheless, recent progress on bioinformatic research of metalloproteins, especially comparative genomics of several essential trace metals, has provided significant insights into the general principles of metal utilization and evolution across the three domains of life. In this review, we highlight recent studies that used various computational strategies and methods to predict metalloproteins and to investigate the distribution and evolution of metalloproteins/metalloproteomes in a wide range of organisms. In the future, with the dramatic increase in the number of sequenced genomes and improved computational techniques for identifying more metalloprotein genes, bioinformatics and comparative genomic approaches will play an even more important role in better understanding of metal utilization and function in biology.

Author Contributions

Y.Z. and J.Z. drafted the manuscript and revised it. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by National Natural Science Foundation of China (grant number 31771407), Guangdong Basic and Applied Basic Research Foundation (grant number 2019A1515011938), Science and Technology Innovation Committee of Shenzhen Municipality (grant number JCYJ20180305124023495), and Shenzhen-Hong Kong Institute of Brain Science-Shenzhen Fundamental Research Institutions (grant number 2019SHIBS0003).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Zoroddu, M.A.; Aaseth, J.; Crisponi, G.; Medici, S.; Peana, M.; Nurchi, V.M. The essential metals for humans: A brief overview. J. Inorg. Biochem. 2019, 195, 120–129. [Google Scholar] [CrossRef] [PubMed]
  2. Freeland-Graves, J.H.; Sanjeevi, N.; Lee, J.J. Global perspectives on trace element requirements. J. Trace Elem. Med. Biol. 2015, 31, 135–141. [Google Scholar] [CrossRef]
  3. Mertz, W. Review of the scientific basis for establishing the essentiality of trace elements. Biol. Trace Elem. Res. 1998, 66, 185–191. [Google Scholar] [CrossRef] [PubMed]
  4. Nordberg, M.; Nordberg, G.F. Trace element research-historical and future aspects. J. Trace Elem. Med. Biol. 2016, 38, 46–52. [Google Scholar] [CrossRef] [PubMed]
  5. Roman, M.; Jitaru, P.; Barbante, C. Selenium biochemistry and its role for human health. Metallomics 2014, 6, 25–54. [Google Scholar] [CrossRef]
  6. Van Gossum, A.; Neve, J. Trace element deficiency and toxicity. Curr. Opin. Clin. Nutr. Metab. Care 1998, 1, 499–507. [Google Scholar] [CrossRef]
  7. Sánchez, M.; Sabio, L.; Gálvez, N.; Capdevila, M.; Dominguez-Vera, J.M. Iron chemistry at the service of life. IUBMB Life 2017, 69, 382–388. [Google Scholar] [CrossRef]
  8. King, J.C. Zinc: An essential but elusive nutrient. Am. J. Clin. Nutr. 2011, 94, 679Ss–684Ss. [Google Scholar] [CrossRef]
  9. Magalon, A.; Mendel, R.R. Biosynthesis and insertion of the molybdenum cofactor. EcoSal Plus 2015. [Google Scholar] [CrossRef]
  10. Giedyk, M.; Goliszewska, K.; Gryko, D. Vitamin B12 catalysed reactions. Chem. Soc. Rev. 2015, 44, 3391–3404. [Google Scholar] [CrossRef]
  11. Degtyarenko, K. Bioinorganic motifs: Towards functional classification of metalloproteins. Bioinformatics 2000, 16, 851–864. [Google Scholar] [CrossRef] [PubMed]
  12. Waldron, K.J.; Rutherford, J.C.; Ford, D.; Robinson, N.J. Metalloproteins and metal sensing. Nature 2009, 460, 823–830. [Google Scholar] [CrossRef]
  13. Andreini, C.; Bertini, I.; Cavallaro, G.; Holliday, G.L.; Thornton, J.M. Metal ions in biological catalysis: From enzyme databases to general principles. J. Biol. Inorg. Chem. 2008, 13, 1205–1218. [Google Scholar] [CrossRef] [PubMed]
  14. Maret, W. Zinc and the zinc proteome. Met. Ions Life Sci. 2013, 12, 479–501. [Google Scholar] [PubMed]
  15. Alfano, M.; Cavazza, C. Structure, function, and biosynthesis of nickel-dependent enzymes. Protein Sci. 2020, 29, 1071–1089. [Google Scholar] [CrossRef]
  16. Zoidis, E.; Seremelis, I.; Kontopoulos, N.; Danezis, G.P. Selenium-dependent antioxidant enzymes: Actions and properties of selenoproteins. Antioxidants (Basel) 2018, 7, 66. [Google Scholar] [CrossRef]
  17. Chandrangsu, P.; Rensing, C.; Helmann, J.D. Metal homeostasis and resistance in bacteria. Nat. Rev. Microbiol. 2017, 15, 338–350. [Google Scholar] [CrossRef]
  18. Martinez-Finley, E.J.; Chakraborty, S.; Fretham, S.J.; Aschner, M. Cellular transport and homeostasis of essential and nonessential metals. Metallomics 2012, 4, 593–605. [Google Scholar] [CrossRef]
  19. Zhang, Y.; Gladyshev, V.N. Comparative genomics of trace element dependence in biology. J. Biol. Chem. 2011, 286, 23623–23629. [Google Scholar] [CrossRef]
  20. Sukdeo, N.; Clugston, S.L.; Daub, E.; Honek, J.F. Distinct classes of glyoxalase I: Metal specificity of the Yersinia pestis, Pseudomonas aeruginosa and Neisseria meningitidis enzymes. Biochem. J. 2004, 384, 111–117. [Google Scholar] [CrossRef]
  21. Bulteau, A.L.; Chavatte, L. Update on selenoprotein biosynthesis. Antioxid. Redox Signal. 2015, 23, 775–794. [Google Scholar] [CrossRef]
  22. Gonzalez-Flores, J.N.; Shetty, S.P.; Dubey, A.; Copeland, P.R. The molecular biology of selenocysteine. Biomol. Concepts 2013, 4, 349–365. [Google Scholar] [CrossRef]
  23. Zhang, Y.; Romero, H.; Salinas, G.; Gladyshev, V.N. Dynamic evolution of selenocysteine utilization in bacteria: A balance between selenoprotein loss and evolution of selenocysteine from redox active cysteine residues. Genome Biol. 2006, 7, R94. [Google Scholar] [CrossRef]
  24. Mariotti, M.; Guigó, R. Selenoprofiles: Profile-based scanning of eukaryotic genome sequences for selenoprotein genes. Bioinformatics 2010, 26, 2656–2663. [Google Scholar] [CrossRef] [PubMed]
  25. Andreini, C.; Banci, L.; Bertini, I.; Rosato, A. Zinc through the three domains of life. J. Proteome Res. 2006, 5, 3173–3178. [Google Scholar] [CrossRef] [PubMed]
  26. Andreini, C.; Banci, L.; Bertini, I.; Elmi, S.; Rosato, A. Non-heme iron through the three domains of life. Proteins 2007, 67, 317–324. [Google Scholar] [CrossRef]
  27. Andreini, C.; Banci, L.; Bertini, I.; Rosato, A. Occurrence of copper proteins through the three domains of life: A bioinformatic approach. J. Proteome Res. 2008, 7, 209–216. [Google Scholar] [CrossRef]
  28. Andreini, C.; Bertini, I.; Rosato, A. Metalloproteomes: A bioinformatic approach. Acc. Chem. Res. 2009, 42, 1471–1479. [Google Scholar] [CrossRef]
  29. Andreini, C.; Bertini, I. A bioinformatics view of zinc enzymes. J. Inorg. Biochem. 2012, 111, 150–156. [Google Scholar] [CrossRef]
  30. Andreini, C.; Putignano, V.; Rosato, A.; Banci, L. The human iron-proteome. Metallomics 2018, 10, 1223–1231. [Google Scholar] [CrossRef]
  31. Andreini, C.; Rosato, A.; Banci, L. The relationship between environmental dioxygen and iron-sulfur proteins explored at the genome level. PLoS ONE 2017, 12, e0171279. [Google Scholar] [CrossRef]
  32. Andreini, C.; Bertini, I.; Cavallaro, G.; Decaria, L.; Rosato, A. A simple protocol for the comparative analysis of the structure and occurrence of biochemical pathways across superkingdoms. J. Chem. Inf. Model. 2011, 51, 730–738. [Google Scholar] [CrossRef]
  33. Passerini, A.; Andreini, C.; Menchetti, S.; Rosato, A.; Frasconi, P. Predicting zinc binding at the proteome level. BMC Bioinform. 2007, 8, 39. [Google Scholar] [CrossRef] [PubMed]
  34. Shu, N.; Zhou, T.; Hovmöller, S. Prediction of zinc-binding sites in proteins from sequence. Bioinformatics 2008, 24, 775–782. [Google Scholar] [CrossRef]
  35. Zhao, W.; Xu, M.; Liang, Z.; Ding, B.; Niu, L.; Liu, H.; Teng, M. Structure-based de novo prediction of zinc-binding sites in proteins of unknown function. Bioinformatics 2011, 27, 1262–1268. [Google Scholar] [CrossRef] [PubMed]
  36. Zheng, C.; Wang, M.; Takemoto, K.; Akutsu, T.; Zhang, Z.; Song, J. An integrative computational framework based on a two-step random forest algorithm improves prediction of zinc-binding sites in proteins. PLoS ONE 2012, 7, e49716. [Google Scholar] [CrossRef] [PubMed][Green Version]
  37. Chen, Z.; Wang, Y.; Zhai, Y.F.; Song, J.; Zhang, Z. ZincExplorer: An accurate hybrid method to improve the prediction of zinc-binding sites from protein sequences. Mol. Biosyst. 2013, 9, 2213–2222. [Google Scholar] [CrossRef]
  38. Srivastava, A.; Kumar, M. Prediction of zinc binding sites in proteins using sequence derived information. J. Biomol. Struct. Dyn. 2018, 36, 4413–4423. [Google Scholar] [CrossRef]
  39. Ajitha, M.; Sundar, K.; Arul Mugilan, S.; Arumugam, S. Development of METAL-ACTIVE SITE and ZINCCLUSTER tool to predict active site pockets. Proteins 2018, 86, 322–331. [Google Scholar] [CrossRef]
  40. Yan, R.; Wang, X.; Tian, Y.; Xu, J.; Xu, X.; Lin, J. Prediction of zinc-binding sites using multiple sequence profiles and machine learning methods. Mol. Omics 2019, 15, 205–215. [Google Scholar] [CrossRef]
  41. Liu, R.; Hu, J. HemeBIND: A novel method for heme binding residue prediction by combining structural and sequence information. BMC Bioinform. 2011, 12, 207. [Google Scholar] [CrossRef] [PubMed]
  42. Liou, Y.F.; Charoenkwan, P.; Srinivasulu, Y.; Vasylenko, T.; Lai, S.C.; Lee, H.C.; Chen, Y.H.; Huang, H.L.; Ho, S.Y. SCMHBP: Prediction and analysis of heme binding proteins using propensity scores of dipeptides. BMC Bioinform. 2014, 15 (Suppl. 16), S4. [Google Scholar] [CrossRef]
  43. Estellon, J.; Ollagnier de Choudens, S.; Smadja, M.; Fontecave, M.; Vandenbrouck, Y. An integrative computational model for large-scale identification of metalloproteins in microbial genomes: A focus on iron-sulfur cluster proteins. Metallomics 2014, 6, 1913–1930. [Google Scholar] [CrossRef] [PubMed]
  44. Valasatava, Y.; Rosato, A.; Banci, L.; Andreini, C. MetalPredator: A web server to predict iron-sulfur cluster binding proteomes. Bioinformatics 2016, 32, 2850–2852. [Google Scholar] [CrossRef]
  45. Sodhi, J.S.; Bryson, K.; McGuffin, L.J.; Ward, J.J.; Wernisch, L.; Jones, D.T. Predicting metal-binding site residues in low-resolution structural models. J. Mol. Biol. 2004, 342, 307–320. [Google Scholar] [CrossRef]
  46. Brylinski, M.; Skolnick, J. FINDSITE-metal: Integrating evolutionary information and machine learning for structure-based metal-binding site prediction at the proteome level. Proteins 2011, 79, 735–751. [Google Scholar] [CrossRef] [PubMed]
  47. Levy, R.; Edelman, M.; Sobolev, V. Prediction of 3D metal binding sites from translated gene sequences based on remote-homology templates. Proteins 2009, 76, 365–374. [Google Scholar] [CrossRef] [PubMed]
  48. Passerini, A.; Lippi, M.; Frasconi, P. MetalDetector v2.0: Predicting the geometry of metal binding sites from protein sequence. Nucleic Acids Res. 2011, 39, W288–W292. [Google Scholar] [CrossRef]
  49. Lin, Y.F.; Cheng, C.W.; Shih, C.S.; Hwang, J.K.; Yu, C.S.; Lu, C.H. MIB: Metal ion-binding site prediction and docking server. J. Chem. Inf. Model. 2016, 56, 2287–2291. [Google Scholar] [CrossRef]
  50. Mariotti, M. SECISearch3 and seblastian: In-silico tools to predict SECIS elements and selenoproteins. Methods Mol. Biol. 2018, 1661, 3–16. [Google Scholar]
  51. Jiang, L.; Liu, Q. SelGenAmic: An algorithm for selenoprotein gene assembly. Methods Mol. Biol. 2018, 1661, 29–39. [Google Scholar] [PubMed]
  52. Zhang, Y.; Gladyshev, V.N. An algorithm for identification of bacterial selenocysteine insertion sequence elements and selenoprotein genes. Bioinformatics 2005, 21, 2580–2589. [Google Scholar] [CrossRef] [PubMed]
  53. Haberal, İ.; Oğul, H. Prediction of protein metal binding sites using deep neural networks. Mol. Inform. 2019, 38, e1800169. [Google Scholar] [CrossRef] [PubMed]
  54. Cvetkovic, A.; Menon, A.L.; Thorgersen, M.P.; Scott, J.W.; Poole, F.L., 2nd; Jenney, F.E., Jr.; Lancaster, W.A.; Praissman, J.L.; Shanmukh, S.; Vaccaro, B.J.; et al. Microbial metalloproteomes are largely uncharacterized. Nature 2010, 466, 779–782. [Google Scholar] [CrossRef]
  55. Azia, A.; Levy, R.; Unger, R.; Edelman, M.; Sobolev, V. Genome-wide computational determination of the human metalloproteome. Proteins 2015, 83, 931–939. [Google Scholar] [CrossRef]
  56. Santesmasses, D.; Mariotti, M.; Gladyshev, V.N. Bioinformatics of selenoproteins. Antioxid. Redox Signal. 2020, in press. [Google Scholar] [CrossRef]
  57. Kryukov, G.V.; Gladyshev, V.N. The prokaryotic selenoproteome. EMBO Rep. 2004, 5, 538–543. [Google Scholar] [CrossRef]
  58. Castagnetto, J.M.; Hennessy, S.W.; Roberts, V.A.; Getzoff, E.D.; Tainer, J.A.; Pique, M.E. MDB: The metalloprotein database and browser at the scripps research institute. Nucleic Acids Res. 2002, 30, 379–382. [Google Scholar] [CrossRef]
  59. Andreini, C.; Bertini, I.; Cavallaro, G.; Holliday, G.L.; Thornton, J.M. Metal-MACiE: A database of metals involved in biological catalysis. Bioinformatics 2009, 25, 2088–2089. [Google Scholar] [CrossRef]
  60. Zhang, Y.; Gladyshev, V.N. dbTEU: A protein database of trace element utilization. Bioinformatics 2010, 26, 700–702. [Google Scholar] [CrossRef]
  61. Harding, M.M.; Hsin, K.Y. Mespeus—A database of metal interactions with proteins. Methods Mol. Biol. 2014, 1091, 333–342. [Google Scholar] [PubMed]
  62. Putignano, V.; Rosato, A.; Banci, L.; Andreini, C. MetalPDB in 2018: A database of metal sites in biological macromolecular structures. Nucleic Acids Res. 2018, 46, D459–D464. [Google Scholar] [CrossRef] [PubMed]
  63. Romagné, F.; Santesmasses, D.; White, L.; Sarangi, G.K.; Mariotti, M.; Hübler, R.; Weihmann, A.; Parra, G.; Gladyshev, V.N.; Guigó, R.; et al. SelenoDB 2.0: Annotation of selenoprotein genes in animals and their genetic diversity in humans. Nucleic Acids Res. 2014, 42, D437–D443. [Google Scholar] [CrossRef] [PubMed]
  64. Ireland, S.M.; Martin, A.C.R. ZincBind-the database of zinc binding sites. Database (Oxford) 2019, 2019, baz006. [Google Scholar] [CrossRef]
  65. Miller, W.; Makova, K.D.; Nekrutenko, A.; Hardison, R.C. Comparative genomics. Annu. Rev. Genomics Hum. Genet. 2004, 5, 15–56. [Google Scholar] [CrossRef]
  66. Maret, W. Metalloproteomics, metalloproteomes, and the annotation of metalloproteins. Metallomics 2010, 2, 117–125. [Google Scholar] [CrossRef]
  67. Zhang, Y.; Gladyshev, V.N. Comparative genomics of trace elements: Emerging dynamic view of trace element utilization and function. Chem. Rev. 2009, 109, 4828–4861. [Google Scholar] [CrossRef]
  68. Tzou, W.S.; Chu, Y.; Lin, T.Y.; Hu, C.H.; Pai, T.W.; Liu, H.F.; Lin, H.J.; Cases, I.; Rojas, A.; Sanchez, M.; et al. Molecular evolution of multiple-level control of heme biosynthesis pathway in animal kingdom. PLoS ONE 2014, 9, e86718. [Google Scholar] [CrossRef][Green Version]
  69. Zhang, Y.; Ying, H.; Xu, Y. Comparative genomics and metagenomics of the metallomes. Metallomics 2019, 11, 1026–1043. [Google Scholar] [CrossRef]
  70. Bertini, I.; Decaria, L.; Rosato, A. The annotation of full zinc proteomes. J. Biol. Inorg. Chem. 2010, 15, 1071–1078. [Google Scholar] [CrossRef]
  71. Decaria, L.; Bertini, I.; Williams, R.J. Zinc proteomes, phylogenetics and evolution. Metallomics 2010, 2, 706–709. [Google Scholar] [CrossRef] [PubMed]
  72. Aruga, J.; Hatayama, M. Comparative genomics of the Zic family genes. Adv. Exp. Med. Biol. 2018, 1046, 3–26. [Google Scholar] [PubMed]
  73. Vervoort, M.; Meulemeester, D.; Béhague, J.; Kerner, P. Evolution of Prdm genes in animals: Insights from comparative genomics. Mol. Biol. Evol. 2016, 33, 679–696. [Google Scholar] [CrossRef] [PubMed]
  74. Najafabadi, H.S.; Garton, M.; Weirauch, M.T.; Mnaimneh, S.; Yang, A.; Kim, P.M.; Hughes, T.R. Non-base-contacting residues enable kaleidoscopic evolution of metazoan C2H2 zinc finger DNA binding. Genome Biol. 2017, 18, 167. [Google Scholar] [CrossRef] [PubMed]
  75. Salih, H.; Odongo, M.R.; Gong, W.; He, S.; Du, X. Genome-wide analysis of cotton C2H2-zinc finger transcription factor family and their expression analysis during fiber development. BMC Plant. Biol. 2019, 19, 400. [Google Scholar] [CrossRef] [PubMed]
  76. Tsaousis, A.D.; Gentekaki, E.; Eme, L.; Gaston, D.; Roger, A.J. Evolution of the cytosolic iron-sulfur cluster assembly machinery in Blastocystis species and other microbial eukaryotes. Eukaryot. Cell 2014, 13, 143–153. [Google Scholar] [CrossRef]
  77. Cavallaro, G.; Decaria, L.; Rosato, A. Genome-based analysis of heme biosynthesis and uptake in prokaryotic systems. J. Proteome Res. 2008, 7, 4946–4954. [Google Scholar] [CrossRef]
  78. Hayrapetyan, H.; Siezen, R.; Abee, T.; Nierop Groot, M. Comparative genomics of iron-transporting systems in bacillus cereus strains and impact of iron sources on growth and biofilm formation. Front. Microbiol. 2016, 7, 842. [Google Scholar] [CrossRef]
  79. Decaria, L.; Bertini, I.; Williams, R.J. Copper proteomes, phylogenetics and evolution. Metallomics 2011, 3, 56–60. [Google Scholar] [CrossRef]
  80. Ridge, P.G.; Zhang, Y.; Gladyshev, V.N. Comparative genomic analyses of copper transporters and cuproproteomes reveal evolutionary dynamics of copper utilization and its link to oxygen. PLoS ONE 2008, 3, e1378. [Google Scholar] [CrossRef]
  81. Zhang, Y.; Gladyshev, V.N. General trends in trace element utilization revealed by comparative genomic analyses of Co, Cu, Mo, Ni, and Se. J. Biol. Chem. 2010, 285, 3393–3405. [Google Scholar] [CrossRef] [PubMed]
  82. Scherbaum, S.; Hellmann, N.; Fernández, R.; Pick, C.; Burmester, T. Diversity, evolution, and function of myriapod hemocyanins. BMC Evol. Biol. 2018, 18, 107. [Google Scholar] [CrossRef] [PubMed]
  83. Peng, T.; Xu, Y.; Zhang, Y. Comparative genomics of molybdenum utilization in prokaryotes and eukaryotes. BMC Genomics 2018, 19, 691. [Google Scholar] [CrossRef] [PubMed]
  84. Mendel, R.R. The molybdenum cofactor. J. Biol. Chem. 2013, 288, 13165–13172. [Google Scholar] [CrossRef] [PubMed]
  85. Moura, J.J.; Bernhardt, P.V.; Maia, L.B.; Gonzalez, P.J. Molybdenum and tungsten enzymes: From biology to chemistry and back. J. Biol. Inorg. Chem. 2015, 20, 181–182. [Google Scholar] [CrossRef] [PubMed]
  86. Hille, R.; Hall, J.; Basu, P. The mononuclear molybdenum enzymes. Chem. Rev. 2014, 114, 3963–4038. [Google Scholar] [CrossRef] [PubMed]
  87. Leimkühler, S.; Iobbi-Nivol, C. Bacterial molybdoenzymes: Old enzymes for new purposes. FEMS Microbiol. Rev. 2016, 40, 1–18. [Google Scholar] [CrossRef]
  88. Gladyshev, V.N.; Zhang, Y. Abundance, ubiquity and evolution of molybdoenzymes. In Molybdenum and Tungsten Enzymes: Biochemistry, 1st ed.; Hille, R., Schulzke, C., Kirk, M.L., Eds.; The Royal Society of Chemistry: Cambridge, UK, 2016; pp. 81–99. [Google Scholar]
  89. Llamas, A.; Tejada-Jiménez, M.; Fernández, E.; Galván, A. Molybdenum metabolism in the alga Chlamydomonas stands at the crossroad of those in Arabidopsis and humans. Metallomics 2011, 3, 578–590. [Google Scholar] [CrossRef]
  90. Zhang, Y.; Rump, S.; Gladyshev, V.N. Comparative genomics and evolution of molybdenum utilization. Coord. Chem. Rev. 2011, 255, 1206–1217. [Google Scholar] [CrossRef]
  91. Bevers, L.; Hagedoorn, P.; Hagen, W. The bioinorganic chemistry of tungsten. Coord. Chem. Rev. 2009, 253, 269–290. [Google Scholar] [CrossRef]
  92. Pushie, M.J.; Cotelesage, J.J.; George, G.N. Molybdenum and tungsten oxygen transferases--and functional diversity within a common active site motif. Metallomics 2014, 6, 15–24. [Google Scholar] [CrossRef] [PubMed]
  93. Niks, D.; Hille, R. Molybdenum- and tungsten-containing formate dehydrogenases and formylmethanofuran dehydrogenases: Structure, mechanism, and cofactor insertion. Protein Sci. 2019, 28, 111–122. [Google Scholar] [CrossRef] [PubMed]
  94. Boll, M.; Einsle, O.; Ermler, U.; Kroneck, P.M.; Ullmann, G.M. Structure and function of the unusual tungsten enzymes acetylene hydratase and class II benzoyl-coenzyme a reductase. J. Mol. Microbiol. Biotechnol. 2016, 26, 119–137. [Google Scholar] [CrossRef]
  95. Boer, J.L.; Mulrooney, S.B.; Hausinger, R.P. Nickel-dependent metalloenzymes. Arch. Biochem. Biophys. 2014, 544, 142–152. [Google Scholar] [CrossRef]
  96. Kräutler, B. Vitamin B12: Chemistry and biochemistry. Biochem. Soc. Trans. 2005, 33, 806–810. [Google Scholar] [CrossRef]
  97. Smith, A.D.; Warren, M.J.; Refsum, H. Vitamin B12. Adv. Food Nutr. Res. 2018, 83, 215–279. [Google Scholar]
  98. Takano, H.; Mise, K.; Hagiwara, K.; Hirata, N.; Watanabe, S.; Toriyabe, M.; Shiratori-Takano, H.; Ueda, K. Role and function of LitR, an adenosyl B12-bound light-sensitive regulator of bacillus megaterium QM B1551, in regulation of carotenoid production. J. Bacteriol. 2015, 197, 2301–2315. [Google Scholar] [CrossRef]
  99. Kobayashi, M.; Shimizu, S. Cobalt proteins. Eur. J. Biochem. 1999, 261, 1–9. [Google Scholar] [CrossRef] [PubMed]
  100. Zhang, Y.; Rodionov, D.A.; Gelfand, M.S.; Gladyshev, V.N. Comparative genomic analyses of nickel, cobalt and vitamin B12 utilization. BMC Genomics 2009, 10, 78. [Google Scholar] [CrossRef]
  101. Zheng, K.; Ngo, P.D.; Owens, V.L.; Yang, X.P.; Mansoorabadi, S.O. The biosynthetic pathway of coenzyme F430 in methanogenic and methanotrophic archaea. Science 2016, 354, 339–342. [Google Scholar] [CrossRef]
  102. Mulrooney, S.B.; Hausinger, R.P. Nickel uptake and utilization by microorganisms. FEMS Microbiol. Rev. 2003, 27, 239–261. [Google Scholar] [CrossRef]
  103. Eitinger, T.; Suhr, J.; Moore, L.; Smith, J.A. Secondary transporters for nickel and cobalt ions: Theme and variations. BioMetals 2005, 18, 399–405. [Google Scholar] [CrossRef] [PubMed]
  104. Shelton, A.N.; Seth, E.C.; Mok, K.C.; Han, A.W.; Jackson, S.N.; Haft, D.R.; Taga, M.E. Uneven distribution of cobamide biosynthesis and dependence in bacteria predicted by comparative genomics. ISME J. 2019, 13, 789–804. [Google Scholar] [CrossRef] [PubMed]
  105. Ying, H.; Zhang, Y. Systems biology of selenium and complex disease. Biol. Trace Elem. Res. 2019, 192, 38–50. [Google Scholar] [CrossRef]
  106. Steinbrenner, H.; Speckmann, B.; Klotz, L.O. Selenoproteins: Antioxidant selenoenzymes and beyond. Arch. Biochem. Biophys. 2016, 595, 113–119. [Google Scholar] [CrossRef]
  107. Hariharan, S.; Dharmaraj, S. Selenium and selenoproteins: It’s role in regulation of inflammation. Inflammopharmacology 2020, 28, 667–695. [Google Scholar] [CrossRef]
  108. Li, M.; Huang, Y.; Xiao, Y. A method for identification of selenoprotein genes in archaeal genomes. Genom. Proteom. Bioinform. 2009, 7, 62–70. [Google Scholar] [CrossRef]
  109. Santesmasses, D.; Mariotti, M.; Guigó, R. Selenoprofiles: A computational pipeline for annotation of selenoproteins. Methods Mol. Biol. 2018, 1661, 17–28. [Google Scholar]
  110. Castellano, S.; Lobanov, A.V.; Chapple, C.; Novoselov, S.V.; Albrecht, M.; Hua, D.; Lescure, A.; Lengauer, T.; Krol, A.; Gladyshev, V.N.; et al. Diversity and functional plasticity of eukaryotic selenoproteins: Identification and characterization of the SelJ family. Proc. Natl. Acad. Sci. USA 2005, 102, 16188–16193. [Google Scholar] [CrossRef]
  111. Zhang, Y.; Gladyshev, V.N. Trends in selenium utilization in marine microbial world revealed through the analysis of the global ocean sampling (GOS) project. PLoS Genet. 2008, 4, e1000095. [Google Scholar] [CrossRef]
  112. Lin, J.; Peng, T.; Jiang, L.; Ni, J.Z.; Liu, Q.; Chen, L.; Zhang, Y. Comparative genomics reveals new candidate genes involved in selenium metabolism in prokaryotes. Genome Biol. Evol. 2015, 7, 664–676. [Google Scholar] [CrossRef]
  113. Cravedi, P.; Mori, G.; Fischer, F.; Percudani, R. Evolution of the selenoproteome in helicobacter pylori and epsilonproteobacteria. Genome Biol. Evol. 2015, 7, 2692–2704. [Google Scholar] [PubMed]
  114. Peng, T.; Lin, J.; Xu, Y.Z.; Zhang, Y. Comparative genomics reveals new evolutionary and ecological patterns of selenium utilization in bacteria. ISME J. 2016, 10, 2048–2059. [Google Scholar] [CrossRef]
  115. Miller, W.G.; Yee, E.; Lopes, B.S.; Chapman, M.H.; Huynh, S.; Bono, J.L.; Parker, C.T.; Strachan, N.J.C.; Forbes, K.J. Comparative genomic analysis identifies a campylobacter clade deficient in selenium metabolism. Genome Biol. Evol. 2017, 9, 1843–1858. [Google Scholar] [CrossRef] [PubMed]
  116. Santesmasses, D.; Mariotti, M.; Guigó, R. Computational identification of the selenocysteine tRNA (tRNASec) in genomes. PLoS Comput. Biol. 2017, 13, e1005383. [Google Scholar] [CrossRef] [PubMed]
  117. Rother, M.; Quitzke, V. Selenoprotein synthesis and regulation in Archaea. Biochim. Biophys. Acta Gen. Subj. 2018, 1862, 2451–2462. [Google Scholar] [CrossRef] [PubMed]
  118. Mariotti, M.; Lobanov, A.V.; Manta, B.; Santesmasses, D.; Bofill, A.; Guigó, R.; Gabaldón, T.; Gladyshev, V.N. Lokiarchaeota marks the transition between the archaeal and eukaryotic selenocysteine encoding systems. Mol. Biol. Evol. 2016, 33, 2441–2453. [Google Scholar] [CrossRef] [PubMed]
  119. Lobanov, A.V.; Fomenko, D.E.; Zhang, Y.; Sengupta, A.; Hatfield, D.L.; Gladyshev, V.N. Evolutionary dynamics of eukaryotic selenoproteomes: Large selenoproteomes may associate with aquatic life and small with terrestrial life. Genome Biol. 2007, 8, R198. [Google Scholar] [CrossRef]
  120. Mariotti, M.; Ridge, P.G.; Zhang, Y.; Lobanov, A.V.; Pringle, T.H.; Guigo, R.; Hatfield, D.L.; Gladyshev, V.N. Composition and evolution of the vertebrate and mammalian selenoproteomes. PLoS ONE 2012, 7, e33066. [Google Scholar] [CrossRef]
  121. Jiang, L.; Ni, J.; Liu, Q. Evolution of selenoproteins in the metazoan. BMC Genomics 2012, 13, 446. [Google Scholar] [CrossRef]
  122. Liang, H.; Wei, T.; Xu, Y.; Li, L.; Kumar Sahu, S.; Wang, H.; Li, H.; Fu, X.; Zhang, G.; Melkonian, M.; et al. Phylogenomics provides new insights into gains and losses of selenoproteins among archaeplastida. Int. J. Mol. Sci. 2019, 20, 3020. [Google Scholar] [CrossRef]
  123. Mariotti, M.; Santesmasses, D.; Capella-Gutierrez, S.; Mateo, A.; Arnan, C.; Johnson, R.; D’Aniello, S.; Yim, S.H.; Gladyshev, V.N.; Serras, F.; et al. Evolution of selenophosphate synthetases: Emergence and relocation of function through independent duplications and recurrent subfunctionalization. Genome Res. 2015, 25, 1256–1267. [Google Scholar] [CrossRef] [PubMed]
  124. Sarangi, G.K.; Romagné, F.; Castellano, S. Distinct patterns of selection in selenium-dependent genes between land and aquatic vertebrates. Mol. Biol. Evol. 2018, 35, 1744–1756. [Google Scholar] [CrossRef] [PubMed]
  125. Gobler, C.J.; Berry, D.L.; Dyhrman, S.T.; Wilhelm, S.W.; Salamov, A.; Lobanov, A.V.; Zhang, Y.; Collier, J.L.; Wurch, L.L.; Kustka, A.B.; et al. Niche of harmful alga Aureococcus anophagefferens revealed through ecogenomics. Proc. Natl. Acad. Sci. USA 2011, 108, 4352–4357. [Google Scholar] [CrossRef]
  126. Lobanov, A.V.; Hatfield, D.L.; Gladyshev, V.N. Reduced reliance on the trace element selenium during evolution of mammals. Genome Biol. 2008, 9, R62. [Google Scholar] [CrossRef]
  127. Baclaocos, J.; Santesmasses, D.; Mariotti, M.; Bierła, K.; Vetick, M.B.; Lynch, S.; McAllen, R.; Mackrill, J.J.; Loughran, G.; Guigó, R.; et al. Processive recoding and metazoan evolution of selenoprotein P: Up to 132 UGAs in molluscs. J. Mol. Biol. 2019, 431, 4381–4407. [Google Scholar] [CrossRef]
  128. Mariotti, M.; Salinas, G.; Gabaldón, T.; Gladyshev, V.N. Utilization of selenocysteine in early-branching fungal phyla. Nat. Microbiol. 2019, 4, 759–765. [Google Scholar] [CrossRef]
  129. Khrustalev, V.V.; Barkovsky, E.V.; Khrustaleva, T.A. Magnesium and manganese binding sites on proteins have the same predominant motif of secondary structure. J. Theor. Biol. 2016, 395, 174–185. [Google Scholar] [CrossRef]
  130. Vincent, J.B. The biochemistry of chromium. J. Nutr. 2000, 130, 715–718. [Google Scholar] [CrossRef]
  131. Ueki, T.; Adachi, T.; Kawano, S.; Aoshima, M.; Yamaguchi, N.; Kanamori, K.; Michibata, H. Vanadium-binding proteins (vanabins) from a vanadium-rich ascidian Ascidia sydneiensis samea. Biochim. Biophys. Acta 2003, 1626, 43–50. [Google Scholar] [CrossRef]

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.