Functional Annotations of Paralogs: A Blessing and a Curse
AbstractGene duplication followed by mutation is a classic mechanism of neofunctionalization, producing gene families with functional diversity. In some cases, a single point mutation is sufficient to change the substrate specificity and/or the chemistry performed by an enzyme, making it difficult to accurately separate enzymes with identical functions from homologs with different functions. Because sequence similarity is often used as a basis for assigning functional annotations to genes, non-isofunctional gene families pose a great challenge for genome annotation pipelines. Here we describe how integrating evolutionary and functional information such as genome context, phylogeny, metabolic reconstruction and signature motifs may be required to correctly annotate multifunctional families. These integrative analyses can also lead to the discovery of novel gene functions, as hints from specific subgroups can guide the functional characterization of other members of the family. We demonstrate how careful manual curation processes using comparative genomics can disambiguate subgroups within large multifunctional families and discover their functions. We present the COG0720 protein family as a case study. We also discuss strategies to automate this process to improve the accuracy of genome functional annotation pipelines. View Full-Text
- Supplementary File 1:
ZIP-Document (ZIP, 509 KB)
Share & Cite This Article
Zallot, R.; Harrison, K.J.; Kolaczkowski, B.; de Crécy-Lagard, V. Functional Annotations of Paralogs: A Blessing and a Curse. Life 2016, 6, 39.
Zallot R, Harrison KJ, Kolaczkowski B, de Crécy-Lagard V. Functional Annotations of Paralogs: A Blessing and a Curse. Life. 2016; 6(3):39.Chicago/Turabian Style
Zallot, Rémi; Harrison, Katherine J.; Kolaczkowski, Bryan; de Crécy-Lagard, Valérie. 2016. "Functional Annotations of Paralogs: A Blessing and a Curse." Life 6, no. 3: 39.
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.