Next Article in Journal
Regulatory Roles of Non-Coding RNAs in Colorectal Cancer
Next Article in Special Issue
An Overview of Predictors for Intrinsically Disordered Proteins over 2010–2014
Previous Article in Journal
Curcumol Inhibits Growth and Induces Apoptosis of Colorectal Cancer LoVo Cell Line via IGF-1R and p38 MAPK Pathway
Previous Article in Special Issue
In Silico Analysis of Correlations between Protein Disorder and Post-Translational Modifications in Algae
Article Menu
Issue 8 (August) cover image

Export Article

Open AccessArticle
Int. J. Mol. Sci. 2015, 16(8), 19868-19885; doi:10.3390/ijms160819868

Accurate Ab Initio and Template-Based Prediction of Short Intrinsically-Disordered Regions by Bidirectional Recurrent Neural Networks Trained on Large-Scale Datasets

1
School of Computer Science, University College Dublin, Belfield, Dublin 4, Ireland
2
Adaptive and Complex Systems Laboratory, University College Dublin, Belfield, Dublin 4, Ireland
*
Author to whom correspondence should be addressed.
Academic Editors: Lukasz Kurgan and Vladimir N. Uversky
Received: 1 June 2015 / Revised: 28 July 2015 / Accepted: 29 July 2015 / Published: 21 August 2015
View Full-Text   |   Download PDF [503 KB, uploaded 21 August 2015]   |  

Abstract

Intrinsically-disordered regions lack a well-defined 3D structure, but play key roles in determining the function of many proteins. Although predictors of disorder have been shown to achieve relatively high rates of correct classification of these segments, improvements over the the years have been slow, and accurate methods are needed that are capable of accommodating the ever-increasing amount of structurally-determined protein sequences to try to boost predictive performances. In this paper, we propose a predictor for short disordered regions based on bidirectional recurrent neural networks and tested by rigorous five-fold cross-validation on a large, non-redundant dataset collected from MobiDB, a new comprehensive source of protein disorder annotations. The system exploits sequence and structural information in the forms of frequency profiles, predicted secondary structure and solvent accessibility and direct disorder annotations from homologous protein structures (templates) deposited in the Protein Data Bank. The contributions of sequence, structure and homology information result in large improvements in predictive accuracy. Additionally, the large scale of the training set leads to low false positive rates, making our systems a robust and efficient way to address high-throughput disorder prediction. View Full-Text
Keywords: protein disorder; BRNN; MobiDB; PDB protein disorder; BRNN; MobiDB; PDB
This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. (CC BY 4.0).

Scifeed alert for new publications

Never miss any articles matching your research from any publisher
  • Get alerts for new papers matching your research
  • Find out the new papers from selected authors
  • Updated daily for 49'000+ journals and 6000+ publishers
  • Define your Scifeed now

SciFeed Share & Cite This Article

MDPI and ACS Style

Volpato, V.; Alshomrani, B.; Pollastri, G. Accurate Ab Initio and Template-Based Prediction of Short Intrinsically-Disordered Regions by Bidirectional Recurrent Neural Networks Trained on Large-Scale Datasets. Int. J. Mol. Sci. 2015, 16, 19868-19885.

Show more citation formats Show less citations formats

Related Articles

Article Metrics

Article Access Statistics

1

Comments

[Return to top]
Int. J. Mol. Sci. EISSN 1422-0067 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert
Back to Top