Next Article in Journal
Regulatory Roles of Non-Coding RNAs in Colorectal Cancer
Next Article in Special Issue
An Overview of Predictors for Intrinsically Disordered Proteins over 2010–2014
Previous Article in Journal
Curcumol Inhibits Growth and Induces Apoptosis of Colorectal Cancer LoVo Cell Line via IGF-1R and p38 MAPK Pathway
Previous Article in Special Issue
In Silico Analysis of Correlations between Protein Disorder and Post-Translational Modifications in Algae
Article

Accurate Ab Initio and Template-Based Prediction of Short Intrinsically-Disordered Regions by Bidirectional Recurrent Neural Networks Trained on Large-Scale Datasets

1
School of Computer Science, University College Dublin, Belfield, Dublin 4, Ireland
2
Adaptive and Complex Systems Laboratory, University College Dublin, Belfield, Dublin 4, Ireland
*
Author to whom correspondence should be addressed.
Academic Editors: Lukasz Kurgan and Vladimir N. Uversky
Int. J. Mol. Sci. 2015, 16(8), 19868-19885; https://doi.org/10.3390/ijms160819868
Received: 1 June 2015 / Revised: 28 July 2015 / Accepted: 29 July 2015 / Published: 21 August 2015
Intrinsically-disordered regions lack a well-defined 3D structure, but play key roles in determining the function of many proteins. Although predictors of disorder have been shown to achieve relatively high rates of correct classification of these segments, improvements over the the years have been slow, and accurate methods are needed that are capable of accommodating the ever-increasing amount of structurally-determined protein sequences to try to boost predictive performances. In this paper, we propose a predictor for short disordered regions based on bidirectional recurrent neural networks and tested by rigorous five-fold cross-validation on a large, non-redundant dataset collected from MobiDB, a new comprehensive source of protein disorder annotations. The system exploits sequence and structural information in the forms of frequency profiles, predicted secondary structure and solvent accessibility and direct disorder annotations from homologous protein structures (templates) deposited in the Protein Data Bank. The contributions of sequence, structure and homology information result in large improvements in predictive accuracy. Additionally, the large scale of the training set leads to low false positive rates, making our systems a robust and efficient way to address high-throughput disorder prediction. View Full-Text
Keywords: protein disorder; BRNN; MobiDB; PDB protein disorder; BRNN; MobiDB; PDB
Show Figures

Figure 1

MDPI and ACS Style

Volpato, V.; Alshomrani, B.; Pollastri, G. Accurate Ab Initio and Template-Based Prediction of Short Intrinsically-Disordered Regions by Bidirectional Recurrent Neural Networks Trained on Large-Scale Datasets. Int. J. Mol. Sci. 2015, 16, 19868-19885. https://doi.org/10.3390/ijms160819868

AMA Style

Volpato V, Alshomrani B, Pollastri G. Accurate Ab Initio and Template-Based Prediction of Short Intrinsically-Disordered Regions by Bidirectional Recurrent Neural Networks Trained on Large-Scale Datasets. International Journal of Molecular Sciences. 2015; 16(8):19868-19885. https://doi.org/10.3390/ijms160819868

Chicago/Turabian Style

Volpato, Viola, Badr Alshomrani, and Gianluca Pollastri. 2015. "Accurate Ab Initio and Template-Based Prediction of Short Intrinsically-Disordered Regions by Bidirectional Recurrent Neural Networks Trained on Large-Scale Datasets" International Journal of Molecular Sciences 16, no. 8: 19868-19885. https://doi.org/10.3390/ijms160819868

Find Other Styles

Article Access Map by Country/Region

1
Only visits after 24 November 2015 are recorded.
Back to TopTop