Next Article in Journal / Special Issue
Integrated Computational Approach for Virtual Hit Identification against Ebola Viral Proteins VP35 and VP40
Previous Article in Journal
The Control of Calcium Metabolism in Zebrafish (Danio rerio)
Previous Article in Special Issue
Effects of Mutations on Structure–Function Relationships of Matrix Metalloproteinase-1
Article Menu
Issue 11 (November) cover image

Export Article

Open AccessArticle
Int. J. Mol. Sci. 2016, 17(11), 1788;

Predicting Protein–Protein Interaction Sites Using Sequence Descriptors and Site Propensity of Neighboring Amino Acids

Institute of Biomedical Informatics, National Yang-Ming University, Taipei 112, Taiwan
Office of Information Management, National Yang-Ming University Hospital, Yilan 260, Taiwan
Author to whom correspondence should be addressed.
Academic Editor: Christo Z. Christov
Received: 8 September 2016 / Revised: 14 October 2016 / Accepted: 18 October 2016 / Published: 26 October 2016
(This article belongs to the Collection Proteins and Protein-Ligand Interactions)
Full-Text   |   PDF [513 KB, uploaded 26 October 2016]   |  


Information about the interface sites of Protein–Protein Interactions (PPIs) is useful for many biological research works. However, despite the advancement of experimental techniques, the identification of PPI sites still remains as a challenging task. Using a statistical learning technique, we proposed a computational tool for predicting PPI interaction sites. As an alternative to similar approaches requiring structural information, the proposed method takes all of the input from protein sequences. In addition to typical sequence features, our method takes into consideration that interaction sites are not randomly distributed over the protein sequence. We characterized this positional preference using protein complexes with known structures, proposed a numerical index to estimate the propensity and then incorporated the index into a learning system. The resulting predictor, without using structural information, yields an area under the ROC curve (AUC) of 0.675, recall of 0.597, precision of 0.311 and accuracy of 0.583 on a ten-fold cross-validation experiment. This performance is comparable to the previous approach in which structural information was used. Upon introducing the B-factor data to our predictor, we demonstrated that the AUC can be further improved to 0.750. The tool is accessible at View Full-Text
Keywords: Protein–Protein Interaction; intrinsically-disorder protein; machine learning algorithms Protein–Protein Interaction; intrinsically-disorder protein; machine learning algorithms

Figure 1

This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. (CC BY 4.0).

Share & Cite This Article

MDPI and ACS Style

Kuo, T.-H.; Li, K.-B. Predicting Protein–Protein Interaction Sites Using Sequence Descriptors and Site Propensity of Neighboring Amino Acids. Int. J. Mol. Sci. 2016, 17, 1788.

Show more citation formats Show less citations formats

Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Related Articles

Article Metrics

Article Access Statistics



[Return to top]
Int. J. Mol. Sci. EISSN 1422-0067 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert
Back to Top