Int. J. Mol. Sci. 2011, 12(12), 8347-8361; doi:10.3390/ijms12128347

Prediction of Lysine Ubiquitylation with Ensemble Classifier and Feature Selection

1 College of Life Science, Northeast Normal University, 5268 Renmin Street, Changchun 130024, China 2 College of Computer Science, Northeast Normal University, 2555 Jingyue Street, Changchun 130117, China
* Authors to whom correspondence should be addressed.
Received: 27 July 2011; in revised form: 14 November 2011 / Accepted: 15 November 2011 / Published: 28 November 2011
(This article belongs to the Section Biochemistry, Molecular Biology and Biophysics)
PDF Full-text Download PDF Full-Text [247 KB, uploaded 28 November 2011 10:57 CET]
Abstract: Ubiquitylation is an important process of post-translational modification. Correct identification of protein lysine ubiquitylation sites is of fundamental importance to understand the molecular mechanism of lysine ubiquitylation in biological systems. This paper develops a novel computational method to effectively identify the lysine ubiquitylation sites based on the ensemble approach. In the proposed method, 468 ubiquitylation sites from 323 proteins retrieved from the Swiss-Prot database were encoded into feature vectors by using four kinds of protein sequences information. An effective feature selection method was then applied to extract informative feature subsets. After different feature subsets were obtained by setting different starting points in the search procedure, they were used to train multiple random forests classifiers and then aggregated into a consensus classifier by majority voting. Evaluated by jackknife tests and independent tests respectively, the accuracy of the proposed predictor reached 76.82% for the training dataset and 79.16% for the test dataset, indicating that this predictor is a useful tool to predict lysine ubiquitylation sites. Furthermore, site-specific feature analysis was performed and it was shown that ubiquitylation is intimately correlated with the features of its surrounding sites in addition to features derived from the lysine site itself. The feature selection method is available upon request.
Keywords: ubiquitylation; ensemble classifier; support vector machine; lysine ubiquitylation sites

Article Statistics

Load and display the download statistics.

Citations to this Article

Cite This Article

MDPI and ACS Style

Zhao, X.; Li, X.; Ma, Z.; Yin, M. Prediction of Lysine Ubiquitylation with Ensemble Classifier and Feature Selection. Int. J. Mol. Sci. 2011, 12, 8347-8361.

AMA Style

Zhao X, Li X, Ma Z, Yin M. Prediction of Lysine Ubiquitylation with Ensemble Classifier and Feature Selection. International Journal of Molecular Sciences. 2011; 12(12):8347-8361.

Chicago/Turabian Style

Zhao, Xiaowei; Li, Xiangtao; Ma, Zhiqiang; Yin, Minghao. 2011. "Prediction of Lysine Ubiquitylation with Ensemble Classifier and Feature Selection." Int. J. Mol. Sci. 12, no. 12: 8347-8361.

Int. J. Mol. Sci. EISSN 1422-0067 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert