Next Article in Journal / Special Issue
Binding Ligand Prediction for Proteins Using Partial Matching of Local Surface Patches
Previous Article in Journal
Profile and Functional Properties of Seed Proteins from Six Pea (Pisum sativum) Genotypes
Previous Article in Special Issue
Screening and Initial Binding Assessment of Fumonisin B1 Aptamers
Article Menu

Export Article

Open AccessArticle
Int. J. Mol. Sci. 2010, 11(12), 4991-5008; doi:10.3390/ijms11124991

Computational Prediction of O-linked Glycosylation Sites that Preferentially Map on Intrinsically Disordered Regions of Extracellular Proteins

1
College of Information Science and Engineering, Ritsumeikan University/Noji-higashi 1-1-1, Kusatsu, Shiga 525-8577, Japan
2
College of Life Sciences, Ritsumeikan University/Noji-higashi 1-1-1, Kusatsu, Shiga 525-8577, Japan
3
Center for Information Biology & DNA Data Bank of Japan, National Institute of Genetics/Yata 1111, Mishima, Shizuoka 411-8540, Japan
4
Department of Bioinformatics, Maebashi Institute of Technology/Kamisadori 460-1, Maebashi, Gunma 371-0816, Japan
*
Author to whom correspondence should be addressed.
Received: 28 October 2010 / Revised: 17 November 2010 / Accepted: 30 November 2010 / Published: 3 December 2010
(This article belongs to the Special Issue Advances in Molecular Recognition)
View Full-Text   |   Download PDF [458 KB, uploaded 19 June 2014]   |  

Abstract

O-glycosylation of mammalian proteins is one of the important posttranslational modifications. We applied a support vector machine (SVM) to predict whether Ser or Thr is glycosylated, in order to elucidate the O-glycosylation mechanism. O-glycosylated sites were often found clustered along the sequence, whereas other sites were located sporadically. Therefore, we developed two types of SVMs for predicting clustered and isolated sites separately. We found that the amino acid composition was effective for predicting the clustered type, whereas the site-specific algorithm was effective for the isolated type. The highest prediction accuracy for the clustered type was 74%, while that for the isolated type was 79%. The existence frequency of amino acids around the O-glycosylation sites was different in the two types: namely, Pro, Val and Ala had high existence probabilities at each specific position relative to a glycosylation site, especially for the isolated type. Independent component analyses for the amino acid sequences around O-glycosylation sites showed the position-specific existences of the identified amino acids as independent components. The O-glycosylation sites were preferentially located within intrinsically disordered regions of extracellular proteins: particularly, more than 90% of the clustered O-GalNAc glycosylation sites were observed in intrinsically disordered regions. This feature could be the key for understanding the non-conservation property of O-glycosylation, and its role in functional diversity and structural stability.
Keywords: protein O-glycosylation; mucin-type; posttranslational modification; support vector machine; clustered and isolated glycosylation sites; intrinsically disordered; extracellular protein; non-conservation property protein O-glycosylation; mucin-type; posttranslational modification; support vector machine; clustered and isolated glycosylation sites; intrinsically disordered; extracellular protein; non-conservation property
Figures

This is an open access article distributed under the Creative Commons Attribution License (CC BY 3.0).

Supplementary material

Scifeed alert for new publications

Never miss any articles matching your research from any publisher
  • Get alerts for new papers matching your research
  • Find out the new papers from selected authors
  • Updated daily for 49'000+ journals and 6000+ publishers
  • Define your Scifeed now

SciFeed Share & Cite This Article

MDPI and ACS Style

Nishikawa, I.; Nakajima, Y.; Ito, M.; Fukuchi, S.; Homma, K.; Nishikawa, K. Computational Prediction of O-linked Glycosylation Sites that Preferentially Map on Intrinsically Disordered Regions of Extracellular Proteins. Int. J. Mol. Sci. 2010, 11, 4991-5008.

Show more citation formats Show less citations formats

Related Articles

Article Metrics

Article Access Statistics

1

Comments

[Return to top]
Int. J. Mol. Sci. EISSN 1422-0067 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert
Back to Top