Next Article in Journal
Backstepping Adaptive Neural Network Control for Electric Braking Systems of Aircrafts
Previous Article in Journal
Multimodal Dynamic Journey-Planning
Open AccessArticle

Exploiting Sparse Statistics for a Sequence-Based Prediction of the Effect of Mutations

Department of Pharmacological Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
Algorithms 2019, 12(10), 214;
Received: 23 September 2019 / Revised: 6 October 2019 / Accepted: 12 October 2019 / Published: 14 October 2019
Recent work showed that there is a significant difference between the statistics of amino acid triplets and quadruplets in sequences of folded proteins and randomly generated sequences. These statistics were used to assign a score to each sequence and make a prediction whether a sequence is likely to fold. The present paper extends the statistics to higher multiplets and suggests a way to handle the treatment of multiplets that were not found in the set of folded proteins. In particular, foldability predictions were done along the line of the previous work using pentuplet statistics and a way was found to combine the quadruplet and pentuplets statistics to improve the foldability predictions. A different, simpler, score was defined for hextuplets and heptuplets and were used to predict the direction of stability change of a protein upon mutation. With the best score combination the accuracy of the prediction was 73.4%. View Full-Text
Keywords: protein sequence; foldability; mutation; residue n-tuple protein sequence; foldability; mutation; residue n-tuple
Show Figures

Figure 1

MDPI and ACS Style

Mezei, M. Exploiting Sparse Statistics for a Sequence-Based Prediction of the Effect of Mutations. Algorithms 2019, 12, 214.

Show more citation formats Show less citations formats
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

Back to TopTop