Next Article in Journal
A Splice Intervention Therapy for Autosomal Recessive Juvenile Parkinson’s Disease Arising from Parkin Mutations
Next Article in Special Issue
Ligand-Receptor Interactions and Machine Learning in GCGR and GLP-1R Drug Discovery
Previous Article in Journal
Structural Diversity and Dynamics of Human Three-Finger Proteins Acting on Nicotinic Acetylcholine Receptors
Article

SPOTONE: Hot Spots on Protein Complexes with Extremely Randomized Trees via Sequence-Only Features

1
CNC—Center for Neuroscience and Cell Biology, University of Coimbra, 3004-504 Coimbra, Portugal
2
Department of Life Sciences, Center for Neuroscience and Cell Biology, Coimbra University, 3000-456 Coimbra, Portugal
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2020, 21(19), 7281; https://doi.org/10.3390/ijms21197281
Received: 10 August 2020 / Revised: 26 September 2020 / Accepted: 30 September 2020 / Published: 1 October 2020
(This article belongs to the Special Issue Artificial Intelligence and Machine Learning in Drug Development)
Protein Hot-Spots (HS) are experimentally determined amino acids, key to small ligand binding and tend to be structural landmarks on protein–protein interactions. As such, they were extensively approached by structure-based Machine Learning (ML) prediction methods. However, the availability of a much larger array of protein sequences in comparison to determined tree-dimensional structures indicates that a sequence-based HS predictor has the potential to be more useful for the scientific community. Herein, we present SPOTONE, a new ML predictor able to accurately classify protein HS via sequence-only features. This algorithm shows accuracy, AUROC, precision, recall and F1-score of 0.82, 0.83, 0.91, 0.82 and 0.85, respectively, on an independent testing set. The algorithm is deployed within a free-to-use webserver, only requiring the user to submit a FASTA file with one or more protein sequences. View Full-Text
Keywords: big-data; hot-spots; machine learning; protein–protein complexes; structural biology big-data; hot-spots; machine learning; protein–protein complexes; structural biology
Show Figures

Figure 1

MDPI and ACS Style

Preto, A.J.; Moreira, I.S. SPOTONE: Hot Spots on Protein Complexes with Extremely Randomized Trees via Sequence-Only Features. Int. J. Mol. Sci. 2020, 21, 7281. https://doi.org/10.3390/ijms21197281

AMA Style

Preto AJ, Moreira IS. SPOTONE: Hot Spots on Protein Complexes with Extremely Randomized Trees via Sequence-Only Features. International Journal of Molecular Sciences. 2020; 21(19):7281. https://doi.org/10.3390/ijms21197281

Chicago/Turabian Style

Preto, A. J., and Irina S. Moreira 2020. "SPOTONE: Hot Spots on Protein Complexes with Extremely Randomized Trees via Sequence-Only Features" International Journal of Molecular Sciences 21, no. 19: 7281. https://doi.org/10.3390/ijms21197281

Find Other Styles
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

1
Back to TopTop