Next Article in Journal
Cytogenetically Elusive Sex Chromosomes in Scincoidean Lizards
Previous Article in Journal
Genome-Wide Identification and Transcriptional Expression Analysis of Annexin Genes in Capsicum annuum and Characterization of CaAnn9 in Salt Tolerance
Previous Article in Special Issue
Evolution as a Guide to Designing xeno Amino Acid Alphabets
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Editorial

Protein Structure Analysis and Prediction with Statistical Scoring Functions

Department of Biosciences, University of Salzburg, 5020 Salzburg, Austria
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2021, 22(16), 8665; https://doi.org/10.3390/ijms22168665
Submission received: 28 July 2021 / Accepted: 4 August 2021 / Published: 12 August 2021
The PDB database provides more than 150,000 entries for biological macromolecular structures. The vast majority of the entries comprise proteins. Thus, we can resort to a large dataset that encodes information about sequence–structure–function relationships. Many bioinformatics approaches take advantage of this information and utilize it for a wealth of basic biological, biochemical, and biophysical problems. A well-established key approach is the statistical analysis of experimentally resolved structures for the subsequent derivation of statistical scoring functions (SSFs, also referred to as statistical energy functions, knowledge-based potentials, or mean force potentials). Such SSFs are employed in numerous bioinformatics methods, e.g., for the assessment of experimentally determined structures or in the prediction of 3D protein structures, protein–protein interactions, protein–ligand interactions, protein stability, etc.
The original purpose of what is now termed “statistical scoring function” was the description of molecular interactions. Different authors established energy-like functions depending on the distance between two atoms, dihedral angles, density of neighboring atoms (contacts), or orientation and distance. Meanwhile, SSFs are viewed in more general ways by certain authors, which is also reflected in distinct contributions to this Special Issue.
Georgoulia and Bjelic investigated the prediction of protein–protein binding interactions in dimeric coiled coils. For this purpose, they applied Rosetta, a versatile set of computational methods for molecular modeling and analysis developed at David Baker’s lab. The heart of Rosetta are molecular scoring functions that combine physics-based terms such as Van der Waals interactions with statistics-based terms such as a distance-dependent atom pair potential.
Laimer and Lackner contributed a method named MHCII3D that concentrates on the structure-based prediction of peptides that bind to MHC II molecules. Biologically, such an interaction is part of natural antigen recognition and triggers immune responses. This process plays also a key role in diseases such as allergies and cancer. MHCII3D employs comparative modeling and a distance-dependent SSF for the prediction of potential binding peptides. In contrast to competitor methods, it does not employ machine learning and is therefore not constrained by learning data from wet lab binding experiments.
Poot Velez et al. describe a new way of representing protein structure and apply it to the prediction of protein–protein interactions (PPIs). They map the atomic three-dimensional structures of proteins participating in protein–protein interactions to compact representations based on 26 residue cluster classes (RCCs). Variants of combining the individual RCCs of binding partners are then analyzed with respect to their ability to correctly classify PPIs with machine learning methods.
Alshehri et al. present the first large-scale characterization of intrinsically disordered proteins in the proteome of the genus Camelus. Here, SSFs are used (in addition to other methods) to predict disordered proteins/protein regions in Camelus species and Homo sapiens. Those proteins are then functionally annotated and systematically compared with respect to their degree of disorder and their biological roles. The authors highlight several functional categories that are overrepresented among Camelus disordered proteins when compared to the human proteome.
The applicability of SSFs derived from known protein structures is naturally challenged by so-called xenoproteins, which are built from amino acids not part of life’s standard alphabet. Mayer-Bacon et al. review empirical work on the incorporation of non-canonical amino acids into proteins and juxtapose these findings with more theoretical considerations about the properties of amino acids from an evolutionary perspective. The review stimulates rethinking of many of the assumptions underlying current prediction models.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Lackner, P.; Wiederstein, M. Protein Structure Analysis and Prediction with Statistical Scoring Functions. Int. J. Mol. Sci. 2021, 22, 8665. https://doi.org/10.3390/ijms22168665

AMA Style

Lackner P, Wiederstein M. Protein Structure Analysis and Prediction with Statistical Scoring Functions. International Journal of Molecular Sciences. 2021; 22(16):8665. https://doi.org/10.3390/ijms22168665

Chicago/Turabian Style

Lackner, Peter, and Markus Wiederstein. 2021. "Protein Structure Analysis and Prediction with Statistical Scoring Functions" International Journal of Molecular Sciences 22, no. 16: 8665. https://doi.org/10.3390/ijms22168665

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop