Next Article in Journal
Assessment of Agrimonia eupatoria L. and Lipophosphonoxin (DR-6180) Combination for Wound Repair: Bridging the Gap Between Phytomedicine and Organic Chemistry
Previous Article in Journal
Effects of Early Stress Exposure on Anxiety-like Behavior and MORC1 Expression in Rats
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Structural Proteomics Exploration of Synphilin-1 and Alpha-Synuclein Interaction in Pathogenesis of Parkinson’s Disease

1
Department of Applied Sciences, Indian Institute of Information Technology Allahabad, Prayagraj 211015, India
2
Department of Biotechnology, Nagaland University, Kohima 797004, India
3
Department of Medicine, Section of Rheumatology and Gwen Knapp Center for Lupus and Immunology Research, University of Chicago, Chicago, IL 60637, USA
4
Faculty of Engineering & Technology, United University Prayagraj, Prayagraj 211012, India
*
Authors to whom correspondence should be addressed.
Biomolecules 2024, 14(12), 1588; https://doi.org/10.3390/biom14121588
Submission received: 24 October 2024 / Revised: 14 November 2024 / Accepted: 15 November 2024 / Published: 12 December 2024
(This article belongs to the Section Bioinformatics and Systems Biology)

Abstract

:
Pathological significance of interaction of Synphilin-1 with mutated alpha-synuclein is well known to have serious consequences in causing the formation of inclusion bodies that are linked to Parkinson’s disease (PD). Information extracted so far pointed out that specific mutations, A53T, A30P, and E46K, in alpha-synuclein promote such interactions. However, a detailed structural study of this interaction is pending due to the unavailability of the complete structures of the large protein Synphilin-1 of chain length 919 residues and the mutated alpha-synuclein having all the reported specific mutations so far. In this study, a semi-automatic pipeline-based meta-predictor, AlphaLarge, is created to predict high-fidelity structures of large proteins like Synphilin-1 given the limitations of the existing protocols. AlphaLarge recruits a novel augmented AlphaFold model that uses a divide and conquer based strategy on the foundation of a self-sourced template dataset to choose the best structure model through their standard validations. The structure models were re-validated by a Protein Mediated Interaction Analysis (PMIA) formalism that uses the existing structurally relevant information of these proteins. For the training dataset, the new method, AlphaLarge, performed reasonably better than AlphaFold. Also, the new residue- and domain-based structural details of interactions of resultant best structure models of Synphilin-1 and both wild and mutated alpha-synuclein are extracted using PMIA. This result paves the way for better screening of target specific drugs to control the progression of PD, in particular, and research on any kind of pathophysiology involving large proteins of unknown structures, in general.

Graphical Abstract

1. Introduction

It is well known that the substantially below normal level of dopamine is linked to cognitive impairment as well as abnormal motor activities causing tremors and rigidity [1]. Consequently, the pathological manifestation of Parkinson’s disease (PD) mainly happens due to the progressive degradation of the dopaminergic neurons that is mainly represented through the manifestation of Lewy Bodies (LB), located in the brains of PD patients as fibrillar aggregates [2]. The complexity of the pathology related to PD is yet to be fully explored. However, current research indicates that there are three main triggering points to cause LB aggregates within brain. Among them is the report on the mutation in the SNCA gene that codes for the protein, alpha-synuclein (a-Syn), also located in the presynaptic neurons [3]. It is known that overexpression of a-Syn has a toxic effect and causes neuronal death through their aggregation [4]. The second one is the presence of another protein, Synphilin-1 (Syn-1), within LB aggregates that appears to be another suspect in causing LB aggregates [5]. Overexpression of Syn-1 is also reported to render a similar toxic effect to cause LB aggregates like the mutated a-Syn [6]. The third interesting event is the interaction of these two proteins, mutated a-Syn and Syn-1, both of which are found within LB aggregates [7]. In this regard, the role and function of Syn-1 in the pathogenesis of PD need to be studied to its logical conclusion. The experimental structure prediction method, NMR, provides partial information for structure of Synphilin-1 [8]. Elucidating the complete structure of Synphilin-1 will enhance understanding of its role in Parkinson’s disease. Although Alphafold2 [9] has predicted the structure of Synphilin-1 as deposited in the UniProt database, there appears to be sufficient scope for improvement in the available predicted structure of the target protein. Also, the first part of our study shows that the Alphafold2-modeled structure fails to show high binding affinity with the most appropriate ligands.
Among the many causes responsible for the pathological manifestation of PD, we have focused our study to examine the effect of the simultaneous occurrence of a specific combination of mutations, A53T, A30P, and E46K in a-Syn, and its effect in the interaction of Syn-1 with mutated a-Syn leading to the formation of Lewy Bodies (LB)—an important cause of PD. As discussed above, the primary concern of this work is to explore the detailed structural aspects of interaction of Syn-1 with mutated a-Syn to find the specific binding sites and, subsequently, to help in finding preventive therapeutics through design of target specific drug. In this regard, the main bottlenecks in this study are found to be the unavailability of the experimentally determined structures of the large protein, Syn-1, which is of size 919 residues and the a-Syn having all the reported mutations, e.g., A53T, A30P, and E46K [10,11,12]. For any structural studies of a protein–protein interaction, the availability of highly authentic structures is considered to be a good starting point, whereas, acceptability of the predicted structures by earlier structure predictor models was questionable before the advent of the Google Colab [13] designed AlphaFold, which showed remarkably good performance leading to further use of structural models predicted by it. Alphafold2 almost exactly solves one of the major problems i.e., protein three-dimensional structure prediction of whole human proteome, and, thus, gained more prominence than other structure–prediction methods [14]. However, it is observed that, like other earlier predictors, AlphaFold2 also suffers from some limitations, especially in predicting structures of large proteins [15,16,17,18]. As reported by Lee et al., 2022 [19], the major limitation of deep learning-based methods like AlphaFold and RoseTTAFold is in predicting the intricate multi-domain large protein structure, primarily the domain interacting region. They also pointed out that the performance of AlphaFold is highly dependent on the quality and diversity of the training data. Therefore, unavailability of a significant number of high-quality structural data for large size protein structure in the training dataset may lead to the inaccuracies in the predicted structure. Additionally, the prediction accuracy of AlphaFold2 can be compromised for proteins with shallow-depth multiple sequence alignments (MSA), i.e., those with fewer homologous sequences or low sequence similarity [9]. Therefore, in this work, a semi-automatic pipeline is proposed to design a meta-predictor, AlphaLarge, that is basically an augmented version of AlphaFold. AlphaLarge works on a divide and conquer strategy in which the structures of overlapping small fragments of target sequence are first predicted by AlphaFold2 for their reuse as templates within the Homology Model [20] application to substantially enhance prediction accuracy in comparison to the direct application of AlphaFold2.
Studies on protein–protein and protein–ligand interactions are always of considerable interest because of their potential in revealing pathophysiological mechanism of a disease process [21,22,23,24]. In this work, information on these interactions are extracted to fulfill two different objectives. First off, the already known information about the interaction of a ligand or protein with the target protein is utilized to select the best structure models from the output generated by the semiautomatic meta-predictor, AlphaLarge. The second objective is to study the interaction of Syn-1 with mutated a-Syn to know about the overall binding characteristics, including the binding site residues of a-Syn for controlling this interaction-mediated toxicity leading to PD. Therefore, a Protein Mediated Interaction Analysis (PMIA) Paradigm is introduced in this work to consistently bring out the crucial binding characteristics of such interactions. Under the framework of PMIA, an interaction is characterized by its binding energy, binding site residues participating in this interaction (for protein–protein interaction), or binding site residues of the target protein bonding with the ligands (for protein–ligand interaction) and, the bonds, formation of which causes this interaction. Focusing on the main objective of study on the structural details of the Syn-1 and a-Syn interaction, this work emphasized the development of an analytical formalism for this study combining the potential of AlphaLarge with PMIA. The new and specific binding site information extracted through this work is found to be consistent with the experimental reports with the potential to design better therapeutic strategy against the progress of PD.

2. Materials and Methods

This work supports building an analytical formalism to study the specific structural details of the interaction between Syn-1 and mutated a-syn. Following this workflow, this section is divided into three sub-sections. The first sub-section deals with the formalism developed to analyze the specific structural details of the binding sites of Syn-1 and a-Syn participating in their interaction considering the availability of their respective structures. The features, e.g., binding energy, participating residues or atoms, and the interaction bonds are studied following specific rules under the framework PMIA as described in the Introduction. The second subsection deals with the building of the semi-automatic pipeline to create the meta-predictor, AlphaLarge to get highly accurate structures required for this study.

2.1. Protein Mediated Interaction Analysis (PMIA) Formalism

Once considerably accurate structure models of the target protein (TP) and its interacting partner (IP) (another protein or a ligand) are available, the following rule is used to extract specific information on their binding for two purposes. The first one is to select the best structure model output by the AlphaLarge and the second one is to extract specific structural details of binding for the interaction between Syn-1 and mutated a-syn.
Rule 1: Calculate the binding energy of interaction. In this work, it is calculated using the module Maestro of the application Schrodinger. Select the structure model that exhibits the least binding energy of interaction.
Rule 2: Re-validate the structure selected through rule 1 by first finding out the binding site residues or domains along with specific interaction information of the target protein. These interacting residues or domains should correspond well with those already reported. Otherwise, the selected structure model is discarded for any further study. The parameters utilized in PMIA formalism are depicted in Figure 1.

2.2. Building of Semi-Automatic Pipeline for Designing the Meta-Predictor AlphaLarge

2.2.1. Collection of Data for Sample Proteins and Ligands

Since this work aims to get the structure of the protein, Syn-1, a single chain protein of sequence length 919, the range of the sequence length for the protein samples collected in this work is also kept similar within the limits of 900 to 1100. To calculate a statistically acceptable sample size, S, the following analytical formula of Charan et al. (2013) [25] is utilized as shown in Equation (1):
S = Z 2 × p 1 p d 2 = 14.31
where, p is the fraction of protein following the above-mentioned criteria within the population and is obtained as p = n/N = 0.0094 for the population N = 72213 and, number of proteins following the above-mentioned criteria, n = 679, standard normal variate Z = 1.96 within a level of significance, d = 0.05.
In this work, after rounding off, the sample size was considered as S = 14 and utilized further for this work. Table 1 shows these sample proteins and their corresponding known ligands as collected from PDB.

2.2.2. Working Principle of AlphaLarge

The semiautomatic pipeline-based meta-predictor, AlphaLarge, designed in this work to get structures of single-chain proteins, is based on the steps as given below. It should be noted that prior prediction of domain from the protein sequence by current standard methods (e.g., ThreaDom, Pfam) [49,50] is avoided in this work for the paradox that domains are also a large part of the structure only and their occasional incorrect prediction [51] may have a cascading effect in cumulative error accumulation.
Step 1: If the protein size is below 400, find the structure by the Direct Application of AlphaFold2 (referred as DAA pipeline), otherwise go to Step 2. Naturally, structure models generated using the DAA pipeline are referred as DAA_models.
Step 2: If the protein size is more than 400, then divide it into overlapping segments to restrict the size of each of the fragments as 400, keeping the length of the overlapping sections of consecutive sequence fragments as 100. It also corresponds well with Rost’s criteria [52], which says that the probability of finding the homologue for a protein decreases with the increase in the protein’s sequence length. Therefore, depending on the size of the protein, the last fragment size is compensated accordingly. For a hypothetical example, for a protein of size 885, the residue count for each of the fragments will be as follows:
First fragment: 1 to 400
Second fragment: 301 to 700
Third fragment (i.e., the last fragment): 601 to 885
For details of the algorithm, the MATLAB code for fragmentation is given in the Supplementary MATLAB Code. The details on the residue counts for each of the fragments of all the training proteins and Syn-1 are given in Supplementary Table S2. Also, it is found that the small changes (with a change in the fragment size between 300 and 400 and overlapping size between 100 and 200) made in the fragmentation algorithm do not alter the structures of the predicted models used in this work.
Step 3: Predict the structures for all the fragments using AlphaFold2 Colab Notebook to make them qualify to be templates for the target protein. These fragments with their predicted structures are referred to as derived templates in this work. Figure 2 demonstrates the divide and conquer strategy used in the semiautomatic pipeline-based meta-predictor, AlphaLarge.
Step 4: The alignment of each one of the derived templates with the target protein was performed using Clustal Omega [53] of PyMod 3.0 plugin nested in PyMOL application.
Step 5: The structure of the target protein was modeled by homology modeling based plugin Modeller available in PyMOL application. Steps 2 to 5 are referred as the Derived Template Source (DTS) pipeline and the structure models generated using it is referred as DTS_models.
Step 6: Model protein structures are validated using standard validation parameters, ERRAT [54], Ramachandran score [55], G-factor [55], SRI [56], PAL–RMSD [57], and QMEAN-score [58]. For training sample proteins of known experimentally determined structures, GDT–TS [59], and TM-Score [60] are also computed.
Step 7: Model protein structures are re-validated using their binding characteristics with their corresponding ligands and matching them with the published reports following PMIA as described in Section 2.1. In this step, the following applications are used for their specific purposes:
Schrodinger–Maestro Release 2017: For the study of protein–ligand docking [61] and to calculate the binding energy (i.e., Glide’s XP docking scoring function) the following [62] are used:
PatchDock: For the study of protein–protein interaction [63]
PRODIGY: For the study of the binding energy of the interacting complex [64]
BIOVIA Discovery studio visualizer 4.5: For visualization of docking results [65]
MATLAB version R2022a: For computing proximity of binding site centre of experimental structure with that of DAA and DTS
Step 8: The best model structure is selected following the criteria depicted in rule 2 of PMIA formalism using steps 6 and 7.
A flowchart depicting the working principle of AlphaLarge is shown in Figure 3. The working principle of AlphaLarge is validated using the sample proteins given in Table 1. Subsequently, it is applied to get the structures of the proteins of the present case study, e.g., Syn-1 and a-Syn. Finally, the interaction between Syn-1 and a-Syn is studied under PMIA formalism to extract the structural details of their binding leading to their aggregates.
To get the structures of Syn-1 using AlphaLarge, first the sequence of Syn-1 was retrieved from UniProt database employing accession no. Q9Y6H5. Subsequently, this sequence is fed into AlphaLarge to get the best structure model of Syn-1. For the wildtype a-Syn, the experimentally obtained human micelle-bound protein with PDB ID 1XQ8 was used. In case of the mutated version of a-Syn, first, its sequence was retrieved from the UniProt using accession no. P37840. In the next step, this sequence is manually altered through the mutations, A30P, A53T, and E46K. Finally, the structure of a-Syn is obtained through the direct application of AlphaFold2.

3. Results

Considering the objective of this work to extract the structural mechanism of interaction between Syn-1 and a-Syn leading to toxic LB aggregate formation, the result presented in Section 3.1 shows the efficiency of the semiautomatic pipeline-based meta-predictor, AlphaLarge, to predict authentic structure models of the training protein samples for which the structures are known and therefore, validation is easier. After the potential of AlphaLarge is established, the result presented in Section 3.2 shows the standard validations of the structure models generated by AlphaLarge for both Syn-1 and mutated a-Syn with specific 3 mutations, which are not available currently by any experimental means. Finally, the result of the structural and functional analysis of interaction between Syn-1 and a-Syn is provided in the result Section 3.3.

3.1. Results of AlphaLarge Application on the Training Samples

The quality of DAA and DTS modeled structures under the AlphaLarge protocol was compared with their experimentally known structure employing standard structure validation parameters as given in step 6 of Section 2.2.2 to select the best structure model. The comparison of DTS with DAA was performed on the basis of the following two criteria for a specific validation parameter:
Win: W D T S and W D A A refers to situations where the validation parameter is found to be better for DTS and DAA structure models, respectively.
Tie: Tie refers to the situation where the absolute difference between the validation scores for DTS and DAA models are less than a predefined value, say for δ, which is fixed for a specific validation parameter as the 0.01% of its best standard value.
Table 2 clearly depicts the percentage of wins, losses, and tie cases for structure generated by DAA and DTS methods. It was found that DTS structures outperformed DAA structures in terms of standard validation parameters, Ramachandran score, G-factor, PAL_RMSD, and QMEAN whereas, DAA structures won in terms of ERRAT, GDT–TS, and TM-score. Also, as shown in Table 2 in the case of Ramachandran score, DTS structures completely win over DAA structures, which clearly indicate that the stereochemical factors of the modeled structures using derived templates are reasonably good. If the all atom RMSD (PAL_RMSD) score is taken into account, DTS structures clearly perform better than DAA structures. The PAL_RMSD score is a more reliable parameter for structure validation through comparison with experimentally obtained structure than usual RMSD score. Hence, both methods, DTS and DAA, perform competitively with each other with respect to standard validation parameters.
The best structure models are selected using the winning or tie performance of the sub-pipelines, DTS and DAA. Also, the results of PMIA formalism as shown in the latter section in Table 3 describing the corresponding bonds resolved as well as the binding energy, respectively, are used for the purpose of re-validating the structure models as well as to break the tie for final selection of a model.
Re-Validation of the DTS and DAA Structure Models Using PMIA Formalism
To re-validate the findings obtained through standard validation protocols, the interaction-based PMIA formalism was also applied for the structure models obtained through DAA and DTS pipelines. It is also useful to break the tie cases as mentioned above. In this regard, the result of the application of PMIA for the study of protein–ligand interactions for each of the sample proteins is presented in Table 3. It shows the performances of DTS and DAA modeled structures in comparison to that reported from the experiments.
While Table 3 shows that the performance of DTS is slightly better than DAA in resolving the total number of interactions, in individual cases, it appears to be quite different. It demonstrates the competitive performance of DTS and DAA sub-pipelines for individual proteins and the need of building the semiautomatic pipeline-based meta-predictor, AlphaLarge, based on the competitive performance of DTS and DAA. This finding is again found to correspond well with the comparative profile of binding energies of known interactions, BIDTS and BIDAA, obtained for individual proteins using DTS and DAA approaches, respectively, as shown in Table 3 (details of the residues are given in Supplementary Table S1).
As shown in Table 3, the data for the binding energies of known interactions, BIExperimental, BIDTS, and BIDAA, obtained for individual sample proteins using experimental, DTS, and DAA models, respectively, also shows that the performance of DTS is reasonably better than DAA in terms of two parameters, “proximity of binding site centre of experimental structure with that of DAA and DTS models” and “proximity of binding energy of experimental structure with that of DAA and DTS models” (please see supplementary Table S1 for residue and domain details of binding sites). It also indicates the competitive performance of DTS and DAA for individual proteins and the need of building the semiautomatic pipeline-based meta-predictor, AlphaLarge, based on the competitive performance of DTS and DAA.

3.2. Standard Validation of Syn-1 Model Structures

After testing the DTS and DAA pipelines of AlphaLarge, it was pertinent to examine the potential of these methods for the pathologically important protein, synphilin-1, as a case study for which no experimental or predicted structure is available. Naturally, among the standard validation parameters used in the training phase of developing AlphaLarge, whichever was found dependent on the original structure was discarded for their utilization in this case study. Table 4 summarizes the result of validation of Synphilin-1 structures obtained from DAA and DTS using ERRAT, Ramachandran score, G-factor, QMEAN Score, and % coil amount. For non-bonded atomic interactions, the comprehensive quality factor is determined by ERRAT. The result shows that DTS predicted structure is better than that predicted by DAA.
As reported in Table 4, the Ramachandran plots generated for Synphilin-1 in Figure 4 also showed that the stereochemical property of DTS predicted structure is much better than that predicted by DAA. The detailed analysis of Table 4 reveals that most of the residues of DTS structure (82.1%) are within the highly allowed (i.e., core) region in comparison to the DAA structure (32.2%). Therefore, it is quite natural that an abundance of DTS residues in other regions (i.e., disallowed, generously allowed, and allowed) will be less than the DAA structure. This observation is found to be consistent with the Ramachandran plot shown in Figure 4. Similarly, DTS structure shows G factor value (−0.15) within an allowed limit of −0.5 to 0 whereas, for DAA (−2.55), the value is much further away from this limit. Regarding QMEAN Score, the DTS value (−5.97) is much closer to its expected score (i.e., 0) in comparison to DAA (−21.82). The same trend follows for %coil and ERRAT scores where DTS structure substantially outperformed DAA structure.

3.2.1. Result of PMIA Formalism Based Re-Validation of Syn-1 Structure Using Binding Energy

Alpha-synuclein, an intrinsically disordered protein, undergoes structural transitions under specific conditions, including a notable shift from an alpha-helix to a beta-sheet conformation. In its interaction with lipid membranes (PDB ID: 1XQ8) and proteins like calmodulin (PDB ID: 2M55), alpha-synuclein adopts a stable alpha-helical structure. Therefore, for the study of this interaction, the experimentally obtained structure of a-Syn (wildtype Human micelle-bound alpha-helical conformation of full-length alpha-synuclein with PDB ID 1XQ8) was used. Finally, the analysis of binding energy of Synphilin-1 structures predicted from DTS and DAA for their binding with a-Syn is presented in Table 5 that are computed using Prodigy as given in step 7 of Section 2.2.2. It showed that the binding energy of interaction with DTS structure model is less than that with DAA, indicating marginally better interaction and stability of binding complex made with DTS structure. Together with the results produced by Table 4 and Figure 4, it shows that the structure predicted through the DTS pipeline is more acceptable than that predicted by DAA.

3.2.2. Result of PMIA Formalism Based Re-Validation of Syn-1 Structure Using Residue Level Interaction

Earlier study based on NMR spectroscopy revealed that the coiled–coil domain (349–557) was responsible for the interaction with the N-terminal region of alpha-synuclein (1–65) [8]. In this regard, to check the correspondence of this result with the Syn-1 and a-Syn, the BIOVIA Discovery studio visualizer 4.5 is utilized as described in step 7 of Section 2.2.2. In this regard, L529, T533, and R536 residues of DAA_synphilin1 coiled–coil domain are found to be involved in interaction with K32, E35, and V40 residues of alpha-synuclein, respectively, as shown in Figure 5, panel C. Furthermore, in the same panel, R536, V537, and K551 residues of DTS_synphilin1 coiled–coil domain interacted with M1, F4, and K12 residues of alpha-synuclein. Direct evidence of involvement of F4 and K12 residues of Synphilin-1 interacting with alpha-synuclein was established previously by NMR study [8]. Therefore, from the above observations, it can be concluded that the DTS_synphilin1 structure is more biologically acceptable compared to DAA_synphilin1.

3.3. Result of Structural Details of Interaction Between Best Syn-1 and a-Syn (Both Mutated and Wild Type)

From the result given in the preceding section, it is evident that PMIA formalism helps to find the best structure model of Syn-1 (i.e., the DTS model obtained using AlphaLarge meta-predictor) through the analysis of both binding energy and binding site residues. Furthermore, PMIA formalism is also applied to study the interaction of best Syn-1 structure model with a-Syn (both mutated and wild type) whereas, all these structures were obtained using the steps described in Section 2.2.2. In this second study, PMIA formalism was used both for re-validation of the DAA structure of mutated a-Syn by checking the correspondence with its reported binding characteristics and also to explore new binding information to help in developing new therapeutics. In this regard, Table 6 shows the comparative results of binding energies for the interactions of best Syn-1 structure with both wild and mutated types of a-Syn along with the PDB–RMSD score.
Also, the superimposed structures of wild and mutated a-Syn are shown in panel C of Figure 6 along with the structures of wild and mutated a-Syn in panels A and B, respectively. Visually, although the basic structural characteristics of wild and mutated a-Syn appear not to be very different, the superimposition of them made by PDB application does not appear to represent the same.
The visual representations of the structures of a-Syn, mutated a-Syn, and their alignment are shown in Figure 7 and the results of standard validation for both wild and mutated a-Syn presented in Table 7 show a slight increase in structural disorder in mutated a-Syn in comparison to its wild type.
The number of interactions resolved with the details of the participating residues for the interactions of Syn-1 with a-Syn (wild type) and Syn-1 with a-Syn (mutated) are shown in Table 8. It clearly shows stronger binding for interactions between Syn-1 and mutated a-Syn through a reasonably greater amount of bond formations by the corresponding participating pairs of residues of these proteins in comparison to that between Syn-1 and a-Syn (wild type). It also corresponds well with the binding energy profile of these two interactions as shown in Table 6. Table 8 also shows that several C-terminal residues of a-Syn (mutated) were found to be involved in altered interaction with Syn-1 (shown in bold) in the resulted complex which tallies with the previous FRET based interaction study [66].
The interactions described in Table 8 are also demonstrated in Figure 7 as shown below.
From the result of Table 8 and known information about domain regions of Syn-1, a-Syn (wild type), and a-Syn (mutated), it is evident that the central coiled coil and C-terminal domain regions of Syn-1 are taking part in interactions with a-Syn (residue no. 1 to 18) following the published result [8]. However, in case of an interaction of Syn-1 with mutated a-Syn, the interaction is mostly at the C-terminal region of Syn-1 as already reported [66] with few interactive residues found in the N terminal region.

4. Discussion

Extensive studies have been performed on mutations in the genes like SNCA, PRKN1, PRKN2, PINK1, DJ1, and LRRK2 leading to pathophysiological manifestation of Parkinson’s disease (PD) [67,68,69,70,71,72,73]. These mutated genes are known to negatively impact both autophagy and proteosome activities responsible for degradation of misfolded protein aggregates, like Lewy Bodies (LB) [74]. Among these mutations, SNCA-triggered mutations in a-syn, A30P, A53T, and E46K are considered as very important [10,11,12] for its implication in the interaction with Syn-1 leading to formation of these toxic aggregates [3]. Currently, there is no way to stop the progress of this aggregate-formation causing irreversible neuronal damage gradually with time. Under this circumstance, this work focused on the study on the interactions of Syn-1 with mutated a-Syn to extract all the structural details of these individual proteins as well as their bindings that may help in designing target specific therapeutics in future.
To accomplish the aim of this work in unearthing the structural details of interaction of Syn-1 with mutated a-Syn, the information is gradually unfolded through the series of steps as described in the Material and methods and the Result sections that are broadly followed from the logical foundation of Kumar et al. [75]. Consequently, the first attempt in this work was to find highly accurate structures of Syn-1 and mutated a-Syn with specific mutations, A30P, A53T, and E46K, using the semiautomatic pipeline-based meta-predictor, AlphaLarge, as described in Section 2.2.2. The PMIA formalism described in this work basically mines the experimentally valid information relevant to the target structures in question from existing publications and utilizes them to strengthen the choice of the structural models thereafter. There are two sub-pipelines within AlphaLarge named DTS (divide and conquer based strategy giving derived template sources) and DAA (direct application of AlphaFold2). The biological basis of the divide and conquer strategy can be drawn from the existing concept of protein structural folds [76,77]. The fragments are intentionally made to overlap to help the homology application Modeller in assigning the coordinates better by cross-checking at the overlapping zones. Also, each fragment being smaller in size, is found to be better predictable by AlphaFold. It is further confirmed by the observation that the overall MSA-depth found from AlphaFold application for the smaller fragments is deeper than that obtained for the undivided large sequence hinting about the reason for better accuracy under DTS pipeline following [14]. However, this work avoided the pre-detection of the domains since domains are a large structural part; the error in their predictions [51] may have a cascading effect to accumulate a greater amount of error afterwards. In this regard, the potential of AlphaLarge is evident from the standard validation and PMIA formalism-based validation of the output structure models of the training sample proteins as shown in Table 2. All these proteins are of a size similar to the target Syn-1. The re-validation of these structures under PMIA formalism through checking the already-reported interactions in terms of proximity of binding sites and the binding energies of these interactions are shown in Table 3, which clearly demonstrates their potential in selecting a better structure model for a specific protein. It is well accepted [78,79] that not only binding energy, but also the geometric parameters related to the binding sites of protein mediated interactions and bonds are the most important interaction parameters. This is also quite evident since interaction happens through the surface via a lock-and-key type of arrangement whereas, an interaction site that is completely different from the known one may exhibit nearly the same binding energy. This is true for the formation of bonds also that may be similar at different binding sites. That is why the proximity of the binding sites of the model structures outputted by both AlphaLarge and AlphaFold with the ground truth (i.e., the known binding site of the original protein) have been provided in Table 3. Since, for a fixed spatial arrangement and chemical environment of interacting residues (atoms in details), the bond information cannot be altered, in our work, for all protein mediated interactions, the bond information can be considered as embedded within all such interacting residues. For this reason, no separate information on bonds is provided in the manuscript. For details on the binding site residues, please refer to the Supplementary Table S1 given in the updated supplementary file.
In line with this strategy, to fulfill the main objective to explore the structural aspects of interactions of Syn-1 with a-Syn (mutated), first, the structure of Syn-1 is obtained through its standard validation-based selection from the structure models outputted both by the DTS and DAA sub-pipelines of AlphaLarge as clarified in Table 4 and Figure 4. Continuing with the earlier strategy, the best structure model (DTS) is re-confirmed as the best one for the further study by the application of PMIA based formalism using the results of its interaction with a-syn (wild type) by checking the binding energy as shown in Table 5 and the number of interactions resolved as provided in result Section 3.2.2, the first column of the Table 8, and Figure 5.
Finally, the interaction of Syn-1 with mutated a-Syn is studied following the methodology described in Section 2.2.2. For this purpose, first, the specific three mutations, A30P, A53T, and E46K, reported to promote the formation of aggregates of Syn-1 and a-syn complex, are manually made in the sequence of wild type a-Syn. Finally, the structure of mutated a-Syn is obtained through the DAA pipeline of AlphaLarge. At this step the question was about the authenticity of the structure of the mutated a-Syn. To resolve it, the same earlier validations, both standard and PMIA formalism are applied. The result of standard validation, as given in Figure 6 and Table 7, shows a slight increase in structural disorder in mutated a-Syn compared to its wild counterpart. At this stage, PMIA formalism helps establish the authenticity of the mutated a-Syn structure and explores the structural basis of interactions between Syn-1 and mutated a-Syn.
Since mutated a-Syn is already known to promote formation of higher amount of aggregates of Syn-1 and a-Syn complex in comparison to that made by a-Syn (wild type), it was intriguing to see whether it is reflected from the binding energy profiles of protein complexes made by the interactions of Syn-1 with a-Syn and mutated a-Syn, respectively. In this regard, the result of binding energy profile as shown in Table 6 clearly shows reasonably stronger binding of Syn-1 with mutated a-Syn in comparison to that with a-Syn (wild type). Furthermore, it can also be expected that for stronger interactions of Syn-1 with mutated a-Syn, the number of residue pairs of both the proteins participating in this interaction should be more comparable to the interaction of Syn-1 with a-Syn (wild type). Along with the visual demonstration of this interaction in Figure 7, it is also intriguing to observe that the same is also satisfied by the results shown in Table 8. The result of Table 8 is found to give new insight into the detailed residue–residue pair wise interactions between syn-1 and mutated a-Syn. In this regard, several C-terminal residues of a-Syn (mutated) were found to be involved in altered interaction with Syn-1 (shown in bold in Table 8) in the resulted complex which tallies with the previous FRET based interaction study [65]. Therefore, in this work, the PMIA formalism backed by AlphaLarge appears to pave the way for the new therapeutics to stop the progression of PD by offering the new structural insight of the causal interaction of Syn-1 with mutated a-Syn.

5. Conclusions

Popular therapeutics mainly rely on protein level control targeting the causal proteins and their interactions leading to the particular pathophysiological process. However, to fulfill this goal, there must be the availability of the structures of these proteins where experimental methods to get them are too slow and the prediction methods are too error-prone until the launch of AlphaFold2 by Google Colab that has already gained popularity for its prediction accuracy. Consequently, this work focused on study of interactions of the proteins Syn-1 and a-Syn with specific mutations leading to their aggregates referred to as Lewy Bodies (LB) that are toxic enough to cause irreversible neuronal damage causing PD. In line with this objective, a semiautomatic pipeline-based meta-predictor, AlphaLarge, compensates for the limitation of AlphaFold2 in predicting the structure of large proteins like Syn-1. The interaction of Syn-1 with a-Syn is studied under PMIA formalism, which not only re-validated the structures of these proteins, but also revealed important structural details of their interaction in terms of the specific residue–residue pairs participating in this interaction. This crucial information is expected to offer better therapeutic strategy in future to control the progression of PD.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/biom14121588/s1, Supplementary Table S1 Supplementary Table S2, and MATLAB Code.

Author Contributions

Conceptualization, T.L.; methodology, T.L.; software, M.K.P.; validation, A.T. and R.M.; formal analysis, A.T. and R.M.; investigation, A.T.; resources, A.T., M.M. and M.K.P.; data curation, A.T. and R.M.; writing—original draft preparation, A.T., T.L. and R.M.; writing—review and editing, T.L. and M.M.; visualization, A.T.; supervision, T.L.; project administration, T.L.; funding acquisition, T.L. All authors have read and agreed to the published version of the manuscript.

Funding

The work is supported by the CPDA fund of Indian Institute of Information Technology to TL.

Data Availability Statement

All the data used in this work are available from the authors and can be shared on such demand.

Acknowledgments

The authors gratefully acknowledge the infrastructures provided by the Indian Institute of Information Technology Allahabad, India to carry out this work.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Kouli, A.; Torsney, K.M.; Kuan, W.L. Parkinson’s Disease: Etiology, Neuropathology, and Pathogenesis. In Parkinson’s Disease: Pathogenesis and Clinical Aspects; Stoker, T.B., Ed.; Codon Publications: Singapore, 2018. [Google Scholar]
  2. Xu, L.; Pu, J. Alpha-synuclein in Parkinson’s disease: From pathogenetic dysfunction to potential clinical application. Parkinsons Dis. 2016, 2016, 1720621. [Google Scholar] [CrossRef] [PubMed]
  3. Gomez-Benito, M.; Granado, N.; Garcia-Sanz, P.; Michel, A.; Dumoulin, M.; Moratalla, R. Modeling Parkinson’s disease with the alpha-synuclein protein. Front. Pharmacol. 2020, 11, 356. [Google Scholar] [CrossRef] [PubMed]
  4. Tagliafierro, L.; Chiba-Falek, O. Up-regulation of SNCA gene expression: Implications to synucleinopathies. Neurogenetics 2016, 17, 145–157. [Google Scholar] [CrossRef] [PubMed]
  5. Ribeiro, C.S.; Carneiro, K.; Ross, C.A.; Menezes, J.R.L.; Engelender, S. Synphilin-1 is developmentally localized to synaptic terminals, and its association with synaptic vesicles is modulated by alpha-synuclein. J. Biol. Chem. 2002, 277, 23927–23933. [Google Scholar] [CrossRef] [PubMed]
  6. Richfield, E.K.; Thiruchelvam, M.J.; Cory-Slechta, D.A.; Wuertzer, C.; Gainetdinov, R.R.; Caron, M.G.; Di Monte, D.A.; Federoff, H.J. Behavioral and neurochemical effects of wild-type and mutated human alpha-synuclein in transgenic mice. Exp. Neurol. 2002, 175, 35–48. [Google Scholar] [CrossRef]
  7. Engelender, S.; Wanner, T.; Kleiderlein, J.J.; Wakabayashi, K.; Tsuji, S.; Takahashi, H.; Ashworth, R.; Margolis, R.L.; Ross, C.A. Organization of the human Synphilin-1 gene, a candidate for Parkinson’s disease. Mamm. Genome 2000, 11, 763–766. [Google Scholar] [CrossRef]
  8. Xie, Y.Y.; Zhou, C.J.; Zhou, Z.R.; Hong, J.; Che, M.X.; Fu, Q.S.; Song, A.X.; Lin, D.H.; Hu, H.Y. Interaction with synphilin-1 promotes inclusion formation of α-synuclein: Mechanistic insights and pathological implication. FASEB J. 2010, 24, 196–205. [Google Scholar] [CrossRef]
  9. Jumper, J.; Evans, R.; Pritzel, A.; Green, T.; Figurnov, M.; Ronneberger, O.; Tunyasuvunakool, K.; Bates, R.; Žídek, A.; Potapenko, A.; et al. Highly accurate protein structure prediction with AlphaFold. Nature 2021, 596, 583–589. [Google Scholar] [CrossRef]
  10. Polymeropoulos, M.H.; Lavedan, C.; Leroy, E.; Ide, S.E.; Dehejia, A.; Dutra, A.; Pike, B.; Root, H.; Rubenstein, J.; Boyer, R.; et al. Mutation in the alpha-synuclein gene identified in families with Parkinson’s disease. Science 1997, 276, 2045–2047. [Google Scholar] [CrossRef]
  11. Krüger, R.; Kuhn, W.; Müller, T.; Woitalla, D.; Graeber, M.; Kösel, S.; Przuntek, H.; Epplen, J.T.; Schols, L.; Riess, O. AlaSOPro mutation in the gene encoding α-synuclein in Parkinson’s disease. Nat. Genet. 1998, 18, 106–108. [Google Scholar] [CrossRef]
  12. Zarranz, J.J.; Alegre, J.; Gómez-Esteban, J.C.; Lezcano, E.; Ros, R.; Ampuero, I.; Vidal, L.; Hoenicka, J.; Rodriguez, O.; Atarés, B.; et al. The new mutation, E46K, of α-synuclein causes parkinson and Lewy body dementia. Ann. Neurol. 2004, 55, 164–173. [Google Scholar] [CrossRef] [PubMed]
  13. Bisong, E. Building Machine Learning and Deep. Learning Models on Google Cloud Platform; Apress: Berkeley, CA, USA, 2019; pp. 59–64. [Google Scholar]
  14. Tunyasuvunakool, K.; Adler, J.; Wu, Z.; Green, T.; Zielinski, M.; Žídek, A.; Bridgland, A.; Cowie, A.; Meyer, C.; Laydon, A.; et al. Highly accurate protein structure prediction for the human proteome. Nature 2021, 596, 590–596. [Google Scholar] [CrossRef] [PubMed]
  15. Abbas, U.L.; Chen, J.; Shao, Q. Assessing fairness of AlphaFold2 prediction of protein 3D structures. In Proceedings of the 14th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics, Houston, TX, USA, 3–6 September 2023; ACM: New York, NY, USA, 2023. [Google Scholar]
  16. MarcuŞtefan-Bogdan, T.; Sabin, T. An Overview of Alphafold’s Breakthrough. Front. Artif. Intell. 2022, 5, 875587. [Google Scholar]
  17. Meng, Q.; Guo, F.; Tang, J. Improved structure-related prediction for insufficient homologous proteins using MSA enhancement and pre-trained language model. Brief. Bioinform. 2023, 24, bbad217. [Google Scholar] [CrossRef]
  18. McDonald, E.F.; Jones, T.; Plate, L.; Meiler, J.; Gulsevin, A. Benchmarking AlphaFold2 on peptide structure prediction. Structure 2023, 31, 111–119.e2. [Google Scholar] [CrossRef]
  19. Lee, C.; Su, B.H.; Tseng, Y.J. Comparative Studies of AlphaFold, RoseTTAFold and Modeller: A Case Study Involving the Use of G-Protein-Coupled Receptors. Brief. Bioinform. 2022, 23, bbac308. [Google Scholar] [CrossRef]
  20. Krieger, E.; Nabuurs, S.D.; Vriend, G. Structural Bioinformatics; Bourne, P.E., Weissig, H., Eds.; Wiley-Liss: Hoboken, NJ, USA, 2012; pp. 507–520. [Google Scholar]
  21. Thompson, T.B.; Chaggar, P.; Kuhl, E.; Goriely, A. Alain Goriely, and Alzheimer’s Disease Neuroimaging Initiative. Protein-protein interactions in neurodegenerative diseases: A conspiracy theory. PLoS Comput. Biol. 2020, 16, e1008267. [Google Scholar] [CrossRef]
  22. Poluri, K.M.; Gulati, K.; Tripathi, D.K.; Nagar, N. Protein–protein interactions in neurodegenerative diseases. In Protein-Protein Interactions; Springer Nature: Singapore, 2023; pp. 101–169. [Google Scholar]
  23. Balint, D.; Brito, I.L. Human–gut bacterial protein–protein interactions: Understudied but impactful to human health. Trends Microbiol. 2024, 32, 325–332. [Google Scholar] [CrossRef]
  24. Mondal, R.; Mandal, M.; Lahiri, T. A divide and conquer approach (DACA) to predict high fidelity structure of large multidomain protein BRWD1. bioRxiv 2023. [Google Scholar] [CrossRef]
  25. Charan, J.; Biswas, T. How to calculate sample size for different study designs in medical research? Indian J. Psychol. Med. 2013, 35, 121–126. [Google Scholar] [CrossRef]
  26. National Center for Biotechnology Information. PubChem Compound Summary for CID 4459709, D-hexose 6-phosphate. Available online: https://pubchem.ncbi.nlm.nih.gov/compound/4459709 (accessed on 26 September 2024).
  27. National Center for Biotechnology Information. PubChem Compound Summary for CID 135509118. Available online: https://pubchem.ncbi.nlm.nih.gov/compound/135509118 (accessed on 26 September 2024).
  28. Resch, M.; Dobbek, H.; Meyer, O. Structural and functional reconstruction in situ of the [CuSMoO2] active site of carbon monoxide dehydrogenase from the carbon monoxide oxidizing eubacterium Oligotrophacarboxidovorans. JBIC J. Biol. Inorg. Chem. 2005, 10, 518–528. [Google Scholar] [CrossRef] [PubMed]
  29. Gourlay, C.; Nielsen, D.J.; Evans, D.J.; White, J.M.; Young, C.G. Young, Models for aerobic carbon monoxide dehydrogenase: Synthesis, characterization and reactivity of paramagnetic MoVO(μ-S)CuI complexes. Chem. Sci. 2018, 9, 876–888. [Google Scholar] [CrossRef] [PubMed]
  30. National Center for Biotechnology Information. PubChem Compound Summary for CID 5288690, 5’-O-(L-Leucylsulfamoyl)adenosine. Available online: https://pubchem.ncbi.nlm.nih.gov/compound/5_-O-_L-Leucylsulfamoyl_adenosine (accessed on 26 September 2024).
  31. Kim, S.; Yoon, I.; Son, J.; Park, J.; Kim, K.; Lee, J.H.; Park, S.Y.; Kang, B.S.; Han, J.M.; Hwang, K.Y.; et al. Leucine-sensing mechanism of leucyl-tRNA synthetase 1 for mTORC1 activation. Cell Rep. 2021, 35, 109031. [Google Scholar] [CrossRef] [PubMed]
  32. Walker, E.H.; Pacold, M.E.; Perisic, O.; Stephens, L.; Hawkins, P.T.; Wymann, M.P.; Williams, R.L. Structural determinants of phosphoinositide 3-kinase inhibition by wortmannin, LY294002, quercetin, myricetin, and staurosporine. Mol. Cell. 2000, 6, 909–919. [Google Scholar] [CrossRef] [PubMed]
  33. National Center for Biotechnology Information. PubChem Compound Summary for CID 312145, Wortmannin. Available online: https://pubchem.ncbi.nlm.nih.gov/compound/Wortmannin (accessed on 26 September 2024).
  34. National Center for Biotechnology Information. PubChem Compound Summary for CID 24139, N-ACETYL-beta-D-GLUCOSAMINE. Available online: https://pubchem.ncbi.nlm.nih.gov/compound/N-ACETYL-beta-D-GLUCOSAMINE (accessed on 26 September 2024).
  35. National Center for Biotechnology Information. PubChem Compound Summary for CID 164513459. Available online: https://pubchem.ncbi.nlm.nih.gov/compound/164513459 (accessed on 26 September 2024).
  36. Winkelmann, D.A.; Forgacs, E.; Miller, M.T.; Stock, A.M. Structural basis for drug-induced allosteric changes to human β-cardiac myosin motor activity. Nat. Commun. 2015, 6, 7974. [Google Scholar] [CrossRef]
  37. National Center for Biotechnology Information. PubChem Compound Summary for CID 11689883, Omecamtivmecarbil. Available online: https://pubchem.ncbi.nlm.nih.gov/compound/11689883 (accessed on 26 September 2024).
  38. National Center for Biotechnology Information. PubChem Compound Summary for CID 65482, Sinefungin. Available online: https://pubchem.ncbi.nlm.nih.gov/compound/Sinefungin (accessed on 26 September 2024).
  39. National Center for Biotechnology Information. PubChem Compound Summary for CID 86208110, 3-Amino-N-{(1r)-2-(Hydroxyamino)-2-Oxo-1-[4-(1h-Pyrazol-1-Yl)phenyl]ethyl}benzamide. Available online: https://pubchem.ncbi.nlm.nih.gov/compound/86208110 (accessed on 26 September 2024).
  40. Mistry, S.N.; Drinkwater, N.; Ruggeri, C.; Sivaraman, K.K.; Loganathan, S.; Fletcher, S.; Drag, M.; Paiardini, A.; Avery, V.M.; Scammells, P.J.; et al. Two-pronged attack: Dual inhibition of Plasmodium falciparum M1 and M17 metalloaminopeptidases by a novel series of hydroxamic acid-based inhibitors. J. Med. Chem. 2014, 57, 9168–9183. [Google Scholar] [CrossRef]
  41. National Center for Biotechnology Information. PubChem Compound Summary for CID 91532, Phosphomethylphosphonic Acid Adenylate Ester. Available online: https://pubchem.ncbi.nlm.nih.gov/compound/Phosphomethylphosphonic-acid-adenylate-ester (accessed on 26 September 2024).
  42. Inoue, M.; Sakuta, N.; Watanabe, S.; Zhang, Y.; Yoshikaie, K.; Tanaka, Y.; Ushioda, R.; Kato, Y.; Takagi, J.; Tsukazaki, T.; et al. Structural Basis of Sarco/Endoplasmic Reticulum Ca2+-ATPase 2b Regulation via Transmembrane Helix Interplay. Cell Rep. 2019, 27, 1221–1230. [Google Scholar] [CrossRef]
  43. National Center for Biotechnology Information. PubChem Compound Summary for CID 8117, Diethylene Glycol. Available online: https://pubchem.ncbi.nlm.nih.gov/compound/Diethylene-glycol (accessed on 26 September 2024).
  44. Jiang, K.; Zhang, Y.; Chen, Z.; Wu, D.; Cai, J.; Gao, X. Structural and Functional Insights into the C-terminal Fragment of Insecticidal Vip3A Toxin of Bacillus thuringiensis. Toxins 2020, 12, 438. [Google Scholar] [CrossRef]
  45. National Center for Biotechnology Information. PubChem Compound Summary for CID 155804422. Available online: https://pubchem.ncbi.nlm.nih.gov/compound/155804422 (accessed on 26 September 2024).
  46. Gao, Y.; Hu, Y.; Liu, Q.; Li, X.; Li, X.; Kim, C.Y.; James, T.D.; Li, J.; Chen, X.; Guo, Y. Two-Dimensional Design Strategy to Construct Smart Fluorescent Probes for the Precise Tracking of Senescence. Angew. Chem. Int. Ed. Engl. 2021, 60, 10756–10765. [Google Scholar] [CrossRef]
  47. National Center for Biotechnology Information. PubChem Compound Summary for CID 6022, Adenosine-5′-diphosphate. Available online: https://pubchem.ncbi.nlm.nih.gov/compound/Adenosine-5_-diphosphate (accessed on 26 September 2024).
  48. National Center for Biotechnology Information. PubChem Compound Summary for CID 644100, 2-(4-((2,5-Difluorobenzyl)oxy)phenoxy)-5-ethoxyaniline. 2024. Available online: https://pubchem.ncbi.nlm.nih.gov/compound/sea0400 (accessed on 26 September 2024).
  49. Xue, Z.; Xu, D.; Wang, Y.; Zhang, Y. ThreaDom: Extracting protein domain boundary information from multiple threading alignments. Bioinformatics 2013, 29, 247–256. [Google Scholar] [CrossRef]
  50. Finn, R.D.; Coggill, P.; Eberhardt, R.Y.; Eddy, S.R.; Mistry, J.; Mitchell, A.L.; Potter, S.C.; Punta, M.; Qureshi, M.; Sangrador-Vegas, A.; et al. The Pfam protein families database: Towards a more sustainable future. Nucleic Acids Res. 2016, 44, D279–D285. [Google Scholar] [CrossRef] [PubMed]
  51. Jiang, Y.; Wang, D.; Xu, D. DeepDom: Predicting protein domain boundary from sequence alone using stacked bidirectional LSTM. Pac. Symp. Biocomput. 2019, 24, 66–75. [Google Scholar] [PubMed]
  52. Rost, B.; Schneider, R.; Sander, C. Protein fold recognition by prediction-based threading. J. Mol. Biol. 1997, 270, 471–480. [Google Scholar] [CrossRef] [PubMed]
  53. Sievers, F.; Wilm, A.; Dineen, D.; Gibson, T.J.; Karplus, K.; Li, W.; Lopez, R.; McWilliam, H.; Remmert, M.; Söding, J.; et al. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol. Syst. Biol. 2011, 7, 539. [Google Scholar] [CrossRef] [PubMed]
  54. Colovos, C.; Yeates, T.O. Verification of protein structures: Patterns of nonbonded atomic interactions. Protein Sci. 1993, 2, 1511–1519. [Google Scholar] [CrossRef] [PubMed]
  55. Laskowski, R.A.; MacArthur, M.W.; Moss, D.S.; Thornton, J.M. PROCHECK: A program to check the stereochemical quality of protein structures. J. Appl. Crystallogr. 1993, 26, 283–291. [Google Scholar] [CrossRef]
  56. Pal, M.K.; Lahiri, T.; Tanwar, G.; Kumar, R. An improved protein structure evaluation using a semi-empirically derived structure property. BMC Struct. Biol. 2018, 18, 16. [Google Scholar] [CrossRef]
  57. Tripathi, A.; Mondal, R.; Lahiri, T.; Chaurasiya, D.; Pal, M.K. TemPred: A novel protein template search engine to improve protein structure prediction. IEEE/ACM Trans. Comput. Biol. Bioinform. 2023, 20, 2112–2121. [Google Scholar] [CrossRef]
  58. Benkert, P.; Biasini, M.; Schwede, T. Toward the estimation of the absolute quality of individual protein structure models. Bioinformatics 2011, 27, 343–350. [Google Scholar] [CrossRef]
  59. Zemla, A. LGA: A method for finding 3D similarities in protein structures. Nucleic Acids Res. 2003, 31, 3370–3374. [Google Scholar] [CrossRef]
  60. Zhang, Y.; Skolnick, J. Scoring function for automated assessment of protein structure template quality. Proteins 2004, 57, 702–710. [Google Scholar] [CrossRef] [PubMed]
  61. Schrödinger Release 2017.0: Maestro; Schrödinger LLC: New York, NY, USA, 2017.
  62. Friesner, R.A.; Murphy, R.B.; Repasky, M.P.; Frye, L.L.; Greenwood, J.R.; Halgren, T.A.; Sanschagrin, P.C.; Mainz, D.T. Extra precision Glide: Docking and scoring incorporating a model of hydrophobic enclosure for protein-ligand complexes. J. Med. Chem. 2006, 49, 6177–6196. [Google Scholar] [CrossRef]
  63. Schneidman-Duhovny, D.; Inbar, Y.; Nussinov, R.; Wolfson, H.J. PatchDock and SymmDock: Servers for rigid and symmetric docking. Nucleic Acids Res. 2005, 33, 363–367. [Google Scholar] [CrossRef] [PubMed]
  64. Xue, L.C.; Rodrigues, J.P.; Kastritis, P.L.; Bonvin, A.M.; Vangone, A. PRODIGY: A web server for predicting the binding affinity of protein–protein complexes. Bioinformatics 2016, 32, 3676–3678. [Google Scholar] [CrossRef] [PubMed]
  65. BIOVIA, Dassault Systèmes, Discovery Studio, 4.5; Dassault Systèmes: San Diego, CA, USA, 2021.
  66. Kawamata, H.; McLean, P.J.; Sharma, N.; Hyman, B.T. Interaction of alpha-synuclein and synphilin-1: Effect of Parkinson’s disease-associated mutations. J. Neurochem. 2001, 77, 929–934. [Google Scholar] [CrossRef] [PubMed]
  67. Camargos, S.T.; Dornas, L.O.; Momeni, P.; Lees, A.; Hardy, J.; Singleton, A.; Cardoso, F. Familial Parkinsonism and early onset Parkinson’s disease in a Brazilian movement disorders clinic: Phenotypic characterization and frequency of SNCA, PRKN, PINK1, and LRRK2 mutations. Mov. Disord. 2009, 24, 662–666. [Google Scholar] [CrossRef]
  68. Hattori, N.; Kitada, T.; Matsumine, H.; Asakawa, S.; Yamamura, Y.; Yoshino, H.; Kobayashi, T.; Yokochi, M.; Wang, M.; Yoritaka, A.; et al. Molecular genetic analysis of a novel Parkin gene in Japanese families with autosomal recessive juvenile parkinsonism: Evidence for variable homozygous deletions in the Parkin gene in affected individuals. Ann. Neurol. 1998, 44, 935–941. [Google Scholar] [CrossRef]
  69. Bras, J.; Guerreiro, R.; Ribeiro, M.; Morgadinho, A.; Januario, C.; Dias, M.; Calado, A.; Semedo, C.; Oliveira, C.; Hardy, J.; et al. Analysis of Parkinson disease patients from Portugal for mutations in SNCA, PRKN, PINK1 and LRRK2. BMC Neurol. 2008, 8, 1. [Google Scholar] [CrossRef]
  70. Golbe, L.I.; Di Iorio, G.; Bonavita, V.; Miller, D.C.; Duvoisin, R.C. A large kindred with autosomal dominant Parkinson’s disease. Ann. Neurol. 1990, 27, 276–282. [Google Scholar] [CrossRef]
  71. Nuytemans, K.; Theuns, J.; Cruts, M.; Van Broeckhoven, C. Genetic etiology of Parkinson disease associated with mutations in the SNCA, PARK2, PINK1, PARK7, and LRRK2 genes: A mutation update. Hum. Mutat. 2010, 31, 763–780. [Google Scholar] [CrossRef]
  72. Oczkowska, A.; Kozubski, W.; Lianeri, M.; Dorszewska, J. Mutations in PRKN and SNCA Genes Important for the Progress of Parkinson’s Disease. Curr. Genom. 2014, 14, 502–517. [Google Scholar] [CrossRef] [PubMed]
  73. Ganguly, U.; Chakrabarti, S.S.; Kaur, U.; Mukherjee, A.; Chakrabarti, S. Alpha-synuclein, proteotoxicity and Parkinson’s disease: Search for neuroprotective therapy. Curr. Neuropharmacol. 2018, 16, 1086–1097. [Google Scholar] [CrossRef] [PubMed]
  74. Glick, D.; Barth, S.; Macleod, K.F. Autophagy: Cellular and molecular mechanisms. J. Pathol. 2010, 221, 3–12. [Google Scholar] [CrossRef] [PubMed]
  75. Kumar, R.; Mondal, R.; Lahiri, T.; Pal, M.K. Application of sequence semantic and integrated cellular geography approach to study alternative biogenesis of exonic circular RNA. BMC Bioinform. 2023, 24, 148. [Google Scholar] [CrossRef] [PubMed]
  76. Yang, J.; Yan, R.; Roy, A.; Xu, D.; Poisson, J.; Zhang, Y. The I-TASSER Suite: Protein structure and function prediction. Nat. Methods 2015, 12, 7–8. [Google Scholar] [CrossRef]
  77. Wu, S.; Zhang, Y. LOMETS: A local meta-threading-server for protein structure prediction. Nucleic Acids Res. 2007, 35, 3375–3382. [Google Scholar] [CrossRef]
  78. Jones, S.; Thornton, J.M. Principles of protein-protein interactions. Proc. Natl. Acad. Sci. USA 1996, 93, 13–20. [Google Scholar] [CrossRef]
  79. Qin, K.; Dong, C.; Wu, G.; Lambert, N.A. Inactive-state preassembly of G(q)-coupled receptors and G(q) heterotrimers. Nat. Chem. Biol. 2011, 7, 740–747. [Google Scholar] [CrossRef]
Figure 1. Working parameters (participating residues, bonds, and binding energies) used in PMIA.
Figure 1. Working parameters (participating residues, bonds, and binding energies) used in PMIA.
Biomolecules 14 01588 g001
Figure 2. Creation of overlapping fragments A, B, and C following the divide and conquer strategy.
Figure 2. Creation of overlapping fragments A, B, and C following the divide and conquer strategy.
Biomolecules 14 01588 g002
Figure 3. Flowchart depicting the working principle of AlphaLarge, where a, b and c shows the structure of fragments 1, 2 and 3 respectively.
Figure 3. Flowchart depicting the working principle of AlphaLarge, where a, b and c shows the structure of fragments 1, 2 and 3 respectively.
Biomolecules 14 01588 g003
Figure 4. Quality assessment of Syn-1 structure predicted by DAA and DTS method from Ramachandran plots obtained from the Swiss-PdbViewer.
Figure 4. Quality assessment of Syn-1 structure predicted by DAA and DTS method from Ramachandran plots obtained from the Swiss-PdbViewer.
Biomolecules 14 01588 g004
Figure 5. (A) Surface filling model, (B) ribbon representation of DTS and DAA structures of Syn-1 (cyan overall with deep blue highlighting central coiled–coil region), and (C) zoomed version of (B) showing interactions with a-syn (wild type) (grey ribbon).
Figure 5. (A) Surface filling model, (B) ribbon representation of DTS and DAA structures of Syn-1 (cyan overall with deep blue highlighting central coiled–coil region), and (C) zoomed version of (B) showing interactions with a-syn (wild type) (grey ribbon).
Biomolecules 14 01588 g005aBiomolecules 14 01588 g005b
Figure 6. Panels (AC) show wild, mutated, and superimposed structures of wild (PDB ID: 1XQ8), and mutated a-Syn, respectively.
Figure 6. Panels (AC) show wild, mutated, and superimposed structures of wild (PDB ID: 1XQ8), and mutated a-Syn, respectively.
Biomolecules 14 01588 g006
Figure 7. Syn-1 and a-Syn (mutated) complex structure: (A) Surface filling model of the AlphaLarge predicted structure of Syn-1 (cyan overall with deep blue highlighting central coiled coil region) and ribbon representation of mutated a-syn (N-terminal 1–65 in grey ribbon), (B) ribbon representation of the complex and (C) zoomed version of (B) showing interactions with mutated a-syn through C-terminal (green ribbon).
Figure 7. Syn-1 and a-Syn (mutated) complex structure: (A) Surface filling model of the AlphaLarge predicted structure of Syn-1 (cyan overall with deep blue highlighting central coiled coil region) and ribbon representation of mutated a-syn (N-terminal 1–65 in grey ribbon), (B) ribbon representation of the complex and (C) zoomed version of (B) showing interactions with mutated a-syn through C-terminal (green ribbon).
Biomolecules 14 01588 g007
Table 1. Training proteins and their corresponding ligands.
Table 1. Training proteins and their corresponding ligands.
Proteins (PDB ID)Ligands (Chemical ID)
Mutant monomer of recombinant human hexokinase Type I complexed with Glucose, Glucose-6-Phosphate, and ADP (1CZA)alpha-D-glucose 6-phosphate(G6P)
[26]
Crystal structure of the aldehyde oxidoreductase from desulfovibriodesulfuricansatcc 27774(1DGJ)Pterin cytosine dinucleotide(MCN) [27,28,29]
Crystal structure of human leucyl-tRNA synthetase, Leu-AMS-bound form(6KIE)5′-O-(L-leucylsulfamoyl)adenosine(LSS)
[30,31]
Structure determinants of phosphoinositide 3-kinase inhibition by wortmannin, LY294002, quercetin, myricetin and staurosporine(1E7U)Wortmannin (KWT)
[32,33]
X-ray crystal structure of human ceruloplasmin at 3.0 angstroms(1KCW)2-acetamido-2-deoxy-beta-D-glucopyranose(NAG)
[34]
Crystal structure of endoplasmic reticulum aminopeptidase 2 (erap2) complex with a highly selective and potent small molecule(7SH0)(2S)-N-hydroxy-3-(4-methoxyphenyl)-2-[4-({[5-(pyridin-2-yl)thiophene-2-sulfonyl]amino}methyl)-1H-1,2,3-triazol-1-yl]propenamide (GIY) [35]
OmecamtivMercarbil binding site on the Human Beta-Cardiac Myosin Motor Domain(4PA0)methyl 4-(2-fluoro-3-{[(6-methylpyridin-3-yl)carbamoyl]amino}benzyl)piperazine-1-carboxylate(2OW)
[36,37]
Structure of human DNMT1 (601-1600) in complex with Sinefungin(3SWR)SINEFUNGIN(SFG)
[38]
Structure of the m1 alanylaminopeptidase from malaria complexed with a hydroxamic acid-based inhibitor(4R5X)3-amino-N-{(1R)-2-(hydroxyamino)-2-oxo-1-[4-(1H-pyrazol-1-yl)phenyl]ethyl}benzamide(R5X)[39,40]
Structure of Ca2+ ATPase(5ZTF)Phosphomethylphosphonic acid adenylate ester(ACP) [41,42]
Structure of C-terminal fragment of Vip3A toxin(6VLS)DI(HYDROXYETHYL)ETHER(PEG)
[43,44]
E.coli beta-galactosidase (E537Q) in complex with fluorescent probe KSA02(7BRS)8-[2-[(E)-2-[4-[(2S,3R,4S,5R,6R)-6-(hydroxymethyl)-3,4,5-tris(oxidanyl)oxan-2-yl]oxyphenyl]ethenyl]-3,3-dimethyl-indol-1-ium-1-yl]octanoic acid(F4X) [45,46]
Crystal structure of human MTR4(6IEG)ADENOSINE-5′-DIPHOSPHATE(ADP)
[47]
Structure of human sodium-calcium exchanger NCX1(8JP0)2-{4-[(2,5-difluorophenyl)methoxy]phenoxy}-5-ethoxyaniline(EKY)
[48]
Table 2. A profile of WDAA, WDTS, and Tie percentage against various structure validation parameters.
Table 2. A profile of WDAA, WDTS, and Tie percentage against various structure validation parameters.
Standard Structure Validation Metrics W D T S W D A A Tie
Ramachandran Score10000
G-factor9370
GDT–TS73657
PAL_RMSD432829
QMEAN50437
SRI434314
ERRAT793 0
TM-Score74350
Table 3. The comparative binding profiles of interactions obtained for individual proteins using experimental structure, DTS and DAA models, respectively. Better performance of DTS over DAA is highlighted by bold font.
Table 3. The comparative binding profiles of interactions obtained for individual proteins using experimental structure, DTS and DAA models, respectively. Better performance of DTS over DAA is highlighted by bold font.
ProteinsLigandsInteractions Details
Proximity of Binding Site of Experimental Structure in Å with That ofBinding Energy in kJ/mol
DAADTSBIExperimentalBIDAABIDTS
1CZAG6P33.327.8−25.9408−28.0328−27.6144
1DGJMCN29.228.6−59.8312−10.8784−24.2672
6KIELSS13.09.0−19.2464−23.4304−12.9704
1E7UKWT28.924.3−23.4304−15.4808−15.4808
1KCWNAG11.616.5−17.9912−22.1752−22.5936
7SH0GIY11.86.2−33.0536−27.196−26.7776
4PA02OW11.915.9−30.1248−23.4304−25.9408
3SWRSFG11.218.5−40.1664−22.1752−18.4096
4R5XR5X22.936.5−22.5936−17.1544−16.3176
5ZTFACP11.67.7−30.5432−17.5728−26.7776
6VLSPEG31.310.1−1.6736−0.4184−6.6944
7BRSF4X21.44.7−22.5936−25.5224−20.0832
6IEGADP13.922.6−21.3384−23.8488−15.0624
8JP0EKY16.810.5−32.6352−22.5936−10.46
Table 4. Comparison of Syn-1 model structures predicted through DTS and DAA pipelines using the validation parameters, ERRAT, Ramachandran score, G-factor, QMEAN Score, and % coil amount present in the structures.
Table 4. Comparison of Syn-1 model structures predicted through DTS and DAA pipelines using the validation parameters, ERRAT, Ramachandran score, G-factor, QMEAN Score, and % coil amount present in the structures.
ModelsRamachandran Score in % for Different RegionsG-FactorQMEAN Score% CoilERRAT
CoreAllowedGenerously AllowedDisallowed
DTS82.114.92.10.9−0.15−5.9765.672.48
DAA32.21718.332.5−2.55−21.8272.760.27
Table 5. Binding energy (BI) for interaction of a-syn with Syn-1 (both DTS and DAA models).
Table 5. Binding energy (BI) for interaction of a-syn with Syn-1 (both DTS and DAA models).
Structure Model of a-SynStructure Models of Syn-1BI in kJ/mol
Wild type (PDB ID: 1XQ8)DTS−58.576
DAA−56.0656
Table 6. BI for interaction of Syn-1 best model (DTS) with both wild and mutated a-Syn along with the Backbone PDB–RMSD between wild and mutated a-Syn in Å (R).
Table 6. BI for interaction of Syn-1 best model (DTS) with both wild and mutated a-Syn along with the Backbone PDB–RMSD between wild and mutated a-Syn in Å (R).
Structure Model of Syn-1Structure Models of a-SynBI in kJ/molPDB–RMSD (R) in Å with 11% Identity
DTSWild type (PDB ID: 1XQ8)−58.5763.35
Mutated a-Syn−110.0392
Table 7. Comparison of structures of wild and mutated a-Syn using the validation parameters, ERRAT, Ramachandran score, G-factor, QMEAN Score, and % coil amount present in the structures.
Table 7. Comparison of structures of wild and mutated a-Syn using the validation parameters, ERRAT, Ramachandran score, G-factor, QMEAN Score, and % coil amount present in the structures.
ModelsRamachandran Score in % for Different RegionsG-FactorQMEAN Score% CoilERRAT
CoreAllowedGenerously AllowedDisallowed
Wild Type (PDB ID: 1XQ8)77.4137.81.7−0.05−5.592573.01
Mutated
a-Syn
67.514.911.4 6.1−1.95−11.2534.20
Table 8. A comparative profile of residue level interactions of Syn-1 (A) with a-syn (wild type) (B) and Syn-1 (A) with a-syn (mutated) (C) (the reported interactions [8,66] are shown in bold).
Table 8. A comparative profile of residue level interactions of Syn-1 (A) with a-syn (wild type) (B) and Syn-1 (A) with a-syn (mutated) (C) (the reported interactions [8,66] are shown in bold).
Interaction of Syn-1 with a-Syn (Wild Type)Interaction of Syn-1 with a-syn (Mutated)
A: LEU547—B: LEU8
A: LYS551—B: MET1
A: LYS626—B: VAL3
A: ILE619—B: VAL3
A: LEU623—B: VAL3
A: VAL537—B: LYS12
A: VAL816—B: ALA18
A: ARG536—B: PHE4
A: GLU627—B: MET1
A: GLU630—B: MET1
A:PRO699—C: ALA11
A: ARG700—C: ALA11
A: ALA696—C: ALA18
A: ARG899—C: LYS58
A: PRO907—C: GLY73
A: GLY52—C: VAL118
A: ARG700—C: GLY7
A: ARG899—C: THR54
A: THR901—C: VAL55
A: SER19—C: TYR133
A: ARG34—C: TYR125
A: ARG34—C: GLU126
A: CYS51—C: ASN122
A: TYR158—C: THR81
A: LYS160—C: LYS80
A: SER200—C:ASP98
A: ARG700—C: GLY7
A: ARG899—C: THR54
A: THR901—C: VAL55
A: SER21—C: GLU130
A: ARG33—C: TYR125
A: TRP49—C: ALA124
A: GLN307—C: TYR133
A: SER912—C: THR75
A: GLN164—C: THR92
A: LEU165—C: THR92
A: GLN164—C: GLY93
A: ASP36—C: SER129
A: SER21—C: GLN134
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Tripathi, A.; Mondal, R.; Mandal, M.; Lahiri, T.; Pal, M.K. A Structural Proteomics Exploration of Synphilin-1 and Alpha-Synuclein Interaction in Pathogenesis of Parkinson’s Disease. Biomolecules 2024, 14, 1588. https://doi.org/10.3390/biom14121588

AMA Style

Tripathi A, Mondal R, Mandal M, Lahiri T, Pal MK. A Structural Proteomics Exploration of Synphilin-1 and Alpha-Synuclein Interaction in Pathogenesis of Parkinson’s Disease. Biomolecules. 2024; 14(12):1588. https://doi.org/10.3390/biom14121588

Chicago/Turabian Style

Tripathi, Asmita, Rajkrishna Mondal, Malay Mandal, Tapobrata Lahiri, and Manoj Kumar Pal. 2024. "A Structural Proteomics Exploration of Synphilin-1 and Alpha-Synuclein Interaction in Pathogenesis of Parkinson’s Disease" Biomolecules 14, no. 12: 1588. https://doi.org/10.3390/biom14121588

APA Style

Tripathi, A., Mondal, R., Mandal, M., Lahiri, T., & Pal, M. K. (2024). A Structural Proteomics Exploration of Synphilin-1 and Alpha-Synuclein Interaction in Pathogenesis of Parkinson’s Disease. Biomolecules, 14(12), 1588. https://doi.org/10.3390/biom14121588

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop