- freely available
Int. J. Mol. Sci. 2013, 14(9), 17256-17278; doi:10.3390/ijms140917256
Published: 22 August 2013
Abstract: We use molecular dynamics simulations of a full atomistic Gō model to explore the impact of selected DE-loop mutations (D59P and W60C) on the folding space of protein human β2-microglobulin (Hβ2m), the causing agent of dialysis-related amyloidosis, a conformational disorder characterized by the deposition of insoluble amyloid fibrils in the osteoarticular system. Our simulations replicate the effect of mutations on the thermal stability that is observed in experiments in vitro. Furthermore, they predict the population of a partially folded state, with 60% of native internal free energy, which is akin to a molten globule. In the intermediate state, the solvent accessible surface area increases up to 40 times relative to the native state in 38% of the hydrophobic core residues, indicating that the identified species has aggregation potential. The intermediate state preserves the disulfide bond established between residue Cys25 and residue Cys80, which helps maintain the integrity of the core region, and is characterized by having two unstructured termini. The movements of the termini dominate the essential modes of the intermediate state, and exhibit the largest displacements in the D59P mutant, which is the most aggregation prone variant. PROPKA predictions of pKa suggest that the population of the intermediate state may be enhanced at acidic pH explaining the larger amyloidogenic potential observed in vitro at low pH for the WT protein and mutant forms.
Human β2-microglobulin (Hβ2m) is the non-covalently bound light chain of the human class I major histocompatibility complex (MHC-I), where it chaperones the complex assembly for antigen presentation. The wild-type (WT) Hβ2m comprises 99 residues arranged into a typical immunoglobulin (Ig) fold. It exhibits a sandwich-like structure formed by two sheets of antiparallel β-strands. One of the sheets comprises strands A-B-E-D with the second sheet being formed by strands C-F-G. The native structure is stabilized by a disulfide bond established between residue Cys25 (located on strand B) and residue Cys80 (located on strand F) . Such disulfide bond has been deemed critical for β2m fibrillogenesis at neutral pH [2,3]. Another key structural feature of Hβ2m is the existence of a peptidyl-prolyl bond on the BC-loop (between His31 and Pro32), which adopts a thermodynamically unfavorable cis isomerization in the native state (Figure 1). Docking of Hβ2m onto the β3 domain of the MHC involves the four-stranded beta-sheet and a surface area of 595 Å . In particular, the four aromatic residues Phe56, Trp60, Phe62, and Tyr63 and the aliphatic Leu65, which are shielded from the solvent in the quaternary structure of the MHC, become highly solvent-exposed upon dissociation of Hβ2m from the heavy chain.
During the continuous turnover of the MHC molecules, Hβ2m is shed from the cell membrane and discarded to the serum being catabolized in the kidneys. In individuals with kidney impairment undergoing long-term (>10 years) haemodialysis, the levels of Hβ2m increase from 0.3–30 mg/mL to 30–50 mg/mL due to the inability of the dialysis membrane to effectively remove the protein . Clearance failure leads to the onset of a pathological condition termed dialysis-related amyloidosis (DRA). In DRA, Hβ2m undergoes extracellular accumulation and slow deposition in the osteoarticular system (tendons, synovia, and bones) where it eventually aggregates into amyloid fibrils. The final outcome of DRA is tissue erosion and destruction.
While a concentration increase of Hβ2m appears to be a necessary condition for the occurrence of amyloid formation, it is certainly not sufficient to trigger the amyloid pathway. Indeed, experiments in vitro have shown that amyloid fibrils will not form in physiological conditions (of pH and temperature) even when the concentration of Hβ2m largely exceeds that observed in DRA patients [5,6]. Although it is widely recognized that WT Hβ2m is unable to efficiently nucleate fibrillogenesis in vitro under physiological conditions, it is also known that a multitude of environmental factors (e.g., acidic pH , agitation , addition of Cu2+ in the presence of urea , co-solvents ), can trigger the formation of β2m amyloid fibrils in laboratory experiments.
Seminal studies by Chiti and co-workers provided important insights into the in vitro folding mechanism of Hβ2m in physiological conditions (pH 7.4). Two intermediate states were identified in the folding pathway of Hβ2m: an ensemble of partially folded conformers (termed I1) with substantial elements of non-random structure that forms from the denatured sate, and a low-populated intermediate state I2, forming (directly or indirectly) from I1 and presenting a more consolidated protein core [11,12]. Subsequent investigations by Radford and co-workers, which explored the folding mechanism of β2m with atomic detail, showed that I2 is indeed a highly native-like slow-folding intermediate which is also aggregation-competent at neutral pH and physiological temperature . Despite being native-like, this intermediate exhibits a non-native (i.e., trans) isomerization of the peptidyl-prolyl bond between His31 and Pro32, and for this reason, it has been termed IT . It is known that specific variants of Hβ2m [13,14], including a cleaved form found in ex vivo amyloid fibrils (i.e., ΔN6) [15,16] of patients with DRA, may induce a higher population (up to 90%) of the intermediate IT and efficiently promote β2m fibril nucleation and/or elongation in a physiological (or near-physiological) pH in a mechanism akin to conformational conversion in prion .
Interestingly, the cleavage of the first six N-terminal residues in ΔN6 not only favors the isomerization of the peptidyl-prolyl bond characteristic of IT but also enhances the local dynamics of both the BC- and DE-loops in a pH-dependent manner suggesting these two regions are important for fibril assembly . Indeed, the local dynamics of the region spanning the D-strand and DE-loop, which establishes contacts with the MHC-I heavy chain, has been widely implicated in β2m fibrillogenesis . The DE-loop region is a key structural element of the protein that has been shown to be a major determinant of the kinetics of fibril formation being both involved in the nucleation and elongation phases of fibril assembly at acidic pH 2.5 . This segment of the polypeptide chain accommodates several aromatic bulky residues that become solvent exposed in the monomeric (free) form of β2m thus being able to act as sticky patches in intermolecular association. Indeed, strand D has been acknowledged to play a major role in edge-edge strand association and fibril assembly [9,18,19]. In this region, residue Trp60 is of particular interest. This bulky amino acid, located at the apex of the DE-loop, is highly conserved amongst vertebrates and plays a critical role in assisting the association of the β2m with the MHC-I . These observations have motivated a series of in vitro experiments based on mutations of Trp60 and other nearby residues in order to establish their relevance for biological association of β2m [20–24].
Here, we use discrete molecular dynamics simulations of a full atomistic Gō model, structural clustering and principal component analysis to assess the impact of two DE-loop mutations (D59P and W60C) on the thermodynamics and folding pathways of Hβ2m. While native-centric models have obvious limitations (e.g., they ignore water-mediated effects and cannot be used to explore regions of the free energy landscape dominated by non-native interactions), they are extraordinarily efficient from a computational standpoint, allowing the characterization of biomolecular processes occurring over long timescales such as protein folding. Furthermore, the full atomistic representation adopted in the present work takes into account an essential effect of protein folding which is that of side-chain packing, thus more accurately capturing the effect of single-point mutations .
This article is organized as follows. In the next section, we present and discuss the results. In the subsequent section, we provide a brief summary of the model and methods. Finally, we draw the conclusions.
2. Results and Discussion
2.1. Proteins Studied
In this work, we investigate the equilibrium folding of the (wild-type) WT Hβ2m (PDB ID: 1LDS) (Figure 1) and two single-point mutants D59P (PDB ID: 3DHM) and W60C (PDB ID: 3DHJ) obtained by replacing Asp59 by Pro59 and Trp60 by Cys60, respectively. D59P is the more aggregation prone variant studied here, being able to efficiently nucleate fibrillogenesis in vitro at physiological pH (7.4) . The W60C mutant, on the other hand, displays a decreased amyloidogenic propensity relatively to the WT form at physiological pH. The amyloidogenic behavior of all variants increases at acidic pH with that of W60C becoming quantitatively comparable to the WT protein .
The X-ray structure of the WT Hβ2m used in this work displays a straight D-strand with no β-bulge and an outward orientation of the AB-loop  which constitutes the most important structural deviations with regard to the conformation adopted by Hβ2m in solution and also in the MHC-I . As pointed out in Ref. , regular beta-strands are dangerous because they are in the right conformation to interact with other beta-strands they encounter. Thus, this rare conformer of Hβ2m is potentially more aggregation prone than other common conformations. Despite being considered a rare conformer of Hβ2m, the X-ray structure has been widely used to gain insight into the mechanism of amyloid formation by β2m. The X-ray structures of the two mutational variants, W60C and D59P, share with the WT Hβ2m X-ray structure its peculiar structural features.
In the current Gō model, the total number of native contacts exhibited by the three variants is 1072 (WT), 1091 (W60C) and 1063 (D59P). It is important to evaluate the impact of each single-point mutation on the number of native contacts established by the DE-loop (which accommodates the mutation), by the (adjacent) BC-loop and also by the terminal regions. The latter are widely acknowledged to have a protective role against aggregation by maintaining the hydrophobic balance that stabilizes the native state . Results are summarized in Table 1.
2.2. The Loop Mutations Decrease Thermal Stability
The melting (or transition) temperature Tm is considered a measure of the protein thermal stability. Here, as well as in experiments in vitro, we evaluate the melting Tm as the temperature at which the heat capacity Cv peaks (i.e., attains its maximum value). We compute Cv from the mean squared fluctuations in energy at each temperature considered in the RE-DMD simulations, in accordance with the definition Cv = (<E2> − <E>2)/T2. The results for Tm (Figure 2a)—1.280 (WT) > 1.251(W60C) > 1.226 (D59P)—follow the same qualitative trend as those obtained in experiments in vitro, 60.1 °C (WT) > 59.8 °C (W60C) > 52.0 °C (D59P) , and highlight a clearly more pronounced loss of thermal stability for the D59P mutant. The more pronounced loss of thermal stability exhibited by the D59P variant might be related with its larger propensity to aggregate. Indeed, it forms amyloid fibrils more quickly and with a higher yield than both the WT and W60C mutant .
The incipient shoulder in the Cv curve of the W60C mutant at high temperature (i.e., above Tm) indicates a weak pre-folding collapse transition. In the D59P mutant, it is possible to see an incipient second peak in the heat capacity curve (at T ~ 1.26) indicating that the pre-folding collapse is a stronger transition in this case. The less sharp folding transition of the D59P mutant is clearly visible in the curve showing the energy variation with temperature (Figure 2b).
2.3. The Loop Mutations Trigger the Population of a Partially Folded Intermediate State
The analysis of the free energy (FE) profiles (Figure 2c) and FE surfaces (Figure 3) allows for evaluation of the impact of mutations on the folding space of Hβ2m. The FE profile for the WT form (Figure 2c) reveals the existence of a free energy barrier between the denatured state and a partially folded intermediate state located at E~−400. The free energy barrier is relatively small which indicates that the pre-transition state intermediate forms relatively fast from the denatured state. We therefore hypothesize that it represents the intermediate species I1 originally reported by Chiti et al. [11,12] or a conformational excursion of the latter that may facilitate its conversion into IT. The analysis of the free energy surfaces reveals that the intermediate state is also present in the folding pathways of the mutants sharing the same gross characteristics across the three species: E ~ −420, Rg ~ 20 Å and RMSD ~ 15 Å (Figure 3). Interestingly, the TSE of the D59P mutant is slightly stabilized relative to that of the WT Hβ2m (Figure 2c) which may facilitate the conformational conversions between the intermediate and the native states. By comparing the free energy profiles of the mutants with that of the WT Hβ2m, we observe that the single point mutations thermodynamically stabilize the intermediate state and, most importantly, significantly destabilize the denatured state. Indeed, this effect is so striking that if we had restricted ourselves to the analysis of the free energy profiles of the mutants (Figure 2c) we would have concluded that the free energy minimum located at E ~ −400 is not an intermediate state, but a highly native-like denatured state instead (in fact, a compaction and structural consolidation of the denatured ensemble upon loop mutations was reported by Serrano on the Spc-SH3 folding domain ). The analysis of the free energy surfaces (in particular the free energy projected onto E and RMSD) shows a protrusion of the intermediate state’s basin towards higher energy and RMSD indicating the presence of the denatured state (which, nevertheless, is significantly destabilized at the considered temperature).
The mutations have a minor effect at the level of the native basin. The projection of the FE onto E and RMSD shows that the native basin of the WT protein is broader than that of the mutants (Figure 3a–c). Structural clustering (with a cutoff radius of 5–9 Å) of the native ensembles extracted from equilibrated DMD trajectories at fixed temperature (within the transition region) reveal that the WT and, to a lesser extent, the D59P mutant both populate a native-like species exhibiting a partially detached N-terminus (Figure S1). The native-like species represents less than 1% of the equilibrium ensemble in both cases.
The partial detachment of the N-terminus might reflect a smaller number of native interactions in the termini in the WT and D59P relative to that of W60C (Table 1). On the other hand, in W60C, there is an increment in the number of native contacts (up to 11.4%) established by the first four β-strands relative to the WT form (Table 1), which may contribute to its narrower native basin.
2.4. The Intermediate State Has a Molten Globule-Like Nature
In order to isolate and structurally characterize the intermediate state populated by each variant, we performed extensive structural clustering (with a cutoff radius of 14–17 Å) over ensembles of conformations collected from equilibrated DMD simulations at fixed temperature within the transition region. We found that the intermediate exhibits the disulfide bond between residues Cys25 and Cys80 (Figure S2), a structural feature which has been proposed to be a pre-requisite for β2m fibrillogenesis at neutral pH . The most likely dihedral angle for the His31-Pro32 bond in the intermediate state is ~−90° for the mutants and −45° for the WT. The intermediate’s main structural trait is the existence of unstructured termini detached from the region comprising residues 22–83 (i.e., strands B-F and connecting loops). Given its extension, we term such a region a core region. The RMSD of the core region (<3 Å) calculated with regard to the equivalent region in the native state indicates a high degree of native-like character (Figure S2). This is perhaps associated with the preservation of the disulfide bridge in the intermediate state that secures the integrity of the core region by acting like a staple. The intermediate’s (mean) energy, E, represents ~60% of the native energy and its (mean) radius of gyration, Rg, is 35% (WT), up to 40% (W60C, D59P) larger than that of the native state (Figure 4a). The (mean) root-mean-square deviation, RMSD, measured with reference to the native structure ranges from 15 Å (WT) to 16.6 Å (W60C, D59P). These structural traits are consistent with those of a molten globule (MG), i.e., a compact state with fluctuating tertiary structure . The intermediate state populates 9.4% (WT), 19.9% (W60C), and 23.2% (D59P) of the WT, W60C, and D59P equilibrium ensembles, respectively, at Tm.
The comparison between the amounts of native interactions established by each β-strand in the intermediate state with those established in the corresponding native state provides an indication of the loss of native structure (Figure 4b). Apart from losing secondary structure in the terminal strands, the intermediate is also less structurally consolidated in strands B and F. The E-strand establishes the higher number of native contacts thus playing an important role in the preservation of the core region. The loss of native content can be observed with more detail in the probability maps that report the ratios between the total number of native contacts between each pair of residues (computed at the atomic level) in the intermediate and in the native state (Figure S3). The analysis of the probability maps shows that the most ‘promiscuous’ residues (i.e., those involved in a higher number of intramolecular interactions) are located in the E-strand. The similar patterns of native contacts exhibited by the intermediate state of each variant further support the idea that the intermediate state is common to the three protein variants studied here.
2.5. The Intermediate State Exhibits Aggregation-Prone Traits
Since the exposure of aggregation-prone hydrophobic patches has been pointed out as a hallmark of protein aggregation , we evaluated the solvent accessible surface area (SASA) per residue (Figure 4c) in order to gauge the aggregation potential of the intermediate state. There are similarities in the SASA (per residue) of the intermediate states of all variants, of note, SASA increases from ~3 up to ~40 times with regard to the native state in 38% of the hydrophobic core residues (i.e., Leu7, Val9, Leu23, Val27, Phe30, Leu39, Val82, and Trp95). Apart from the terminal regions (N-terminus and strand A plus C-terminus and strand G), the increase in SASA is significant for the region comprising B-strand and the BC-loop. These observations thus suggest that the identified intermediate state has a high aggregation potential.
2.6. Large Displacements of the Termini Dominate the Intermediate State’s Dynamical Repertoire
Essential dynamics (also known as Principal Component Analysis, PCA) has been traditionally used to identify collective motions in biomolecules. It is a useful method to distinguish the dominant (i.e., essential) motions from the “noise” associated with lower-amplitude motions. High-amplitude loop motions, for example, have been frequently considered to be detrimental for the function of several enzymes. Here, we use PCA to identify the dominant motions of the native and intermediate states in the three variants of Hβ2m.
We started by calculating the Cα atoms’ variance-covariance matrix for the ensembles of conformations representative of native states that were obtained with the clustering analysis (Figure 5a–c). The native ensembles of the mutants are similar and they exhibit predominantly small-amplitude correlated motions. In the WT, although the pattern of anti-correlated motions is more scattered, they involve a higher-amplitude displacement of the corresponding Cα atoms (Figure 5a). These higher-amplitude motions correspond to the displacement in opposite directions of the N-terminus and the BC-, DE-, and FG-loop regions. The AB-loop displays a similar pattern of anti-correlated motion mostly with strands A, B, D and E and both loops DE and EF. In addition there is a clear positive correlation between the movements of the N-terminus and the AB-loop. Furthermore, the N-terminus moves in tandem with the EF-loop, and the AB-loop does it so with the region enclosing both G-strand and FG-loop. This pattern of positively-correlated motions involving the N-terminal regions of the protein is particularly evident for both the WT and, to a lesser extent, for the D59P variants being consistent with the population of the native-like species with a semi-detached N-terminus/strand A previously mentioned.
We evaluated the total mean square fluctuations (MSF) of the native state corresponding to the trace of the native covariance matrix. The total MSF of the native state is higher for both the WT and D59P variants (496 and 524 Å2, respectively, against 438 Å2 for the W60C mutant), which is in agreement with the wider native state basin identified in their FE surfaces. In the WT, the first five eigenvectors or principal components (PCs), which are associated with larger amplitude motions, account for ~54% of the total MSF, while in the mutational variants they account for ~40%
In order to gain insight into the type of motion associated with the first five PCs, we evaluated the contributions of each Cα atom in these PCs to the total RMSF (Figure 6a). We find that these principal modes represent mostly the movement of both the N-terminus and AB-loop suggesting that the transition between the native-state and the native-like state with a semi-detached N-terminus occurs mostly along these five principal modes. In the D59P variant along with the motion of the N-terminus and AB-loop (PCs 1–3), there is also a high RMSF associated with the C-terminus (PCs 3 and 4). Considering only those five eigenvectors, the DE-loop RMSF is only distinctively higher for the W60C variant along PCs 3 and 4. The enhanced dynamics of the native WT and the local nature of the correlated motions which involve well-defined regions of the protein (visible through low sparseness displayed by the covariance matrices) are indicative of highly-directional fluctuations of the native state that might concur for its propensity for in vitro native oligomerization at neutral pH .
The covariance matrices of the intermediate state’s ensembles of the three variants are quite similar (Figure 5d–f). The N-terminal region encompassing the N-terminus plus the AB-loop moves in an anti-correlated manner with the BC and DE-loop regions (including corresponding strands) and also with the C-terminal region (starting from strand F). In other words, the protein core (plus the C-terminal segment) and the N-terminal region move along opposite directions. The intermediate state populated by the WT is associated with the largest total MSF (9.4 × 104 Å2 against 1.2 × 104, 1.1 × 104 for the D59P and W60C mutants, respectively), which reflects the broader basin observed in its FE surface (Figure 3a). For all three variants the first five eigenvectors (or PCs) account for ~77%–78% of the total MSF in the considered DMD trajectories at Tm. The contributions of each Cα atom in these PCs to the total RMSF (Figure 6b) show very similar behavior across the three protein variants, and highlight especially high RMSF associated the motion of the N- and C-terminal regions. These motions have much larger amplitudes than the equivalent motions in the native state.
The first five principal modes of the combined covariance matrices (Figure 6c) highlight the enhanced RMSF associated with the motion of the N- and C-terminal regions of the D59P mutant responsible for the transition from the native to the intermediate state. The N-terminal region should be the first to become detached from the protein core (along PC 1 to 3).
2.7. Is the Predicted Intermediate State Compatible with a Molten Globule Identified for Hβ2m at pH 4?
The importance of a MG state for Hβ2m in vitro fibrillogenesis was reported for pH = 3.6 . This MG state appears to be preferentially stabilized at pH ~ 4  reaching a maximum (equilibrium) population of 76% in those conditions . It was shown to be the precursor of short, curved, worm-like fibrils . Structurally speaking, it preserves the disulfide bond Cys25–Cys80, it exhibits a stable and compact core involving strands B, C, D, E and F, and highly unstructured terminal strands A and G [7,31–35]. These structural traits are—all of them—shared with the intermediate state reported in the present work (Figure 4b). Despite these remarkable structural similarities, we cannot conclude that the intermediate reported here would form and be thermodynamically stable at pH 4.0 because its prediction is based on a Gō modeling approach that does not take into account the effect of pH. However, in what follows, we present an argument that suggests that the population of our intermediate state may be favored at pH 4.0, leading to a higher thermodynamic stability at this pH. Therefore, we cannot exclude the possibility that the MG state detected in vitro and the intermediate state reported here represent the same species.
As the pH decreases from 7.0 to 4.0, the number of positive charges in the native states of the WT and mutant forms increases (Table S1). In particular, according to PROPKA  predictions, the native state’s N-terminal region (extending up to residue 21) acquires one additional positive charge, and loses one negative charge, while the C-terminal region (starting at His84), displays one additional positive charge. The increased load of positive charge at pH 4.0 is visible in the electrostatic isocontours of the native and intermediate conformations (Figure S4). In particular, the excess of positive charges in both termini in the native structure is likely to drive their separation, contributing to the formation of the intermediate state at pH 4.0. However, the detachment of both protein termini from the core exposes hydrophobic residues that account for an unfavorable change in the nonpolar free energy of solvation (Gsolv,nonpolar) (Table S1). Still, solvation calculations suggest that formation of the intermediate state is more likely to occur at pH 4 because at this pH the electrostatic contribution to the free energy of solvation (Gsolv,polar) in the intermediate state is considerably lower than at pH 7.0 (Table S1).
3. Experimental Section
3.1. Gō Model and Discrete Molecular Dynamics Simulations
We consider a representation of the protein where all non-hydrogen atoms are represented by hard spheres of unit mass. The size of each atom is defined by scaling the relevant van der Waals (vdW) radius  by a factor α < 1. Protein energetics contains excluded volume interactions, non-bonded interactions and bonded interactions, all of which are modeled by discontinuous, piecewise constant interaction potentials. To take into account excluded volume interactions, all atom-atom interactions at distances smaller than the sum of the hard-sphere radii (hard clashes) are strictly forbidden. A pair of atoms i and j separated by a distance rij is a hard clash if rij < α(r0i + r0j), where r0i is the vdW radius of atom i. Non-bonded (or contact) interactions are represented by a square-well potential whose depth is given by Gō energetics . This means that if atoms i and j are located in residues which are separated by more than two units of backbone distance, the interaction parameter between them, ɛij, is given by:
In the expression above, σ =α (r0i+ r0j) is the hard-core distance, λ is a scaling factor that controls the range of attractive interactions, and Δij = −1 if i and j are in contact in the native conformation and is 0 otherwise. We followed  in treating the energetics of the disulfide bond in the same manner as we treat the other contact interactions. We set α = 0.80 and λ = 1.6 in order to have a well-behaved folding transition [38,41]. This choice of parameters sets a cut-off distance of 4.7 Å (for methyl carbon), and leads to 1072, 1091, and 1063 native contacts in the WT Hβ2m (PDB ID: 1LDS), W60C (PDB ID: 3DHJ) and D59P forms (PDB ID: 3DHM). The total energy of a conformation is computed as the sum over all atom pairs,
Covalent and covalent-like bonds between adjacent atoms i and j are modeled by a narrow, infinitely high potential well,
with b0= 0.9 and b1= 1.1, so that the average covalent bond length is equal to 1 Å. Folding dynamics is modeled with discrete (also termed discontinuous) molecular dynamics simulations [42,43]. The evaluation of thermodynamic properties requires equilibrium sampling of the conformational space. In order to obtain reversible folding trajectories at different temperatures, we used replica-exchange (RE) Monte Carlo simulations . In order to compute thermodynamic properties (e.g., the heat capacity), we used thermally equilibrated and uncorrelated states from very long DMD trajectories (consisting of at least 40 billion events after thermal equilibration). The weighted histogram analysis method (WHAM)  was used to compute the free energy as a function of different reaction coordinates (energy, E, radius of gyration, Rg, and root-mean-square deviation, RMSD).
3.2. Structural Clustering
Apart from the DMD-RE simulations, we also ran extensively long (up to 4 × 1011 events) DMD simulations at fixed temperature T (with T located within the transition region of each variant). A total number of 3 (D59P) and 6 trajectories (W60C) were considered. From each set of simulations we constructed a conformational ensemble for each analyzed protein (with 1.2–2 × 105 elements) by picking up equilibrated and uncorrelated conformations (i.e. conformations recorded every 50,000 events beyond the first folding transition) from the DMD trajectories. Subsequently, each conformational ensemble was analyzed with the k-means clustering algorithm of Brooks and co-workers as implemented in the MMTSB toolset . Data recorded at fixed temperature was also used for principal component analysis (PCA).
3.3. Principal Component Analysis
The PCA method is commonly used to identify the so-called essential motions, i.e., the set of collective internal fluctuations that make up the most important contributions to protein dynamics on a large number of conformations taken from a molecular dynamics trajectory. The PCA analysis presented in the present work was carried out with the g_covar and g_anaeig modules of GROMACS 4.5.4 . The variance-covariance matrices were evaluated after optimally superimposing the analyzed conformations onto the respective X-ray native structure. Further details can be found in .
In this work, we used molecular dynamics simulations to explore the impact of two selected loop mutations (D59P and W60C) on the folding pathways of a rare conformer of Hβ2m lacking a beta-bulge in strand D. Despite its peculiar features, this conformation has been widely used to study Hβ2m aggregation. In particular, the mutant variants considered here were prepared by amino acid replacement using this conformation as a template. Experiments in vitro indicate that the D59P mutational variant is more amyloidogenic than the WT protein, and the W60C is less. At acidic pH (2.5), the WT and W60C forms exhibit similar aggregation efficiencies.
The simulation results reported here recapitulate the trend for the melting temperature observed in experiments in vitro. Namely, that the largest decrease in Tm, reflecting the most significant loss of thermal stability, occurs for the more aggregation prone mutant D59P. Furthermore, our structure-based model predicts that both mutations induce the population of a partially folded state with 60% of the native’s state enthalpy, which has some molten globule-like character. The population of this folding intermediate is larger for the more amyloidegenic variant. The structural characterization of the intermediate state shows that about 40% of its hydrophobic core residues become significantly solvent exposed. The exposition of hydrophobic patches is a classical hallmark of protein aggregation. In addition, the folding intermediate identified here also presents two unstructured termini. The relevance of this particular structural feature for protein aggregation has been recently acknowledged [37,48,49]. In fact, perturbation of the N- and C-terminal strands has been demonstrated to account for the generation of assembly-competent states of β2m [14–16,50]. Furthermore, an unstructured strand A has been previously linked with the onset of β2m fibrillogenesis given its involvement in one of the two types of dimer interfaces established between monomers within the β2m H13F hexamer . Thus, it is likely that the folding intermediate identified in this study is also an intermediate state for aggregation. In addition, it is also possible that it represents a structurally consolidated version of the intermediate I1 that facilitates its conformational transition into IT, a widely recognized amyloid competent species in the folding space of β2m.
The W60C mutant also populates the intermediate state identified here. However, the W60C mutant is less amyloidogenic than the WT despite exhibiting a larger intermediate population. This may be taken as an indirect indication that the large and bulky tryptophan 60 is an important hub for the establishment of intermolecular interactions. The importance of Trp60 in β2m aggregation has been widely acknowledged [9,19–22].
The principal component analysis carried out in this study showed that the principal modes of the intermediate state are associated with the termini. In particular, the N-terminus moves significantly and in an opposite direction from the rest of the protein. It is well known that high-amplitude motions become overdamped in the presence of water, which is the major source of friction for slow and especially, very slow modes . The fact that the high amplitude motions will be overdamped in water does not change one of the hypothesis that we put forward in our work, i.e., that the identified intermediate state is aggregation-prone. Indeed, the major driving force for aggregation is the exposition of hydrophobic core residues, and in the present study, this trait comes in association with the loss of secondary structure in the protein termini and not with their mobility. However, one can conjecture that a more mobile terminus is more likely to establish intermolecular interactions. In this regard, overdamping the termini movement may have a minor effect on the predicted aggregation potential of the identified intermediate state, especially in the case of the D59P mutant, for which these movements are particularly relevant.
The population of a molten globule state has been reported for several proteins [52,53], including β2m [7,31–35]. The molten globule state detected for β2m shares with the intermediate state identified here important structural features. Namely, the preservation of the disulfide bond Cys25–Cys80, a stable and compact core, and highly unstructured termini. It has been suggested that the protonation of His84 (which will happen at acid pH given its low pKa) is a key event that triggers the molten globule formation . We put forward a simple argument that suggests that pH 4 may favor the population of the intermediate state reported here. Thus, we cannot exclude the possibility that the intermediate state reported here and the in vitro detected molten globule state correspond to the same species. Since the intermediate state reported here displays aggregation-prone traits, the higher stabilization of the intermediate state at pH 4 may provide a rationale for the enhanced amyloidogenic potential of all the protein sequences studied here at acidic pH.
Three-dimensional structures of the scarcely-populated native-like species identified for two of the Hβ2m variants fitted to the original X-ray native structure. The (starting) N-terminus is colored in blue while the C-terminus is shown in red.
Additional structural traits of the intermediate state identified for the three Hβ2m variants. The graph displays on the left-hand side the distributions of Cα RMSD values of the core region, comprehended between β-strands B and F (residues 22–83), after fitting it to the corresponding native structure core. The right-hand side depicts the distributions of disulfide bridge bond-length values in the three variants.
Native contact maps of the intermediate state identified for each of the three Hβ2m variants. Bottom half: Native contacts by residue in the native state. Top half: Probability map of the ratios between the total number of native atomic contacts by pair of residues in the intermediate and native states.
Electrostatic isocontours of the WT native (N)—X-ray structure—and intermediate (I)—clustering representative—states in acidic (4.0) and neutral pH (7.0) conditions. Coloring from red to blue corresponds to electrostatic potentials of −5 to +5 kBT. The 21 hydrophobic core amino acids are represented in pink in the SASA representation/depiction of the protein at the bottom of the figure. The protonation states at both pH values were attributed with PROPKA  via the webserver PDB2PQR v1.8 . The Poisson-Boltzmann equation was solved using the Adaptive Poisson-Boltzmann Solver APBS v1.4  in VMD v1.8.7 . All isosurfaces were generated at −5 and +5 kBT.
PFNF and SGE thank the Fundação para a Ciência e a Tecnologia (FCT) for financial support through grant PTDC/SAU-GMG/098274/2008 (to PFNF). PFNF thanks the Fundação para a Ciência e a Tecnologia (FCT) for financial support through grant PTDC/FIS/113638/2009. SGE holds a post-doctoral fellowship (SFRH/BPD/46313/2008) funded by the FCT.
Conflicts of Interest
The authors declare no conflict of interest.
- Trinh, C.H.; Smith, D.P.; Kalverda, A.P.; Phillips, S.E.V.; Radford, S.E. Crystal structure of monomeric human β-2-microglobulin reveals clues to its amyloidogenic properties. Proc. Natl. Acad. Sci. USA 2002, 99, 9771–9776.
- Hasegawa, K.; Ohhashi, Y.; Yamaguchi, I.; Takahashi, N.; Tsutsumi, S.; Goto, Y.; Gejyo, F.; Naiki, H. Amyloidogenic synthetic peptides of β2-microglobulin—A role of the disulfide bond. Biochem. Biophys. Res. Commun 2003, 304, 101–106.
- Yamamoto, K.; Yagi, H.; Ozawa, D.; Sasahara, K.; Naiki, H.; Goto, Y. Thiol compounds inhibit the formation of amyloid fibrils by β2-microglobulin at neutral pH. J. Mol. Biol 2008, 376, 258–268.
- Bellotti, V.; Gallieni, M.; Giorgetti, S.; Brancaccio, D. Dynamic of β2-microglobulin fibril formation and reabsorption: The role of proteolysis. Semin. Dial 2001, 14, 117–122.
- Eakin, C.M.; Miranker, A.D. From chance to frequent encounters: Origins of β2-microglobulin fibrillogenesis. BBA-Proteins Proteom 2005, 1753, 92–99.
- Platt, G.W.; Radford, S.E. Glimpses of the molecular mechanisms of β2-microglobulin fibril formation in vitro: Aggregation on a complex energy landscape. FEBS Lett 2009, 583, 2623–2629.
- Smith, D.P.; Jones, S.; Serpell, L.C.; Sunde, M.; Radford, S.E. A systematic investigation into the effect of protein destabilisation on beta 2-microglobulin amyloid formation. J. Mol. Biol 2003, 330, 943–954.
- Ohhashi, Y.; Kihara, M.; Naiki, H.; Goto, Y. Ultrasonication-induced amyloid fibril formation of β2-microglobulin. J. Biol. Chem 2005, 280, 32843–32848.
- Calabrese, M.F.; Eakin, C.M.; Wang, J.M.; Miranker, A.D. A regulatable switch mediates self-association in an immunoglobulin fold. Nat. Struct. Mol. Biol 2008, 15, 965–971.
- Hodkinson, J.P.; Radford, S.E.; Ashcroft, A.E. The role of conformational flexibility in β2-microglobulin amyloid fibril formation at neutral pH. Rapid Commun. Mass Spectrom 2012, 26, 1783–1792.
- Chiti, F.; Mangione, P.; Andreola, A.; Giorgetti, S.; Stefani, M.; Dobson, C.M.; Bellotti, V.; Taddei, N. Detection of two partially structured species in the folding process of the amyloidogenic protein [beta]2-microglobulin. J. Mol. Biol. 2001, 307, 379–391.
- Chiti, F.; de Lorenzi, E.; Grossi, S.; Mangione, P.; Giorgetti, S.; Caccialanza, G.; Dobson, C.M.; Merlini, G.; Ramponi, G.; Bellotti, V. A partially structured species of [beta]2-microglobulin is significantly populated under physiological conditions and involved in fibrillogenesis. J. Biol. Chem 2001, 276, 46714–46721.
- Jahn, T.R.; Parker, M.J.; Homans, S.W.; Radford, S.E. Amyloid formation under physiological conditions proceeds via a native-like folding intermediate. Nat. Struct. Mol. Biol 2006, 13, 195–201.
- Eichner, T.; Radford, S.E. A generic mechanism of β2-microglobulin amyloid assembly at neutral pH involving a specific proline switch. J. Mol. Biol 2009, 386, 1312–1326.
- Esposito, G.; Michelutti, R.; Verdone, G.; Viglino, P.; Hernández, H.H.; Robinson, C.V.; Amoresano, A.; Piaz, F.D.; Monti, M.; Pucci, P.; et al. Removal of the N-terminal hexapeptide from human β2-microglobulin facilitates protein aggregation and fibril formation. Protein Sci 2000, 9, 831–845.
- Eichner, T.; Kalverda, A.P.; Thompson, G.S.; Homans, S.W.; Radford, S.E. Conformational conversion during amyloid formation at atomic resolution. Mol. Cell 2011, 41, 161–172.
- Routledge, K.E.; Tartaglia, G.G.; Platt, G.W.; Vendruscolo, M.; Radford, S.E. Competition between intramolecular and intermolecular interactions in an amyloid-forming protein. J. Mol. Biol 2009, 389, 776–786.
- Mendoza, V.L.; Antwi, K.; Barón-Rodríguez, M.A.; Blanco, C.; Vachet, R.W. Structure of the preamyloid dimer of β-2-microglobulin from covalent labeling and mass spectrometry. Biochemistry 2010, 49, 1522–1532.
- Colombo, M.; de Rosa, M.; Bellotti, V.; Ricagno, S.; Bolognesi, M. A recurrent D-strand association interface is observed in β-2 microglobulin oligomers. FEBS J 2012, 279, 1131–1143.
- Esposito, G.; Ricagno, S.; Corazza, A.; Rennella, E.; Gümral, D.; Mimmi, M.C.; Betto, E.; Pucillo, C.E.M.; Fogolari, F.; Viglino, P.; et al. The controlling roles of Trp60 and Trp95 in β2-microglobulin function, folding and amyloid aggregation properties. J. Mol. Biol 2008, 378, 887–897.
- Santambrogio, C.; Ricagno, S.; Colombo, M.; Barbiroli, A.; Bonomi, F.; Bellotti, V.; Bolognesi, M.; Grandori, R. DE-loop mutations affect beta2 microglobulin stability, oligomerization, and the low-pH unfolded form. Protein Sci 2010, 19, 1386–1394.
- Ricagno, S.; Colombo, M.; de Rosa, M.; Sangiovanni, E.; Giorgetti, S.; Raimondi, S.; Bellotti, V.; Bolognesi, M. DE loop mutations affect β2-microglobulin stability and amyloid aggregation. Biochem. Biophys. Res. Commun 2008, 377, 146–150.
- Raimondi, S.; Barbarini, N.; Mangione, P.; Esposito, G.; Ricagno, S.; Bolognesi, M.; Zorzoli, I.; Marchese, L.; Soria, C.; Bellazzi, R.; et al. The two tryptophans of beta2-microglobulin have distinct roles in function and folding and might represent two independent responses to evolutionary pressure. BMC Evol. Biol 2011, 11, 159.
- Kihara, M.; Chatani, E.; Iwata, K.; Yamamoto, K.; Matsuura, T.; Nakagawa, A.; Naiki, H.; Goto, Y. Conformation of amyloid fibrils of β2-microglobulin probed by tryptophan mutagenesis. J. Biol. Chem 2006, 281, 31061–31069.
- Estácio, S.G.; Fernandes, C.S.; Krobath, H.; Faísca, P.F.N.; Shakhnovich, E.I. Robustness of atomistic Gō models in predicting native-like folding intermediates. J. Chem. Phys. 2012, 137.
- Esposito, G.; Bellotti, V. Emerging Molecular Targets in the Therapy of Dialysis-related Amyloidosis. In Protein Misfolding Diseases; John Wiley & Sons, Inc: Hoboken, NJ, USA, 2010; pp. 843–865.
- Richardson, J.S.; Richardson, D.C. Natural β-sheet proteins use negative design to avoid edge-to-edge aggregation. Proc. Natl. Acad. Sci. USA 2002, 99, 2754–2759.
- Martinez, J.C.; Pisabarro, M.T.; Serrano, L. Obligatory steps in protein folding and the conformational diversity of the transition state. Nat. Struct. Mol. Biol 1998, 5, 721–729.
- Ohgushi, M.; Wada, A. “Molten-globule state”: A compact form of globular proteins with mobile side-chains. FEBS Lett 1983, 164, 21–24.
- Chiti, F. Relative Importance of Hydrophobicity, Net Charge, and Secondary Structure Propensities in Protein Aggregation. In Protein Misfolding, Aggregation, and Conformational Diseases—Protein Reviews; Springer: New York, NY, USA, 2006; Volume 4, pp. 43–59.
- McParland, V.J.; Kalverda, A.P.; Homans, S.W.; Radford, S.E. Structural properties of an amyloid precursor of [beta]2-microglobulin. Nat. Struct. Biol 2002, 9, 326–331.
- Mukaiyama, A.; Nakamura, T.; Makabe, K.; Maki, K.; Goto, Y.; Kuwajima, K. The molten globule of β2-microglobulin accumulated at pH 4 and its role in protein folding. J. Mol. Biol 2013, 425, 273–291.
- Mukaiyama, A.; Nakamura, T.; Makabe, K.; Maki, K.; Goto, Y.; Kuwajima, K. Native-state heterogeneity of β2-microglobulin as revealed by kinetic folding and real-time NMR experiments. J. Mol. Biol 2013, 425, 257–272.
- McParland, V.; Kad, N.; Kalverda, A.; Brown, A.; Kirwin-Jones, P.; Hunter, M.; Sunde, M.; Radford, S. Partially unfolded states of β2-microglobulin and amyloid formation in vitro. Biochemistry 2000, 39, 8735–8746.
- Platt, G.W.; McParland, V.J.; Kalverda, A.P.; Homans, S.W.; Radford, S.E. Dynamics in the unfolded state of [beta]2-microglobulin studied by NMR. J. Mol. Biol 2005, 346, 279–294.
- Li, H.; Robertson, A.D.; Jensen, J.H. Very fast empirical prediction and rationalization of protein pKa values. Proteins: Struct. Funct. Bioinf 2005, 61, 704–721.
- Krobath, H.; Estácio, S.G.; Faísca, P.F.N.; Shakhnovich, E.I. Identification of a conserved aggregation-prone intermediate state in the folding pathways of Spc-SH3 amyloidogenic variants. J. Mol. Biol 2012, 422, 705–722.
- Huang, L.; Shakhnovich, E.I. Is there an en route folding intermediate for cold shock proteins? Protein Sci 2012, 21, 677–685.
- Tsai, J.; Levitt, M.; Baker, D. Hierarchy of structure loss in MD simulations of src SH3 domain unfolding. J. Mol. Biol 1999, 291, 215–225.
- Gō, N.; Abe, H. Noninteracting local-structure model of folding and unfolding transition in globular proteins. I. Formulation. Biopolymers 1981, 20, 991–1011.
- Shimada, J.; Kussell, E.L.; Shakhnovich, E.I. The folding thermodynamics and kinetics of crambin using an all-atom Monte Carlo simulation. J. Mol. Biol 2001, 308, 79–95.
- Dokholyan, N.V.; Buldyrev, S.V.; Stanley, H.E.; Shakhnovich, E.I. Discrete molecular dynamics studies of the folding of a protein-like model. Fold. Des 1998, 3, 577–587.
- Smith, S.W.; Hall, C.K.; Freeman, B.D. Molecular dynamics for polymeric fluids using discontinuous potentials. J. Comput. Phys 1997, 134, 16–30.
- Sugita, Y.; Okamoto, Y. Replica-exchange molecular dynamics method for protein folding. Chem. Phys. Lett 1999, 314, 141–151.
- Chodera, J.D.; Swope, W.C.; Pitera, J.W.; Seok, C.; Dill, K.A. Use of the weighted histogram analysis method for the analysis of simulated and parallel tempering simulations. J. Chem. Theor. Comput 2007, 3, 26–41.
- Feig, M.; Karanicolas, J.; Brooks, C.L., III. MMTSB Tool Set: enhanced sampling and multiscale modeling methods for applications in structural biology. J. Mol. Graph. Model. 2004, 22, 377–395.
- Pronk, S.; Páll, S.; Schulz, R.; Larsson, P.; Bjelkmar, P.; Apostolov, R.; Shirts, M.R.; Smith, J.C.; Kasson, P.M.; van der Spoel, D.; et al. GROMACS 4.5: A high-throughput and highly parallel open source molecular simulation toolkit. Bioinformatics 2013, 29, 845–854.
- Neudecker, P.; Robustelli, P.; Cavalli, A.; Walsh, P.; Lundström, P.; Zarrine-Afsar, A.; Sharpe, S.; Vendruscolo, M.; Kay, L.E. Structure of an intermediate state in protein folding and aggregation. Science 2012, 336, 362–366.
- Chiti, F.; Dobson, C.M. Amyloid formation by globular proteins under native conditions. Nat. Chem. Biol 2009, 5, 15–22.
- Jones, S.; Smith, D.P.; Radford, S.E. Role of the N and C-terminal strands of [beta]2-microglobulin in amyloid formation at neutral pH. J. Mol. Biol 2003, 330, 935–941.
- Hinsen, K.; Kneller, G.R. Solvent effects in the slow dynamics of proteins. Proteins Struct. Funct. Bioinf 2008, 70, 1235–1242.
- Hughson, F.; Wright, P.; Baldwin, R. Structural characterization of a partly folded apomyoglobin intermediate. Science 1990, 249, 1544–1548.
- Park, S.H. Hydrophobic core variant ubiquitin forms a molten globule conformation at acidic pH. J. Biochem. Mol. Biol 2004, 37, 676–683.
- Dolinsky, T.; Czodrowski, P.; Li, H.; Nielsen, J.E.; Jensen, J.H.; Klebe, G.; Baker, N. PDB2PQR: Expanding and upgrading automated preparation of biomolecular structures for molecular simulations. Nucleic Acids Res 2004, 35, W522–W525.
- Baker, N.A.; Sept, D.; Joseph, S.; Holst, M.J.; McCammon, J.A. Electrostatics of nanosystems: Application to microtubules and the ribosome. Proc. Natl. Acad. Sci. USA 2001, 98, 10037–10041.
- Humphrey, W.; Dalke, A.; Schulten, K. VMD: Visual molecular dynamics. J. Mol. Graph 1996, 14, 33–38.
|Table 1. Native contacts established by β-strand and loop. The change (in percentage) in the number of native contacts relatively to the WT is shown in parentheses. The WT form and the D59P mutational variant display a similar number of native contacts in the DE-loop (44 and 46, respectively) in agreement with the similar conformations adopted by the DE-loop in both cases. The two additional native contacts observed in the D59P mutant may result from an increased rigidity brought by the proline side-chain. The Trp60 to Cys single-point mutation in the W60C form produces the most noticeable impact on the number of native contacts established by the DE-loop which decreases by −13.6% relatively to the WT. This observation reflects the substitution of Trp60 residue, a potential hub for intra-molecular interactions due to its large size and bulkiness. This decrease in the number of native contacts extends to the adjacent BC-loop. There is a loss of native contacts in the termini of the D59P variant relatively to the WT (3% in the N-terminus and 10% in the C-terminus). In the W60C variant, the loss of native contacts in the C-terminus increases up to 12.5%. In the case of the amyloidogenic variant (D59P), there is a noticeable decrease in the number of contacts established by the loops connecting the terminal strands. In the case of the less-amyloidogenic variant W60C, the number of contacts established by the terminal loops is only slightly lower than the ones displayed by the WT.|
|A (6–12)||140||149 (+6.4%)||136 (−2.8%)|
|B (22–30)||319||330 (+3.4%)||305 (−4.4%)|
|C (36–41)||187||195 (+4.3%)||187|
|C′ (44–45)||35||39 (+11.4%)||32 (−8.6%)|
|D (51–56)||108||108||102 (−5.6%)|
|E (62–69)||324||325 (+0.3%)||323 (−0.3%)|
|F (78–83)||204||197 (−3.4%)||199 (−2.4%)|
|G (91–94)||72||63 (−12.5%)||66 (−8.3%)|
|AB (13–21)||89||87 (−2.2%)||80 (−10.1%)|
|BC (31–35)||102||92 (−9.8%)||97 (−4.9%)|
|DE (57–61)||44||38 (−13.6%)||46 (+4.5%)|
|EF (70–77)||111||100 (−9.9%)||108 (−2.7%)|
|FG (84–90)||95||94 (−1.0%)||88 (−7.4%)|
© 2013 by the authors; licensee MDPI, Basel, Switzerland This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).