- freely available
- re-usable
Int. J. Mol. Sci. 2014, 15(5), 7841-7864; doi:10.3390/ijms15057841
Abstract
: Agonist binding is related to a series of motions in G protein-coupled receptors (GPCRs) that result in the separation of transmembrane helices III and VI at their cytosolic ends and subsequent G protein binding. A large number of smaller motions also seem to be associated with activation. Most helices in GPCRs are highly irregular and often contain kinks, with extensive literature already available about the role of prolines in kink formation and the precise function of these kinks. GPCR transmembrane helices also contain many α-bulges. In this article we aim to draw attention to the role of these α-bulges in ligand and G-protein binding, as well as their role in several aspects of the mobility associated with GPCR activation. This mobility includes regularization and translation of helix III in the extracellular direction, a rotation of the entire helix VI, an inward movement of the helices near the extracellular side, and a concerted motion of the cytosolic ends of the helices that makes their orientation appear more circular and that opens up space for the G protein to bind. In several cases, α-bulges either appear or disappear as part of the activation process.1. Introduction
G protein-coupled receptors (GPCRs) are important targets for the pharmaceutical industry and have consequently been studied extensively in vivo, in vitro, and in silico. Their importance is illustrated by the fact that PubMed [1] lists around 500 reviews relating to GPCRs every year. The first three-dimensional structure of a GPCR was solved in 2000 [2], and recent years have seen a flurry of GPCR structures [3] being solved, published and deposited in the PDB (Protein Data Bank) [4]. Nearly all the GPCR structures solved so far are from the rhodopsin-like family—normally referred to as the Class A GPCR family. This article exclusively examines Class A GPCRs, so each time the acronym GPCR is used, it should be interpreted as “Class A GPCR”.
GPCRs possess seven transmembrane helices that traverse the membrane as illustrated in Figure 1. The N-terminus is located extracellularly and the C-terminus is located in the cytosol. Each helix contains at least one highly conserved residue (see Figure 1). These conserved residues are commonly used to anchor GPCR sequence alignments using the soon to be abandoned concept that “there are no insertions and deletions in helices” [5,6].
In the year 2000, the first structure of a GPCR was solved experimentally [2]. This structure not only revealed a series of surprises but also confirmed [8] or falsified [9] a series of previous hypotheses. Other hypotheses [10] were proven right in concept but wrong in detail. After the second GPCR structure had been solved (the β2 adrenergic receptor [11]), the floodgates opened, with the result that over the last few years we have on average been able to study one new GPCR structure every few months. The most frequent source of newly solved GPCRs has been the Stevens Lab [3].
GPCRs are notoriously difficult to crystallize. They are membrane proteins and therefore have large hydrophobic surfaces that are prone to nonspecific binding. However, it is perhaps even more important to realize that most GPCRs exhibit constitutive activity, which means they are highly mobile, continuously moving in and out of the active state. Consequently, GPCR crystallographers not only need to cope with the classical membrane crystallization problem of sticky hydrophobic surfaces, they also need to fixate the GPCR in one of its many marginally stable conformations. The sticky surface problem is frequently addressed by adding amphiphiles, while increasing the hydrophilic surface is often done by cloning lysozyme covalently into the GPCR. This covalent cloning is typically done between helices V and VI (e.g., as in the β2 adrenoceptor [11]), but lysozyme has also been cloned-in at other locations (e.g., the β2 adrenoceptor in which lysozyme has been fused to the N-terminus [12]). Other molecules are also being used for this purpose [13]. The GPCR mobility issue has been reduced by adding strong binding ligands (e.g., PDBid = 3eml [14]), nanobodies (e.g., PDBid = 3p0g [15]), G proteins (e.g., PDBid = 3sn6 [11]), and by mutating the residues that are most involved in the activation process—for example, the β1 adrenoceptor (PDBid = 2vt4 [16]) or the adenosine A2A receptor (PDBid = 3rey [17]). All these modifications have an influence on the structure. Figure 2 shows a series of examples of non-native interactions observed in GPCR structures. An associated web page [18] holds pictures of the 84 GPCR structures presently available with the modifications and non-native interactions indicated.
When a GPCR structure deviates from the canonical situation shown in Figure 1, it is important to consider whether the deviation is an interesting finding or a crystallization artifact. For example, in the recently solved structure of the NTS1 neurotensin receptor (PDBid = 4grv [19]) we observed that helix VIII is barely present and what can be seen of it appears to be displaced. This could be an interesting finding, but it seems more likely that a large number of crystal packing contacts (see Figure 3) has caused helix I to occupy the canonical space of helix VIII, causing helix VIII to find another stable position packing against the cloned-in lysozyme structure [19].
Mason et al. [20] found that upon receptor activation, the volume of the ligand binding site decreased by ~40 Å3 for the β1 and β2 adrenoceptors and by ~90 Å3 for the purinergic A2A receptor. This indicates that agonists stabilize a receptor conformation in which the extracellular sides of certain helices are closer together. This finding, combined with the finding of the “see-saw” like motions of helices [21,22], results in a model that can be compared to the mechanism of a clothespin. This model is explained in Figure 4.
GPCR structures have been modified in many different ways to aid crystallization, but fortunately, because many different crystallization methods have been used, we can still extract an overall picture by observing trends in large numbers of GPCR structures. After dealing with sequence insertions and deletions caused by α-bulges and short stretches of π-helix and 310-helix, we can unambiguously assign 203 residues that are common to all GPCR structures, all of which we can be reasonably sure are homologous. We have measured the 20,503 pair-wise distances between these 203 residues in 69 GPCR structures, and we have analyzed these distances using the random forest (RF) method. One of the scientific possibilities offered by the RF method when applied to a very large set of observations is detection of those observations that are most supportive for a hypothesized classification. We produced a series of structure classifications, each consisting of either two or three groups. Examples are: dark-state rhodopsins versus other receptors in the inactive state; receptors with a G protein or nanobody bound at the cytosolic side versus all others; and adenosine receptors with bound inverse agonists, antagonists or agonists.
The results shed new light on the GPCR activation process, on the roles of many individual residues, and on the helix motions. They also shed new light on α-bulges and their role in the dynamic processes related to GPCR activation. Upon activation (either by G-protein binding or by other means) helices III and VI separate at the cytosolic side, while helix III translates slightly in the extracellular direction [18].
Helical distortions play a major role in the overall fold of GPCRs. Most GPCRs possess α-bulges [24–30] in helices II and V, but they are also observable in most other helices in one or more GPCRs; only in the lipid receptor structures did we not detect any α-bulges. The highly conserved α-bulges often mentioned in helices II and V are not present in all GPCR structures, and when they are present, their locations are not always strictly conserved. In the second transmembrane segment (TM2), for example, the proline pattern is not conserved in sequence space. When present, it may be located at position 232, 233, or 234. In 2009 Devillé et al. [29] proposed that an indel led to two structural motifs for helix 2: bulged receptors with a proline at position 233 or 234 or a kink in receptors with a proline at position 232. In the structures available today, though, the proline in TM2 is structurally conserved and the observed sequence variability is caused by the absence or presence of an α-bulge. This sequence motif is particularly important for predicting the presence of bulges near the ligand-binding site, which in turn is crucial for homology modeling. Figure 5 shows how in the trace amine subfamily in the GPCRDB [5] a bulge is observed near position 229 in about half of all family members.
α-Bulges tend to remain stable in molecular dynamics simulations, indicating that these bulges represent (at least) a local energy minimum (DE Gloriam, personal communication).
We will argue that the role of prolines near the kinks in the helices is different to that previously thought. Overall, our results lead to a series of new hypotheses that are amenable to experimental validation.
2. Results
The way we use the RF method to analyze GPCR structure characteristics in distance space requires that the hypotheses put forward logically lead to classifications of existing GPCR structures involving a limited number of groups. Any hypothesis related to the activation process will be a good candidate, because the GPCR structure community has been working hard to shed light on this process by solving the structures of GPCRs with bound agonists, partial agonists, etc. and GPCR structures in the presence and absence of G proteins.
Visual inspection reveals that residue Tyr733 is displaced the most between active and inactive structures, with widening of the gap between helices III and VI also being closely associated with activation. In total, 202 distance vectors relate to displacement of the Tyr733 residue. However, almost eight times this number of vectors contribute to the relative displacement of helices III and VI. This asymmetry between the number of distance vectors related to displacement of Tyr733 and the number of distance vectors related to displacement of helices III and VI causes artifacts in the RF computations [32,33]. We therefore iteratively searched for the pair of distance vectors that had the highest Pearson correlation coefficient, randomly removing one of the two vectors during each iteration. This process was repeated until no two distance vectors had a Pearson correlation coefficient higher than 0.90, resulting in removal of around 80% of the vectors. The RF determination of the distance vectors most representative for the differences between active and inactive structures are projected on 1F88 in Figure 6. The yellow lines in Figure 6 represent distances that are shorter in the active state GPCRs than in the inactive ones. Most of the yellow vectors in Figure 6 are caused by a small but systematic displacement of helix III towards the extracellular side in the active state structures. The magenta lines represent distances that increase upon activation. These mainly involve the cytosolic side of helix VI, which undergoes an outward motion away from the rest of the helix bundle upon activation.
Tyr733, which is located at the intracellular side of helix VII, is the most important single residue to classify receptors as either being in the active state or the inactive state. It has been hypothesized [2,8,34–36] that this highly conserved residue stabilizes the active state by stabilizing the open conformation of the “ionic lock” formed by Arg340 and Glu600. During activation, the local backbone around Tyr733 undergoes a complicated reorganization, part of which involves the appearance/disappearance of an α-bulge.
When we use RF to determine the most significant distance differences between active and inactive GPCR structures, we find that many of these involve a residue in helix III. Visual inspection of superposed structures in the active and inactive state reveals that helix III is slightly displaced towards the extracellular direction in the activated GPCRs. Helix III is also slightly more wound up in the activated state. We also observe a correlation between truncation of the N-terminus and an outward movement of the extracellular side of helix I, as illustrated in Figure 7.
Although many articles have been published about the role of prolines in inducing kinks in helices (e.g., [37–43]), we do not believe that prolines actively induce these kinks. We believe that they merely allow for them. Nevertheless, the suggestion that a relationship exists between prolines, kinks, and the mobility that is necessary for GPCR activation continues to be a prominent part of experimental studies. However, our structural comparisons between GPCRs in the active and inactive form do not fully support this suggestion, as is illustrated in Figure 8, in which the active form of the β2-adrenoceptor is superposed on its inactive form. Figure 8 illustrates that the structural differences observed in the helices are not related one-to-one to the presence of prolines. The major hinge point is located near the kink in helix VI, but this helix does not bend in the middle; it undergoes a full-helix rotation. The blue and red helix VI in Figure 8 can be superposed on all 28 Cα atoms (including the residues in the irregular area near the ligand binding site) with an RMS displacement of only 0.9 Å.
In every kinked helix, a highly conserved proline is found near the kink. Whenever there is a non-helical element in the middle of a helix, a proline starts the “rest” of that helix. These non-helical elements can take the form of an α-bulge or a short 310 helix. Our supposition is that prolines do not induce kinks, as has often been suggested, but merely facilitate the non-helical element in the middle of the helix and ensure that the transmembrane domain can, after the irregularity, continue as a normal α-helix. In Figure 8 we see that for all cases the irregularities are highly similar in the active and inactive form, except for helix VII. In the inactive state this helix has a large stretch of residues forming a highly irregular helix, whereas in the active state the helix is more regular. The fact that helix VI rotates in its entirety rather than bending at the kink near the proline further adds to the idea that the relationship between prolines and kinks is not as simple and direct as has often been suggested.
Table 1 lists the α-bulges observed in the seven transmembrane helices in the 84 GPCRs studied. Figure 9 shows the distribution of secondary structure types over all the residues that these 69 structures have in common. The often mentioned [24–30] α-bulges in helices II and V are present in nearly all GPCRs, but they are not as highly conserved as some other features, and their locations are not conserved. Especially in helix V, many more irregularities are observed—and the S1P-lipid receptors lack the α-bulge in helix V. Interestingly, this receptor does not have the otherwise conserved proline at position 520. This finding might even indicate an alternative way for the lipid ligand to enter the active site, because the flexible ligand can perhaps enter directly from the membrane between helices IV and V [18]. Figure 10 illustrates the variability of the bulges observed in helix V.
Most structures have an α-bulge in the middle of helix II, notable exceptions being opioid, CXCR4, CCR5, and lipid receptors. Visual inspection of structures containing the bulge in helix II does not suggest a functional role for it despite that it is located at the same depth in the membrane as the ligand binding site. Squid rhodopsin (PDBid = 2z73 [47]) has an α-bulge at the extracellular side of helix II that is absent in bovine rhodopsin.
The α-bulge after Tyr733 is present in all GPCR structures except the β1 and β2 adrenoceptors. The absence of this bulge in β1 and β2 adrenoceptors is correlated with the presence of a proline at location 808 (in the corner between helices VII and VIII). It therefore appears that both the bulge and the proline can facilitate a similar displacement of Tyr733. Figure 11 illustrates these effects. This bulge may be “needed” to allow tyrosine 733 to bend inwards and assist in stabilization of G protein binding to the GPCR. The word “needed” should be read with caution, as there is no indication of a causal relationship between the two observations.
Figure 12 shows the bulges in helix IV in the CXCR4 chemokine receptor (PDBid = 3odu [48]) and the M2 muscarinic acetylcholine receptor (PDBid = 3uon [49]). At present, it is not possible to evidentially associate these bulges with any particular function, although it could be speculated that they play a role in dimer formation.
For the adenosine 2A receptor, structures are available bound to inverse agonists, antagonists, and agonists. Unfortunately, no structure is available yet for this receptor with a bound G protein. Figure 13 shows the superposition of the ten available structures for the adenosine 2A receptor and indicates that in most cases the differences between structures with a bound inverse agonist and structures with a bound antagonist are small. However, the structural differences are larger when either of these two groups is compared to structures with a bound agonist. Most of these structural differences seem to agree with the clothespin mechanism [53] but the displacements seen for the cytosolic side of helix V are surprising. When going from a bound inverse agonist to an antagonist we observe an inward motion of helix V, and upon binding of the agonist we observe an outward motion. We have no explanation yet for this counter-intuitive phenomenon.
Figure 13 shows differences in the helices observed in globally superposed structures. Figure 13 shows the same helices in the same colors with the helices superposed locally. This local superposition reveals that helix VI rotates as a whole rather than its cytosolic side bending outwards (see Figure 14). Helix III becomes more regular and less bent upon binding an agonist. Helix V winds up more tightly going from the inverse agonist, via the antagonist, to the agonist bound structures.
Many structures are available for the β2 adrenoceptor family, including structures with bound agonists, antagonists, and inverse agonists. We also have a structure for a bound G protein trimer, and a structure with a nanobody bound in such a way that similar displacements are observed. This diversity of β2 adrenoceptor structures allows an in-depth study of how they move during different phases of the activation process. Figure 15 shows differences in the helices observed in globally superposed structures.
3. Discussion
Over one hundred GPCR structures are available in the PDB. These structures relate to more than ten different GPCRs that have been solved with and without nanobodies or G proteins bound to them; with different ligands bound to them, and with different modifications to achieve crystallization. All these structures contain at least some artifacts due to crystallization. Binding a ligand or a G protein leads to structural changes such as helix displacements or rotamer flips in residues, but it is sometimes hard to separate these endogenous activation-related structure changes from those caused by crystallization artifacts. We used the variable importance score of the Random Forest (RF) method to elucidate which distances differ systematically as a function of motions associated with different phases of the GPCR activation process. However, the results of this approach cannot be taken at face-value, because we manually defined which structures were in the active or inactive state (G protein bound, or nanobody bound at the same location) and then used the RF method to determine which interatomic distances differ most systematically between the defined states. Similarly, a comparison of structures with bound agonists, antagonists, or inverse agonists is only as good as the determination of whether the bound compounds indeed are an agonist, etc. In addition, there is the risk that we may have missed a confounding variable—for example, the possibility that inverse agonists only bind to structures in which a certain mutation has been introduced.
The comparison between active structures and inactive structures, and between structures with an agonist bound and structures with an inverse agonist bound, revealed several interesting systematic movements. The strongest systematically observed effect in GPCR activation is an inward motion of Tyr733. It has been hypothesized [2,8,34–36] that this residue stabilizes the active conformation by interacting with Arg340, thereby stabilizing the open conformation of the “ionic lock” comprising Arg340 and Glu600. The outward motion of the intracellular side of helix VI is also systematically related to stages of the activation process, and is more a rotation of the entire helix then it is a bending of the helix at the kink near W618 and P620.
Compared to structures in the inactive state, structures in the active state show an upward (towards the outside of the cell) displacement of helix III combined with a hard to describe screw motion that makes the extracellular part of helix III a more regular α-helix. The extent of this upward displacement differs from case to case.
Agonist binding brings helices V–VII closer to each other around the ligand-binding site. Activation and G-protein binding are associated with a displacement of the intracellular part of helix VI that creates the crevice in which the G protein binds. Helix VI, however, does not have a hinge-point where the bending angle of the helix changes. The helix itself stays rigid but it rotates in such a way that its cytosolic side moves outwards. Combined with the regularization of helix III, and the known constitutive activity of many GPCRs in the absence of agonists, we see similarity to a clothespin in the model for G protein activation. GPCRs are mostly in the inactive conformation (clothespin closed) and occasionally in the active conformation (clothespin open), in which state they can activate a G protein (catch the laundry). Agonist binding and G protein binding both stabilize the active conformation. The activating ligand keeps the extracellular sides of helices V–VII close together and straightens helix III, while also moving helix III upwards, towards the extracellular side. The other helices do not bend at their hinge points but maintain their same general local orientation. It could be hypothesized that the upward movement of helix III makes it easier to break the ionic lock. During GPCR activation, Arg340 flips upwards, where it is maintained in position by interactions with Tyr733 and Tyr528. This creates a crevice at the intracellular side of the receptor in which the C-terminal end of a G protein’s α-subunit can dock. After binding to the activated GPCR, the G protein exchanges its bound GDP for a GTP and activates downstream pathways. An inverse agonist keeps the GPCR in its inactive state by pushing the extracellular sides of the helices outward, thus keeping the cytosolic sides of the helices close to one another.
4. Methods
GPCR structures were extracted from the PDB [58] and stripped of water, crystallization additives, multimeric partners, G proteins, cloned-in lysozymes, antibodies, etc. They were renumbered using the GPCRDB numbering scheme [59] which has the advantage of allocating identical residue numbers to homologous residues, making it easier for many computer programs to deal with them. For example, the Utopia-GPCRDB intelligent PDF-reader [60] directly couples GPCR-related articles to the GPCRDB [61] information system, making it very easy to place information extracted from an article in the wider context of GPCR knowledge. For example, this article can be read most productively using the Utopia-GPCRDB intelligent PDF-reader, which can be downloaded from the GPCRDB [61].
Unless mentioned otherwise, global structure superpositions were performed using the YASARA (YASARA Biosciences, Vienna, Austria) [44] implementation of the Mustang structure alignment program [62]. Local superpositions were produced with the WHAT IF [63] superposition module [64] that is integrated into YASARA. The superposition of single helices using only Cα atoms of corresponding residues (thus neglecting bulge residues when one superposition partner does not possess that bulge) was performed using a variant of the WHAT IF superposition module specifically adapted for the purpose.
Analyses were performed using the randomForest module [65] in the R [66] package. Variable importances for predefined classes were determined using standard parameters and ntree = 1000, importance = TRUE. Variables were sorted with the Gini importance parameter as sorting key.
α-Bulges were detected with a rewritten version of DSSP [67]. This rewritten version, DSSP 2.0, produces >>99.9% the same results as the now thirty-year old original DSSP 1.0 program written by Kabsch and Sander when α-bulges (also often called π-helices [26]) are not counted. Table 1 lists the structures used and their α-bulges.
DSSP 2.0 is freely available through the Centre for Molecular Bacteriology and Infection (CMBI) structure facilities [68] web pages [69] or directly from the DSSP FTP site at [70]. All structures mentioned in this study, and an extensive study of the effects of mutations and crystallization additives on the structures, are available at [18]. This website also holds many methodological details that are beyond the scope of this article.
5. Conclusions
We have shown that α-bulges are a prominent feature of GPCRs and that the bulges in helices II and V are highly conserved, albeit varying in type, and in helix V also varying in location. We also observed bulges in helices IV and VII. In several cases, the presence or absence of these bulges is directly linked to the activation process, but in other cases this conclusion awaits experimental validation. Although α-bulges have so far largely eluded the interest of the GPCR research community, we hope that this short summary of our observations will stimulate experiments aimed at shedding light on their precise functional role.
Acknowledgments
We thank Ray Stevens, Bob Bywater, Laerte Oliveira, Thomas Lengauer, and Olga Kalinka for stimulating discussions. Maarten Hekkelman rewrote DSSP to properly account for bulges.
Conflicts of Interest
The authors declare no conflict of interest.
- Author ContributionsGV designed the project and supervised RvdK during the execution while RvdK was a master student in GV’s group at the CMBI. RvdK and GV jointly wrote the article.
References
- PubMed. Available online: http://www.ncbi.nlm.nih.gov/pubmed accessed on 19 April 2014.
- Palczewski, K.; Kumasaka, T.; Hori, T.; Behnke, C.A.; Motoshima, H.; Fox, B.A.; le Trong, I.; Teller, D.C.; Okada, T.; Stenkamp, R.E.; et al. Crystal structure of rhodopsin: A G protein-coupled receptor. Science 2000, 289, 739–745. [Google Scholar]
- GPCR Network. Available online: http://gpcr.scripps.edu/ accessed on 19 April 2014.
- Kouranov, A.; Xie, L.; de la Cruz, J.; Chen, L.; Westbrook, J.; Bourne, P.E.; Berman, H.M. The RCSB PDB information portal for structural genomics. Nucleic Acids Res 2006, 34, D302–D305. [Google Scholar]
- Vroling, B.; Sanders, M.; Baakman, C.; Borrmann, A.; Verhoeven, S.; Klomp, J.; Oliveira, L.; de Vlieg, J.; Vriend, G. GPCRDB: Information system for G protein-coupled receptors. Nucleic Acids Res 2011, 39, D309–D319. [Google Scholar]
- Horn, F.; Bettler, E.; Oliveira, L.; Campagne, F.; Cohen, F.E.; Vriend, G. GPCRDB information system for G protein-coupled receptors. Nucleic Acids Res 2003, 31, 294–297. [Google Scholar]
- Van Durme, J.; Horn, F.; Costagliola, S.; Vriend, G.; Vassart, G. GRIS: Glycoprotein-hormone receptor information system. Mol. Endocrinol 2006, 20, 2247–2255. [Google Scholar]
- Oliveira, L.; Paiva, A.C.; Vriend, G. A low resolution model for the interaction of G proteins with G protein-coupled receptors. Protein Eng 1999, 12, 1087–1095. [Google Scholar]
- Oliveira, L.; Hulsen, T.; Lutje Hulsik, D.; Paiva, A.C.; Vriend, G. Heavier-than-air flying machines are impossible. FEBS Lett 2004, 564, 269–273. [Google Scholar]
- Oliveira, L.; Paiva, A.C.; Sander, C.; Vriend, G. A common step for signal transduction in G protein-coupled receptors. Trends Pharmacol. Sci 1994, 15, 170–172. [Google Scholar]
- Rasmussen, S.G.; DeVree, B.T.; Zou, Y.; Kruse, A.C.; Chung, K.Y.; Kobilka, T.S.; Thian, F.S.; Chae, P.S.; Pardon, E.; Calinski, D.; et al. Crystal structure of the β2 adrenergic receptor-Gs protein complex. Nature 2011, 477, 549–555. [Google Scholar]
- Wacker, D.; Fenalti, G.; Brown, M.A.; Katritch, V.; Abagyan, R.; Cherezov, V.; Stevens, R.C. Conserved binding mode of human β2 adrenergic receptor inverse agonists and antagonist revealed by X-ray crystallography. J. Am. Chem. Soc 2010, 132, 11443–11445. [Google Scholar]
- Chun, E.; Thompson, A.A.; Liu, W.; Roth, C.B.; Griffith, M.T.; Katritch, V.; Kunken, J.; Xu, F.; Cherezov, V.; Hanson, M.A.; et al. Fusion partner toolchest for the stabilization and crystallization of G protein-coupled receptors. Structure 2012, 20, 967–976. [Google Scholar]
- Lebon, G.; Warne, T.; Edwards, P.C.; Bennett, K.; Langmead, C.J.; Leslie, A.G.; Tate, C.G. Agonist-bound adenosine A2A receptor structures reveal common features of GPCR activation. Nature 2011, 474, 521–525. [Google Scholar]
- Rasmussen, S.G.; Choi, H.J.; Fung, J.J.; Pardon, E.; Casarosa, P.; Chae, P.S.; Devree, B.T.; Rosenbaum, D.M.; Thian, F.S.; Kobilka, T.S.; et al. Structure of a nanobody-stabilized active state of the β2 adrenoceptor. Nature 2011, 469, 175–180. [Google Scholar]
- Warne, T.; Serrano-Vega, M.J.; Baker, J.G.; Moukhametzianov, R.; Edwards, P.C.; Henderson, R.; Leslie, A.G.; Tate, C.G.; Schertler, G.F. Structure of a β1-adrenergic G protein-coupled receptor. Nature 2008, 454, 486–491. [Google Scholar]
- Doré, A.S.; Robertson, N.; Errey, J.C.; Ng, I.; Hollenstein, K.; Tehan, B.; Hurrell, E.; Bennett, K.; Congreve, M.; Magnani, F.; et al. Structure of the adenosine A2A receptor in complex with ZM241385 and the xanthines XAC and caffeine. Structure 2011, 19, 1283–1293. [Google Scholar]
- GPCR Activation: What Moves Where? Van der Kant RWA, Available online: http://swift.cmbi.ru.nl/gv/GPCR/ accessed on 19 April 2014.
- White, J.F.; Noinaj, N.; Shibata, Y.; Love, J.; Kloss, B.; Xu, F.; Gvozdenovic-Jeremic, J.; Shah, P.; Shiloach, J.; Tate, C.G.; et al. Structure of the agonist-bound neurotensin receptor. Nature 2012, 490, 508–513. [Google Scholar]
- Mason, J.S.; Bortolato, A.; Congreve, M.; Marshall, F.H. New insights from structural biology into the druggability of G protein-coupled receptors. Trends Pharmacol. Sci 2012, 33, 249–260. [Google Scholar]
- Hino, T.; Arakawa, T.; Iwanari, H.; Yurugi-Kobayashi, T.; Ikeda-Suno, C.; Nakada-Nakura, Y.; Kusano-Arai, O.; Weyand, S.; Shimamura, T.; Nomura, N.; et al. G protein-coupled receptor inactivation by an allosteric inverse-agonist antibody. Nature 2012, 482, 237–240. [Google Scholar]
- Katritch, V.; Reynolds, K.A.; Cherezov, V.; Hanson, M.A.; Roth, C.B.; Yeager, M.; Abagyan, R. Analysis of full and partial agonists binding to β2-adrenergic receptor suggests a role of transmembrane helix V in agonist-specific conformational changes. J. Mol. Recognit 2009, 22, 307–318. [Google Scholar]
- Rasmussen, S.G.; Choi, H.J.; Rosenbaum, D.M.; Kobilka, T.S.; Thian, F.S.; Edwards, P.C.; Burghammer, M.; Ratnala, V.R.; Sanishvili, R.; Fischetti, R.F.; et al. Crystal structure of the human β2 adrenergic G protein-coupled receptor. Nature 2007, 450, 383–387. [Google Scholar]
- Rey, J.; Deville, J.; Chabbert, M. Structural determinants stabilizing helical distortions related to proline. J. Struct. Biol 2010, 171, 266–276. [Google Scholar]
- Van Arnam, E.B.; Lester, H.A.; Dougherty, D.A. Dissecting the functions of conserved prolines within transmembrane helices of the D2 dopamine receptor. ACS Chem. Biol 2011, 6, 1063–1068. [Google Scholar]
- Cartailler, J.P.; Luecke, H. Structural and functional characterization of pi bulges and other short intrahelical deformations. Structure 2004, 12, 133–144. [Google Scholar]
- Worth, C.L.; Kreuchwig, A.; Kleinau, G.; Krause, G. GPCR-SSFE: A comprehensive database of G protein-coupled receptor template predictions and homology models. BMC Bioinform 2011, 12, 185. [Google Scholar]
- Deupi, X. Quantification of structural distortions in the transmembrane helices of GPCRs. Methods Mol. Biol 2012, 914, 219–235. [Google Scholar]
- Devillé, J.; Rey, J.; Chabbert, M. An indel in transmembrane helix 2 helps to Trace the molecular evolution of class A G protein-coupled receptors. J. Mol. Evol 2009, 68, 475–489. [Google Scholar]
- Gonzalez, A.; Cordomí, A.; Caltabiano, G.; Pardo, L. Impact of helix irregularities on sequence alignment and homology modeling of G protein-coupled receptors. Chembiochem 2012, 13, 1393–1399. [Google Scholar]
- Isberg, V.; Vroling, B.; van der Kant, R.; Li, K.; Vriend, G.; Gloriam, D. GPCRDB: An information system for G protein-coupled receptors. Nucleic Acids Res 2014, 42, D422–D425. [Google Scholar]
- Strobl, C.; Boulesteix, A.L.; Kneib, T.; Augustin, T.; Zeileis, A. Conditional variable importance for random forests. BMC Bioinform 2008, 9, 307:1–307:11. [Google Scholar]
- Toloşi, L.; Lengauer, T. Classification with correlated features: Unreliability of feature ranking and solutions. Bioinformatics 2011, 27, 1986–1994. [Google Scholar]
- Hofmann, K.P.; Scheerer, P.; Hildebrand, P.W.; Choe, H.W.; Park, J.H.; Heck, M.; Ernst, O.P. A G protein-coupled receptor at work: The rhodopsin model. Trends Biochem. Sci 2009, 34, 540–552. [Google Scholar]
- Vogel, R.; Sakmar, T.P.; Sheves, M.; Siebert, F. Coupling of protonation switches during rhodopsin activation. Photochem. Photobiol 2007, 83, 286–292. [Google Scholar]
- Fritze, O.; Filipek, S.; Kuksa, V.; Palczewski, K.; Hofmann, K.P.; Ernst, O.P. Role of the conserved NPxxY(x)5,6F motif in the rhodopsin ground state and during activation. Proc. Natl. Acad. Sci. USA 2003, 100, 2290–2295. [Google Scholar]
- Yohannan, S.; Faham, S.; Yang, D.; Whitelegge, J.P.; Bowie, J.U. The evolution of transmembrane helix kinks and the structural diversity of G protein-coupled receptors. Proc. Natl. Acad. Sci. USA 2004, 101, 959–963. [Google Scholar]
- Ceruso, M.A.; Weinstein, H. Structural mimicry of proline kinks: Tertiary packing interactions support local structural distortions. J. Mol. Biol 2002, 318, 1237–1249. [Google Scholar]
- Riek, R.P.; Rigoutsos, I.; Novotny, J.; Graham, R.M. Non-α-helical elements modulate polytopic membrane protein architecture. J. Mol. Biol 2001, 306, 349–362. [Google Scholar]
- Hong, S.; Ryu, K.S.; Oh, M.S.; Ji, I.; Ji, T.H. Roles of transmembrane prolines and proline-induced kinks of the lutropin/choriogonadotropin receptor. J. Biol. Chem 1997, 272, 4166–4171. [Google Scholar]
- Geetha, V. Distortions in protein helices. Int. J. Biol. Macromol 1996, 19, 81–89. [Google Scholar]
- Von Heijne, G. Proline kinks in transmembrane α-helices. J. Mol. Biol 1991, 218, 499–503. [Google Scholar]
- Conner, A.C.; Hay, D.L.; Simms, J.; Howitt, S.G.; Schindler, M.; Smith, D.M.; Wheatley, M.; Poyner, D.R. A key role for transmembrane prolines in calcitonin receptor-like receptor agonist binding and signalling: Implications for family B G protein-coupled receptors. Mol. Pharmacol 2005, 67, 20–31. [Google Scholar]
- Krieger, E.; Vriend, G. Models@Home: Distributed computing in bioinformatics using a screensaver based approach. Bioinformatics 2002, 18, 315–318. [Google Scholar]
- Hanson, M.A.; Roth, C.B.; Jo, E.; Griffith, M.T.; Scott, F.L.; Reinhart, G.; Desale, H.; Clemons, B.; Cahalan, S.M.; Schuerer, S.C.; et al. Crystal structure of a lipid G protein-coupled receptor. Science 2012, 335, 851–855. [Google Scholar]
- Jaakola, V.P.; Griffith, M.T.; Hanson, M.A.; Cherezov, V.; Chien, E.Y.; Lane, J.R.; Ijzerman, A.P.; Stevens, R.C. The 2.6 Ångstrom crystal structure of a human A2A adenosine receptor bound to an antagonist. Science 2008, 322, 1211–1217. [Google Scholar]
- Murakami, M.; Kouyama, T. Crystal structure of squid rhodopsin. Nature 2008, 453, 363–367. [Google Scholar]
- Wu, B.; Chien, E.Y.; Mol, C.D.; Fenalti, G.; Liu, W.; Katritch, V.; Abagyan, R.; Brooun, A.; Wells, P.; Bi, F.C.; et al. Structures of the CXCR4 chemokine GPCR with small-molecule and cyclic peptide antagonists. Science 2010, 330, 1066–1071. [Google Scholar]
- Haga, K.; Kruse, A.C.; Asada, H.; Yurugi-Kobayashi, T.; Shiroishi, M.; Zhang, C.; Weis, W.I.; Okada, T.; Kobilka, B.K.; Haga, T.; et al. Structure of the human M2 muscarinic acetylcholine receptor bound to an antagonist. Nature 2012, 482, 547–551. [Google Scholar]
- Park, J.H.; Scheerer, P.; Hofmann, K.P.; Choe, H.W.; Ernst, O.P. Crystal structure of the ligand-free G protein-coupled receptor opsin. Nature 2008, 454, 183–187. [Google Scholar]
- Granier, S.; Manglik, A.; Kruse, A.C.; Kobilka, T.S.; Thian, F.S.; Weis, W.I.; Kobilka, B.K. Structure of the δ-opioid receptor bound to naltrindole. Nature 2012, 485, 400–404. [Google Scholar]
- Chien, E.Y.; Liu, W.; Zhao, Q.; Katritch, V.; Han, G.W.; Hanson, M.A.; Shi, L.; Newman, A.H.; Javitch, J.A.; Cherezov, V.; et al. Structure of the human dopamine D3 receptor in complex with a D2/D3 selective antagonist. Science 2010, 330, 1091–1095. [Google Scholar]
- Schwartz, T.W.; Frimurer, T.M.; Holst, B.; Rosenkilde, M.M.; Elling, C.E. Molecular mechanism of 7TM receptor activation—A global toggle switch model. Annu. Rev. Pharmacol. Toxicol 2006, 46, 481–519. [Google Scholar]
- Congreve, M.; Andrews, S.P.; Doré, A.S.; Hollenstein, K.; Hurrell, E.; Langmead, C.J.; Mason, J.S.; Ng, I.W.; Tehan, B.; Zhukov, A.; et al. Discovery of 1,2,4-triazine derivatives as adenosine A2A antagonists using structure based drug design. J. Med. Chem 2012, 55, 1898–1903. [Google Scholar]
- Xu, F.; Wu, H.; Katritch, V.; Han, G.W.; Jacobson, K.A.; Gao, Z.G.; Cherezov, V.; Stevens, R.C. Structure of an agonist-bound human A2A adenosine receptor. Science 2011, 332, 322–327. [Google Scholar]
- Cherezov, V.; Rosenbaum, D.M.; Hanson, M.A.; Rasmussen, S.G.; Thian, F.S.; Kobilka, T.S.; Choi, H.J.; Kuhn, P.; Weis, W.I.; Kobilka, B.K.; et al. High-resolution crystal structure of an engineered human β2-adrenergic G protein-coupled receptor. Science 2007, 318, 1258–1265. [Google Scholar]
- Hanson, M.A.; Cherezov, V.; Griffith, M.T.; Roth, C.B.; Jaakola, V.P.; Chien, E.Y.; Velasquez, J.; Kuhn, P.; Stevens, R.C. A specific cholesterol binding site is established by the 2.8 Å structure of the human β2-adrenergic receptor. Structure 2008, 16, 897–905. [Google Scholar]
- Berman, H.M.; Westbrook, J.; Feng, Z.; Gilliland, G.; Bhat, T.N.; Weissig, H.; Shindyalov, I.N.; Bourne, P.E. The protein data bank. Nucleic Acids Res 2000, 28, 235–242. [Google Scholar]
- Oliveira, L.; Paiva, A.C.M.; Vriend, G. A common motif in G protein-coupled seven transmembrane helix receptors. J. Comput-Aided Mol. Des 1993, 7, 649–658. [Google Scholar]
- Vroling, B.; Thorne, D.; McDermott, P.; Attwood, T.K.; Vriend, G.; Pettifer, S. Integrating GPCR-specific information with full text articles. BMC Bioinform 2011, 12, 362:1–362:10. [Google Scholar]
- GPCRDB. Available online: http://www.gpcr.org/7tm/ accessed on 19 April 2014.
- Konagurthu, A.S.; Whisstock, J.C.; Stuckey, P.J.; Lesk, A.M. MUSTANG: A multiple structural alignment algorithm. Proteins Struct. Funct. Bioinform 2006, 64, 559–574. [Google Scholar]
- Vriend, G. WHAT IF A molecular modelling and drug design program. J. Mol. Graph 1990, 8, 52–56. [Google Scholar]
- Vriend, G.; Sander, C. Detection of common three-dimensional substructures in proteins. Proteins 1991, 11, 52–58. [Google Scholar]
- Liaw, A.; Wiener, M. Classification and regression by randomForest. R News 2002, 2, 18–22. [Google Scholar]
- R Development Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2008. Available online: http://www.R-project.org accessed on 19 April 2014.
- Kabsch, W.; Sander, C. Dictionary of protein secondary structure: Pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 1983, 22, 2577–2637. [Google Scholar]
- Joosten, R.P.; te Beek, T.A.; Krieger, E.; Hekkelman, M.L.; Hooft, R.W.; Schneider, R.; Sander, C.; Vriend, G. A series of PDB related databases for everyday needs. Nucleic Acids Res 2011, 39, D411–D419. [Google Scholar]
- CMBI Protein Structure Bioinformatics Facilities, Available online: http://swift.cmbi.ru.nl/gv/facilities/ accessed on 19 April 2014.
- Index of Software DSSP, Available online: ftp://ftp.cmbi.ru.nl/pub/software/dssp/ accessed on 19 April 2014.















| Species | ID | TM1 | TM2 | TM3 | TM4 | TM5 | TM6 | TM7 |
|---|---|---|---|---|---|---|---|---|
| Rhodopsin | 1f88, 1gzm, 1hzx, 1l9h, 1u19, 2g87, 2hzy, 2i35, 2i36, 2i37, 2j4y, 2ped, 3c9l, 3c9m, 3oax | 0 | 1 | 0 | 0 | 1 | 0 | 0 |
| Opsin | 2x72, 3cap, 3dqb, 3pqr, 3pxo, 4a4m | 0 | 1 | 0 | 0 | 1 | 0 | 1 |
| Rhod squid | 2z73, 2ziy, 3aym, 3ayn | 0 | 2 | 0 | 0 | 1 | 0 | 0 |
| β1 AR | 2vt4, 2y00, 2y01, 2y02, 2y03, 2y04, 2ycw, 2ycx, 2ycy, 2ycz, 4ami, 4amj, 4gpo | 0 | 1 | 0 | 0 | 1 | 0 | −1 |
| β2 AR inact | 2rh1, 3d4s, 3ny8, 3ny9, 3nya | 0 | 1 | 0 | 0 | 1 | 0 | −1 |
| β2 AR act | 3p0g, 3pds, 3sn6, 4lde, 4ldl, 4ldo | 0 | 1 | 0 | 0 | 1 | 0 | 0 |
| A2A inact | 3eml, 3pwh, 3rey, 3rfm, 3uza, 3uac, 3vg9, 3vga, 4eiy | 0 | 1 | 0 | 0 | 2 * | 0 | 0 |
| A2A act | 2ydo, 2ydv, 3qak | 0 | 1 | 0 | 0 | 2 * | 0 | 1 |
| CXCR4 | 3odu, 3oe0, 3oe6, 3oe8, 3oe9 | 0 | 0 | 0 | 1 | 1 | 0 | 0 |
| Opioid | 4djh, 4dkl, 4ea3, 4ej4 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
| Lipid | 3v2w, 3v2y | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Serotonin 1B | 4iaq, 4iar | 0 | 1 | 0 | 0 | 1 | 0 | 0 |
| Serotonin 2B | 4ib4 | 1 | 1 | 0 | 0 | 1 | 0 | 0 |
| CCR5 | 4mbs | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
| PAR1 | 3vw7 | 0 | 0 | 0 | 0 | 1 | 1 | 0 |
| NTSR1 | 4grv | 0 | 1 | 0 | 0 | 1 | 0 | 0 |
| Muscarinic | 3uon, 4daj | 0 | 1 | 0 | 1 | 1 | 0 | 0 |
| Histamine H1 | 3rze | 0 | 1 | 0 | 0 | 1 | 0 | 0 |
| Dopamine D3 | 3pbl | 0 | 1 | 0 | 0 | 1 | 0 | 0 |
*Bulge in Helix V of the A2A receptors is twice as long as in other receptors.
© 2014 by the authors; licensee MDPI, Basel, Switzerland This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).