<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="en" article-type="review-article" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">ijms</journal-id>
<journal-title>International Journal of Molecular Sciences</journal-title>
<abbrev-journal-title>Int. J. Mol. Sci.</abbrev-journal-title>
<issn pub-type="epub">1422-0067</issn>
<publisher>
<publisher-name>Molecular Diversity Preservation International (MDPI)</publisher-name></publisher></journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3390/ijms141020635</article-id>
<article-id pub-id-type="publisher-id">ijms-14-20635</article-id>
<article-categories>
<subj-group>
<subject>Review</subject></subj-group></article-categories>
<title-group>
<article-title>Mass Spectrometry Coupled Experiments and Protein Structure Modeling Methods</article-title></title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Pi</surname><given-names>Jaewoo</given-names></name><xref rid="af1-ijms-14-20635" ref-type="aff">1</xref><xref rid="af2-ijms-14-20635" ref-type="aff">2</xref></contrib>
<contrib contrib-type="author">
<name><surname>Sael</surname><given-names>Lee</given-names></name><xref rid="af1-ijms-14-20635" ref-type="aff">1</xref><xref rid="af2-ijms-14-20635" ref-type="aff">2</xref><xref rid="c1-ijms-14-20635" ref-type="corresp">&#x0002A;</xref></contrib></contrib-group>
<aff id="af1-ijms-14-20635">
<label>1</label>Department of Computer Science, Stony Brook University, Stony Brook, NY 11794, USA</aff>
<aff id="af2-ijms-14-20635">
<label>2</label>Department of Computer Science, State University of New York Korea, Incheon 406-840, Korea; E-Mail: <email>jwpi@sunykorea.ac.kr</email></aff>
<author-notes>
<corresp id="c1-ijms-14-20635">
<label>&#x0002A;</label>Author to whom correspondence should be addressed; E-Mail: <email>sael@sunykorea.ac.kr</email>; Tel.: &#x0002B;81-32-626-1215; Fax: &#x0002B;81-32-626-1198.</corresp></author-notes>
<pub-date pub-type="collection">
<month>10</month>
<year>2013</year></pub-date>
<pub-date pub-type="epub">
<day>15</day>
<month>10</month>
<year>2013</year></pub-date>
<volume>14</volume>
<issue>10</issue>
<fpage>20635</fpage>
<lpage>20657</lpage>
<history>
<date date-type="received">
<day>30</day>
<month>07</month>
<year>2013</year></date>
<date date-type="rev-recd">
<day>17</day>
<month>09</month>
<year>2013</year></date>
<date date-type="accepted">
<day>19</day>
<month>09</month>
<year>2013</year></date></history>
<permissions>
<copyright-statement>&#x000A9; 2013 by the authors; licensee MDPI, Basel, Switzerland</copyright-statement>
<copyright-year>2013</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/3.0/">
<p>This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).</p></license></permissions>
<abstract>
<p>With the accumulation of next generation sequencing data, there is increasing interest in the study of intra-species difference in molecular biology, especially in relation to disease analysis. Furthermore, the dynamics of the protein is being identified as a critical factor in its function. Although accuracy of protein structure prediction methods is high, provided there are structural templates, most methods are still insensitive to amino-acid differences at critical points that may change the overall structure. Also, predicted structures are inherently static and do not provide information about structural change over time. It is challenging to address the sensitivity and the dynamics by computational structure predictions alone. However, with the fast development of diverse mass spectrometry coupled experiments, low-resolution but fast and sensitive structural information can be obtained. This information can then be integrated into the structure prediction process to further improve the sensitivity and address the dynamics of the protein structures. For this purpose, this article focuses on reviewing two aspects: the types of mass spectrometry coupled experiments and structural data that are obtainable through those experiments; and the structure prediction methods that can utilize these data as constraints. Also, short review of current efforts in integrating experimental data in the structural modeling is provided.</p></abstract>
<kwd-group>
<kwd>constraint-base structure prediction</kwd>
<kwd>integrative structure prediction</kwd>
<kwd>sequence variants</kwd>
<kwd>protein dynamics</kwd>
<kwd>mass spectrometry</kwd></kwd-group></article-meta></front>
<body>
<sec sec-type="intro">
<label>1.</label>
<title>Introduction</title>
<p>In the post-genomics period, more researches are focused on functional and conformational analysis of proteins in a genomic scale &#x0005B;<xref rid="b1-ijms-14-20635" ref-type="bibr">1</xref>&#x0005D;. Although experimental methods, such as nuclear magnetic resonance (NMR) spectroscopy and X-ray crystallography, have advanced in the past two decades, these methods are still labor intensive, high cost, and it can take weeks to months to solve a three dimensional structure &#x0005B;<xref rid="b2-ijms-14-20635" ref-type="bibr">2</xref>&#x0005D;. Due to the difficulties associated with the experimental methods, the number of protein structures that have been solved is much smaller than the number of protein sequences. With advancements in the sequencing machines, the gap between the numbers is growing even faster (<xref rid="f1-ijms-14-20635" ref-type="fig">Figure 1</xref>). Structure prediction approaches can be used to overcome this chasm. By determining three dimensional (3D) molecular structures &#x0005B;<xref rid="b3-ijms-14-20635" ref-type="bibr">3</xref>,<xref rid="b4-ijms-14-20635" ref-type="bibr">4</xref>&#x0005D;, they can be used to analyze structural interactions between biomolecules &#x0005B;<xref rid="b5-ijms-14-20635" ref-type="bibr">5</xref>,<xref rid="b6-ijms-14-20635" ref-type="bibr">6</xref>&#x0005D; and to determine the functionality of a protein or protein complexes &#x0005B;<xref rid="b7-ijms-14-20635" ref-type="bibr">7</xref>,<xref rid="b8-ijms-14-20635" ref-type="bibr">8</xref>&#x0005D;. However, prediction of precise structures in the presence of variations (or mutations) remains challenging. Determination of their atomic level dynamics also remains difficult.</p>
<p>Integration of proteomics results, such as mass spectrometry (MS) coupled experiments, can reduce the difficulties associated with structural modeling. MS-coupled methods such as hydrogen-deuterium exchange (HDX), hydroxyl-radical mediated covalent labeling (protein footprinting), chemical cross-linking, ion mobility spectrometry, and native methods have emerged as structural proteomics techniques for analyzing the protein complexes, for identifying structural change up-on binding, and for detection of post-translational modifications. MS-coupled experiments provide fast and highly sensitive spatial information of the structure being analyzed. Much of the spatial information can be integrated into the structure prediction methods. They can be used to choose the structure that is most consistent with the MS-coupled experiments. They can be also used directly in the structure optimization procedure. MS-coupled methods, in addition to being fast and highly sensitive, require less mass of sample to extract the structural information compared to traditional structure solvers. This means that multiple experiments can be done without being limited by the available sample.</p>
<p>Although there are many studies on both the MS-coupled experiments and the structure prediction methods, the integration of the experimental data with the computation methods is still not widely explored. Developments in the integrative methods will provide advancements in the structural biology area. For this reasons, this review focuses on the mass spectrometry for studying the structural and dynamics of biomolecules &#x0005B;<xref rid="b9-ijms-14-20635" ref-type="bibr">9</xref>,<xref rid="b10-ijms-14-20635" ref-type="bibr">10</xref>&#x0005D; and structure prediction methods to promote integrative method development and researches in structural bioinformatics. The review is organized as follows. First, types and characteristics of MS-coupled experiments are overviewed. Then, a review of structure prediction methods is provided. In the last section, existing integrative methods are described with suggestions for further integrations.</p></sec>
<sec sec-type="other">
<label>2.</label>
<title>Mass Spectrometry Techniques</title>
<p>Mass spectrometry experiment (MS) is a high-throughput experimental method that characterizes molecules by their mass-to-charge (<italic>m</italic>/<italic>z</italic>) ratio. The MS is composed of sample preparation, molecular ionization, detection, and instrumentation analysis processes &#x0005B;<xref rid="b11-ijms-14-20635" ref-type="bibr">11</xref>&#x0005D;. MS is beneficial in that it is generally fast, requires a small amount of sample, and provides high accuracy measurements. For these reasons, MS alone or combined with other structural proteomics techniques is widely used for various molecular biology analysis purposes. Examples of the analysis include post-translations modifications in proteins, identification of vibrational components in proteins, and analysis of protein conformation and dynamics &#x0005B;<xref rid="b12-ijms-14-20635" ref-type="bibr">12</xref>&#x0005D;. We will focus on MS-coupled methods that provide information about conformation and dynamics of the protein being studied (<xref rid="t1-ijms-14-20635" ref-type="table">Table 1</xref>). For a comprehensive review on MS procedures, refer to &#x0005B;<xref rid="b12-ijms-14-20635" ref-type="bibr">12</xref>&#x0005D;, and for a review on various types of MS-coupled methods, refer to &#x0005B;<xref rid="b9-ijms-14-20635" ref-type="bibr">9</xref>&#x0005D;.</p>
<sec>
<label>2.1.</label>
<title>Hydrogen/Deuterium Exchange Mass Spectrometry</title>
<p>Hydrogen/deuterium exchange mass spectrometry (HDX-MS) exploits the chemical exchange pattern of amide hydrogens, <italic>i.e.</italic>, hydrogens that are attached to the backbone nitrogen in proteins &#x0005B;<xref rid="b13-ijms-14-20635" ref-type="bibr">13</xref>&#x0005D;. In a HDX experiment, proteins are placed in a solution containing deuterated water (D<sub>2</sub>O). Inside the solution, the amide hydrogens (H) exchange with the deuterium (D). This exchange increases the mass of proteins. The proteins can then be treated for the MS analysis to find out the overall mass change. Alternatively, the protein can be fragmented and fragments can be treated for the MS analysis to find out the mass change for each of the fragments.</p>
<p>The location and rate of the exchange depends on the solvent accessibility, hydrogen-bonding, pH level, and temperature. Assuming that the pH level and the temperature can be controlled, the solvent accessibility and hydrogen-bonding can be detected through analysis of the change in mass. The hydrogen exchange event occurs primarily in the amide hydrogen of residues on the solvent accessible region of the protein. However, not all solvent accessible residues have amide hydrogen available for the exchange event. Amide hydrogen also plays a role in constructing secondary structures such as alpha-helices and beta-sheets. When a secondary structure is formed, hydrogen bonding occurs between amide hydrogen and electro-negative atom in the side chains of other residues. A stable structure makes hydrogen exchange in the amide hydrogen less likely.</p>
<p>Depending on the availability and stability of the (local) structures, the rate of the exchange differs. Amide hydrogens that are exposed on the surface exchange hydrogen with deuterium quickly, while those buried in the core have much slower exchange rates. For the amide hydrogens that are solvent accessible but are part of hydrogen bonding, the exchange happens much slower through low-frequency vibration motions of the proteins.</p>
<p>Some of the successful applications of HDX include detecting binding affinity between HIV-1 Nef and Lyn SH3 &#x0005B;<xref rid="b17-ijms-14-20635" ref-type="bibr">17</xref>&#x0005D;, detecting conformational dynamics of the scaffold protein in the presence and absence of lipid &#x0005B;<xref rid="b18-ijms-14-20635" ref-type="bibr">18</xref>&#x0005D;, and examining the structural changes in the binding cites of the vitamin D receptor when bound to its natural ligand, 1&#x003B1;,25-dihydroxyvitamin D3, and two analogs ligands, alfacalcidol and ED-71 &#x0005B;<xref rid="b19-ijms-14-20635" ref-type="bibr">19</xref>&#x0005D;.</p></sec>
<sec>
<label>2.2.</label>
<title>Hydroxyl-Radical Mediated Covalent Labeling Mass Spectrometry</title>
<p>Hydroxyl-radical mediated covalent labeling, or protein footprinting, is a MS-coupled technique that is conceptually similar to HDX-MS. Similar to HDX-MS, protein footprinting also probes the solvent accessible residue and makes modifications to the accessible residues. The major difference between the HDX-MS and the protein footprinting is that HDX-MS targets the backbone amide hydrogen whereas the protein footprinting targets the side chains of the residues. In the protein footprinting, relative hydroxyl radicals, which have water-like solvent properties, interact with the side-chains of the solvent accessible residues and form stable covalent modifications that are detectable by MS &#x0005B;<xref rid="b14-ijms-14-20635" ref-type="bibr">14</xref>&#x0005D;. More specifically, side-chains of the solvent accessible residues are exposed to hydroxyl radicals and undergo covalent oxidation. The oxidation of the side chains results in mass shift which can be detected by MS. The comparison between the unmodified and modified proteins reveals which residues are solvent accessible &#x0005B;<xref rid="b10-ijms-14-20635" ref-type="bibr">10</xref>&#x0005D;. Protein footprinting provides a more direct measurement of solvent accessibility compared to the HDX-MS experiments.</p>
<p>The location and rate of the oxidation differs depending on the solvent accessibility of the residues and the reactivity of the side chains to hydroxyl radicals. Solvent accessibility of the protein structures can be evaluated through analyzing the correlation between the accessibility and the oxidation level for each type of amino acid &#x0005B;<xref rid="b20-ijms-14-20635" ref-type="bibr">20</xref>&#x0005D;. The relative reactivity of residue to hydroxyl radical depends on the side-chain chemistry, that can be listed by order of reactivity as follows: Cys &gt; Met &gt; Trp &gt; Tyr &gt; Phe &gt; Cystine (two disulfide bonded Cys) &gt; His &gt; Leu ~ Ile &gt; Arg ~ Lys ~ Val &gt; Ser ~ Thr ~ Pro &gt; Gln ~ Glu &gt; Asp ~Asn &gt; Ala &gt; Gly &#x0005B;<xref rid="b21-ijms-14-20635" ref-type="bibr">21</xref>&#x0005D;. Of these residues, Gly, Ala, Asp, and Asn have low reactivity, and thus, are not useful. In addition to the reactivity, mass change after oxidation needs to be large enough for MS to detect. For this reason, although Ser and Thr are reactive, they cannot be used for detection of solvent accessibility. In summary, 14 residues out of the 20 amino-acids can be used to detect structural properties via the protein footprinting method &#x0005B;<xref rid="b22-ijms-14-20635" ref-type="bibr">22</xref>&#x0005D;.</p>
<p>Detection of solvent accessibility enables the protein footprinting to be an attractive method for identifying interaction regions &#x0005B;<xref rid="b23-ijms-14-20635" ref-type="bibr">23</xref>&#x0005D;. One of the first uses of protein footprinting was in characterizing DNA-protein interactions such as detecting sequence-specific interactions of I12-X86 lac repressor with non-operator DNA &#x0005B;<xref rid="b24-ijms-14-20635" ref-type="bibr">24</xref>&#x0005D;. Protein footprinting has also been used to study the structural aspects of transmembrane proteins such as G protein-coupled receptors. In one study, protein footprinting was used to provide evidence that water molecules embedded and conserved in the G protein-coupled receptors are likely to be functionally important &#x0005B;<xref rid="b25-ijms-14-20635" ref-type="bibr">25</xref>&#x0005D;.</p>
<p>However, the preferential interaction quality makes the analysis of the MS results challenging. In order to apply the protein footprinting method for solvent accessibility analysis, accurate analysis of the correlation between the solvent accessibility and the reactivity of the residues is needed &#x0005B;<xref rid="b10-ijms-14-20635" ref-type="bibr">10</xref>&#x0005D;. For further details on various protein footprinting techniques, readers can refer to a review by Kiselar and Chance &#x0005B;<xref rid="b14-ijms-14-20635" ref-type="bibr">14</xref>&#x0005D;.</p></sec>
<sec>
<label>2.3.</label>
<title>Chemical Cross-Linking</title>
<p>Chemical cross-linking combined with a MS analysis is another important proteomics technique for structural analysis. Chemical cross-linking experiments are used to detect spatial closeness between residues in a protein for structure analysis purposes. They are also used to detect interacting region between proteins &#x0005B;<xref rid="b15-ijms-14-20635" ref-type="bibr">15</xref>&#x0005D;. Chemical cross-linking involves the use of a special reagent called cross-linkers, most often lysine linkers, to covalently attach two residues within a protein or between proteins that are spatially close. After the chemical cross-linkage process, MS analysis is performed to detect the cross-linked regions &#x0005B;<xref rid="b14-ijms-14-20635" ref-type="bibr">14</xref>&#x0005D;. The identified cross-link location information can be transferred to as distance constraints between residues. A sufficient number of distance constraints is known to provide important clues about the 3D structure of the protein.</p>
<p>Cross-links are generally formed by chemical reactions that are initiated by various factors, such as change in pH, heat, and radiation. The type of activator differs by the type of cross-linking reagents and results in cross-links of different characteristics. <xref rid="f2-ijms-14-20635" ref-type="fig">Figure 2</xref> shows four types of cross-linking reagents in a cartoon form. In a homo-bifunctional cross-linking, two of the same types of reactive groups are linked by a carbon-chain spacer arm (<xref rid="f2-ijms-14-20635" ref-type="fig">Figure 2A</xref>). In a hetero-bifunctional cross-linker, two different types of reactive groups are linked by a spacer arm (<xref rid="f2-ijms-14-20635" ref-type="fig">Figure 2B</xref>). In a zero-length cross-linking, cross-linking agents mediate amide or a phosphoramidate bond formation of the two reactive groups without the intermediate spacer (<xref rid="f2-ijms-14-20635" ref-type="fig">Figure 2C</xref>). The zero-length cross-linker is especially useful when we want to detect residues that are within 3&#x000C5; in space. There is also a hetero-trifunctional cross-linking agent, where three types of reactive group can be cross-linked. In the trifunctional cross-linking, a third reactive group from a protein can be attached or can be used for affinity purification purposes in case a biotin moiety is incorporated &#x0005B;<xref rid="b26-ijms-14-20635" ref-type="bibr">26</xref>&#x0005D;.</p>
<p>Cross-linking coupled MS has been successfully used to determine interactions between proteins. It has been used to identify interaction sites between heat shock protein and substrates &#x0005B;<xref rid="b27-ijms-14-20635" ref-type="bibr">27</xref>&#x0005D;, determine the structural organization of 19S regulatory particles in the 26S proteasome &#x0005B;<xref rid="b28-ijms-14-20635" ref-type="bibr">28</xref>&#x0005D;, and assess dynamic structures of viral capsid by identifying residue specific inter- and intra-subunit interactions in the viral capsid precursor &#x0005B;<xref rid="b29-ijms-14-20635" ref-type="bibr">29</xref>&#x0005D;. There are various advantages of chemical cross-linking experiments including the importance of the distance constraint information obtainable from the experiment and the ease of the cross-link experiment. However, due to the complexity in the cross-linking chemistry, the MS analysis is considered to be challenging and requires advances in both the experimental and computational analysis strategies &#x0005B;<xref rid="b14-ijms-14-20635" ref-type="bibr">14</xref>&#x0005D;. A survey of chemical cross-linking technique can be found in a review by Sinz &#x0005B;<xref rid="b26-ijms-14-20635" ref-type="bibr">26</xref>&#x0005D;.</p></sec>
<sec>
<label>2.4.</label>
<title>Ion Mobility-Mass Spectrometry</title>
<p>Ion mobility-mass spectrometry (IM-MS) is a multi-dimensional separation method that combines the ion-mobility spectrometry experiment with the MS experiment to identify components in the test sample. The major contribution of IM-MS in the proteomics studies is the capability to separate molecules by their size and shape, which enables the discrimination and determination of heterogeneity in the biomolecules &#x0005B;<xref rid="b16-ijms-14-20635" ref-type="bibr">16</xref>&#x0005D;.</p>
<p>In the IM-MS process, the ion mobility spectrometry experiment (IM) separates the initial batch of ionized test sample according to their mobility in the gas phase. The mobility depends on the size and shape of each ion. Other factors, such as structural heterogeneity and flexibility that effects the orientation and distribution of charges on the ion, also play important roles in the mobility of ions &#x0005B;<xref rid="b16-ijms-14-20635" ref-type="bibr">16</xref>,<xref rid="b30-ijms-14-20635" ref-type="bibr">30</xref>&#x0005D;. However, comprehensive list of factors and their mechanisms are not yet known. Known factors are controlled and utilized to analysis the characteristics of the molecule. After IM process, the ions are further separated by their mass-to-charge ratio (<italic>m</italic>/<italic>z</italic>) by the MS analysis. The MS process, in most case, is done in vacuum conditions and utilizes the distinctive properties of ions to determine their mass.</p>
<p>Since IM-MS experiment is executed in gas and vacuum states, the molecules being studied are more dynamic compared to when they are in a crystalline state, which is a required state for X-ray crystallography. This property allows for better analysis of the dynamics of the proteins being studied as well as providing more native-like information about the fold of the proteins &#x0005B;<xref rid="b31-ijms-14-20635" ref-type="bibr">31</xref>&#x0005D;. Also, diffusion cross-section data obtained in the IM process provides information about the radius of gyration of the protein &#x0005B;<xref rid="b32-ijms-14-20635" ref-type="bibr">32</xref>&#x0005D;.</p>
<p>IM-MS has been successfully used to identify the ring-like topology of trp RNA binding protein, composed of 11 members, by determining its collision cross section&#x0005B;<xref rid="b33-ijms-14-20635" ref-type="bibr">33</xref>&#x0005D; to study the relative population of oligomers of 42-residue amyloid beta-protein and its alloform with 19th residue substituted to proline &#x0005B;<xref rid="b34-ijms-14-20635" ref-type="bibr">34</xref>&#x0005D;, and to characterize the oligomeric population detected during the formation of fibrils of &#x003B2;(2)-microglobulin. This helped to identify the properties of transient, oligomeric intermediates formed during assembly of the fibrils &#x0005B;<xref rid="b35-ijms-14-20635" ref-type="bibr">35</xref>&#x0005D;. Diverse types of IM and MS exist that can be combined to form the IM-MS technique. A review on the types of IM methods coupled with MS can be found in &#x0005B;<xref rid="b36-ijms-14-20635" ref-type="bibr">36</xref>&#x0005D;. Also, further description and application of IM-MS method in the context of applications to structural biology can be found in &#x0005B;<xref rid="b16-ijms-14-20635" ref-type="bibr">16</xref>&#x0005D;.</p></sec>
<sec>
<label>2.5.</label>
<title>Native Mass Spectrometry</title>
<p>Native MS is a group of MS-coupled experiments that focuses on the structure, dynamics, and subcomponent interaction of intact biomolecular complex in a native-like state &#x0005B;<xref rid="b37-ijms-14-20635" ref-type="bibr">37</xref>&#x0005D;. Native MS is often combined with various MS-coupled methods, such as electrospray ionization MS (ESI-MS) and ion mobile MS (IM-MS), and structure optimization programs to determine the topology and dynamics of quaternary structures in their native-like state &#x0005B;<xref rid="b38-ijms-14-20635" ref-type="bibr">38</xref>&#x0005D;. Native MS in itself is a low resolution structure determination technology. However, compared to the traditional structure determination technologies such as X-ray crystallography and NMR, it is more sensitive, faster, and allows higher selectivity as well as providing information on stoichiometry, stability, and spatial arrangement of the subunits in the complex &#x0005B;<xref rid="b38-ijms-14-20635" ref-type="bibr">38</xref>,<xref rid="b39-ijms-14-20635" ref-type="bibr">39</xref>&#x0005D;. The higher sensitivity comes from the environmental property of native MS that preserves the native-like conditions of the native structure and dynamics of the complex.</p>
<p>There is diversity in the methodology and the application of the native MS. However, only the key characteristics are pointed out to enhance understanding of its usefulness in structural modeling. Native MS has special properties such as non-denaturing ionization of electrospray ionization (ESI) &#x0005B;<xref rid="b40-ijms-14-20635" ref-type="bibr">40</xref>&#x0005D;. The electrospray ionization of native MS involves the dispersion of the liquid state solution into nano-droplets which are then reduced to maximal surface charge of molecular till a certain size and composition is reached. Then, the ion-free state is accomplished through uses of volatile ESI compatible buffers under native-like conditions. More details of the electrospray ionization process can be found on review by Kebarle and Verkerk &#x0005B;<xref rid="b41-ijms-14-20635" ref-type="bibr">41</xref>&#x0005D;. After this process, the complex can be decomposed to sub-complexes and subunits. Denaturing MS can be used to find the mass of subunits and subcomplexes, revealing the topology of the complex &#x0005B;<xref rid="b40-ijms-14-20635" ref-type="bibr">40</xref>&#x0005D;. Tandem MS can be used additionally to validate the subunits and also to identify peripheral subunits. Ion mobility MS is a rather young addition to the native MS pipeline that can be used to determine the shape and cross-section of intact complexes and subcomplexes &#x0005B;<xref rid="b38-ijms-14-20635" ref-type="bibr">38</xref>,<xref rid="b42-ijms-14-20635" ref-type="bibr">42</xref>&#x0005D;.</p>
<p>Native MS has been used to characterize the structure of 20S proteasome &#x0005B;<xref rid="b43-ijms-14-20635" ref-type="bibr">43</xref>,<xref rid="b44-ijms-14-20635" ref-type="bibr">44</xref>&#x0005D;, confirm the subcomponents and stoichiometry of RNA polymerase II and III &#x0005B;<xref rid="b45-ijms-14-20635" ref-type="bibr">45</xref>&#x0005D;, and study the endogenously expressed protein complexes including exosome &#x0005B;<xref rid="b46-ijms-14-20635" ref-type="bibr">46</xref>&#x0005D;. More details in the application of native MS for structural analysis will be described in Section 4.</p></sec></sec>
<sec sec-type="methods">
<label>3.</label>
<title>Structure Prediction Methods</title>
<p>In this section, we focus on the structure prediction methods which often act as prerequisites of function annotation of protein or protein complexes. We take special interest in properties that have the potential for being integrated with the MS experimental data for sensitive modeling of structure and dynamics of biomolecules of interest.</p>
<p>Protein structure prediction falls into three categories depending on the availability of solved structures: homology (comparative) modeling, threading (fold recognition), and free (<italic>ab initio</italic>) modeling &#x0005B;<xref rid="b47-ijms-14-20635" ref-type="bibr">47</xref>&#x0005D;. Comparative modeling builds a model using experimentally solved 3D structures (templates) that have high sequence similarity to the protein being analyzed. Threading involves the alignment of the target sequence directly to 3D structures of proteins utilizing structural and biochemical similarities detectable between the target sequence and 3D structures in the database. This allows for relaxation of sequential similarity between the target and the template. Free modeling, or the <italic>ab initio</italic> method, predicts a model without a template structure, utilizing the force fields and knowledge-based potentials of the target sequence. <xref rid="t2-ijms-14-20635" ref-type="table">Table 2</xref> summarizes the three types of modeling methods.</p>
<sec>
<label>3.1.</label>
<title>Homology Modeling</title>
<p>The structure prediction process of homology modeling, according to Mart&#x000ED;-Renom <italic>et al.</italic> &#x0005B;<xref rid="b59-ijms-14-20635" ref-type="bibr">59</xref>&#x0005D;, is composed of four sequential steps: (1) fold assignment and template selection; (2) target-template alignment; (3) model building; and (4) model evaluation. Templates are selected based on the sequence similarities that are analyzable after the sequence alignments. The first two steps can be executed together using fast but accurate alignment methods. It has been shown that homology modeling can achieve accuracy up to backbone RMSD of 1&#x02013;2 &#x000C5; when a template of 50&#x00025; or higher sequence identity is found and used &#x0005B;<xref rid="b60-ijms-14-20635" ref-type="bibr">60</xref>&#x0005D;.</p>
<p>Template selection and alignment are two of the most important components in comparative modeling. Thus, development of sequence alignment methods with high sensitivity and specificity is critical &#x0005B;<xref rid="b61-ijms-14-20635" ref-type="bibr">61</xref>&#x0005D;. Template selection and alignment methods have evolved towards improving the balance between the two criteria. Earlier approaches used pairwise alignment methods, such as FASTA &#x0005B;<xref rid="b62-ijms-14-20635" ref-type="bibr">62</xref>&#x0005D; and BLAST &#x0005B;<xref rid="b63-ijms-14-20635" ref-type="bibr">63</xref>&#x0005D;, to compare sequence similarity between target sequence and sequences on the database. Nowadays, multiple sequence alignments are being used. Multiple sequence alignments are shown to improve the sensitivity of alignment without sacrificing the selectivity. Multiple sequence alignment also has been shown to be better in preserving structural similarities &#x0005B;<xref rid="b64-ijms-14-20635" ref-type="bibr">64</xref>&#x0005D;. They are also used to find highly conserved region, such as ligand binding sites. Some of the available multiple sequence alignment tools are MUSCLE &#x0005B;<xref rid="b65-ijms-14-20635" ref-type="bibr">65</xref>&#x0005D;, ClustalW &#x0005B;<xref rid="b66-ijms-14-20635" ref-type="bibr">66</xref>&#x0005D;, PSI-BLAST &#x0005B;<xref rid="b67-ijms-14-20635" ref-type="bibr">67</xref>&#x0005D;, and HHsearch (as part of the HH-suite &#x0005B;<xref rid="b50-ijms-14-20635" ref-type="bibr">50</xref>&#x0005D;).</p>
<p>Once the model has been aligned, the next step is the model building. This involves initial assignment of Cartesian coordinates to the target. The idea of conventional model building started from copying 3D coordinates from a database of templates. The easiest, yet widely used approach is called rigid body assembly. In the rigid body assembly, first a conserved core region from a small number of templates is constructed by superposing and averaging coordinates of C&#x003B1; atoms or backbone molecules &#x0005B;<xref rid="b68-ijms-14-20635" ref-type="bibr">68</xref>&#x0005D;. After initial assignment is made, the model rebuilds non-core regions such as side chains. Loop regions are often optimized further since the structure of those areas are less conserved &#x0005B;<xref rid="b69-ijms-14-20635" ref-type="bibr">69</xref>,<xref rid="b70-ijms-14-20635" ref-type="bibr">70</xref>&#x0005D;. Alternative model building methods utilize the segment-matching approach. The segment-matching approach is an extension of rigid body assembly that utilizes the coordinates of small segments that best align with the protein of interest. Unger and co-workers &#x0005B;<xref rid="b71-ijms-14-20635" ref-type="bibr">71</xref>&#x0005D; introduced and experimented the &#x0201C;building blocks&#x0201D; approach on hexametric structures. The building block approach first builds a model from the representative segments (blocks), then replaces them by another segment within the cluster whose RMSD is smaller. Similarly, Levitt &#x0005B;<xref rid="b72-ijms-14-20635" ref-type="bibr">72</xref>&#x0005D; first divided the target sequence into short segments, then matched fragments from the database using energetic or geometrical criteria, which are: Sequence similarity, conformational similarity (secondary structure and atomic coordinates), and compatibility (van der Waals interactions). Modern homology modeling has evolved into much more sophisticated approach, conjoined with global energy minimization procedure. This approach is called modeling by satisfaction of spatial restraints &#x0005B;<xref rid="b59-ijms-14-20635" ref-type="bibr">59</xref>,<xref rid="b70-ijms-14-20635" ref-type="bibr">70</xref>&#x0005D;.</p></sec>
<sec>
<label>3.2.</label>
<title>Threading</title>
<p>Threading shares many methodological similarities with that of homology modeling. The difference lies in the properties used for target-template alignment. Unlike homology modeling that relies solely on the sequence information, threading aligns the target protein sequences and target structures by their statistical similarity between sequence and structural properties. This idea expanded from the observation that the diversity of sequences is higher than that of the folds. An earlier threading approach by Bowie <italic>et al.</italic> &#x0005B;<xref rid="b73-ijms-14-20635" ref-type="bibr">73</xref>,<xref rid="b74-ijms-14-20635" ref-type="bibr">74</xref>&#x0005D; introduced the sequence-structure profile matching method. The method generates structural profiles from the environmental factors of the residues in the 3D structure. The environmental factors include the area of the residue buried in the protein which is inaccessible to solvent, the fraction of side-chain area that is covered by polar atoms, and the local secondary structure. The 3D profiles are aligned with dynamic programing based on the statistical compatibility with the 1D target sequence independent of the template sequence information.</p>
<p>One of the representative threading algorithms, PROSPECT &#x0005B;<xref rid="b75-ijms-14-20635" ref-type="bibr">75</xref>&#x0005D;, utilizes residue-residue contacts information. PROSPECT finds globally optimal threading alignment between the target sequence and the template structure with a divide and conquer approach. It first divides the template into small substructures in the form of a tree, and then an iterative procedure of alignment and local optimization is performed until the whole template is considered and the total energy is minimized. Their scoring function is a weighted linear function consisting of four energy terms &#x0005B;<xref rid="b76-ijms-14-20635" ref-type="bibr">76</xref>&#x0005D;: mutation, singleton, pair-contact potential, and alignment gap penalty. The mutation energy term is a compatibility measurement for substituting the template amino acids by target acids. The singleton energy term measures the compatibility of aligning the target amino acid onto the template position base. More specifically, the singleton term examines the likelihood of substituting one residue to another and the preference of secondary structure and solvent accessibility for the particular residue. Pairwise-contact potential energy is a statistical term reflecting the likelihood of the residue of types <italic>i</italic> and <italic>j</italic> to be in contact, <italic>i.e.</italic>, resides within 9&#x000C5; but separated by three or more residues in sequence. The alignment gap penalty energy term gives more penalties to larger gaps in alignment.</p>
<p>One exemplar of recently developed threading algorithm is MUSTER &#x0005B;<xref rid="b52-ijms-14-20635" ref-type="bibr">52</xref>&#x0005D;. MUSTER also uses dynamic programming to identify the best match between the target and the template sequences. The scoring function of MUSTER consists of seven energy terms. The first term is the sequence profile, which denotes the frequency of the residue types at a position in template and can be acquired by PSI-BLAST multiple sequence alignment. The second term indicates secondary structure match between the residues of the target, predicted by PSI-PRED, and the analyzed secondary structure of the residues of the template. The third term is the structure profile that is derived by a depth-dependent structure analysis. The depth of the residue is the measurement of the depth from the protein surface to the residue by calculating average distance of a residue from the solvent water molecule. Unlike solvent accessibility, it can distinguish atoms just below the surface and those in the core &#x0005B;<xref rid="b77-ijms-14-20635" ref-type="bibr">77</xref>,<xref rid="b78-ijms-14-20635" ref-type="bibr">78</xref>&#x0005D;. MUSTER splits the initial templates with nine residues by a gapless threading. Then fragments with similar depth (those with smaller RMSD) from the database are collected to calculate the frequency profile at each position of the template. Fourth term is solvent accessibility term, which compares the solvent accessibility of the template assigned by STRIDE &#x0005B;<xref rid="b79-ijms-14-20635" ref-type="bibr">79</xref>&#x0005D; and the solvent accessibility of the target predicted by the two-state neural network machine. The fifth and sixth term assigns scores based on the similarity between the two torsion (psi and phi) angles of the template and the torsion angles of the target predicted by support vector regression. The last term is from hydrophobic scoring matrix, matching the hydrophobic patterns of target and template.</p></sec>
<sec>
<label>3.3.</label>
<title><italic>Ab Initio</italic> Method</title>
<p><italic>Ab initio</italic> method, alternatively called <italic>de novo</italic> or free modeling, is a structural modeling approach that does not rely on template structures. Although homology modeling and threading can achieve higher prediction accuracy, <italic>ab initio</italic> methods are needed when there are no detectable template structures in the database. There have been numerous advances in the <italic>ab initio</italic> methods. However, computation time cost is still high and building models with more than 150 residues are still challenging in terms of accuracy &#x0005B;<xref rid="b4-ijms-14-20635" ref-type="bibr">4</xref>,<xref rid="b60-ijms-14-20635" ref-type="bibr">60</xref>&#x0005D;.</p>
<p>There are two directions in the <italic>ab initio</italic> methods, one is more physics-based and other is more knowledge-based. Physics-based methods are generally more interested in the fold dynamics themselves while knowledge-based methods are focused on the accuracy of the final structure. Physics-based methods are often integrated with molecular dynamics simulations using physics-based force fields. Representative examples of modeling systems using all-atom physics based force fields include CHARMM &#x0005B;<xref rid="b80-ijms-14-20635" ref-type="bibr">80</xref>&#x0005D;, AMBER &#x0005B;<xref rid="b81-ijms-14-20635" ref-type="bibr">81</xref>&#x0005D;, and OPLS &#x0005B;<xref rid="b82-ijms-14-20635" ref-type="bibr">82</xref>&#x0005D;. Their force fields share potential terms including intra-molecular terms such as bond lengths, angles and torsion angles, as well as non-bonded terms such as Coulomb potential and Lennard-Johns. Knowledge-based methods are focused on the resulting structure rather than the actual fold mechanism. For this reason, they use knowledge-based potential energy functions in addition to simple energy terms. Also, reduced models of the residues are often used to speed up the computation and increase the conformational search space &#x0005B;<xref rid="b58-ijms-14-20635" ref-type="bibr">58</xref>,<xref rid="b83-ijms-14-20635" ref-type="bibr">83</xref>&#x0005D;. Knowledge-based <italic>ab initio</italic> methods rely on the efficient structure space sampling algorithms as well as the effective scoring functions. It is not feasible to consider all possible conformations a structure can have. Thus, often variants of Monte Carlo sampling methods are used to search for possible conformations. Scoring function integrated with the sampling methods is also important for finding the most native-like structures. Following are some of the energy terms used in the scoring functions of <italic>ab initio</italic> methods, including SimFold &#x0005B;<xref rid="b56-ijms-14-20635" ref-type="bibr">56</xref>&#x0005D; and QUARK &#x0005B;<xref rid="b4-ijms-14-20635" ref-type="bibr">4</xref>&#x0005D;.</p>
<sec>
<label>3.3.1.</label>
<title>Backbone Torsion Angles (Dihedral Angles) Potential</title>
<p>Many structure prediction approaches take advantage of statistically probable phi/psi angle distributions. Ramachandran plot&#x02014;<italic>i.e.</italic>, plot of psi and phi angle present in a structure&#x02014;is useful tool for visualizing the torsion angles of a conformation and determining if they fall into a native-like psi-phi distribution. Both SimFold and QUARK defined this energy function as the sum of probability of phi and psi angles:</p>
<disp-formula id="fd1">
<label>(1)</label>
<mml:math id="m1" display='block'>
<mml:semantics id="sm1">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>E</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>d</mml:mi>
<mml:mi>h</mml:mi></mml:mrow></mml:msub>
<mml:mo>&#x003D;</mml:mo>
<mml:mo>&#x002D;</mml:mo>
<mml:munder>
<mml:mo>&#x02211;</mml:mo>
<mml:mrow>
<mml:mi>R</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>s</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>d</mml:mi>
<mml:mi>u</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>L</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>n</mml:mi>
<mml:mi>g</mml:mi>
<mml:mi>t</mml:mi>
<mml:mi>h</mml:mi>
<mml:mo>&#x002D;</mml:mo>
<mml:mn>2</mml:mn></mml:mrow></mml:munder>
<mml:mrow>
<mml:mtext>log&#x02009;</mml:mtext>
<mml:mi>P</mml:mi>
<mml:mo stretchy='false'>&#x0028;</mml:mo>
<mml:mi>&#x003C6;</mml:mi>
<mml:mo>&#x002C;</mml:mo>
<mml:mi>&#x003C8;</mml:mi>
<mml:mo stretchy='false'>&#x0029;</mml:mo></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula>
<p>Weights in SimFold depend on the type of amino acids and their position in the &#x0201C;quadrant&#x0201D; bin of a Ramachandran plot. Each quadrant corresponds to alpha-helix, beta-strand, alpha<sub>L</sub>, and rare regions &#x0005B;<xref rid="b56-ijms-14-20635" ref-type="bibr">56</xref>&#x0005D;. In QUARK, probabilities of phi and psi angles are conditioned on residue type and secondary structure type. For this purpose, 60 different Ramachandran plots of each condition pairs are generated (20 amino acid types &#x000D7; 3 secondary structure types) and used &#x0005B;<xref rid="b4-ijms-14-20635" ref-type="bibr">4</xref>&#x0005D;.</p></sec>
<sec>
<label>3.3.2.</label>
<title>Hydrogen Bond Potentials</title>
<p>Hydrogen bonds are one of the dominant energy factors for forming the secondary structure and the global topology of a protein structure. SimFold defines the hydrogen bond potential as the summation of following terms: (i) hydrogen bond interaction between any two atoms in the backbone (N, C&#x003B1;, C); (ii) four-body hydrogen bond characteristic in the &#x003B2;-sheet, which incorporates two hydrogen bonds in neighboring &#x003B2;-sheets; and (iii) the Born- or Self-energy term which is effected by charged and polar groups that determines the propensity for residue to be buried or exposed to solvent &#x0005B;<xref rid="b84-ijms-14-20635" ref-type="bibr">84</xref>&#x0005D;. In contrast, QUARK algorithm does not compute hydrogen bonds directly. Instead, QUARK utilizes the geometric features governed by the hydrogen bones between the two closest by residues <italic>i</italic> and <italic>j</italic>: the distance between O<italic><sub>i</sub></italic> and H<italic><sub>j</sub></italic>, the inner angle between C<italic><sub>i</sub></italic>, O<italic><sub>i</sub></italic>, and H<italic><sub>j</sub></italic>, the inner angle between O<italic><sub>i</sub></italic>, H<italic><sub>j</sub></italic>, and N<italic><sub>j</sub></italic>, and the torsion angle between C<italic><sub>i</sub></italic>, O<italic><sub>i</sub></italic>, H<italic><sub>j</sub></italic>, and N<italic><sub>j</sub></italic> &#x0005B;<xref rid="b4-ijms-14-20635" ref-type="bibr">4</xref>&#x0005D;.</p></sec>
<sec>
<label>3.3.3.</label>
<title>Solvent Accessibility</title>
<p>Solvent accessibilities are the extent to which a protein structure interacts with the solvent &#x0005B;<xref rid="b85-ijms-14-20635" ref-type="bibr">85</xref>&#x0005D;. Explicit computation of the solvent accessibility involves various types of factors including electrostatic potential, hydrophobicity, and van der Waals force. Thus, the calculations are computationally intractable and require approximation algorithms. Provided the protein structures, solvent accessibility of biomolecules is often inferred by calculating the solvent-accessible surface area (SASA) or per-residues solvent-accessibility surface area (rSASA) of the structure. The typical method of calculating the precise SASA is done by rolling spherical probe around the bimolecular. The probe size is often 1.4 &#x000C5; to represent the size of water molecule. rSASAs are often divided by the surface area of each type of residue after assigning the SASA for each of the residues of the biomolecules. Readers interested in extensive discussion of the SASA calculation methods are referred to work by Durham <italic>et al.</italic> &#x0005B;<xref rid="b85-ijms-14-20635" ref-type="bibr">85</xref>&#x0005D;. QUARK estimates the solvent accessibility in their optimization process while SimFold does not explicitly account for them in their scoring function &#x0005B;<xref rid="b4-ijms-14-20635" ref-type="bibr">4</xref>&#x0005D;.</p>
<p>There are also several other energy terms used such as van der Waals interaction, solvation, radius of gyration, and secondary structure packing. Also, spatial constraints are used to avoid collisions, to preserve distance between residues, and to form globular structure. In an integrative structural modeling, energy and spatial constraints can be obtained through experiments instead of prediction from sequences. Application of experiment data will thus increase the accuracy of modeling.</p></sec></sec>
<sec>
<label>3.4.</label>
<title>Composite Protein Structure Prediction</title>
<p>Recently, many structure prediction methods consist of a combination of all three types of structure prediction methods &#x0005B;<xref rid="b60-ijms-14-20635" ref-type="bibr">60</xref>&#x0005D;. In homology modeling and threading, modification of unconfident regions such as loops are done in <italic>ab initio</italic> fashion. Also, many <italic>ab initio</italic> approaches have adapted the uses of spatial restrains or structural fragments detectable by threading &#x0005B;<xref rid="b4-ijms-14-20635" ref-type="bibr">4</xref>,<xref rid="b86-ijms-14-20635" ref-type="bibr">86</xref>&#x0005D;. Threading relies more on multiple sequence alignment and sequentially conserved properties to align the sequence to structures. In general, a composite protein structure prediction will first search the template library to determine the availability of homolog structures. If the templates are found, coordinates are assigned to aligned regions between the target and template. Unaligned regions and evolutionarily diverse regions are modeled by <italic>ab initio</italic> methods. If the templates are not found, <italic>ab initio</italic> modeling is performed on all the areas. After the initial prediction, models are evaluated and selected. Then, the full atomic coordinates of side-chains are assigned and optimized &#x0005B;<xref rid="b60-ijms-14-20635" ref-type="bibr">60</xref>&#x0005D;.</p>
<p>We take a closer look at two structure prediction pipelines of the top CASP predictors: I-TASSER &#x0005B;<xref rid="b4-ijms-14-20635" ref-type="bibr">4</xref>,<xref rid="b87-ijms-14-20635" ref-type="bibr">87</xref>&#x0005D; and Rosetta &#x0005B;<xref rid="b54-ijms-14-20635" ref-type="bibr">54</xref>,<xref rid="b86-ijms-14-20635" ref-type="bibr">86</xref>&#x0005D;. Both methods are threading-integrated model free structure prediction methods. The flowcharts of the two methods are shown in <xref rid="f3-ijms-14-20635" ref-type="fig">Figure 3</xref>. Common steps for both methods are fragment generation process, modeling assembly, and atom-level refinement.</p>
<sec>
<label>3.4.1.</label>
<title>Fragment Generation</title>
<p>Sophisticated threading algorithms are used to generate and score target-template alignments. Many algorithms use a combination of both sequence and structure information. Fragments are generated for each segment of the query sequence using the profile that best aligns the sequence segments &#x0005B;<xref rid="b4-ijms-14-20635" ref-type="bibr">4</xref>&#x0005D;. I-TASSER builds fragments with continuous lengths from 1 to 20; Rosetta builds possible 3-residue and 9-residue fragments for each of the sequence segments. In <xref rid="t3-ijms-14-20635" ref-type="table">Table 3</xref>, we show terms used in the scoring function of Picker &#x0005B;<xref rid="b88-ijms-14-20635" ref-type="bibr">88</xref>&#x0005D; from Baker&#x02019;s group and MUSTER &#x0005B;<xref rid="b52-ijms-14-20635" ref-type="bibr">52</xref>&#x0005D; from Zhang&#x02019;s group.</p></sec>
<sec>
<label>3.4.2.</label>
<title>Initial Model Assembly</title>
<p>Reduced model of protein is generally used in the initial assembly. With knowledgebase force field and efficient search algorithm, conformational search is done by Monte Carlo algorithms that iteratively update and optimize confirmation to native structure by energy function. I-TASSER model assembly starts from single decoy and generates many reasonable (<italic>i.e.</italic>, global energy is low and close to zero) decoys by fine tuning C&#x003B1; atom positions and torsion angles. In contrast, Rosetta fragment assembly finds combinations out of candidate fragments that minimize global energy. Commonly used energy functions are shown on <xref rid="t4-ijms-14-20635" ref-type="table">Table 4</xref>. A number of possible models are generated as result of the initial model assembly. Those models are then clustered into few categories and structures in the cluster centroids are chosen for further refinement.</p></sec>
<sec>
<label>3.4.3.</label>
<title>Atom-level Refinement</title>
<p>Detailed backbones and side chains of protein are represented and refined. In the previous step, knowledge based force field that is based on the statistics of the known structures are used. In the atom-level refinement step, realistic potential energy terms are used for model refinement. Other terms such as bond length, angle constraints, steric overlaps and hydrogen-bonding network are also used for refinement.</p></sec></sec></sec>
<sec sec-type="other">
<label>4.</label>
<title>Integration of Proteomics Data and Structural Modeling</title>
<p>Computational methods that integrate structure prediction and experimental methods are emerging strategies in the structural biology field. There are notably many efforts in integrating low resolution structure analysis methods with computational methods, such as docking substructure to cryo-electron microscopy (cryo-EM) images and small angle X-ray scattering (SAXS) profiles in structure determination. However, there are not many attempts to integrate MS-coupled experiment with structure prediction. For readers interested in the integrative structure modeling methods using cryo-EM images or SAXS profiles, detailed reviews can be found in &#x0005B;<xref rid="b89-ijms-14-20635" ref-type="bibr">89</xref>&#x0005D; and &#x0005B;<xref rid="b90-ijms-14-20635" ref-type="bibr">90</xref>&#x0005D;, respectively. In this section, we first cover some of the existing researches on the application of MS experiments to structural modeling. Then, we provide suggestions on possible MS experimental results that can be used in the structural modeling process to analyze the structure and dynamics of biomolecules.</p>
<sec>
<label>4.1.</label>
<title>Chemical Cross-Linking Experiment Integrated Structure Modeling</title>
<p>Chemical cross-linking based MS experiments that provide information about molecules close in distance are one of the earliest and most intuitive MS-coupled experiments that can be integrated to the structure modeling. Using the result for the chemical cross-linking to extract distance constraints, structure modeling can use the distance constraints to either refine the structures in comparative modeling or use in order to limit the sample space in <italic>ab initio</italic> modeling.</p>
<p>In the early work by Young <italic>et al.</italic> &#x0005B;<xref rid="b91-ijms-14-20635" ref-type="bibr">91</xref>&#x0005D;, intra-molecular cross-linking, MS and threading are used to identify the structure or the fold of a bovine basic fibroblast growth factor (FGF)-2. Using a lysine-specific cross-linking agent, they identify the eight lysine-lysine links in the FGF-2 that are validated with the MS. With the distance constraints from the cross-linking experiment combined with the threading method, they were able to correctly identify the fold type of FGF-2 as the b-trefoil fold. They were also able to model the FGF-2 with homology modeling with backbone RMSD of 4.8 &#x000C5;.</p>
<p>Chemical cross-linking has been applied for determining the topology of macromolecules that are difficult to detect by the traditional structure solution techniques. Chen <italic>et al</italic>. &#x0005B;<xref rid="b92-ijms-14-20635" ref-type="bibr">92</xref>&#x0005D; applied the chemical cross-linkage information to determine the architecture of RNA polymerase II with the transcriptional initiation factor (TFIIF) at a peptide resolution. With the cross-linking coupled with the MS, they were able to identify 253 inter-protein and 149 intra-protein links. The subcomponents of the complex were predicted by homology modeling, when the crystal structures are not available. Then, the linkage information was applied to determine the distance constraints used to manually reconstruct the complex using the structures of the 15 subunits.</p>
<p>Chemical cross-linkage can also be used to determine the fold of a single protein. In the work by Fioramonte <italic>et al.</italic> &#x0005B;<xref rid="b15-ijms-14-20635" ref-type="bibr">15</xref>&#x0005D;, the utility of the intra-protein chemical cross-linking in determining the secondary structure of polypeptides without any homology information was illustrated. They exploit the geometric characteristics of alpha-helix and beta-sheets, such as the tendency of residues with bulky side chains to form beta-sheets and distance between the linkages formations used to derive cross-linking rules. Cross-linkage rules are then used to determine the secondary structure of polypeptides or proteins. More recent researches exploit the technical advances in the cross-linking. Instead of being restricted to lysine cross-linking, cross-linking is now possible between divers residue types. This increases the detectable number of distance constraints. A review on chemical cross-linking applied to structure modeling can be found in &#x0005B;<xref rid="b93-ijms-14-20635" ref-type="bibr">93</xref>,<xref rid="b94-ijms-14-20635" ref-type="bibr">94</xref>&#x0005D;.</p></sec>
<sec>
<label>4.2.</label>
<title>Native Mass Spectrometry Integrated Structural Solvers</title>
<p>Native MS is an emerging technique for macromolecular structure determination. As described in the previous section, native MS is a combinatorial method that involves several MS-coupled methods to detect large molecular complexes that are often not detectable by traditional structure determination methods. It is often combined with computational modeling methods to integrate the exiting knowledge of structure with the experimental results. Heck &#x0005B;<xref rid="b37-ijms-14-20635" ref-type="bibr">37</xref>&#x0005D; points out that a native MS can be used to bridge the gap between the interactomics&#x02014;the study of biomolecular interaction&#x02014;and the structural biology. The study of the interaction of molecules is traditionally performed by yeast two-hybrid screening or by affinity purification MS. These methods are often high throughput and can be applied in massive scales. Although native MS is currently unable to scale up to high throughput studies, both in time and size, it has been often shown to be successful in determining interaction between subcomponents of complex structures. Successful applications of macromolecules, such as virus &#x0005B;<xref rid="b95-ijms-14-20635" ref-type="bibr">95</xref>&#x0005D;, yeast exosome &#x0005B;<xref rid="b46-ijms-14-20635" ref-type="bibr">46</xref>,<xref rid="b96-ijms-14-20635" ref-type="bibr">96</xref>&#x0005D;, proteasome structure &#x0005B;<xref rid="b43-ijms-14-20635" ref-type="bibr">43</xref>,<xref rid="b44-ijms-14-20635" ref-type="bibr">44</xref>&#x0005D;, RNA polymerase structure &#x0005B;<xref rid="b97-ijms-14-20635" ref-type="bibr">97</xref>&#x0005D;, and therapeutic antibodies &#x0005B;<xref rid="b98-ijms-14-20635" ref-type="bibr">98</xref>&#x0005D;, have been shown.</p>
<p>Taverner <italic>et al.</italic> &#x0005B;<xref rid="b42-ijms-14-20635" ref-type="bibr">42</xref>&#x0005D; propose an integrative modeling method for identifying the subunit architecture for intact protein complexes using MS and homology modeling. In their method, complex of interest is first isolated using affinity tag and column chromatography. Then, gel electrophoresis and tryptic digestion is performed to determine the subunit composition of the complex. The masses of the complex and identified subunits were then determined by a spectrum of the denatured proteasome lid. Mass of subunits and their stoichiometry was searched against the known units in the database using their search engine, SUMMIT, to identify the actual subunits. Also, interaction network was built using their subunit interaction information. Homology modeling method was used to model the structure of subunits and the structure of the complex was derived manually based on the interaction information obtained through native MS experiment. Then, the structural fit to the experimental result was evaluated.</p>
<p>There are various applications of native MS in analysis of oligomeric structures. Most of the focus is not on computational structural modeling, although homology modeling of subunits is used in several cases &#x0005B;<xref rid="b42-ijms-14-20635" ref-type="bibr">42</xref>,<xref rid="b99-ijms-14-20635" ref-type="bibr">99</xref>,<xref rid="b100-ijms-14-20635" ref-type="bibr">100</xref>&#x0005D;. There are several reviews on native MS and applications to structure modeling &#x0005B;<xref rid="b37-ijms-14-20635" ref-type="bibr">37</xref>,<xref rid="b38-ijms-14-20635" ref-type="bibr">38</xref>,<xref rid="b40-ijms-14-20635" ref-type="bibr">40</xref>&#x0005D;.</p></sec>
<sec>
<label>4.3.</label>
<title>Multiple-Experiment Combination for Structural Modeling</title>
<p>Lasker <italic>et al.</italic> &#x0005B;<xref rid="b101-ijms-14-20635" ref-type="bibr">101</xref>&#x0005D; suggested an automatic iterative four-step integrative structure modeling procedure that can be used to combine experimental methods in structural modeling. The four steps consist of (1) finding available information about the structure of interest; (2) designing systems that will extract spatial restraints from the available experiments; (3) computing candidate structures that satisfy the spatial restraints; and (4) evaluating the candidate structures. They proved the usefulness of this procedure by predicting the architecture of the human RNA polymerase II and verifying the prediction against a known experimentally solved complex. The initial data used, in addition to the experimental data, were 12 homology modeled subcomponents found in the MODBASE &#x0005B;<xref rid="b102-ijms-14-20635" ref-type="bibr">102</xref>&#x0005D;, proteomics data including affinity capture MS proteomics data for yeast RNA polymerase II subunits extracted from the BioGRID &#x0005B;<xref rid="b103-ijms-14-20635" ref-type="bibr">103</xref>&#x0005D;, and an electron density map of human RNA polymerase at 20 &#x000C5; resolution found in the EMDataBank &#x0005B;<xref rid="b104-ijms-14-20635" ref-type="bibr">104</xref>&#x0005D;, which were processed to extract spatial restraints.</p>
<p>Zhou and Robinson &#x0005B;<xref rid="b105-ijms-14-20635" ref-type="bibr">105</xref>&#x0005D; also review how superimposition of high resolution subunits into low resolution complex extracted from various MS experiments, including ion mobility (IM)-MS, and cyro-EM image, can be done. Benesch <italic>et al.</italic> &#x0005B;<xref rid="b106-ijms-14-20635" ref-type="bibr">106</xref>&#x0005D; provide a comprehensive review on gas phase (native state) proteomics methods that can be applied to analyze protein complexes.</p></sec>
<sec>
<label>4.4.</label>
<title>Constraints Common in MS-Coupled Experiments and Structure Modeling Methods</title>
<p>Structural proteomics is becoming more practical with the advancement of computational models and proteomic methods &#x0005B;<xref rid="b107-ijms-14-20635" ref-type="bibr">107</xref>&#x0005D;. However, they are still either experiment-dominant, not exploring the benefits of computation methods, or computation-dominant, being limited by the available experimental data. Also, most experiments are used to find the topology of the complex structure or their structural change altered by binding. However, we argue that experimental methods can also be used to model individual structure focusing on their change in structure and dynamics upon mutation and/or modification. To promote balanced integration of both experimental and computational methods, we identify some of the constraints that can be used to model structures as shown in <xref rid="t5-ijms-14-20635" ref-type="table">Table 5</xref>.</p></sec></sec>
<sec sec-type="conclusions">
<label>5.</label>
<title>Conclusions</title>
<p>Although there have been various efforts for integrating proteomics data into the structural modeling, they are not enough. In this review, we identified and reviewed both the MS-coupled experiments and structure prediction methods such that researchers working on one field (MS or structure prediction) will have a better understanding of the other. Examples of efforts in integrative structure modeling were provided to argue that integrative methods can be successful. However, there are not many methods that are available to directly apply the experimental results in the structural optimization process. There are several reasons for the limitations. One reason is that translating the experimental result to spatial constraints that are addressable by structural modeling has not been investigated enough. Another reason is that availability of the MS-coupled data is limited. More efforts in sharing the MS-coupled data will promote advances in the constraint modeling and also in the integrative structural optimization methods.</p>
<p>To promote the idea of integrating the two methods, we have also listed out some of the constraints that are used in the structure prediction methods and ones that are available through the experiments. By listing out information for both the MS-coupled experiments and the structure prediction methods, we showed that there are still wide possibilities in the marriage between the proteomics studies and the structure prediction. Advances in the constraint modeling methods of experimental data and developments of integrative structural modeling methods that are flexible in integrating various constraints will greatly promote the structural genomics. This in turn will enhance our understanding of biology as well as disease mechanisms that are unable to be detected by genomics alone.</p></sec></body>
<back>
<sec sec-type="display-objects">
<title>Figures and Tables</title>
<fig id="f1-ijms-14-20635" position="float">
<label>Figure 1</label>
<caption>
<p>Number of solved structures <italic>versus</italic> number of identified protein sequences. Numbers of sequences and protein structures are obtained through Uniprot (<ext-link xlink:href="http://www.ebi.ac.uk/uniprot/" ext-link-type="uri">http://www.ebi.ac.uk/uniprot/</ext-link>) and RCBS PDB (<ext-link xlink:href="http://www.rcsb.org" ext-link-type="uri">http://www.rcsb.org</ext-link>), respectively.</p></caption>
<graphic xlink:href="ijms-14-20635f1.gif"/></fig>
<fig id="f2-ijms-14-20635" position="float">
<label>Figure 2</label>
<caption>
<p>Four types of cross-links (adapted from <xref rid="f3-ijms-14-20635" ref-type="fig">figure 3</xref> of &#x0005B;<xref rid="b26-ijms-14-20635" ref-type="bibr">26</xref>&#x0005D;). (<bold>A</bold>) Homo-bifunctional; (<bold>B</bold>) Hetero-bifunctional; (<bold>C</bold>) zero-length; and (<bold>D</bold>) hetero-trifunctional cross-link.</p></caption>
<graphic xlink:href="ijms-14-20635f2.gif"/></fig>
<fig id="f3-ijms-14-20635" position="float">
<label>Figure 3</label>
<caption>
<p>Structure prediction pipeline (<bold>A</bold>) Rosetta &#x0005B;<xref rid="b54-ijms-14-20635" ref-type="bibr">54</xref>&#x0005D;; and (<bold>B</bold>) I-TASSER pipeline (adapted from <xref rid="f1-ijms-14-20635" ref-type="fig">Figure 1</xref> of &#x0005B;<xref rid="b55-ijms-14-20635" ref-type="bibr">55</xref>&#x0005D;).</p></caption>
<graphic xlink:href="ijms-14-20635f3.gif"/></fig>
<table-wrap id="t1-ijms-14-20635" position="float">
<label>Table 1</label>
<caption>
<p>Types and characteristics of mass spectrometry-coupled experiments.</p></caption>
<table frame="hsides" rules="rows">
<thead>
<tr>
<th align="center" valign="bottom">MS-coupled methods</th>
<th align="center" valign="bottom">Types of information detected</th>
<th align="center" valign="bottom">Characteristics</th></tr></thead>
<tbody>
<tr>
<td align="center" valign="middle">HDX &#x0005B;<xref rid="b13-ijms-14-20635" ref-type="bibr">13</xref>&#x0005D;</td>
<td align="center" valign="middle">-Solvent accessibility<break/>-Binding stoichiometry,<break/>-Affinity for protein-ligand interactions</td>
<td align="center" valign="middle">-Exchange target backbone nitrogen</td></tr>
<tr>
<td align="center" valign="middle">Protein footprinting &#x0005B;<xref rid="b14-ijms-14-20635" ref-type="bibr">14</xref>&#x0005D;</td>
<td align="center" valign="middle">-Solvent accessibility</td>
<td align="center" valign="middle">-Labeling reagents target side-chains</td></tr>
<tr>
<td align="center" valign="middle">Chemical cross-linking &#x0005B;<xref rid="b15-ijms-14-20635" ref-type="bibr">15</xref>&#x0005D;</td>
<td align="center" valign="middle">-Distance between protein subunits<break/>-Subcomplex topology</td>
<td align="center" valign="middle">-Type of activator differs by the type of cross-linking reagents</td></tr>
<tr>
<td align="center" valign="middle">Ion mobility (IM)-MS &#x0005B;<xref rid="b16-ijms-14-20635" ref-type="bibr">16</xref>&#x0005D;</td>
<td align="center" valign="middle">-Protein complex shape and size<break/>-Subcomplex topology<break/>-Radius of Gyration</td>
<td align="center" valign="middle">-Analyzed in the gas phase</td></tr>
<tr>
<td align="center" valign="middle">All four methods</td>
<td align="center" valign="middle">-Conformational change</td>
<td align="center" valign="middle">-Can detect changes on a wide timescale<break/>-Requires very little sample<break/>-Crystallization is not required</td></tr></tbody></table></table-wrap>
<table-wrap id="t2-ijms-14-20635" position="float">
<label>Table 2</label>
<caption>
<p>Structure prediction methods and their limitations.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="bottom"/>
<th align="center" valign="bottom">Accuracy range</th>
<th align="center" valign="bottom">Protein size limit</th>
<th align="center" valign="bottom">Structure prediction methods</th></tr></thead>
<tbody>
<tr>
<td align="center" valign="top">Homology modeling</td>
<td align="center" valign="top">1&#x02013;2 &#x000C5;</td>
<td align="center" valign="top">NA</td>
<td align="center" valign="top">MODELLER &#x0005B;<xref rid="b48-ijms-14-20635" ref-type="bibr">48</xref>&#x0005D;, SWISS-MODEL &#x0005B;<xref rid="b49-ijms-14-20635" ref-type="bibr">49</xref>&#x0005D;</td></tr>
<tr>
<td align="center" valign="top">Threading</td>
<td align="center" valign="top">2&#x02013;6 &#x000C5;</td>
<td align="center" valign="top">NA</td>
<td align="center" valign="top">HHpred &#x0005B;<xref rid="b50-ijms-14-20635" ref-type="bibr">50</xref>&#x0005D;, RaptorX &#x0005B;<xref rid="b51-ijms-14-20635" ref-type="bibr">51</xref>&#x0005D;, MUSTER &#x0005B;<xref rid="b52-ijms-14-20635" ref-type="bibr">52</xref>&#x0005D;, Sparks-X &#x0005B;<xref rid="b53-ijms-14-20635" ref-type="bibr">53</xref>&#x0005D;</td></tr>
<tr>
<td align="center" valign="top"><italic>Ab initio</italic></td>
<td align="center" valign="top">4&#x02013;8 &#x000C5;</td>
<td align="center" valign="top">150</td>
<td align="center" valign="top">Rosetta &#x0005B;<xref rid="b54-ijms-14-20635" ref-type="bibr">54</xref>&#x0005D;, I-TASSER &#x0005B;<xref rid="b55-ijms-14-20635" ref-type="bibr">55</xref>&#x0005D;, SimFold &#x0005B;<xref rid="b56-ijms-14-20635" ref-type="bibr">56</xref>,<xref rid="b57-ijms-14-20635" ref-type="bibr">57</xref>&#x0005D;, QUARK &#x0005B;<xref rid="b4-ijms-14-20635" ref-type="bibr">4</xref>&#x0005D;, CABS &#x0005B;<xref rid="b58-ijms-14-20635" ref-type="bibr">58</xref>&#x0005D;</td></tr></tbody></table></table-wrap>
<table-wrap id="t3-ijms-14-20635" position="float">
<label>Table 3</label>
<caption>
<p>Scoring criteria of two fragment generators.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="bottom"/>
<th align="center" valign="bottom">Rosetta (Picker)</th>
<th align="center" valign="bottom">I-TASSER (MUSTER)</th></tr></thead>
<tbody>
<tr>
<td align="center" valign="top">Amino Acid Sequence</td>
<td align="center" valign="top">&#x025CF;</td>
<td align="center" valign="top"/></tr>
<tr>
<td align="center" valign="top">Query Sequence Profile</td>
<td align="center" valign="top">&#x025CF;</td>
<td align="center" valign="top">&#x025CF;</td></tr>
<tr>
<td align="center" valign="top">Secondary Structure</td>
<td align="center" valign="top">&#x025CF;</td>
<td align="center" valign="top">&#x025CF;</td></tr>
<tr>
<td align="center" valign="top">Chemical Shifts</td>
<td align="center" valign="top">&#x025CF;</td>
<td align="center" valign="top">&#x025CF;</td></tr>
<tr>
<td align="center" valign="top">Distance Restraints</td>
<td align="center" valign="top">&#x025CF;</td>
<td align="center" valign="top"/></tr>
<tr>
<td align="center" valign="top">Dihedral Restraints</td>
<td align="center" valign="top">&#x025CF;</td>
<td align="center" valign="top">&#x025CF;</td></tr>
<tr>
<td align="center" valign="top">Solvent Accessibility</td>
<td align="center" valign="top"/>
<td align="center" valign="top">&#x025CF;</td></tr></tbody></table></table-wrap>
<table-wrap id="t4-ijms-14-20635" position="float">
<label>Table 4</label>
<caption>
<p>Energy functions used in structure prediction.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="bottom">Type</th>
<th align="center" valign="bottom">Energy Function</th>
<th align="center" valign="bottom">Description</th></tr></thead>
<tbody>
<tr>
<td align="center" valign="middle" rowspan="3">Physics</td>
<td align="center" valign="middle">Van der Waals</td>
<td align="center" valign="middle">Non-bonded Energy</td></tr>
<tr>
<td align="center" valign="middle">Electrostatics</td>
<td align="center" valign="middle">Coulomb Potential</td></tr>
<tr>
<td align="center" valign="middle">Atomic Bond Length</td>
<td align="center" valign="middle">Equilibrium of Bonds</td></tr>
<tr>
<td colspan="3" align="left" valign="middle">
<hr/></td></tr>
<tr>
<td align="center" valign="middle" rowspan="5">Knowledge</td>
<td align="center" valign="middle">Backbone Torsion Angle</td>
<td align="center" valign="middle">From Ramachandran Plot</td></tr>
<tr>
<td align="center" valign="middle">Hydrogen Bonds</td>
<td align="center" valign="middle">Secondary Structure</td></tr>
<tr>
<td align="center" valign="middle">Radius of Gyration</td>
<td align="center" valign="middle">Structure Compactness</td></tr>
<tr>
<td align="center" valign="middle">Fragment Distance</td>
<td align="center" valign="middle">Distance between Fragments</td></tr>
<tr>
<td align="center" valign="middle">Solvent Accessibility</td>
<td align="center" valign="middle">Tertiary Structure</td></tr></tbody></table></table-wrap>
<table-wrap id="t5-ijms-14-20635" position="float">
<label>Table 5</label>
<caption>
<p>Constraints and energy terms and their availability.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="bottom">Constraints and energy</th>
<th align="center" valign="bottom">MS-coupled experiments</th>
<th align="left" valign="bottom">Structure prediction methods</th></tr></thead>
<tbody>
<tr>
<td align="center" valign="top">Solvent accessibility</td>
<td align="center" valign="top">HDX, protein footprinting</td>
<td align="left" valign="top">I-TASSER, QUARK, SimFold, Rosetta, PROSPECT, RaptorX, MUSTER</td></tr>
<tr>
<td align="center" valign="top">Pair-wise distance constraints</td>
<td align="center" valign="top">Chemical cross-linking</td>
<td align="left" valign="top">I-TASSER, QUARK, SimFold, Rosetta, PROSPECT, CABS</td></tr>
<tr>
<td align="center" valign="top">Secondary structure</td>
<td align="center" valign="top">HDX, chemical cross-linking</td>
<td align="left" valign="top">I-TASSER, QUARK, SimFold, Rosetta, PROSPECT, RaptorX, MUSTER Sparks-X, Swiss-Model</td></tr>
<tr>
<td align="center" valign="top">Radius of gyration</td>
<td align="center" valign="top">Ion mobility</td>
<td align="left" valign="top">I-TASSER, QUARK, SimFold, Rosetta</td></tr>
<tr>
<td align="center" valign="top">Topology</td>
<td align="center" valign="top">Ion mobility, chemical cross-linking</td>
<td align="left" valign="top">I-TASSER, QUARK, Rosetta</td></tr></tbody></table></table-wrap></sec>
<ack>
<title>Acknowledgments</title>
<p>This research was supported by the MSIP (Ministry of Science, ICT and Future Planning), Korea, under the &#x0201C;IT Consilience Creative Program&#x0201D; (NIPA-2013-H0203-13-100) supervised by the NIPA (National IT Industry Promotion Agency) and by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science, ICT and Future Planning (2013005259).</p></ack>
<notes>
<title>Conflicts of Interest</title>
<p>The authors declare no conflict of interest.</p></notes>
<ref-list>
<title>References</title>
<ref id="b1-ijms-14-20635"><label>1</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Apweiler</surname><given-names>R.</given-names></name><name><surname>Bairoch</surname><given-names>A.</given-names></name><name><surname>Wu</surname><given-names>C.H.</given-names></name></person-group><article-title>Protein sequence databases</article-title><source>Curr. Opin. Chem. Biol</source><year>2004</year><volume>8</volume><fpage>76</fpage><lpage>80</lpage></citation></ref>
<ref id="b2-ijms-14-20635"><label>2</label><citation citation-type="thesis"><person-group person-group-type="author"><name><surname>Gao</surname><given-names>X</given-names></name></person-group><article-title>Towards Automating Protein Structure Determination from NMR Data</article-title><source>Ph.D. Thesis</source><publisher-name>University of Waterloo</publisher-name><publisher-loc>Waterloo, ON, Canada</publisher-loc><year>2009</year></citation></ref>
<ref id="b3-ijms-14-20635"><label>3</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Skolnick</surname><given-names>J.</given-names></name><name><surname>Zhang</surname><given-names>Y.</given-names></name><name><surname>Arakaki</surname><given-names>A.K.</given-names></name><name><surname>Kolinski</surname><given-names>A.</given-names></name><name><surname>Boniecki</surname><given-names>M.</given-names></name><name><surname>Szil&#x000E1;gyi</surname><given-names>A.</given-names></name><name><surname>Kihara</surname><given-names>D</given-names></name></person-group><article-title>TOUCHSTONE: A unified approach to protein structure prediction</article-title><source>Proteins</source><year>2003</year><volume>53</volume><fpage>469</fpage><lpage>479</lpage></citation></ref>
<ref id="b4-ijms-14-20635"><label>4</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xu</surname><given-names>D.</given-names></name><name><surname>Zhang</surname><given-names>Y</given-names></name></person-group><article-title><italic>Ab initio</italic> protein structure assembly using continuous structure fragments and optimized knowledge-based force field</article-title><source>Proteins</source><year>2012</year><volume>80</volume><fpage>1715</fpage><lpage>1735</lpage></citation></ref>
<ref id="b5-ijms-14-20635"><label>5</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Venkatraman</surname><given-names>V.</given-names></name><name><surname>Yang</surname><given-names>Y.D.</given-names></name><name><surname>Sael</surname><given-names>L.</given-names></name><name><surname>Kihara</surname><given-names>D</given-names></name></person-group><article-title>Protein-protein docking using region-based 3D Zernike descriptors</article-title><source>BMC Bioinf</source><year>2009</year><volume>10</volume><fpage>407</fpage></citation></ref>
<ref id="b6-ijms-14-20635"><label>6</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kihara</surname><given-names>D.</given-names></name><name><surname>Sael</surname><given-names>L.</given-names></name><name><surname>Chikhi</surname><given-names>R.</given-names></name><name><surname>Esquivel-Rodriguez</surname><given-names>J</given-names></name></person-group><article-title>Molecular surface representation using 3D Zernike descriptors for protein shape comparison and docking</article-title><source>Curr. Protein Pept. Sci</source><year>2011</year><volume>12</volume><fpage>520</fpage><lpage>530</lpage></citation></ref>
<ref id="b7-ijms-14-20635"><label>7</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sael</surname><given-names>L.</given-names></name><name><surname>Chitale</surname><given-names>M.</given-names></name><name><surname>Kihara</surname><given-names>D</given-names></name></person-group><article-title>Structure- and sequence-based function prediction for non-homologous proteins</article-title><source>J. Struct. Funct. Genomics</source><year>2012</year><volume>13</volume><fpage>111</fpage><lpage>123</lpage></citation></ref>
<ref id="b8-ijms-14-20635"><label>8</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sael</surname><given-names>L.</given-names></name><name><surname>Kihara</surname><given-names>D</given-names></name></person-group><article-title>Binding ligand prediction for proteins using partial matching of local surface patches</article-title><source>Int. J. Mol. Sci</source><year>2010</year><volume>11</volume><fpage>5009</fpage><lpage>5026</lpage></citation></ref>
<ref id="b9-ijms-14-20635"><label>9</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Benesch</surname><given-names>J.L.P.</given-names></name><name><surname>Ruotolo</surname><given-names>B.T.</given-names></name></person-group><article-title>Mass spectrometry: Come of age for structural and dynamical biology</article-title><source>Curr. Opin. Struct. Biol</source><year>2011</year><volume>21</volume><fpage>641</fpage><lpage>649</lpage></citation></ref>
<ref id="b10-ijms-14-20635"><label>10</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Kaur</surname><given-names>P.</given-names></name><name><surname>Chance</surname><given-names>M</given-names></name></person-group><article-title>The Utility of Mass Spectrometry Based Structural Proteomics in Biopharmaceutical Biologics Development</article-title><source>Integrative Proteomics</source><person-group person-group-type="editor"><name><surname>Leung</surname><given-names>H.-C.</given-names></name></person-group><publisher-name>InTech</publisher-name><publisher-loc>Cleveland, OH, USA</publisher-loc><year>2012</year><fpage>340</fpage><lpage>412</lpage></citation></ref>
<ref id="b11-ijms-14-20635"><label>11</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dobson</surname><given-names>C.M.</given-names></name></person-group><article-title>Protein folding and misfolding</article-title><source>Nature</source><year>2003</year><volume>426</volume><fpage>884</fpage><lpage>890</lpage></citation></ref>
<ref id="b12-ijms-14-20635"><label>12</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Glish</surname><given-names>G.L.</given-names></name><name><surname>Vachet</surname><given-names>R.W.</given-names></name></person-group><article-title>The basics of mass spectrometry in the twenty-first century</article-title><source>Nat. Rev. Drug Discovery</source><year>2003</year><volume>2</volume><fpage>140</fpage><lpage>150</lpage></citation></ref>
<ref id="b13-ijms-14-20635"><label>13</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Konermann</surname><given-names>L.</given-names></name><name><surname>Pan</surname><given-names>J.</given-names></name><name><surname>Liu</surname><given-names>Y.-H.</given-names></name></person-group><article-title>Hydrogen exchange mass spectrometry for studying protein structure and dynamics</article-title><source>Chem. Soc. Rev</source><year>2011</year><volume>40</volume><fpage>1224</fpage><lpage>1234</lpage></citation></ref>
<ref id="b14-ijms-14-20635"><label>14</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kiselar</surname><given-names>J.G.</given-names></name><name><surname>Chance</surname><given-names>M.R.</given-names></name></person-group><article-title>Future directions of structural mass spectrometry using hydroxyl radical footprinting</article-title><source>J. Mass Spectrom</source><year>2010</year><volume>45</volume><fpage>1373</fpage><lpage>1382</lpage></citation></ref>
<ref id="b15-ijms-14-20635"><label>15</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fioramonte</surname><given-names>M.</given-names></name><name><surname>dos Santos</surname><given-names>A.M.</given-names></name><name><surname>McIlwain</surname><given-names>S.</given-names></name><name><surname>Noble</surname><given-names>W.S.</given-names></name><name><surname>Franchini</surname><given-names>K.G.</given-names></name><name><surname>Gozzo</surname><given-names>F.C.</given-names></name></person-group><article-title>Analysis of secondary structure in proteins by chemical cross-linking coupled to MS</article-title><source>Proteomics</source><year>2012</year><volume>12</volume><fpage>2746</fpage><lpage>2752</lpage></citation></ref>
<ref id="b16-ijms-14-20635"><label>16</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Uetrecht</surname><given-names>C.</given-names></name><name><surname>Rose</surname><given-names>R.J.</given-names></name><name><surname>van Duijn</surname><given-names>E.</given-names></name><name><surname>Lorenzen</surname><given-names>K.</given-names></name><name><surname>Heck</surname><given-names>A.J.R.</given-names></name></person-group><article-title>Ion mobility mass spectrometry of proteins and protein assemblies</article-title><source>Chem. Soc. Rev</source><year>2010</year><volume>39</volume><fpage>1633</fpage><lpage>1655</lpage></citation></ref>
<ref id="b17-ijms-14-20635"><label>17</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Trible</surname><given-names>R.P.</given-names></name><name><surname>Emert-Sedlak</surname><given-names>L.</given-names></name><name><surname>Wales</surname><given-names>T.E.</given-names></name><name><surname>Ayyavoo</surname><given-names>V.</given-names></name><name><surname>Engen</surname><given-names>J.R.</given-names></name><name><surname>Smithgall</surname><given-names>T.E.</given-names></name></person-group><article-title>Allosteric loss-of-function mutations in HIV-1 Nef from a long-term non-progressor</article-title><source>J. Mol. Biol</source><year>2007</year><volume>374</volume><fpage>121</fpage><lpage>129</lpage></citation></ref>
<ref id="b18-ijms-14-20635"><label>18</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Morgan</surname><given-names>C.R.</given-names></name><name><surname>Hebling</surname><given-names>C.M.</given-names></name><name><surname>Rand</surname><given-names>K.D.</given-names></name><name><surname>Stafford</surname><given-names>D.W.</given-names></name><name><surname>Jorgenson</surname><given-names>J.W.</given-names></name><name><surname>Engen</surname><given-names>J.R.</given-names></name></person-group><article-title>Conformational transitions in the membrane scaffold protein of phospholipid bilayer nanodiscs</article-title><source>Mol. Cell. Proteomics</source><year>2011</year><volume>10</volume><pub-id pub-id-type="doi">10.1074/mcp.M111.010876.</pub-id></citation></ref>
<ref id="b19-ijms-14-20635"><label>19</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname><given-names>J.</given-names></name><name><surname>Chalmers</surname><given-names>M.J.</given-names></name><name><surname>Stayrook</surname><given-names>K.R.</given-names></name><name><surname>Burris</surname><given-names>L.L.</given-names></name><name><surname>Garcia-Ordonez</surname><given-names>R.D.</given-names></name><name><surname>Pascal</surname><given-names>B.D.</given-names></name><name><surname>Burris</surname><given-names>T.P.</given-names></name><name><surname>Dodge</surname><given-names>J.A.</given-names></name><name><surname>Griffin</surname><given-names>P.R.</given-names></name></person-group><article-title>Hydrogen/deuterium exchange reveals distinct agonist/partial agonist receptor dynamics within vitamin D receptor/retinoid X receptor heterodimer</article-title><source>Structure</source><year>2010</year><volume>18</volume><fpage>1332</fpage><lpage>1341</lpage></citation></ref>
<ref id="b20-ijms-14-20635"><label>20</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Charv&#x000E1;tov&#x000E1;</surname><given-names>O.</given-names></name><name><surname>Foley</surname><given-names>B.L.</given-names></name><name><surname>Bern</surname><given-names>M.W.</given-names></name><name><surname>Sharp</surname><given-names>J.S.</given-names></name><name><surname>Orlando</surname><given-names>R.</given-names></name><name><surname>Woods</surname><given-names>R.J.</given-names></name></person-group><article-title>Quantifying protein interface footprinting by hydroxyl radical oxidation and molecular dynamics simulation: Application to galectin-1</article-title><source>J. Am. Soc. Mass Spectrom</source><year>2008</year><volume>19</volume><fpage>1692</fpage><lpage>1705</lpage></citation></ref>
<ref id="b21-ijms-14-20635"><label>21</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xu</surname><given-names>G.</given-names></name><name><surname>Chance</surname><given-names>M.R.</given-names></name></person-group><article-title>Radiolytic modification and reactivity of amino acid residues serving as structural probes for protein footprinting</article-title><source>Anal. Chem</source><year>2005</year><volume>77</volume><fpage>4549</fpage><lpage>4555</lpage></citation></ref>
<ref id="b22-ijms-14-20635"><label>22</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Takamoto</surname><given-names>K.</given-names></name><name><surname>Chance</surname><given-names>M.R.</given-names></name></person-group><article-title>Radiolytic protein footprinting with mass spectrometry to probe the structure of macromolecular complexes</article-title><source>Annu. Rev. Biophys. Biomol. Struct</source><year>2006</year><volume>35</volume><fpage>251</fpage><lpage>276</lpage></citation></ref>
<ref id="b23-ijms-14-20635"><label>23</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname><given-names>L.</given-names></name><name><surname>Qin</surname><given-names>Y.</given-names></name><name><surname>Ilchenko</surname><given-names>S.</given-names></name><name><surname>Bohon</surname><given-names>J.</given-names></name><name><surname>Shi</surname><given-names>W.</given-names></name><name><surname>Cho</surname><given-names>M.W.</given-names></name><name><surname>Takamoto</surname><given-names>K.</given-names></name><name><surname>Chance</surname><given-names>M.R.</given-names></name></person-group><article-title>Structural analysis of a highly glycosylated and unliganded gp120-based antigen using mass spectrometry</article-title><source>Biochemistry</source><year>2010</year><volume>49</volume><fpage>9032</fpage><lpage>9045</lpage></citation></ref>
<ref id="b24-ijms-14-20635"><label>24</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schmitz</surname><given-names>A.</given-names></name><name><surname>Galas</surname><given-names>D.J.</given-names></name></person-group><article-title>Sequence-specific interactions of the tight-binding I12-X86 lac repressor with non-operator DNA</article-title><source>Nucleic Acids Res</source><year>1980</year><volume>8</volume><fpage>487</fpage><lpage>506</lpage></citation></ref>
<ref id="b25-ijms-14-20635"><label>25</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Angel</surname><given-names>T.E.</given-names></name><name><surname>Chance</surname><given-names>M.R.</given-names></name><name><surname>Palczewski</surname><given-names>K</given-names></name></person-group><article-title>Conserved waters mediate structural and functional activation of family A (rhodopsin-like) G protein-coupled receptors</article-title><source>Proc. Natl. Acad. Sci. USA</source><year>2009</year><volume>106</volume><fpage>8555</fpage><lpage>8560</lpage></citation></ref>
<ref id="b26-ijms-14-20635"><label>26</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sinz</surname><given-names>A</given-names></name></person-group><article-title>Chemical cross-linking and mass spectrometry to map three-dimensional protein structures and protein-protein interactions</article-title><source>Mass Spectrom. Rev</source><year>2006</year><volume>25</volume><fpage>663</fpage><lpage>682</lpage></citation></ref>
<ref id="b27-ijms-14-20635"><label>27</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jaya</surname><given-names>N.</given-names></name><name><surname>Garcia</surname><given-names>V.</given-names></name><name><surname>Vierling</surname><given-names>E</given-names></name></person-group><article-title>Substrate binding site flexibility of the small heat shock protein molecular chaperones</article-title><source>Proc. Natl. Acad. Sci. USA</source><year>2009</year><volume>106</volume><fpage>15604</fpage><lpage>15609</lpage></citation></ref>
<ref id="b28-ijms-14-20635"><label>28</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sharon</surname><given-names>M.</given-names></name><name><surname>Taverner</surname><given-names>T.</given-names></name><name><surname>Ambroggio</surname><given-names>X.I.</given-names></name><name><surname>Deshaies</surname><given-names>R.J.</given-names></name><name><surname>Robinson</surname><given-names>C.V.</given-names></name></person-group><article-title>Structural organization of the 19S proteasome lid: Insights from MS of intact complexes</article-title><source>PLoS Biol</source><year>2006</year><volume>4</volume><fpage>e267</fpage></citation></ref>
<ref id="b29-ijms-14-20635"><label>29</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kang</surname><given-names>S.</given-names></name><name><surname>Hawkridge</surname><given-names>A.M.</given-names></name><name><surname>Johnson</surname><given-names>K.L.</given-names></name><name><surname>Muddiman</surname><given-names>D.C.</given-names></name><name><surname>Prevelige</surname><given-names>P.E.</given-names></name></person-group><article-title>Identification of subunit-subunit interactions in bacteriophage P22 procapsids by chemical cross-linking and mass spectrometry</article-title><source>J. Proteome Res</source><year>2006</year><volume>5</volume><fpage>370</fpage><lpage>377</lpage></citation></ref>
<ref id="b30-ijms-14-20635"><label>30</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pacholarz</surname><given-names>K.J.</given-names></name><name><surname>Garlish</surname><given-names>R.A.</given-names></name><name><surname>Taylor</surname><given-names>R.J.</given-names></name><name><surname>Barran</surname><given-names>P.E.</given-names></name></person-group><article-title>Mass spectrometry based tools to investigate protein-ligand interactions for drug discovery</article-title><source>Chem. Soc. Rev</source><year>2012</year><volume>41</volume><fpage>4335</fpage><lpage>4355</lpage></citation></ref>
<ref id="b31-ijms-14-20635"><label>31</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jurneczko</surname><given-names>E.</given-names></name><name><surname>Barran</surname><given-names>P.E.</given-names></name></person-group><article-title>How useful is ion mobility mass spectrometry for structural biology? The relationship between protein crystal structures and their collision cross sections in the gas phase</article-title><source>Analyst</source><year>2011</year><volume>136</volume><fpage>20</fpage><lpage>28</lpage></citation></ref>
<ref id="b32-ijms-14-20635"><label>32</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Calvo</surname><given-names>F.</given-names></name><name><surname>Chirot</surname><given-names>F.</given-names></name><name><surname>Albrieux</surname><given-names>F.</given-names></name><name><surname>Lemoine</surname><given-names>J.</given-names></name><name><surname>Tsybin</surname><given-names>Y.O.</given-names></name><name><surname>Pernot</surname><given-names>P.</given-names></name><name><surname>Dugourd</surname><given-names>P</given-names></name></person-group><article-title>Statistical analysis of ion mobility spectrometry. II. Adaptively biased methods and shape correlations</article-title><source>J. Am. Soc. Mass Spectrom</source><year>2012</year><volume>23</volume><fpage>1279</fpage><lpage>1288</lpage></citation></ref>
<ref id="b33-ijms-14-20635"><label>33</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ruotolo</surname><given-names>B.T.</given-names></name><name><surname>Giles</surname><given-names>K.</given-names></name><name><surname>Campuzano</surname><given-names>I.</given-names></name><name><surname>Sandercock</surname><given-names>A.M.</given-names></name><name><surname>Bateman</surname><given-names>R.H.</given-names></name><name><surname>Robinson</surname><given-names>C.V.</given-names></name></person-group><article-title>Evidence for macromolecular protein rings in the absence of bulk water</article-title><source>Science</source><year>2005</year><volume>310</volume><fpage>1658</fpage><lpage>1661</lpage></citation></ref>
<ref id="b34-ijms-14-20635"><label>34</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bernstein</surname><given-names>S.L.</given-names></name><name><surname>Wyttenbach</surname><given-names>T.</given-names></name><name><surname>Baumketner</surname><given-names>A.</given-names></name><name><surname>Shea</surname><given-names>J.-E.</given-names></name><name><surname>Bitan</surname><given-names>G.</given-names></name><name><surname>Teplow</surname><given-names>D.B.</given-names></name><name><surname>Bowers</surname><given-names>M.T.</given-names></name></person-group><article-title>Amyloid beta-protein: Monomer structure and early aggregation states of Abeta42 and its Pro19 alloform</article-title><source>J. Am. Chem. Soc</source><year>2005</year><volume>127</volume><fpage>2075</fpage><lpage>2084</lpage></citation></ref>
<ref id="b35-ijms-14-20635"><label>35</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Smith</surname><given-names>D.P.</given-names></name><name><surname>Woods</surname><given-names>L.A.</given-names></name><name><surname>Radford</surname><given-names>S.E.</given-names></name><name><surname>Ashcroft</surname><given-names>A.E.</given-names></name></person-group><article-title>Structure and dynamics of oligomeric intermediates in &#x003B2;2-microglobulin self-assembly</article-title><source>Biophys. J</source><year>2011</year><volume>101</volume><fpage>1238</fpage><lpage>1247</lpage></citation></ref>
<ref id="b36-ijms-14-20635"><label>36</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kanu</surname><given-names>A.B.</given-names></name><name><surname>Dwivedi</surname><given-names>P.</given-names></name><name><surname>Tam</surname><given-names>M.</given-names></name><name><surname>Matz</surname><given-names>L.</given-names></name><name><surname>Hill</surname><given-names>H.H.</given-names></name></person-group><article-title>Ion mobility-mass spectrometry</article-title><source>J. Mass Spectrom</source><year>2008</year><volume>43</volume><fpage>1</fpage><lpage>22</lpage></citation></ref>
<ref id="b37-ijms-14-20635"><label>37</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Van den Heuvel</surname><given-names>R.H.H.</given-names></name><name><surname>Heck</surname><given-names>A.J.R.</given-names></name></person-group><article-title>Native protein mass spectrometry: From intact oligomers to functional machineries</article-title><source>Curr. Opin. Chem. Biol</source><year>2004</year><volume>8</volume><fpage>519</fpage><lpage>526</lpage></citation></ref>
<ref id="b38-ijms-14-20635"><label>38</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Heck</surname><given-names>A.J.R.</given-names></name></person-group><article-title>Native mass spectrometry: A bridge between interactomics and structural biology</article-title><source>Nat. Methods</source><year>2008</year><volume>5</volume><fpage>927</fpage><lpage>933</lpage></citation></ref>
<ref id="b39-ijms-14-20635"><label>39</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Van Duijn</surname><given-names>E</given-names></name></person-group><article-title>Current limitations in native mass spectrometry based structural biology</article-title><source>J. Am. Soc. Mass Spectrom</source><year>2010</year><volume>21</volume><fpage>971</fpage><lpage>978</lpage></citation></ref>
<ref id="b40-ijms-14-20635"><label>40</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Konijnenberg</surname><given-names>A.</given-names></name><name><surname>Butterer</surname><given-names>A.</given-names></name><name><surname>Sobott</surname><given-names>F</given-names></name></person-group><article-title>Native ion mobility-mass spectrometry and related methods in structural biology</article-title><source>Biochim. Biophys. Acta</source><year>2012</year><volume>1834</volume><fpage>1239</fpage><lpage>1256</lpage></citation></ref>
<ref id="b41-ijms-14-20635"><label>41</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kebarle</surname><given-names>P.</given-names></name><name><surname>Verkerk</surname><given-names>U.H.</given-names></name></person-group><article-title>Electrospray: From ions in solution to ions in the gas phase, what we know now</article-title><source>Mass Spectrom. Rev</source><year>2009</year><volume>28</volume><fpage>898</fpage><lpage>917</lpage></citation></ref>
<ref id="b42-ijms-14-20635"><label>42</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Taverner</surname><given-names>T.</given-names></name><name><surname>Hern&#x000E1;ndez</surname><given-names>H.</given-names></name><name><surname>Sharon</surname><given-names>M.</given-names></name><name><surname>Ruotolo</surname><given-names>B.T.</given-names></name><name><surname>Matak-Vinkovi&#x00107;</surname><given-names>D.</given-names></name><name><surname>Devos</surname><given-names>D.</given-names></name><name><surname>Russell</surname><given-names>R.B.</given-names></name><name><surname>Robinson</surname><given-names>C.V.</given-names></name></person-group><article-title>Subunit architecture of intact protein complexes from mass spectrometry and homology modeling</article-title><source>Acc. Chem. Res</source><year>2008</year><volume>41</volume><fpage>617</fpage><lpage>627</lpage></citation></ref>
<ref id="b43-ijms-14-20635"><label>43</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Loo</surname><given-names>J.A.</given-names></name><name><surname>Berhane</surname><given-names>B.</given-names></name><name><surname>Kaddis</surname><given-names>C.S.</given-names></name><name><surname>Wooding</surname><given-names>K.M.</given-names></name><name><surname>Xie</surname><given-names>Y.</given-names></name><name><surname>Kaufman</surname><given-names>S.L.</given-names></name><name><surname>Chernushevich</surname><given-names>I.V.</given-names></name></person-group><article-title>Electrospray ionization mass spectrometry and ion mobility analysis of the 20S proteasome complex</article-title><source>J. Am.Soc. Mass Spectrom</source><year>2005</year><volume>16</volume><fpage>998</fpage><lpage>1008</lpage></citation></ref>
<ref id="b44-ijms-14-20635"><label>44</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sharon</surname><given-names>M.</given-names></name><name><surname>Witt</surname><given-names>S.</given-names></name><name><surname>Felderer</surname><given-names>K.</given-names></name><name><surname>Rockel</surname><given-names>B.</given-names></name><name><surname>Baumeister</surname><given-names>W.</given-names></name><name><surname>Robinson</surname><given-names>C.V.</given-names></name></person-group><article-title>20S proteasomes have the potential to keep substrates in store for continual degradation</article-title><source>J. Biol. Chem</source><year>2006</year><volume>281</volume><fpage>9569</fpage><lpage>9575</lpage></citation></ref>
<ref id="b45-ijms-14-20635"><label>45</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lorenzen</surname><given-names>K.</given-names></name><name><surname>Vannini</surname><given-names>A.</given-names></name><name><surname>Cramer</surname><given-names>P.</given-names></name><name><surname>Heck</surname><given-names>A.J.R.</given-names></name></person-group><article-title>Structural biology of RNA polymerase III: Mass spectrometry elucidates subcomplex architecture</article-title><source>Structure</source><year>2007</year><volume>15</volume><fpage>1237</fpage><lpage>1245</lpage></citation></ref>
<ref id="b46-ijms-14-20635"><label>46</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Synowsky</surname><given-names>S.A.</given-names></name><name><surname>van den Heuvel</surname><given-names>R.H.H.</given-names></name><name><surname>Mohammed</surname><given-names>S.</given-names></name><name><surname>Pijnappel</surname><given-names>P.W.W.M.</given-names></name><name><surname>Heck</surname><given-names>A.J.R.</given-names></name></person-group><article-title>Probing genuine strong interactions and post-translational modifications in the heterogeneous yeast exosome protein complex</article-title><source>Mol.Cell. Proteomics</source><year>2006</year><volume>5</volume><fpage>1581</fpage><lpage>1592</lpage></citation></ref>
<ref id="b47-ijms-14-20635"><label>47</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname><given-names>Y</given-names></name></person-group><article-title>Protein structure prediction: When is it useful?</article-title><source>Curr. Opin. Struct. Biol</source><year>2009</year><volume>19</volume><fpage>145</fpage><lpage>155</lpage></citation></ref>
<ref id="b48-ijms-14-20635"><label>48</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sali</surname><given-names>A.</given-names></name><name><surname>Blundell</surname><given-names>T.L.</given-names></name></person-group><article-title>Comparative protein modelling by satisfaction of spatial restraints</article-title><source>J. Mol. Biol</source><year>1993</year><volume>234</volume><fpage>779</fpage><lpage>815</lpage></citation></ref>
<ref id="b49-ijms-14-20635"><label>49</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Arnold</surname><given-names>K.</given-names></name><name><surname>Bordoli</surname><given-names>L.</given-names></name><name><surname>Kopp</surname><given-names>J.</given-names></name><name><surname>Schwede</surname><given-names>T</given-names></name></person-group><article-title>The SWISS-MODEL workspace: A web-based environment for protein structure homology modelling</article-title><source>Bioinformatics</source><year>2006</year><volume>22</volume><fpage>195</fpage><lpage>201</lpage></citation></ref>
<ref id="b50-ijms-14-20635"><label>50</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>S&#x000F6;ding</surname><given-names>J</given-names></name></person-group><article-title>Protein homology detection by HMM-HMM comparison</article-title><source>Bioinformatics</source><year>2005</year><volume>21</volume><fpage>951</fpage><lpage>960</lpage></citation></ref>
<ref id="b51-ijms-14-20635"><label>51</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Peng</surname><given-names>J.</given-names></name><name><surname>Xu</surname><given-names>J</given-names></name></person-group><article-title>RaptorX: Exploiting structure information for protein alignment by statistical inference</article-title><source>Proteins</source><year>2011</year><volume>79</volume><fpage>161</fpage><lpage>171</lpage></citation></ref>
<ref id="b52-ijms-14-20635"><label>52</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wu</surname><given-names>S.</given-names></name><name><surname>Zhang</surname><given-names>Y</given-names></name></person-group><article-title>MUSTER: Improving protein sequence profile-profile alignments by using multiple sources of structure information</article-title><source>Proteins</source><year>2008</year><volume>72</volume><fpage>547</fpage><lpage>556</lpage></citation></ref>
<ref id="b53-ijms-14-20635"><label>53</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname><given-names>Y.</given-names></name><name><surname>Faraggi</surname><given-names>E.</given-names></name><name><surname>Zhao</surname><given-names>H.</given-names></name><name><surname>Zhou</surname><given-names>Y</given-names></name></person-group><article-title>Improving protein fold recognition and template-based modeling by employing probabilistic-based matching between predicted one-dimensional structural properties of query and corresponding native properties of templates</article-title><source>Bioinformatics</source><year>2011</year><volume>27</volume><fpage>2076</fpage><lpage>2082</lpage></citation></ref>
<ref id="b54-ijms-14-20635"><label>54</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Das</surname><given-names>R.</given-names></name><name><surname>Qian</surname><given-names>B.</given-names></name><name><surname>Raman</surname><given-names>S.</given-names></name><name><surname>Vernon</surname><given-names>R.</given-names></name><name><surname>Thompson</surname><given-names>J.</given-names></name><name><surname>Bradley</surname><given-names>P.</given-names></name><name><surname>Khare</surname><given-names>S.</given-names></name><name><surname>Tyka</surname><given-names>M.D.</given-names></name><name><surname>Bhat</surname><given-names>D.</given-names></name><name><surname>Chivian</surname><given-names>D.</given-names></name><etal/></person-group><article-title>Structure prediction for CASP7 targets using extensive all-atom refinement with Rosetta@home</article-title><source>Proteins</source><year>2007</year><volume>69</volume><fpage>118</fpage><lpage>128</lpage></citation></ref>
<ref id="b55-ijms-14-20635"><label>55</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Roy</surname><given-names>A.</given-names></name><name><surname>Kucukural</surname><given-names>A.</given-names></name><name><surname>Zhang</surname><given-names>Y</given-names></name></person-group><article-title>I-TASSER: A unified platform for automated protein structure and function prediction</article-title><source>Nat. Protoc</source><year>2010</year><volume>5</volume><fpage>725</fpage><lpage>738</lpage></citation></ref>
<ref id="b56-ijms-14-20635"><label>56</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fujitsuka</surname><given-names>Y.</given-names></name><name><surname>Chikenji</surname><given-names>G.</given-names></name><name><surname>Takada</surname><given-names>S</given-names></name></person-group><article-title>SimFold energy function for de novo protein structure prediction: Consensus with Rosetta</article-title><source>Proteins</source><year>2006</year><volume>62</volume><fpage>381</fpage><lpage>398</lpage></citation></ref>
<ref id="b57-ijms-14-20635"><label>57</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Takada</surname><given-names>S</given-names></name></person-group><article-title>Protein folding simulation with solvent-induced force field: Folding pathway ensemble of three-helix-bundle proteins</article-title><source>Proteins</source><year>2001</year><volume>42</volume><fpage>85</fpage><lpage>98</lpage></citation></ref>
<ref id="b58-ijms-14-20635"><label>58</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kolinski</surname><given-names>A</given-names></name></person-group><article-title>Protein modeling and structure prediction with a reduced representation</article-title><source>Acta Biochim</source><year>2004</year><volume>51</volume><fpage>349</fpage><lpage>371</lpage></citation></ref>
<ref id="b59-ijms-14-20635"><label>59</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mart&#x000ED;-Renom</surname><given-names>M.A.</given-names></name><name><surname>Stuart</surname><given-names>A.C.</given-names></name><name><surname>Fiser</surname><given-names>A.</given-names></name><name><surname>S&#x000E1;nchez</surname><given-names>R.</given-names></name><name><surname>Melo</surname><given-names>F.</given-names></name><name><surname>Sali</surname><given-names>A</given-names></name></person-group><article-title>Comparative protein structure modeling of genes and genomes</article-title><source>Annu. Rev. Biophys. Biomol. Struct</source><year>2000</year><volume>29</volume><fpage>291</fpage><lpage>325</lpage></citation></ref>
<ref id="b60-ijms-14-20635"><label>60</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Roy</surname><given-names>A.</given-names></name><name><surname>Zhang</surname><given-names>Y</given-names></name></person-group><source>Protein Structure Prediction</source><comment>eLS</comment><publisher-name>John Wiley &amp; Sons, Ltd</publisher-name><publisher-loc>Chichester, UK</publisher-loc><year>2007</year></citation></ref>
<ref id="b61-ijms-14-20635"><label>61</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Apostolico</surname><given-names>A.</given-names></name><name><surname>Giancarlo</surname><given-names>R</given-names></name></person-group><article-title>Sequence alignment in molecular biology</article-title><source>J. Comput. Biol</source><year>1998</year><volume>5</volume><fpage>173</fpage><lpage>196</lpage></citation></ref>
<ref id="b62-ijms-14-20635"><label>62</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pearson</surname><given-names>W.R.</given-names></name><name><surname>Lipman</surname><given-names>D.J.</given-names></name></person-group><article-title>Improved tools for biological sequence comparison</article-title><source>Proc. Natl. Acad. Sci. USA</source><year>1988</year><volume>85</volume><fpage>2444</fpage><lpage>2448</lpage></citation></ref>
<ref id="b63-ijms-14-20635"><label>63</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Altschul</surname><given-names>S.F.</given-names></name><name><surname>Gish</surname><given-names>W.</given-names></name><name><surname>Miller</surname><given-names>W.</given-names></name><name><surname>Myers</surname><given-names>E.W.</given-names></name><name><surname>Lipman</surname><given-names>D.J.</given-names></name></person-group><article-title>Basic local alignment search tool</article-title><source>J. Mol. Biol.</source><year>1990</year><volume>215</volume><fpage>403</fpage><lpage>410</lpage></citation></ref>
<ref id="b64-ijms-14-20635"><label>64</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lipman</surname><given-names>D.J.</given-names></name><name><surname>Altschul</surname><given-names>S.F.</given-names></name><name><surname>Kececioglu</surname><given-names>J.D.</given-names></name></person-group><article-title>A tool for multiple sequence alignment</article-title><source>Proc. Natl. Acad.Sci. USA</source><year>1989</year><volume>86</volume><fpage>4412</fpage><lpage>4415</lpage></citation></ref>
<ref id="b65-ijms-14-20635"><label>65</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Edgar</surname><given-names>R.C.</given-names></name></person-group><article-title>MUSCLE: Multiple sequence alignment with high accuracy and high throughput</article-title><source>Nucleic Acids Res</source><year>2004</year><volume>32</volume><fpage>1792</fpage><lpage>1797</lpage></citation></ref>
<ref id="b66-ijms-14-20635"><label>66</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Larkin</surname><given-names>M.A.</given-names></name><name><surname>Blackshields</surname><given-names>G.</given-names></name><name><surname>Brown</surname><given-names>N.P.</given-names></name><name><surname>Chenna</surname><given-names>R.</given-names></name><name><surname>McGettigan</surname><given-names>P.A.</given-names></name><name><surname>McWilliam</surname><given-names>H.</given-names></name><name><surname>Valentin</surname><given-names>F.</given-names></name><name><surname>Wallace</surname><given-names>I.M.</given-names></name><name><surname>Wilm</surname><given-names>A.</given-names></name><name><surname>Lopez</surname><given-names>R.</given-names></name><etal/></person-group><article-title>Clustal W and Clustal X version 2.0</article-title><source>Bioinformatics</source><year>2007</year><volume>23</volume><fpage>2947</fpage><lpage>2948</lpage></citation></ref>
<ref id="b67-ijms-14-20635"><label>67</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Altschul</surname><given-names>S.F.</given-names></name><name><surname>Madden</surname><given-names>T.L.</given-names></name><name><surname>Sch&#x000E4;ffer</surname><given-names>A.A.</given-names></name><name><surname>Zhang</surname><given-names>J.</given-names></name><name><surname>Zhang</surname><given-names>Z.</given-names></name><name><surname>Miller</surname><given-names>W.</given-names></name><name><surname>Lipman</surname><given-names>D.J.</given-names></name></person-group><article-title>Gapped BLAST and PSI-BLAST: A new generation of protein database search programs</article-title><source>Nucleic Acids Res</source><year>1997</year><volume>25</volume><fpage>3389</fpage><lpage>3402</lpage></citation></ref>
<ref id="b68-ijms-14-20635"><label>68</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Blundell</surname><given-names>T.L.</given-names></name><name><surname>Sibanda</surname><given-names>B.L.</given-names></name><name><surname>Sternberg</surname><given-names>M.J.E.</given-names></name><name><surname>Thornton</surname><given-names>J.M.</given-names></name></person-group><article-title>Knowledge-based prediction of protein structures and the design of novel molecules</article-title><source>Nature</source><year>1987</year><volume>326</volume><fpage>347</fpage><lpage>352</lpage></citation></ref>
<ref id="b69-ijms-14-20635"><label>69</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Baker</surname><given-names>D.</given-names></name><name><surname>Sali</surname><given-names>A</given-names></name></person-group><article-title>Protein structure prediction and structural genomics</article-title><source>Science</source><year>2001</year><volume>294</volume><fpage>93</fpage><lpage>96</lpage></citation></ref>
<ref id="b70-ijms-14-20635"><label>70</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wallner</surname><given-names>B.</given-names></name><name><surname>Elofsson</surname><given-names>A</given-names></name></person-group><article-title>All are not equal: A benchmark of different homology modeling programs</article-title><source>Protein Sci</source><year>2005</year><volume>14</volume><fpage>1315</fpage><lpage>1327</lpage></citation></ref>
<ref id="b71-ijms-14-20635"><label>71</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Unger</surname><given-names>R.</given-names></name><name><surname>Harel</surname><given-names>D.</given-names></name><name><surname>Wherland</surname><given-names>S.</given-names></name><name><surname>Sussman</surname><given-names>J</given-names></name></person-group><article-title>A 3D building blocks approach to analyzing and predicting structure of proteins</article-title><source>Proteins</source><year>1989</year><volume>5</volume><fpage>355</fpage><lpage>373</lpage></citation></ref>
<ref id="b72-ijms-14-20635"><label>72</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Levitt</surname><given-names>M</given-names></name></person-group><article-title>Accurate modeling of protein conformation by automatic segment matching</article-title><source>J. Mol. Biol</source><year>1992</year><volume>226</volume><fpage>507</fpage><lpage>533</lpage></citation></ref>
<ref id="b73-ijms-14-20635"><label>73</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bowie</surname><given-names>J.</given-names></name><name><surname>Clarke</surname><given-names>N.</given-names></name><name><surname>Pabo</surname><given-names>C.</given-names></name><name><surname>Sauer</surname><given-names>R</given-names></name></person-group><article-title>Identification of protein folds: Matching hydrophobicity patterns of sequence sets with solvent accessibility patterns of known structures</article-title><source>Proteins</source><year>1990</year><volume>7</volume><fpage>257</fpage><lpage>264</lpage></citation></ref>
<ref id="b74-ijms-14-20635"><label>74</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bowie</surname><given-names>J.</given-names></name><name><surname>Luthy</surname><given-names>R.</given-names></name><name><surname>Eisenberg</surname><given-names>D</given-names></name></person-group><article-title>A method to identify protein sequences that fold into a known three-dimensional structure</article-title><source>Science</source><year>1991</year><volume>253</volume><fpage>164</fpage><lpage>170</lpage></citation></ref>
<ref id="b75-ijms-14-20635"><label>75</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xu</surname><given-names>Y.</given-names></name><name><surname>Xu</surname><given-names>D</given-names></name></person-group><article-title>Protein threading using PROSPECT: Design and evaluation</article-title><source>Proteins: Struct., Funct., Bioinf</source><year>2000</year><volume>354</volume><fpage>343</fpage><lpage>354</lpage></citation></ref>
<ref id="b76-ijms-14-20635"><label>76</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xu</surname><given-names>Y.</given-names></name><name><surname>Xu</surname><given-names>D.</given-names></name><name><surname>Uberbacher</surname><given-names>E.C.</given-names></name></person-group><article-title>An efficient computational method for globally optimal threading</article-title><source>J. Comput. Biol</source><year>1998</year><volume>5</volume><fpage>597</fpage><lpage>614</lpage></citation></ref>
<ref id="b77-ijms-14-20635"><label>77</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname><given-names>H.</given-names></name><name><surname>Zhou</surname><given-names>Y</given-names></name></person-group><article-title>Fold recognition by combining sequence profiles derived from evolution and from depth-dependent structural alignment of fragments</article-title><source>Proteins</source><year>2005</year><volume>58</volume><fpage>321</fpage><lpage>328</lpage></citation></ref>
<ref id="b78-ijms-14-20635"><label>78</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chakravarty</surname><given-names>S.</given-names></name><name><surname>Varadarajan</surname><given-names>R</given-names></name></person-group><article-title>Residue depth: A novel parameter for the analysis of protein structure and stability</article-title><source>Structure</source><year>1999</year><volume>7</volume><fpage>723</fpage><lpage>732</lpage></citation></ref>
<ref id="b79-ijms-14-20635"><label>79</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Frishman</surname><given-names>D.</given-names></name><name><surname>Argos</surname><given-names>P</given-names></name></person-group><article-title>Knowledge-based protein secondary structure assignment</article-title><source>Proteins</source><year>1995</year><volume>23</volume><fpage>566</fpage><lpage>579</lpage></citation></ref>
<ref id="b80-ijms-14-20635"><label>80</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brooks</surname><given-names>B.</given-names></name><name><surname>Bruccoleri</surname><given-names>R</given-names></name></person-group><article-title>CHARMM: A program for macromolecular energy, minimization, and dynamics calculations</article-title><source>J. Comput. Chem</source><year>1983</year><volume>4</volume><fpage>187</fpage><lpage>217</lpage></citation></ref>
<ref id="b81-ijms-14-20635"><label>81</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Weiner</surname><given-names>S.J.</given-names></name><name><surname>Kollman</surname><given-names>P.A.</given-names></name><name><surname>Case</surname><given-names>D.A.</given-names></name><name><surname>Singh</surname><given-names>U.C.</given-names></name><name><surname>Ghio</surname><given-names>C.</given-names></name><name><surname>Alagona</surname><given-names>G.</given-names></name><name><surname>Profeta</surname><given-names>S.</given-names></name><name><surname>Weiner</surname><given-names>P</given-names></name></person-group><article-title>A new force field for molecular mechanical simulation of nucleic acids and proteins</article-title><source>J. Am. Chem. Soc</source><year>1984</year><volume>106</volume><fpage>765</fpage><lpage>784</lpage></citation></ref>
<ref id="b82-ijms-14-20635"><label>82</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jorgensen</surname><given-names>W.</given-names></name><name><surname>Tirado-Rives</surname><given-names>J</given-names></name></person-group><article-title>The OPLS potential functions for proteins, energy minimizations for crystals of cyclic peptides and crambin</article-title><source>J. Am. Chem. Soc</source><year>1988</year><volume>110</volume><fpage>1657</fpage><lpage>1666</lpage></citation></ref>
<ref id="b83-ijms-14-20635"><label>83</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liwo</surname><given-names>A.</given-names></name><name><surname>Pincus</surname><given-names>M.R.</given-names></name><name><surname>Wawak</surname><given-names>R.J.</given-names></name><name><surname>Rackovsky</surname><given-names>S.</given-names></name><name><surname>Scheraga</surname><given-names>H.A.</given-names></name></person-group><article-title>Calculation of protein backbone geometry from alpha-carbon coordinates based on peptide-group dipole alignment</article-title><source>Protein Sci</source><year>1993</year><volume>2</volume><fpage>1697</fpage><lpage>1714</lpage></citation></ref>
<ref id="b84-ijms-14-20635"><label>84</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lazaridis</surname><given-names>T.</given-names></name><name><surname>Karplus</surname><given-names>M</given-names></name></person-group><article-title>Effective energy function for proteins in solution</article-title><source>Proteins</source><year>1999</year><volume>35</volume><fpage>133</fpage><lpage>152</lpage></citation></ref>
<ref id="b85-ijms-14-20635"><label>85</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Durham</surname><given-names>E.</given-names></name><name><surname>Dorr</surname><given-names>B.</given-names></name><name><surname>Woetzel</surname><given-names>N.</given-names></name><name><surname>Staritzbichler</surname><given-names>R.</given-names></name><name><surname>Meiler</surname><given-names>J</given-names></name></person-group><article-title>Solvent accessible surface area approximations for rapid and accurate protein structure prediction</article-title><source>J. Mol. Model</source><year>2009</year><volume>15</volume><fpage>1093</fpage><lpage>1108</lpage></citation></ref>
<ref id="b86-ijms-14-20635"><label>86</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bradley</surname><given-names>P.</given-names></name><name><surname>Misura</surname><given-names>K.M.S.</given-names></name><name><surname>Baker</surname><given-names>D</given-names></name></person-group><article-title>Toward high-resolution de novo structure prediction for small proteins</article-title><source>Science</source><year>2005</year><volume>309</volume><fpage>1868</fpage><lpage>1871</lpage></citation></ref>
<ref id="b87-ijms-14-20635"><label>87</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xu</surname><given-names>D.</given-names></name><name><surname>Zhang</surname><given-names>J.</given-names></name><name><surname>Roy</surname><given-names>A.</given-names></name><name><surname>Zhang</surname><given-names>Y</given-names></name></person-group><article-title>Automated protein structure modeling in CASP9 by I-TASSER pipeline combined with QUARK-based <italic>ab initio</italic> folding and FG-MD-based structure refinement</article-title><source>Proteins</source><year>2011</year><volume>79</volume><fpage>147</fpage><lpage>160</lpage></citation></ref>
<ref id="b88-ijms-14-20635"><label>88</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gront</surname><given-names>D.</given-names></name><name><surname>Kulp</surname><given-names>D.W.</given-names></name><name><surname>Vernon</surname><given-names>R.M.</given-names></name><name><surname>Strauss</surname><given-names>C.E.M.</given-names></name><name><surname>Baker</surname><given-names>D</given-names></name></person-group><article-title>Generalized fragment picking in Rosetta: Design, protocols and applications</article-title><source>PLoS One</source><year>2011</year><volume>6</volume><fpage>e23294</fpage></citation></ref>
<ref id="b89-ijms-14-20635"><label>89</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Topf</surname><given-names>M.</given-names></name><name><surname>Sali</surname><given-names>A</given-names></name></person-group><article-title>Combining electron microscopy and comparative protein structure modeling</article-title><source>Curr. Opin. Struct. Biol</source><year>2005</year><volume>15</volume><fpage>578</fpage><lpage>585</lpage></citation></ref>
<ref id="b90-ijms-14-20635"><label>90</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schneidman-Duhovny</surname><given-names>D.</given-names></name><name><surname>Kim</surname><given-names>S.J.</given-names></name><name><surname>Sali</surname><given-names>A</given-names></name></person-group><article-title>Integrative structural modeling with small angle X-ray scattering profiles</article-title><source>BMC Struct. Biol</source><year>2012</year><volume>12</volume><fpage>17</fpage></citation></ref>
<ref id="b91-ijms-14-20635"><label>91</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Young</surname><given-names>M.M.</given-names></name><name><surname>Tang</surname><given-names>N.</given-names></name><name><surname>Hempel</surname><given-names>J.C.</given-names></name><name><surname>Oshiro</surname><given-names>C.M.</given-names></name><name><surname>Taylor</surname><given-names>E.W.</given-names></name><name><surname>Kuntz</surname><given-names>I.D.</given-names></name><name><surname>Gibson</surname><given-names>B.W.</given-names></name><name><surname>Dollinger</surname><given-names>G</given-names></name></person-group><article-title>High throughput protein fold identification by using experimental constraints derived from intramolecular cross-links and mass spectrometry</article-title><source>Proc. Natl. Acad. Sci. USA</source><year>2000</year><volume>97</volume><fpage>5802</fpage><lpage>5806</lpage></citation></ref>
<ref id="b92-ijms-14-20635"><label>92</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname><given-names>Z.A.</given-names></name><name><surname>Jawhari</surname><given-names>A.</given-names></name><name><surname>Fischer</surname><given-names>L.</given-names></name><name><surname>Buchen</surname><given-names>C.</given-names></name><name><surname>Tahir</surname><given-names>S.</given-names></name><name><surname>Kamenski</surname><given-names>T.</given-names></name><name><surname>Rasmussen</surname><given-names>M.</given-names></name><name><surname>Lariviere</surname><given-names>L.</given-names></name><name><surname>Bukowski-Wills</surname><given-names>J.-C.</given-names></name><name><surname>Nilges</surname><given-names>M.</given-names></name><etal/></person-group><article-title>Architecture of the RNA polymerase II-TFIIF complex revealed by cross-linking and mass spectrometry</article-title><source>EMBO J</source><year>2010</year><volume>29</volume><fpage>717</fpage><lpage>726</lpage></citation></ref>
<ref id="b93-ijms-14-20635"><label>93</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stengel</surname><given-names>F.</given-names></name><name><surname>Aebersold</surname><given-names>R.</given-names></name><name><surname>Robinson</surname><given-names>C.V.</given-names></name></person-group><article-title>Joining forces: Integrating proteomics and cross-linking with the mass spectrometry of intact complexes</article-title><source>Mol. Cell. Proteomics</source><year>2012</year><volume>11</volume><pub-id pub-id-type="doi">10.1074/mcp.R1111.014027</pub-id></citation></ref>
<ref id="b94-ijms-14-20635"><label>94</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Petrotchenko</surname><given-names>E.V.</given-names></name><name><surname>Borchers</surname><given-names>C.H.</given-names></name></person-group><article-title>Crosslinking combined with mass spectrometry for structural proteomics</article-title><source>Mass Spectrom.Rev</source><year>2010</year><volume>29</volume><fpage>862</fpage><lpage>876</lpage></citation></ref>
<ref id="b95-ijms-14-20635"><label>95</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bereszczak</surname><given-names>J.Z.</given-names></name><name><surname>Barbu</surname><given-names>I.M.</given-names></name><name><surname>Tan</surname><given-names>M.</given-names></name><name><surname>Xia</surname><given-names>M.</given-names></name><name><surname>Jiang</surname><given-names>X.</given-names></name><name><surname>van Duijn</surname><given-names>E.</given-names></name><name><surname>Heck</surname><given-names>A.J.R.</given-names></name></person-group><article-title>Structure, stability and dynamics of norovirus P domain derived protein complexes studied by native mass spectrometry</article-title><source>J. Struct. Biol.</source><year>2012</year><volume>177</volume><fpage>273</fpage><lpage>282</lpage></citation></ref>
<ref id="b96-ijms-14-20635"><label>96</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hern&#x000E1;ndez</surname><given-names>H.</given-names></name><name><surname>Dziembowski</surname><given-names>A.</given-names></name><name><surname>Taverner</surname><given-names>T.</given-names></name><name><surname>S&#x000E9;raphin</surname><given-names>B.</given-names></name><name><surname>Robinson</surname><given-names>C.V.</given-names></name></person-group><article-title>Subunit architecture of multimeric complexes isolated directly from cells</article-title><source>EMBO Rep</source><year>2006</year><volume>7</volume><fpage>605</fpage><lpage>610</lpage></citation></ref>
<ref id="b97-ijms-14-20635"><label>97</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ilag</surname><given-names>L.L.</given-names></name><name><surname>Westblade</surname><given-names>L.F.</given-names></name><name><surname>Deshayes</surname><given-names>C.</given-names></name><name><surname>Kolb</surname><given-names>A.</given-names></name><name><surname>Busby</surname><given-names>S.J.W.</given-names></name><name><surname>Robinson</surname><given-names>C.V.</given-names></name></person-group><article-title>Mass spectrometry of Escherichia coli RNA polymerase: Interactions of the core enzyme with sigma70 and Rsd protein</article-title><source>Structure</source><year>2004</year><volume>12</volume><fpage>269</fpage><lpage>275</lpage></citation></ref>
<ref id="b98-ijms-14-20635"><label>98</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Thompson</surname><given-names>N.J.</given-names></name><name><surname>Rosati</surname><given-names>S.</given-names></name><name><surname>Heck</surname><given-names>A.J.R.</given-names></name></person-group><article-title>Performing native mass spectrometry analysis on therapeutic antibodies</article-title><source>Methods</source><year>2013</year><pub-id pub-id-type="doi">10.1016/j.ymeth.2013.05.003.</pub-id></citation></ref>
<ref id="b99-ijms-14-20635"><label>99</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Levy</surname><given-names>E.D.</given-names></name><name><surname>Boeri Erba</surname><given-names>E.</given-names></name><name><surname>Robinson</surname><given-names>C.V.</given-names></name><name><surname>Teichmann</surname><given-names>S.A.</given-names></name></person-group><article-title>Assembly reflects evolution of protein complexes</article-title><source>Nature</source><year>2008</year><volume>453</volume><fpage>1262</fpage><lpage>1265</lpage></citation></ref>
<ref id="b100-ijms-14-20635"><label>100</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Walzthoeni</surname><given-names>T.</given-names></name><name><surname>Leitner</surname><given-names>A.</given-names></name><name><surname>Stengel</surname><given-names>F.</given-names></name><name><surname>Aebersold</surname><given-names>R</given-names></name></person-group><article-title>Mass spectrometry supported determination of protein complex structure</article-title><source>Curr. Opin. Struct. Biol</source><year>2013</year><volume>23</volume><fpage>252</fpage><lpage>260</lpage></citation></ref>
<ref id="b101-ijms-14-20635"><label>101</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lasker</surname><given-names>K.</given-names></name><name><surname>Phillips</surname><given-names>J.L.</given-names></name><name><surname>Russel</surname><given-names>D.</given-names></name><name><surname>Vel&#x000E1;zquez-Muriel</surname><given-names>J.</given-names></name><name><surname>Schneidman-Duhovny</surname><given-names>D.</given-names></name><name><surname>Tjioe</surname><given-names>E.</given-names></name><name><surname>Webb</surname><given-names>B.</given-names></name><name><surname>Schlessinger</surname><given-names>A.</given-names></name><name><surname>Sali</surname><given-names>A</given-names></name></person-group><article-title>Integrative structure modeling of macromolecular assemblies from proteomics data</article-title><source>Mol. Cell. Proteomics</source><year>2010</year><volume>9</volume><fpage>1689</fpage><lpage>1702</lpage></citation></ref>
<ref id="b102-ijms-14-20635"><label>102</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pieper</surname><given-names>U.</given-names></name><name><surname>Webb</surname><given-names>B.M.</given-names></name><name><surname>Barkan</surname><given-names>D.T.</given-names></name><name><surname>Schneidman-Duhovny</surname><given-names>D.</given-names></name><name><surname>Schlessinger</surname><given-names>A.</given-names></name><name><surname>Braberg</surname><given-names>H.</given-names></name><name><surname>Yang</surname><given-names>Z.</given-names></name><name><surname>Meng</surname><given-names>E.C.</given-names></name><name><surname>Pettersen</surname><given-names>E.F.</given-names></name><name><surname>Huang</surname><given-names>C.C.</given-names></name><etal/></person-group><article-title>MODBASE, a database of annotated comparative protein structure models, and associated resources</article-title><source>Nucleic Acids Res</source><year>2011</year><volume>39</volume><fpage>D465</fpage><lpage>D474</lpage></citation></ref>
<ref id="b103-ijms-14-20635"><label>103</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stark</surname><given-names>C.</given-names></name><name><surname>Breitkreutz</surname><given-names>B.-J.</given-names></name><name><surname>Reguly</surname><given-names>T.</given-names></name><name><surname>Boucher</surname><given-names>L.</given-names></name><name><surname>Breitkreutz</surname><given-names>A.</given-names></name><name><surname>Tyers</surname><given-names>M</given-names></name></person-group><article-title>BioGRID: A general repository for interaction datasets</article-title><source>Nucleic Acids Res.</source><year>2006</year><volume>34</volume><fpage>D535</fpage><lpage>D539</lpage></citation></ref>
<ref id="b104-ijms-14-20635"><label>104</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lawson</surname><given-names>C.L.</given-names></name><name><surname>Baker</surname><given-names>M.L.</given-names></name><name><surname>Best</surname><given-names>C.</given-names></name><name><surname>Bi</surname><given-names>C.</given-names></name><name><surname>Dougherty</surname><given-names>M.</given-names></name><name><surname>Feng</surname><given-names>P.</given-names></name><name><surname>van Ginkel</surname><given-names>G.</given-names></name><name><surname>Devkota</surname><given-names>B.</given-names></name><name><surname>Lagerstedt</surname><given-names>I.</given-names></name><name><surname>Ludtke</surname><given-names>S.J.</given-names></name><etal/></person-group><article-title>EMDataBank.org: Unified data resource for CryoEM</article-title><source>Nucleic Acids Res</source><year>2011</year><volume>39</volume><fpage>D456</fpage><lpage>D464</lpage></citation></ref>
<ref id="b105-ijms-14-20635"><label>105</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname><given-names>M.</given-names></name><name><surname>Robinson</surname><given-names>C.V.</given-names></name></person-group><article-title>When proteomics meets structural biology</article-title><source>Trends Biochem. Sci</source><year>2010</year><volume>35</volume><fpage>522</fpage><lpage>529</lpage></citation></ref>
<ref id="b106-ijms-14-20635"><label>106</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Benesch</surname><given-names>J.L.P.</given-names></name><name><surname>Ruotolo</surname><given-names>B.T.</given-names></name><name><surname>Simmons</surname><given-names>D.A.</given-names></name><name><surname>Robinson</surname><given-names>C.V.</given-names></name></person-group><article-title>Protein complexes in the gas phase: Technology for structural genomics and proteomics</article-title><source>Chem. Rev</source><year>2007</year><volume>107</volume><fpage>3544</fpage><lpage>3567</lpage></citation></ref>
<ref id="b107-ijms-14-20635"><label>107</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hyung</surname><given-names>S.-J.</given-names></name><name><surname>Ruotolo</surname><given-names>B.T.</given-names></name></person-group><article-title>Integrating mass spectrometry of intact protein complexes into structural proteomics</article-title><source>Proteomics</source><year>2012</year><volume>12</volume><fpage>1547</fpage><lpage>1564</lpage></citation></ref></ref-list></back></article>
