<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xml:lang="en" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">ijms</journal-id>
<journal-title>International Journal of Molecular Sciences</journal-title>
<abbrev-journal-title>Int. J. Mol. Sci.</abbrev-journal-title>
<issn pub-type="epub">1422-0067</issn>
<publisher>
<publisher-name>Molecular Diversity Preservation International (MDPI)</publisher-name></publisher></journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3390/ijms12128449</article-id>
<article-id pub-id-type="publisher-id">ijms-12-08449</article-id>
<article-categories>
<subj-group>
<subject>Article</subject></subj-group></article-categories>
<title-group>
<article-title>A Comparative Study of the Second-Order Hydrophobic Moments for Globular Proteins: The Consensus Scale of Hydrophobicity and the CHARMM Partial Atomic Charges</article-title></title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Tsai</surname><given-names>Cheng-Fang</given-names></name><xref ref-type="aff" rid="af1-ijms-12-08449">1</xref></contrib>
<contrib contrib-type="author">
<name><surname>Lee</surname><given-names>Kuei-Jen</given-names></name><xref ref-type="aff" rid="af2-ijms-12-08449">2</xref><xref ref-type="corresp" rid="c1-ijms-12-08449">*</xref></contrib></contrib-group>
<aff id="af1-ijms-12-08449">
<label>1</label>Department of Biotechnology, Asia University, Taichung 413, Taiwan; E-Mail: <email>tsaicf@asia.edu.tw</email></aff>
<aff id="af2-ijms-12-08449">
<label>2</label>Department of Bioinformatics, Asia University, Taichung 413, Taiwan</aff>
<author-notes>
<corresp id="c1-ijms-12-08449">
<label>*</label>Author to whom correspondence should be addressed; E-Mail: <email>kjlee@asia.edu.tw</email>; Tel.: +886-4-23398316; Fax: +886-4-23320718.</corresp></author-notes>
<pub-date pub-type="collection">
<year>2011</year></pub-date>
<pub-date pub-type="epub">
<day>29</day>
<month>11</month>
<year>2011</year></pub-date>
<volume>12</volume>
<issue>12</issue>
<fpage>8449</fpage>
<lpage>8465</lpage>
<history>
<date date-type="received">
<day>15</day>
<month>9</month>
<year>2011</year></date>
<date date-type="accepted">
<day>15</day>
<month>11</month>
<year>2011</year></date></history>
<permissions>
<copyright-statement>© 2011 by the authors; licensee MDPI, Basel, Switzerland.</copyright-statement>
<copyright-year>2011</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/3.0">
<p>This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).</p></license></permissions>
<abstract>
<p>In this paper, the second-order hydrophobic moment for fifteen globular proteins in 150 nonhomologous protein chains was performed in a comparative study involving two sets of hydrophobicity: one selected from the consensus scale and the other derived from the CHARMM partial atomic charges. These proteins were divided into three groups, based on their number of residues (<italic>N</italic>) and the asphericity (<italic>δ</italic>). Proteins in Group I were spherical and those in Groups II and III were prolate. The size of the proteins is represented by the mean radius of gyration (<italic>R</italic><italic><sub>g</sub></italic> ), which follows the Flory scaling law, <italic>R</italic><italic><sub>g</sub></italic> ∝ <italic>N</italic><sup>ν</sup>. The mean value of <italic>v</italic> was 0.35, which is similar to a polymer chain in a poor solvent. The spatial distributions of the second-order moment for each of the proteins, obtained from the two sets of hydrophobicity, were compared using the Pearson correlation coefficient; the results reveal that there is a strong correlation between the two data sets for each protein structure when the CHARMM partial atomic charges, |<italic>q</italic><italic><sub>i</sub></italic><italic>|</italic> ≥ 0.3, assigned for polar atoms, are used. The locations at which these distributions vanish and approach a negative value are at approximately 50% of the percentage of solvent accessibility, indicating that there is a transition point from hydrophobic interior to hydrophilic exterior in the proteins. This may suggest that there is a position for the proteins to determine the residues at exposed sites beyond this range.</p></abstract>
<kwd-group>
<kwd>hydrophobicity consensus scale</kwd>
<kwd>CHARMM partial atomic charges</kwd>
<kwd>Secondorder hydrophobic moment</kwd></kwd-group></article-meta></front>
<body>
<sec sec-type="intro">
<title>1. Introduction</title>
<p>Protein structures are stabilized by several non-covalent interactions, such as hydrophobic, van der Waals, electrostatic, and hydrogen bonding interactions [<xref ref-type="bibr" rid="b1-ijms-12-08449">1</xref>,<xref ref-type="bibr" rid="b2-ijms-12-08449">2</xref>]. Of all the interactions that take place in proteins, hydrophobic interaction is a dominant force that drives protein folding. The interaction between non-polar amino-acid residues and the aqueous environment provides a strong hydrophobic force for protein folding [<xref ref-type="bibr" rid="b1-ijms-12-08449">1</xref>,<xref ref-type="bibr" rid="b3-ijms-12-08449">3</xref>], forming a hydrophobic core in the protein interior. Thus, the variation in hydrophobicity from the hydrophobic core to hydrophilic exterior is the spatial variation for most globular proteins. To quantify the spatial transition of hydrophobicity between the interior and the surface of globular proteins of known structure, Silverman [<xref ref-type="bibr" rid="b4-ijms-12-08449">4</xref>] introduced the second-order hydrophobic moment, using the hydrophobicity consensus scale of Eisenberg [<xref ref-type="bibr" rid="b5-ijms-12-08449">5</xref>]. The second-order hydrophobic moment, or quadrupole moment, is similar to the electrostatic quadrupole, and it was first proposed by Eisenberg [<xref ref-type="bibr" rid="b5-ijms-12-08449">5</xref>] to describe the hydrophobic distributions of proteins. A second-order hydrophobic moment has also been applied to the systems [<xref ref-type="bibr" rid="b6-ijms-12-08449">6</xref>] including native and decoy protein structures, and multi-domain proteins [<xref ref-type="bibr" rid="b7-ijms-12-08449">7</xref>]. In contrast, the first-order hydrophobic moment, or hydrophobic dipole moment, is similar to the dipole moment of a molecule. The first-order hydrophobic moment has been used to measure the amphiphilicity of primary and secondary structures either in globular proteins [<xref ref-type="bibr" rid="b5-ijms-12-08449">5</xref>,<xref ref-type="bibr" rid="b8-ijms-12-08449">8</xref>–<xref ref-type="bibr" rid="b10-ijms-12-08449">10</xref>] or in transmembrane proteins [<xref ref-type="bibr" rid="b11-ijms-12-08449">11</xref>,<xref ref-type="bibr" rid="b12-ijms-12-08449">12</xref>]. Moreover, the zeroth-order hydrophobic moment is the total hydrophobicity of the amino-acid residues in a protein, and it is similar to a net molecular charge.</p>
<p>Residues classified as buried or exposed are conventionally described by a geometric parameter calculated using the solvent-accessible surface area (ASA) [<xref ref-type="bibr" rid="b13-ijms-12-08449">13</xref>], which is generated by rolling a spherical probe with a radius of 1.4 Å over the surface of a protein. The ASA values obtained are in absolute values, and these can be changed to relative values, which are also known as the percentage of solvent accessibility <italic>p</italic> (%) [<xref ref-type="bibr" rid="b14-ijms-12-08449">14</xref>,<xref ref-type="bibr" rid="b15-ijms-12-08449">15</xref>]. These values have been calculated using the ASA value of each amino-acid residue in the native state normalized with respect to the ASA value of the corresponding residue X in the extended state either of Gly-X-Gly [<xref ref-type="bibr" rid="b16-ijms-12-08449">16</xref>,<xref ref-type="bibr" rid="b17-ijms-12-08449">17</xref>] or of Ala-X-Ala [<xref ref-type="bibr" rid="b14-ijms-12-08449">14</xref>,<xref ref-type="bibr" rid="b15-ijms-12-08449">15</xref>]. Recently, the prediction of solvent accessibility, based on protein sequences, has also been developed using support vector regression (SVR) [<xref ref-type="bibr" rid="b18-ijms-12-08449">18</xref>–<xref ref-type="bibr" rid="b21-ijms-12-08449">21</xref>]. Gromiha <italic>et al</italic>. [<xref ref-type="bibr" rid="b14-ijms-12-08449">14</xref>,<xref ref-type="bibr" rid="b15-ijms-12-08449">15</xref>] have classified these relative values to the locations of protein residues as follows: 0–2% as completely buried, 2–20% as a location between buried and partially buried, 20–50% as partially buried, and greater than 50% as completely exposed. This classification is used to study the stability changes for protein mutants.</p>
<p>Silverman [<xref ref-type="bibr" rid="b4-ijms-12-08449">4</xref>] used an ellipsoidal representation for the shapes of proteins. The extent of an ellipsoid was defined by a distance, <italic>d</italic>, which was calculated from the molecular moments of geometry [<xref ref-type="bibr" rid="b4-ijms-12-08449">4</xref>]. The second-order hydrophobic moment per residue in a protein was calculated from a small value of <italic>d</italic> to a larger value by increasing the value of <italic>d</italic> until all residues in the protein have been collected to obtain its spatial distribution of the second-order hydrophobic moment. In contrast, the percentage of solvent accessibility, <italic>p</italic> (%), is adopted in this work, and we employ successive increases of solvent accessibility; that is, increases from 0 to 100% are used to study the spatial distributions of the second-order hydrophobic moment of the protein structures. The second-order hydrophobic moment per residue is calculated from a space defined by a small value of <italic>p</italic> (%), which collected the residues at the hydrophobic core of a protein to the larger value of <italic>p</italic> (%) at which all residues will be collected. To investigate spatial characteristics, the distances from the origin of a protein to the centroids of the protein residues in a space defined by <italic>p</italic> (%) are also considered. Here, we use the center of mass of each protein as the origin. In this work, two sets of hydrophobic parameters for each of the amino-acid residues in the protein structures are used in a comparative study of the second-order hydrophobic moments: one is based on the hydrophobicity consensus scale of Eisenberg [<xref ref-type="bibr" rid="b5-ijms-12-08449">5</xref>], and the other is derived from the partial atomic charges assigned to atoms of a protein in the CHARMM program [<xref ref-type="bibr" rid="b22-ijms-12-08449">22</xref>].</p>
<p>In this work, a different approach from the works of Silverman [<xref ref-type="bibr" rid="b4-ijms-12-08449">4</xref>] is used to define the spaces in a protein; <italic>i.e.</italic>, we employ <italic>p</italic> (%) to define the spaces to calculate the spatial distributions of the second-order hydrophobic moment of a protein. The purpose of this study is to examine whether the hydrophobicity of each residue in a protein can be obtained using molecular modeling, and whether the values of hydrophobicity derived from the partial atomic charges assigned to atoms of a protein in the CHARMM program [<xref ref-type="bibr" rid="b22-ijms-12-08449">22</xref>] are comparable with those of the consensus scale of Eisenberg [<xref ref-type="bibr" rid="b5-ijms-12-08449">5</xref>] by using the spatial distributions of the second-order hydrophobic moment in the new definition of spaces in the protein. This work may provide an alternative way to calculate the hydrophobicity of each residue in a protein. Since the hydrophobicity of an amino-acid residue cannot be defined and measured easily, it is usually obtained from the free energy changes calculated by transferring amino-acid side chains from aqueous to non-aqueous media [<xref ref-type="bibr" rid="b23-ijms-12-08449">23</xref>,<xref ref-type="bibr" rid="b24-ijms-12-08449">24</xref>] or from non-aqueous to aqueous media [<xref ref-type="bibr" rid="b25-ijms-12-08449">25</xref>].</p></sec>
<sec sec-type="materials|methods">
<title>2. Materials and Methods</title>
<p>A set of 150 nonhomologous protein chains was randomly selected from PDBSELECT [<xref ref-type="bibr" rid="b26-ijms-12-08449">26</xref>], which included more than 4,000 protein chains. The sequence identity in this set of proteins is lower than 25%, and their single-chain protein crystal structures have been determined at a resolution of less than 1.8Å and at an R-factor of less than 0.18. All the structures of the proteins were obtained from the PDB [<xref ref-type="bibr" rid="b27-ijms-12-08449">27</xref>]. A figure of asphericity (<italic>δ</italic>) and the index of 150 proteins are shown in <xref ref-type="fig" rid="f1-ijms-12-08449">Figure 1</xref>. In order to select protein structures with diverse sizes and shapes, fifteen structures, as shown in <xref ref-type="table" rid="t1-ijms-12-08449">Table 1</xref>, were selected from these 150 proteins and divided into three groups, depending on the number of residues (<italic>N</italic>) and the extent of asphericity (<italic>δ</italic>).</p>
<p>The shape of a protein can be characterized in terms of eigenvalues of the radius of gyration tensor. The radius of gyration tensor <italic>T</italic> is defined as follows [<xref ref-type="bibr" rid="b28-ijms-12-08449">28</xref>,<xref ref-type="bibr" rid="b29-ijms-12-08449">29</xref>]:</p>
<disp-formula id="FD1">
<label>(1)</label>
<mml:math id="mm1" display="block">
<mml:semantics id="sm1">
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mi>n</mml:mi></mml:mfrac>
<mml:munderover>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mi>n</mml:mi></mml:munderover>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>T</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub>
<mml:msubsup>
<mml:mrow>
<mml:mi>T</mml:mi></mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>t</mml:mi></mml:msubsup>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>t</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>x</mml:mi>
<mml:mi>x</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>t</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>x</mml:mi>
<mml:mi>y</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>t</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>x</mml:mi>
<mml:mi>z</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>t</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>y</mml:mi>
<mml:mi>x</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>t</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>y</mml:mi>
<mml:mi>y</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>t</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>y</mml:mi>
<mml:mi>z</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>t</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>z</mml:mi>
<mml:mi>x</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>t</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>z</mml:mi>
<mml:mi>y</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>t</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>z</mml:mi>
<mml:mi>z</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>]</mml:mo></mml:mrow></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula>
<p>where <italic>T</italic><italic><sub>i</sub></italic> = col (<italic>x</italic><italic><sub>i</sub></italic><italic>, y</italic><italic><sub>i</sub></italic><italic>, z</italic><italic><sub>i</sub></italic>) is the vector going from the origin at the protein center of mass vector to the position of the centroid of residue <italic>i</italic>. The center of mass vector, <bold>r</bold><italic><sub>CM</sub></italic>, is defined by [<xref ref-type="bibr" rid="b4-ijms-12-08449">4</xref>]</p>
<disp-formula id="FD2">
<label>(2)</label>
<mml:math id="mm2" display="block">
<mml:semantics id="sm2">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="bold">r</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>C</mml:mi>
<mml:mi>M</mml:mi></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>N</mml:mi></mml:mrow>
<mml:mi>r</mml:mi></mml:msub></mml:mrow></mml:mfrac>
<mml:munderover>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>N</mml:mi></mml:mrow>
<mml:mi>r</mml:mi></mml:msub></mml:mrow></mml:munderover>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="bold">r</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula>
<p>where <italic>N</italic><italic><sub>r</sub></italic> is the number of residues in the protein, and <bold>r</bold><italic><sub>i</sub></italic> is the position vector for the centroids of the amino-acid residues. The matrix <italic>T</italic> can be diagonalized to obtain the three eigenvalues, α<italic><sub>i</sub></italic> (<italic>i =</italic> 1, 2, 3), of this tensor. The asphericity (<italic>δ</italic>) [<xref ref-type="bibr" rid="b28-ijms-12-08449">28</xref>–<xref ref-type="bibr" rid="b31-ijms-12-08449">31</xref>] has been used to characterize protein shapes, and δ is computed as</p>
<disp-formula id="FD3">
<label>(3)</label>
<mml:math id="mm3" display="block">
<mml:semantics id="sm3">
<mml:mrow>
<mml:mi>δ</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>-</mml:mo>
<mml:mn>3</mml:mn>
<mml:mrow>
<mml:mo>〈</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>α</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>α</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>α</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>α</mml:mi></mml:mrow>
<mml:mn>3</mml:mn></mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>α</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>α</mml:mi></mml:mrow>
<mml:mn>3</mml:mn></mml:msub></mml:mrow>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>α</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>α</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>α</mml:mi></mml:mrow>
<mml:mn>3</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:mfrac></mml:mrow>
<mml:mo>〉</mml:mo></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula>
<p>If <italic>δ</italic> = 0, the shape is a perfect sphere, the extent of asphericity can be referred as 0 &lt; <italic>δ &lt;</italic> 1, and if <italic>δ</italic> = 1, the shape is a rod. The shape parameter (<italic>S</italic>) [<xref ref-type="bibr" rid="b30-ijms-12-08449">30</xref>,<xref ref-type="bibr" rid="b31-ijms-12-08449">31</xref>] has been used to quantify the whole shape of a protein, and <italic>S</italic> is computed as</p>
<disp-formula id="FD4">
<label>(4)</label>
<mml:math id="mm4" display="block">
<mml:semantics id="sm4">
<mml:mrow>
<mml:mi>S</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>27</mml:mn>
<mml:mrow>
<mml:mo>〈</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:munderover>
<mml:mo>∏</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mn>3</mml:mn></mml:munderover></mml:mstyle>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>α</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>-</mml:mo>
<mml:mover accent="true">
<mml:mi>α</mml:mi>
<mml:mo>¯</mml:mo></mml:mover>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>α</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>α</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>α</mml:mi></mml:mrow>
<mml:mn>3</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mn>3</mml:mn></mml:msup></mml:mrow></mml:mfrac></mml:mrow>
<mml:mo>〉</mml:mo></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula>
<p>Where</p>
<disp-formula id="FD5">
<label>(5)</label>
<mml:math id="mm5" display="block">
<mml:semantics id="sm5">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>α</mml:mi>
<mml:mo>¯</mml:mo></mml:mover>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>α</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>α</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>α</mml:mi></mml:mrow>
<mml:mn>3</mml:mn></mml:msub></mml:mrow>
<mml:mn>3</mml:mn></mml:mfrac></mml:mrow></mml:semantics></mml:math></disp-formula>
<p>If <italic>S = 0</italic>, the shape is a perfect sphere, if <italic>S &lt; 0</italic>, the shape is oblate ellipsoid, and if <italic>S &gt; 0</italic> the shape is prolate ellipsoid. The shape of proteins can also be represented by semiaxes [<xref ref-type="bibr" rid="b32-ijms-12-08449">32</xref>]; <italic>i.e.</italic>, <italic>a</italic> = [5(<italic>α</italic><sub>1</sub> +<italic>α</italic><sub>2</sub>)/2]<sup>1/2</sup> and <italic>b</italic> = (5<italic>α</italic><sub>3</sub>)<sup>1/2</sup>, where <italic>α</italic><sub>i</sub> was sorted according to increasing magnitude. If <italic>a</italic> ≈ <italic>b</italic>, the shape is close to a sphere, and if <italic>a</italic> &lt; <italic>b</italic> the shape is a prolate ellipsoid.</p>
<p>The molecular size of a protein can be probed using its mean square radius of gyration, 〈<italic>R</italic><italic><sub>g</sub></italic><sup>2</sup>〉, defined as follows [<xref ref-type="bibr" rid="b33-ijms-12-08449">33</xref>]:</p>
<disp-formula id="FD6">
<label>(6)</label>
<mml:math id="mm6" display="block">
<mml:semantics id="sm6">
<mml:mrow>
<mml:mrow>
<mml:mo>〈</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mi>R</mml:mi></mml:mrow>
<mml:mi>g</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow>
<mml:mo>〉</mml:mo></mml:mrow>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>N</mml:mi></mml:mrow>
<mml:mi>r</mml:mi></mml:msub></mml:mrow></mml:mfrac>
<mml:munderover>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>N</mml:mi></mml:mrow>
<mml:mi>r</mml:mi></mml:msub></mml:mrow></mml:munderover>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="bold">r</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>-</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="bold">r</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>C</mml:mi>
<mml:mi>M</mml:mi></mml:mrow></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula>
<p>where <italic>N</italic><italic><sub>r</sub></italic>, <bold>r</bold><italic><sub>i</sub></italic>, and <bold>r</bold><italic><sub>CM</sub></italic> are defined in <xref rid="FD2" ref-type="disp-formula">Equation (2)</xref>.</p>
<p>The solvent accessible surface area (ASA) for each atom or for each residue in the proteins of interest was obtained using the computer program ASC [<xref ref-type="bibr" rid="b34-ijms-12-08449">34</xref>] with default parameters. The percentage of solvent accessibility, <italic>p</italic> (%), was calculated as follows:</p>
<disp-formula id="FD7">
<label>(7)</label>
<mml:math id="mm7" display="block">
<mml:semantics id="sm7">
<mml:mrow>
<mml:mi>p</mml:mi>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">ASA</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>X</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>f</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>l</mml:mi>
<mml:mi>d</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>d</mml:mi></mml:mrow></mml:msub></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">ASA</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>X</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>G</mml:mi>
<mml:mo>-</mml:mo>
<mml:mi>X</mml:mi>
<mml:mo>-</mml:mo>
<mml:mi>G</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac></mml:mrow></mml:semantics></mml:math></disp-formula>
<p>where <italic>ASA</italic><italic><sub>X,folded</sub></italic> is the ASA of X residue in the folded state of a given protein and <italic>ASA</italic><italic><sub>X,G–X–G</sub></italic> is the ASA of the corresponding residue X in tripeptide (Gly-X-Gly) with the extended state. The extended state (<italic>ϕ</italic> = <italic>Ψ</italic> = 180°) coordinates were generated using InsightII program [<xref ref-type="bibr" rid="b35-ijms-12-08449">35</xref>]. The Gly residues to either side of interest residue X can provide steric screening effects of neighboring residues in the simulated models, and such steric screening effects can reduce the values of ASAs calculated for the X residues in the extended state. The results of the ASA for each residue type X are shown in <xref ref-type="table" rid="t2-ijms-12-08449">Table 2</xref></p>
<p>Polar atoms were defined on the basis of the partial charges, which were assigned to atoms in a protein by the CHARMM program [<xref ref-type="bibr" rid="b22-ijms-12-08449">22</xref>]. In this work, three sets of partial charges, namely, |<italic>q</italic><italic><sub>i</sub></italic>| ≥ 0.25, 0.27, or 0.3 were assigned as polar atoms, and the remaining atoms were considered apolar. These three sets of partial charges were used to examine which one has the best correlation with the hydrophobicity consensus scale of Eisenberg [<xref ref-type="bibr" rid="b5-ijms-12-08449">5</xref>]. Polar and apolar ASAs of each residue were calculated by summing the ASA of the respective polar and apolar atoms in the residue. The hydrophobicity, <italic>h</italic><italic><sub>i</sub></italic>, for each amino-acid residue, <italic>i</italic>, in a protein was calculated as follows:</p>
<disp-formula id="FD8">
<label>(8)</label>
<mml:math id="mm8" display="block">
<mml:semantics id="sm8">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>h</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">ASA</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>a</mml:mi>
<mml:mi>p</mml:mi>
<mml:mi>o</mml:mi></mml:mrow></mml:msub>
<mml:mo>-</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">ASA</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>p</mml:mi>
<mml:mi>o</mml:mi></mml:mrow></mml:msub></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">ASA</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>a</mml:mi>
<mml:mi>p</mml:mi>
<mml:mi>o</mml:mi></mml:mrow></mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">ASA</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>p</mml:mi>
<mml:mi>o</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac></mml:mrow></mml:semantics></mml:math></disp-formula>
<p>where <italic>ASA</italic><italic><sub>apo</sub></italic> and <italic>ASA</italic><italic><sub>po</sub></italic> are the apolar and polar ASAs, respectively, in the residue. For a given protein, the <italic>h</italic><italic><sub>i</sub></italic> value of each residue, either selected from the hydrophobicity consensus scale of Eisenberg [<xref ref-type="bibr" rid="b5-ijms-12-08449">5</xref>], as shown in <xref ref-type="table" rid="t2-ijms-12-08449">Table 2</xref>, or calculated according to <xref rid="FD8" ref-type="disp-formula">Equation (8)</xref> was normalized by the following equation:</p>
<disp-formula id="FD9">
<label>(9)</label>
<mml:math id="mm9" display="block">
<mml:semantics id="sm9">
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mi>h</mml:mi></mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>h</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>-</mml:mo>
<mml:mover accent="true">
<mml:mi>h</mml:mi>
<mml:mo>¯</mml:mo></mml:mover></mml:mrow>
<mml:mrow>
<mml:mi>σ</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>h</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mfrac></mml:mrow></mml:semantics></mml:math></disp-formula>
<p>where <italic>h̄</italic> is the mean of the <italic>h</italic><italic><sub>i</sub></italic> for all residues in the protein, and σ(<italic>h</italic>) is their standard deviation. In this work, the second-order hydrophobic moment, <italic>H</italic><sub>2</sub>(d<italic><sub>i</sub></italic>), per residue is similar to the works of Silverman [<xref ref-type="bibr" rid="b4-ijms-12-08449">4</xref>], given by</p>
<disp-formula id="FD10">
<label>(10)</label>
<mml:math id="mm10" display="block">
<mml:semantics id="sm10">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>H</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>d</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>n</mml:mi></mml:mrow>
<mml:mi>p</mml:mi></mml:msub></mml:mrow></mml:mfrac>
<mml:munder>
<mml:mo>∑</mml:mo>
<mml:mi>i</mml:mi></mml:munder>
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mi>h</mml:mi></mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:mi> </mml:mi>
<mml:msubsup>
<mml:mrow>
<mml:mi>d</mml:mi></mml:mrow>
<mml:mi>i</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula>
<p>where <italic>n</italic><sub>p</sub> is the number of residues collected in the space defined by the percentage of solvent accessibility <italic>p</italic> (%), and <italic>d</italic><italic><sub>i</sub></italic> is the distance from the protein centroid to the centroid of residue <italic>i</italic>, in the space. Here, the protein centroid is the coordinates of its center of mass (CM). In this work, the space surface is dependent on <italic>p</italic> (%). When one selects a particular value of <italic>P</italic> (%), the number of residues <italic>n</italic><sub>p</sub>, is collected within the space. When a value of <italic>p</italic> (%) increases, the volume of the space is also increased and the number of residues <italic>n</italic><sub>p</sub>, residing in the space increases as well. The increment of <italic>p</italic> (%) is from 0 to 100%. Therefore, the spatial distribution of the second-order hydrophobic moment per residue, <italic>H</italic><sub>2</sub>(d<italic><sub>i</sub></italic>), is represented as a function of <italic>p</italic> (%).</p>
<p>The Pearson correlation coefficient [<xref ref-type="bibr" rid="b36-ijms-12-08449">36</xref>], R, was used to measure the correlation between the second-order moments as follows:</p>
<disp-formula id="FD11">
<label>(11)</label>
<mml:math id="mm11" display="block">
<mml:semantics id="sm11">
<mml:mrow>
<mml:mi>R</mml:mi>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:munder>
<mml:mo>∑</mml:mo>
<mml:mi>i</mml:mi></mml:munder></mml:mstyle>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>-</mml:mo>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>¯</mml:mo></mml:mover>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>y</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>-</mml:mo>
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo>¯</mml:mo></mml:mover>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mrow>
<mml:msqrt>
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:munder>
<mml:mo>∑</mml:mo>
<mml:mi>i</mml:mi></mml:munder></mml:mstyle>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>-</mml:mo>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>¯</mml:mo></mml:mover>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:mrow></mml:msqrt>
<mml:msqrt>
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:munder>
<mml:mo>∑</mml:mo>
<mml:mi>i</mml:mi></mml:munder></mml:mstyle>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>y</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>-</mml:mo>
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo>¯</mml:mo></mml:mover>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:mrow></mml:msqrt></mml:mrow></mml:mfrac></mml:mrow></mml:semantics></mml:math></disp-formula>
<p>where <italic>x</italic> denotes the second-order hydrophobic moment calculated with the <italic>h</italic><italic><sub>i</sub></italic> selected from the consensus scale (<xref ref-type="table" rid="t2-ijms-12-08449">Table 2</xref>), and <italic>y</italic> denotes the second-order hydrophobic moment calculated with the <italic>h</italic><italic><sub>i</sub></italic> derived from the CHARMM partial atomic charges using <xref rid="FD8" ref-type="disp-formula">Equation (8)</xref>. These <italic>x</italic> and <italic>y</italic> were normalized by <xref rid="FD9" ref-type="disp-formula">Equation (9)</xref>. <italic>x̄</italic> and <italic>ȳ</italic> denote the mean of <italic>x</italic> and <italic>y</italic>, respectively. In this work, this equation is also used to test whether the correlation coefficient between two data sets is statistically significant to determine the equation of the regression line. The correlation coefficient calculated with <xref rid="FD11" ref-type="disp-formula">Equation (11)</xref> is a linear correlation between the two data sets. If the correlation coefficient is close to 1, there is a very strong correlation between the two data sets.</p></sec>
<sec sec-type="results|discussion">
<title>3. Results and Discussion</title>
<sec>
<title>3.1. Molecular Shape and Size</title>
<p>Fifteen protein structures (<xref ref-type="table" rid="t1-ijms-12-08449">Table 1</xref>) in 150 nonhomologous protein chains were divided into three groups, depending on their asphericity (<italic>δ</italic>) as shown in <xref ref-type="table" rid="t3-ijms-12-08449">Tables 3</xref>: Group I ranged from 1.49 × 10<sup>−2</sup> to 2.48 × 10<sup>−2</sup>, Group II from 1.05 × 10<sup>−1</sup> to 1.81 × 10<sup>−1</sup>, and Group III from 2.23 × 10<sup>−1</sup> to 3.54 × 10<sup>−1</sup>. This division was also dependent on the number of residues (N), as shown in <xref ref-type="table" rid="t1-ijms-12-08449">Table 1</xref>, to ensure a diverse range in molecular size. A suitable representation of molecular shape may be the gyration tensor, as shown in <xref rid="FD1" ref-type="disp-formula">Equation (1)</xref>. Diagonalization [<xref ref-type="bibr" rid="b36-ijms-12-08449">36</xref>] of the gyration tensor gives three eigenvalues (<italic>α</italic><sub>1</sub>, <italic>α</italic><sub>2</sub>, and <italic>α</italic><sub>3</sub>) as shown in columns 2, 3, and 4 of <xref ref-type="table" rid="t3-ijms-12-08449">Table 3</xref>, respectively. Columns 5 and 6 of <xref ref-type="table" rid="t3-ijms-12-08449">Table 3</xref> show asphericity (<italic>δ</italic>) and the shape parameter (<italic>S</italic>), respectively. The values of <italic>δ</italic> were calculated with <xref rid="FD3" ref-type="disp-formula">Equation (3)</xref>, and those of <italic>S</italic> were calculated with <xref rid="FD4" ref-type="disp-formula">Equation (4)</xref>. The data show that the values of <italic>δ</italic> and of <italic>S</italic> for proteins in Group I are close to zero, suggesting that the shape of these proteins is close to a sphere. The shape of proteins can also be represented by semiaxes [<xref ref-type="bibr" rid="b32-ijms-12-08449">32</xref>]. As shown in columns 7 and 8 of <xref ref-type="table" rid="t3-ijms-12-08449">Table 3</xref>, the values of <italic>a</italic> and <italic>b</italic> in Group I are very close to each other, indicating that the molecular shape can also be identified as a sphere. As shown in <xref ref-type="table" rid="t3-ijms-12-08449">Table 3</xref>, <italic>α</italic><sub>3</sub> is the largest eigenvalue for each protein structure in Groups II and III, compared with α<sub>1</sub> and α<sub>2</sub>, so that the shape of proteins in Groups II and III can be concluded as a prolate ellipsoid [<xref ref-type="bibr" rid="b32-ijms-12-08449">32</xref>]. Based on <italic>δ</italic> &gt; 0 and <italic>S</italic> &gt; 0, and <italic>a</italic> &lt; <italic>b</italic>, one can also infer that the shape of the proteins in Groups II and III is a prolate ellipsoid.</p>
<p>The last column in <xref ref-type="table" rid="t3-ijms-12-08449">Table 3</xref> lists the square roots of the mean square radius of gyration calculated according to <xref rid="FD6" ref-type="disp-formula">Equation (6)</xref>. The size of folded protein structures can be represented as the mean radius of gyration (<bold><italic>R</italic></bold><italic><sub>g</sub></italic>), which may follow the Flory scaling law [<xref ref-type="bibr" rid="b37-ijms-12-08449">37</xref>–<xref ref-type="bibr" rid="b39-ijms-12-08449">39</xref>], namely, <italic>R</italic><italic><sub>g</sub></italic> ∝ <italic>N</italic><sup>ν</sup> (Å), where <italic>N</italic> is the number of amino-acid residues in a protein and <italic>v</italic> is a scaling exponent. As shown in <xref ref-type="fig" rid="f2-ijms-12-08449">Figure 2</xref>, the values of the mean radius of gyration (<bold><italic>R</italic></bold><italic><sub>g</sub></italic>) and the number of residues (<italic>N</italic>) were linearized by applying the Flory scaling law on a double logarithmic scale and were well fitted to linear lines (<italic>y = mx + b</italic>) by using the least-squares method. The Pearson correlation coefficient between each pair of parameters (log<italic>N</italic> and log<italic>R</italic><italic><sub>g</sub></italic>) was calculated using <xref rid="FD11" ref-type="disp-formula">Equation (11)</xref>. The values for the correlation coefficient are 0.99, 0.98, and 0.98 for proteins in Group I, Group II, and Group III, respectively, indicating that the correlation between N and <italic>R</italic><italic><sub>g</sub></italic> is highly significant. The values of <italic>v</italic>, obtained using the least-squares method, are 0.34, 0.39, and 0.32 for protein structures in Groups I, II, and III, respectively. These values are found to have a mean value of 0.35, which is close to 0.34 as predicted by the Flory scaling law for a polymer chain in a poor solvent [<xref ref-type="bibr" rid="b40-ijms-12-08449">40</xref>–<xref ref-type="bibr" rid="b42-ijms-12-08449">42</xref>]. Thus, these polypeptide chains may form a collapsed globule to minimize contact with the solvent molecules [<xref ref-type="bibr" rid="b42-ijms-12-08449">42</xref>].</p></sec>
<sec>
<title>3.2. Distance Distribution Functions</title>
<p>The distances d (Å) from the protein centroid to the centroids of amino-acid residues in a protein, were grouped into 10 bins. The proportion of distances in each bin was plotted against d (Å) to obtain the distance distribution W (d), as shown in (<xref ref-type="fig" rid="f3-ijms-12-08449">Figures 3a, 3b, and 3c</xref>). Depicted in the figures are the results for protein structures in Groups I, II, and III, respectively. This scheme has been used to investigate the end-to-end distance distributions for poly(oxyethylene) chains [<xref ref-type="bibr" rid="b43-ijms-12-08449">43</xref>]. The distance distributions may associate with the persistence length of a polymer chain described by the wormlike chain model [<xref ref-type="bibr" rid="b31-ijms-12-08449">31</xref>]. The molecular size can also be characterized by the distance distributions. As shown in (<xref ref-type="fig" rid="f3-ijms-12-08449">Figures 3a, 3b, and 3c</xref>), these distributions all have a similar appearance, showing a single Gaussian-like distribution. These distributions, fluctuated about their most probable values, can be denoted as <italic>d̄ ±</italic> δ <italic>d</italic> (Å). A mean value, <italic>d̄</italic> (Å), which is associated with the mean radius of gyration, denotes the precise location of the most probable appearance of a distance distribution, and a value of δ<italic>d</italic> (Å) represents the breadth of its fluctuation. An increased extent of protein asphericity is correlated with increased fluctuations. For example, as shown in <xref ref-type="table" rid="t1-ijms-12-08449">Table 1</xref>, proteins 1E70, 1JAE, and 1LAM all have a comparable number of residues. However, the fluctuations from their most probable values, as shown in <xref ref-type="fig" rid="f3-ijms-12-08449">Figures 3a, 3b, and 3c</xref>, have diverse values, such as 18.48 ± 9.65 Å, 22.10 ± 11.54 Å, and 24.46 ± 12.77 Å for proteins 1E70, 1JAE, and 1LAM, respectively.</p></sec>
<sec>
<title>3.3. The Second-Order Hydrophobic Moment</title>
<p>The detailed results for the second-order moment per residue, calculated according to <xref rid="FD10" ref-type="disp-formula">Equation (10)</xref> with hydrophobicity <italic>h</italic><italic><sub>i</sub></italic><italic>′</italic> selected from the consensus scale of Eisenberg (<xref ref-type="table" rid="t2-ijms-12-08449">Table 2</xref>), are represented in <xref ref-type="table" rid="t4-ijms-12-08449">Table 4</xref> and <xref ref-type="fig" rid="f4-ijms-12-08449">Figure 4</xref>. <xref ref-type="table" rid="t2-ijms-12-08449">Table 2</xref> was normalized by <xref rid="FD9" ref-type="disp-formula">Equation (9)</xref>. <xref ref-type="table" rid="t4-ijms-12-08449">Table 4</xref> shows the details of the spatial distribution of the second-order hydrophobic moment per residue for protein 3PYP (see Table1) at a 2% resolution. The spatial distribution was calculated from the space collecting the residues that are completely buried (0%) to that collecting the residues that are completely exposed (100%) to the solvent. <xref ref-type="fig" rid="f4-ijms-12-08449">Figure 4</xref> shows the distribution of the second-order hydrophobic moment per residue, <italic>H</italic><sub>2</sub>(d<italic><sub>i</sub></italic>), as shown in <xref ref-type="table" rid="t4-ijms-12-08449">Table 4</xref>, as a function of <italic>p</italic> (%); it is clear that the distribution transfers from the hydrophobic core with positive values of <italic>H</italic><sub>2</sub>(d<italic><sub>i</sub></italic>) to the hydrophilic sites with negative values of <italic>H</italic><sub>2</sub>(d<italic><sub>i</sub></italic>).</p>
<p><xref ref-type="table" rid="t5-ijms-12-08449">Table 5</xref> shows the results of the Pearson correlation coefficient calculated using <xref rid="FD11" ref-type="disp-formula">Eq. (11)</xref> for the correlation between two data sets of the second-order hydrophobic moments, calculated according to <xref rid="FD10" ref-type="disp-formula">Equation (10)</xref> using two different sets of hydrophobic parameters: one is the hydrophobicity, <italic>h</italic><italic><sub>i</sub></italic>′, selected from the consensus scale (<xref ref-type="table" rid="t2-ijms-12-08449">Table 2</xref>) and the other is the <italic>h</italic><italic><sub>i</sub></italic>′ calculated according to <xref rid="FD8" ref-type="disp-formula">Equation (8)</xref> using the CHARMM partial atomic charges assigned to polar atoms with |<italic>q</italic><italic><sub>i</sub></italic>| ≥ 0.25, 0.27, or 0.3. The both data sets were normalized by <xref rid="FD9" ref-type="disp-formula">Equation (9)</xref>. Comparing the values in each row of <xref ref-type="table" rid="t5-ijms-12-08449">Table 5</xref> clearly indicates that the values in the last column of <xref ref-type="table" rid="t5-ijms-12-08449">Table 5</xref> give the highest correlations, which were calculated with |<italic>q</italic><italic><sub>i</sub></italic>| ≥ 0.3, for individual proteins. The mean values of correlation coefficient shown in the last column of <xref ref-type="table" rid="t5-ijms-12-08449">Table 5</xref> are 0.91 ± 0.02, 0.85 ± 0.03, and 0.86 ± 0.07 for proteins in Groups I, II, and III, respectively, showing that Group I with the lowest values of asphericity gives the strongest correlation.</p>
<p><xref ref-type="fig" rid="f5-ijms-12-08449">Figures 5a, 5b, and 5c</xref> show the comparative plots for the spatial distributions of the second-order hydrophobic moment for protein structures IE70, 1JAE, and 1LAM, respectively, using <xref rid="FD10" ref-type="disp-formula">Equations (10)</xref> and <xref rid="FD11" ref-type="disp-formula">(11)</xref>. Two data sets of second-order hydrophobic moments depicted in each of these figures were calculated using the hydrophobicity, <italic>h</italic><italic><sub>i</sub></italic>′, selected from the consensus scale (<xref ref-type="table" rid="t2-ijms-12-08449">Table 2</xref>) and the <italic>h</italic><italic><sub>i</sub></italic>′ calculated according to <xref rid="FD8" ref-type="disp-formula">Equation (8)</xref> using the CHARMM partial atomic charges assigned to polar atoms with |<italic>q</italic><italic><sub>i</sub></italic>| ≥ 0.3, respectively. As shown in (<xref ref-type="fig" rid="f5-ijms-12-08449">Figures 5a, 5b, and 5c</xref>), the values of <italic>h</italic><italic><sub>i</sub></italic>′ calculated using the CHARMM partial atomic charges overestimate the values of the second-order hydrophobic moments when <italic>p</italic> (%) is around zero. This is due to the fact that the denominator (<italic>ASA</italic><italic><sub>apo</sub></italic> + <italic>ASA</italic><italic><sub>po</sub></italic>) of <xref rid="FD8" ref-type="disp-formula">Equation (8)</xref> yields small values at the hydrophobic core. In contrast, the other areas of <italic>p</italic> (%) give rise to a very strong Pearson correlation, having the values are 0.91, 0.91, and 0.90. Thus, <italic>h</italic><italic><sub>i</sub></italic>′ calculated using the CHARMM partial atomic charges may provide an alternative way to assign the hydrophobicity for each amino-acid residue in a protein. Moreover, as shown in (<xref ref-type="fig" rid="f5-ijms-12-08449">Figures 5a, 5b, and 5c</xref>), the second-order hydrophobic moments give an overall shape of spatial distributions from the hydrophobic interior to the hydrophilic exterior. These results are in agreement with those of Silverman [<xref ref-type="bibr" rid="b4-ijms-12-08449">4</xref>].</p></sec>
<sec>
<title>3.4. The Locations of the Spatial Profile of the Proteins at p_ (%) or at d̄_ (Å)</title>
<p><xref ref-type="table" rid="t6-ijms-12-08449">Table 6</xref> shows the locations at which the spatial distribution of the second-order hydrophobic moments for the fifteen proteins vanishes and approaches a negative value. These locations are denoted as <italic>p</italic>_ (%) or <italic>d̄</italic>_ (Å) for the space defined by the percentage of solvent accessibility, <italic>p</italic> (%), or the average distance from the protein centroid to the centroids of the amino-acid residues in the same space, respectively. These locations show the transition from the hydrophobic core of the proteins to their hydrophilic exterior. Data listed in columns 2 and 3 of <xref ref-type="table" rid="t6-ijms-12-08449">Table 6</xref> are the values of the locations, <italic>p</italic>_ (%), calculated from the CHARMM partial atomic charges, |<italic>q</italic><italic><sub>i</sub></italic>| ≥ 0.3, assigned for polar atoms, and from the consensus scale (<xref ref-type="table" rid="t2-ijms-12-08449">Table 2</xref>), respectively, using the spaces defined by the percentage of solvent accessibility, <italic>p</italic> (%). The two data sets were normalized by <xref rid="FD9" ref-type="disp-formula">Equation (9)</xref>. Values in columns 4 and 5 of <xref ref-type="table" rid="t6-ijms-12-08449">Table 6</xref> are the average distances, <italic>d̄</italic>_ (Å), calculated using the same parameters of <italic>h</italic><italic><sub>i</sub></italic>′ as those shown in columns 2 and 3 of <xref ref-type="table" rid="t6-ijms-12-08449">Table 6</xref>, in the space corresponding to the percentage of solvent accessibility, <italic>p</italic>_ (%), shown for reference. As shown in <xref ref-type="table" rid="t6-ijms-12-08449">Table 6</xref>, the values of <italic>p</italic>_ (%) in column 2 have an average value of 46.5 ± 7.9 %, whereas those of <italic>p</italic>_ (%) in column 3 have an average value of 57.9 ± 8.3 %. Thus, the hydrophobicity, <italic>h</italic><italic><sub>i</sub></italic>′, calculated with the consensus scale yields a higher average value than that calculated with |<italic>q</italic><italic><sub>i</sub></italic>| ≥ 0.3. The result of 46.5 ± 7.9 % may provide a criterion to determine whether residues in a protein are buried or exposed to a solvent. Gromiha <italic>et al</italic>. [<xref ref-type="bibr" rid="b14-ijms-12-08449">14</xref>,<xref ref-type="bibr" rid="b15-ijms-12-08449">15</xref>] have studied mutations in different solvent accessibilities and have considered the mutations that are exposed when the percentage of solvent accessibility, <italic>p</italic> (%), is greater than 50%.</p></sec></sec>
<sec sec-type="conclusions">
<title>4. Conclusions</title>
<p>A set of 150 nonhomologous protein chains has been randomly selected from PDBSELECT [<xref ref-type="bibr" rid="b26-ijms-12-08449">26</xref>], which consisted of more than 4,000 protein chains. Fifteen protein structures have been selected from a set of 150 nonhomologous protein chains, based on the number of residues (<italic>N</italic>) and the extent of asphericity (<italic>δ</italic>), to ensure diversity of the molecular sizes and shapes. These fifteen protein structures have been the subject of a comparative study using second-order hydrophobic moments. This may provide an alternative approach to the assignment of hydrophobicity to each amino-acid residue in the proteins by using the CHARMM partial atomic charges. The Pearson correlation coefficient for each protein structure has been used to compare the spatial distributions of second-order hydrophobic moments calculated using the hydrophobicity, <italic>h</italic><italic><sub>i</sub></italic>, obtained from the consensus scale of Eisenberg [<xref ref-type="bibr" rid="b5-ijms-12-08449">5</xref>] with that calculated using <xref rid="FD8" ref-type="disp-formula">Equation (8)</xref> for the <italic>h</italic><italic><sub>i</sub></italic> derived from CHARMM partial atomic charges assigned to atoms in a protein. These two data sets were normalized by <xref rid="FD9" ref-type="disp-formula">Equation (9)</xref>. The Pearson correlation coefficient between the two data sets shows a strong correlation for each of these fifteen protein structures when the absolute values of the CHARMM partial atomic charges are greater than 0.3, assigned for polar atoms. These comparative results suggest that the hydrophobicity of each type of amino-acid residue in the proteins can probably be obtained using the CHARMM partial charges assigned to atoms in amino-acid residues. The spatial distributions of the second-order hydrophobic moment have the overall shape of the transition from hydrophobic interior to hydrophilic exterior. The transition at a position where the value of the second-order hydrophobic moment vanishes and approaches a negative value was denoted as <italic>p</italic>_ (%), and the average value of <italic>p</italic>_ (%) is 46.5 ± 7.9%, which has been calculated with CHARMM partial atomic charges, |<italic>q</italic><italic><sub>i</sub></italic>| ≥ 0.3, assigned for polar atoms. This may give the value of the transition point from the buried sites to exposed sites for amino-acid residues in the protein structures.</p></sec></body>
<back>
<ack>
<title>Acknowledgments</title>
<p>The National Center for High-Performance Computing in Taiwan is gratefully acknowledged for providing computer facilities. Chuan-Mei Lee of Harvard University has helped with the English editing of this manuscript.</p></ack>
<ref-list>
<title>References</title>
<ref id="b1-ijms-12-08449"><label>1</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dill</surname><given-names>K.A.</given-names></name></person-group><article-title>Dominant forces in protein folding</article-title><source>Biochemistry</source><year>1990</year><volume>29</volume><fpage>7133</fpage><lpage>7155</lpage><pub-id pub-id-type="doi">10.1021/bi00483a001</pub-id><pub-id pub-id-type="pmid">2207096</pub-id></citation></ref>
<ref id="b2-ijms-12-08449"><label>2</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rose</surname><given-names>G.D.</given-names></name><name><surname>Wolfenden</surname><given-names>R.</given-names></name></person-group><article-title>Hydrogen bonding, hydrophobicity, packing, and protein folding</article-title><source>Annu. Rev. Biophys. Biomol. Struct</source><year>1993</year><volume>22</volume><fpage>381</fpage><lpage>415</lpage><pub-id pub-id-type="doi">10.1146/annurev.bb.22.060193.002121</pub-id><pub-id pub-id-type="pmid">8347995</pub-id></citation></ref>
<ref id="b3-ijms-12-08449"><label>3</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chan</surname><given-names>H.S.</given-names></name><name><surname>Dill</surname><given-names>K.A.</given-names></name></person-group><article-title>Origins of structure in globular proteins</article-title><source>Proc. Natl. Acad. Sci. USA</source><year>1990</year><volume>87</volume><fpage>6388</fpage><lpage>6392</lpage><pub-id pub-id-type="doi">10.1073/pnas.87.16.6388</pub-id><pub-id pub-id-type="pmid">2385597</pub-id></citation></ref>
<ref id="b4-ijms-12-08449"><label>4</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Silverman</surname><given-names>B.D.</given-names></name></person-group><article-title>Hydrophobic moments of protein structures: spatially profiling the distribution</article-title><source>Proc. Natl. Acad. Sci. USA</source><year>2001</year><volume>98</volume><fpage>4996</fpage><lpage>5001</lpage><pub-id pub-id-type="doi">10.1073/pnas.081086198</pub-id><pub-id pub-id-type="pmid">11309489</pub-id></citation></ref>
<ref id="b5-ijms-12-08449"><label>5</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Eisenberg</surname><given-names>D.</given-names></name><name><surname>Weiss</surname><given-names>R.M.</given-names></name><name><surname>Terwilliger</surname><given-names>T.C.</given-names></name><name><surname>Wilcox</surname><given-names>W.</given-names></name></person-group><article-title>Hydrophobic moments and protein structure</article-title><source>Faraday Symp. Chem. Soc</source><year>1982</year><volume>17</volume><fpage>109</fpage><lpage>120</lpage><pub-id pub-id-type="doi">10.1039/fs9821700109</pub-id></citation></ref>
<ref id="b6-ijms-12-08449"><label>6</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname><given-names>R.</given-names></name><name><surname>Silverman</surname><given-names>B.D.</given-names></name><name><surname>Royyuru</surname><given-names>A.K.</given-names></name><name><surname>Athma</surname><given-names>P.</given-names></name></person-group><article-title>Spatial profiling of protein hydrophobicity: native vs. decoy structures</article-title><source>Proteins</source><year>2003</year><volume>52</volume><fpage>561</fpage><lpage>572</lpage><pub-id pub-id-type="doi">10.1002/prot.10419</pub-id><pub-id pub-id-type="pmid">12910456</pub-id></citation></ref>
<ref id="b7-ijms-12-08449"><label>7</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname><given-names>R.</given-names></name><name><surname>Royyuru</surname><given-names>A.</given-names></name><name><surname>Athma</surname><given-names>P.</given-names></name><name><surname>Suits</surname><given-names>F.</given-names></name><name><surname>Silverman</surname><given-names>B.D.</given-names></name></person-group><article-title>Magnitude and spatial orientation of the hydrophobic moments of multi-domain proteins</article-title><source>Int. J. Bioinform. Res. Appl</source><year>2006</year><volume>2</volume><fpage>161</fpage><lpage>176</lpage><pub-id pub-id-type="pmid">18048160</pub-id></citation></ref>
<ref id="b8-ijms-12-08449"><label>8</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Eisenberg</surname><given-names>D.</given-names></name><name><surname>Weiss</surname><given-names>R.M.</given-names></name><name><surname>Terwilliger</surname><given-names>T.C.</given-names></name></person-group><article-title>The helical hydrophobic moment: a measure of the amphiphilicity of a helix</article-title><source>Nature</source><year>1982</year><volume>299</volume><fpage>371</fpage><lpage>374</lpage><pub-id pub-id-type="doi">10.1038/299371a0</pub-id><pub-id pub-id-type="pmid">7110359</pub-id></citation></ref>
<ref id="b9-ijms-12-08449"><label>9</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Eisenberg</surname><given-names>D.</given-names></name><name><surname>Weiss</surname><given-names>R.M.</given-names></name><name><surname>Terwilliger</surname><given-names>T.C.</given-names></name></person-group><article-title>The hydrophobic moment detects periodicity in protein hydrophobicity</article-title><source>Proc. Natl. Acad. Sci. USA</source><year>1984</year><volume>81</volume><fpage>140</fpage><lpage>144</lpage><pub-id pub-id-type="doi">10.1073/pnas.81.1.140</pub-id><pub-id pub-id-type="pmid">6582470</pub-id></citation></ref>
<ref id="b10-ijms-12-08449"><label>10</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Silverman</surname><given-names>B.D.</given-names></name></person-group><article-title>Hydrophobic moments of tertiary protein structures</article-title><source>Proteins</source><year>2003</year><volume>53</volume><fpage>880</fpage><lpage>888</lpage><pub-id pub-id-type="doi">10.1002/prot.10494</pub-id><pub-id pub-id-type="pmid">14635130</pub-id></citation></ref>
<ref id="b11-ijms-12-08449"><label>11</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Eisenberg</surname><given-names>D.</given-names></name><name><surname>Schwarz</surname><given-names>E.</given-names></name><name><surname>Komaromy</surname><given-names>M.</given-names></name><name><surname>Wall</surname><given-names>R.</given-names></name></person-group><article-title>Analysis of membrane and surface protein sequences with the hydrophobic moment plot</article-title><source>J. Mol. Biol</source><year>1984</year><volume>179</volume><fpage>125</fpage><lpage>142</lpage><pub-id pub-id-type="doi">10.1016/0022-2836(84)90309-7</pub-id><pub-id pub-id-type="pmid">6502707</pub-id></citation></ref>
<ref id="b12-ijms-12-08449"><label>12</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wallace</surname><given-names>J.</given-names></name><name><surname>Daman</surname><given-names>O.A.</given-names></name><name><surname>Harris</surname><given-names>F.</given-names></name><name><surname>Phoenix</surname><given-names>D.A.</given-names></name></person-group><article-title>Investigation of hydrophobic moment and hydrophobicity properties for transmembrane α-helices</article-title><source>Theor. Biol. Med. Model</source><year>2004</year><volume>1</volume><fpage>5</fpage><lpage>15</lpage><pub-id pub-id-type="doi">10.1186/1742-4682-1-5</pub-id><pub-id pub-id-type="pmid">15312230</pub-id></citation></ref>
<ref id="b13-ijms-12-08449"><label>13</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lee</surname><given-names>B.</given-names></name><name><surname>Richards</surname><given-names>F.M.</given-names></name></person-group><article-title>The interpretation of protein structures: estimation of static accessibility</article-title><source>J. Mol. Biol</source><year>1971</year><volume>55</volume><fpage>379</fpage><lpage>400</lpage><pub-id pub-id-type="doi">10.1016/0022-2836(71)90324-X</pub-id><pub-id pub-id-type="pmid">5551392</pub-id></citation></ref>
<ref id="b14-ijms-12-08449"><label>14</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gromiha</surname><given-names>M.M.</given-names></name><name><surname>Oobatake</surname><given-names>M.</given-names></name><name><surname>Kono</surname><given-names>H.</given-names></name><name><surname>Uedaira</surname><given-names>H.</given-names></name><name><surname>Sarai</surname><given-names>H.</given-names></name></person-group><article-title>Role of structural and sequence information in the prediction of protein stability changes: comparison between buried and partially buried mutations</article-title><source>Protein Eng</source><year>1999</year><volume>12</volume><fpage>549</fpage><lpage>555</lpage><pub-id pub-id-type="doi">10.1093/protein/12.7.549</pub-id><pub-id pub-id-type="pmid">10436080</pub-id></citation></ref>
<ref id="b15-ijms-12-08449"><label>15</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gromiha</surname><given-names>M.M.</given-names></name></person-group><article-title>Factors influencing the thermal stability of buried protein mutants</article-title><source>Polymer</source><year>2003</year><volume>44</volume><fpage>4061</fpage><lpage>4066</lpage><pub-id pub-id-type="doi">10.1016/S0032-3861(03)00354-9</pub-id></citation></ref>
<ref id="b16-ijms-12-08449"><label>16</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vogt</surname><given-names>G.</given-names></name><name><surname>Woell</surname><given-names>S.</given-names></name><name><surname>Argos</surname><given-names>P.</given-names></name></person-group><article-title>Protein thermal stability, hydrogen bonds, and ion pairs</article-title><source>J. Mol. Biol</source><year>1997</year><volume>269</volume><fpage>631</fpage><lpage>643</lpage><pub-id pub-id-type="doi">10.1006/jmbi.1997.1042</pub-id><pub-id pub-id-type="pmid">9217266</pub-id></citation></ref>
<ref id="b17-ijms-12-08449"><label>17</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Samanta</surname><given-names>U.</given-names></name><name><surname>Bahadur</surname><given-names>R.P.</given-names></name><name><surname>Chakrabarti</surname><given-names>P.</given-names></name></person-group><article-title>Quantifying the accessible surface area of protein residues in their local environment</article-title><source>Protein Eng</source><year>2002</year><volume>15</volume><fpage>659</fpage><lpage>667</lpage><pub-id pub-id-type="doi">10.1093/protein/15.8.659</pub-id><pub-id pub-id-type="pmid">12364580</pub-id></citation></ref>
<ref id="b18-ijms-12-08449"><label>18</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname><given-names>H.</given-names></name><name><surname>Zhou</surname><given-names>H.-X.</given-names></name></person-group><article-title>Prediction of solvent accessibility and sites of deleterious mutations from protein sequence</article-title><source>Nucleic Acids Res</source><year>2005</year><volume>33</volume><fpage>3193</fpage><lpage>3199</lpage><pub-id pub-id-type="doi">10.1093/nar/gki633</pub-id><pub-id pub-id-type="pmid">15937195</pub-id></citation></ref>
<ref id="b19-ijms-12-08449"><label>19</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yuan</surname><given-names>Z.</given-names></name><name><surname>Huang</surname><given-names>B.</given-names></name></person-group><article-title>Prediction of protein accessible surface areas by support vector regression</article-title><source>Proteins</source><year>2004</year><volume>57</volume><fpage>558</fpage><lpage>564</lpage><pub-id pub-id-type="doi">10.1002/prot.20234</pub-id><pub-id pub-id-type="pmid">15382233</pub-id></citation></ref>
<ref id="b20-ijms-12-08449"><label>20</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nguyen</surname><given-names>M.N.</given-names></name><name><surname>Rajapakse</surname><given-names>J.C.</given-names></name></person-group><article-title>Two-stage support vector regression approach for predicting accessible surface areas of amino acids</article-title><source>Proteins</source><year>2006</year><volume>63</volume><fpage>542</fpage><lpage>550</lpage><pub-id pub-id-type="doi">10.1002/prot.20883</pub-id><pub-id pub-id-type="pmid">16456847</pub-id></citation></ref>
<ref id="b21-ijms-12-08449"><label>21</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yuan</surname><given-names>Z.</given-names></name><name><surname>Zhang</surname><given-names>F.</given-names></name><name><surname>Davis</surname><given-names>M.J.</given-names></name><name><surname>Bodén</surname><given-names>M.</given-names></name><name><surname>Teasdale</surname><given-names>R.D.</given-names></name></person-group><article-title>predicting the solvent accessibility of transmembrane residues from protein sequence</article-title><source>J. Proteome Res</source><year>2006</year><volume>5</volume><fpage>1063</fpage><lpage>1070</lpage><pub-id pub-id-type="doi">10.1021/pr050397b</pub-id><pub-id pub-id-type="pmid">16674095</pub-id></citation></ref>
<ref id="b22-ijms-12-08449"><label>22</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brooks</surname><given-names>B.R.</given-names></name><name><surname>Bruccoleri</surname><given-names>R.E.</given-names></name><name><surname>Olafson</surname><given-names>B.D.</given-names></name><name><surname>States</surname><given-names>D.J.</given-names></name><name><surname>Swaminathan</surname><given-names>S.</given-names></name><name><surname>Karplus</surname><given-names>M.</given-names></name></person-group><article-title>CHARMM: A program for macromolecular energy, minimization, and dynamics calculations</article-title><source>J. Comput. Chem</source><year>1983</year><volume>4</volume><fpage>187</fpage><lpage>217</lpage><pub-id pub-id-type="doi">10.1002/jcc.540040211</pub-id></citation></ref>
<ref id="b23-ijms-12-08449"><label>23</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nozaki</surname><given-names>Y.</given-names></name><name><surname>Tanford</surname><given-names>C.</given-names></name></person-group><article-title>The solubility of amino acids and two glycine peptides in aqueous ethanol and dioxane solutions</article-title><source>J. Biol. Chem</source><year>1971</year><volume>246</volume><fpage>2211</fpage><lpage>2217</lpage><pub-id pub-id-type="pmid">5555568</pub-id></citation></ref>
<ref id="b24-ijms-12-08449"><label>24</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kyte</surname><given-names>J.</given-names></name><name><surname>Doolittle</surname><given-names>R.F.</given-names></name></person-group><article-title>A simple method for displaying the hydropathic character of a protein</article-title><source>J. Mol. Biol</source><year>1982</year><volume>157</volume><fpage>105</fpage><lpage>132</lpage><pub-id pub-id-type="doi">10.1016/0022-2836(82)90515-0</pub-id><pub-id pub-id-type="pmid">7108955</pub-id></citation></ref>
<ref id="b25-ijms-12-08449"><label>25</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sharp</surname><given-names>K.A.</given-names></name><name><surname>Nicholls</surname><given-names>A.</given-names></name><name><surname>Friedman</surname><given-names>R.</given-names></name><name><surname>Honig</surname><given-names>B.</given-names></name></person-group><article-title>Extracting hydrophobic free energies from experimental data: relationship to protein folding and theoretical models</article-title><source>Biochemistry</source><year>1991</year><volume>30</volume><fpage>9686</fpage><lpage>9697</lpage><pub-id pub-id-type="doi">10.1021/bi00104a017</pub-id><pub-id pub-id-type="pmid">1911756</pub-id></citation></ref>
<ref id="b26-ijms-12-08449"><label>26</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Griep</surname><given-names>S.</given-names></name><name><surname>Hobohm</surname><given-names>U.</given-names></name></person-group><article-title>PDBselect 1992-2009 and PDBfilter-select</article-title><source>Nucleic Acids Res</source><year>2010</year><volume>38</volume><fpage>D318</fpage><lpage>D319</lpage><pub-id pub-id-type="doi">10.1093/nar/gkp786</pub-id><pub-id pub-id-type="pmid">19783827</pub-id></citation></ref>
<ref id="b27-ijms-12-08449"><label>27</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Berman</surname><given-names>H.M.</given-names></name><name><surname>Westbrook</surname><given-names>J.</given-names></name><name><surname>Feng</surname><given-names>Z.</given-names></name><name><surname>Gilliland</surname><given-names>G.</given-names></name><name><surname>Bhat</surname><given-names>T.N.</given-names></name><name><surname>Weissig</surname><given-names>H.</given-names></name><name><surname>Shindyalov</surname><given-names>I.N.</given-names></name><name><surname>Bourne</surname><given-names>P.E.</given-names></name></person-group><article-title>The protein data bank</article-title><source>Nucleic Acids Res</source><year>2000</year><volume>28</volume><fpage>235</fpage><lpage>242</lpage><pub-id pub-id-type="doi">10.1093/nar/28.1.235</pub-id><pub-id pub-id-type="pmid">10592235</pub-id></citation></ref>
<ref id="b28-ijms-12-08449"><label>28</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sun</surname><given-names>T.T.</given-names></name><name><surname>Zhang</surname><given-names>L.X.</given-names></name></person-group><article-title>Effect of secondary structure on the conformations and folding behaviors of protein-like chains</article-title><source>Polymer</source><year>2004</year><volume>45</volume><fpage>7759</fpage><lpage>7766</lpage><pub-id pub-id-type="doi">10.1016/j.polymer.2004.08.069</pub-id></citation></ref>
<ref id="b29-ijms-12-08449"><label>29</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yi</surname><given-names>W.Q.</given-names></name><name><surname>Zhang</surname><given-names>L.X.</given-names></name></person-group><article-title>Conformational properties and dynamics of protein-like chains confined in an infinite cylinder</article-title><source>Eur. Polym. J</source><year>2006</year><volume>42</volume><fpage>573</fpage><lpage>579</lpage><pub-id pub-id-type="doi">10.1016/j.eurpolymj.2005.09.002</pub-id></citation></ref>
<ref id="b30-ijms-12-08449"><label>30</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tran</surname><given-names>H.T.</given-names></name><name><surname>Pappu</surname><given-names>R.V.</given-names></name></person-group><article-title>Toward an accurate theoretical framework for describing ensembles for proteins under strongly denaturing conditions</article-title><source>Biophys. J</source><year>2006</year><volume>91</volume><fpage>1868</fpage><lpage>1886</lpage><pub-id pub-id-type="doi">10.1529/biophysj.106.086264</pub-id><pub-id pub-id-type="pmid">16766618</pub-id></citation></ref>
<ref id="b31-ijms-12-08449"><label>31</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hyeon</surname><given-names>C.</given-names></name><name><surname>Dima</surname><given-names>R.I.</given-names></name><name><surname>Thirumalai</surname><given-names>D.</given-names></name></person-group><article-title>Size, shape, and flexibility of RNA structures</article-title><source>J. Chem. Phys</source><year>2006</year><volume>125</volume><fpage>194905</fpage><lpage>194914</lpage><pub-id pub-id-type="doi">10.1063/1.2364190</pub-id><pub-id pub-id-type="pmid">17129165</pub-id></citation></ref>
<ref id="b32-ijms-12-08449"><label>32</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fernandes</surname><given-names>M.X.</given-names></name><name><surname>Bernadó</surname><given-names>P.</given-names></name><name><surname>Pons</surname><given-names>M.</given-names></name><name><surname>de la Torre</surname><given-names>J.G.</given-names></name></person-group><article-title>An analytical solution to the problem of the orientation of rigid particles by planar obstacles. Application to membrane systems and to the calculation of dipolar couplings in protein NMR spectroscopy</article-title><source>J. Am. Chem. Soc</source><year>2001</year><volume>123</volume><fpage>12037</fpage><lpage>12047</lpage><pub-id pub-id-type="doi">10.1021/ja011361x</pub-id><pub-id pub-id-type="pmid">11724612</pub-id></citation></ref>
<ref id="b33-ijms-12-08449"><label>33</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kumar</surname><given-names>S.K.</given-names></name><name><surname>Vacatello</surname><given-names>M.</given-names></name><name><surname>Yoon</surname><given-names>D.Y.</given-names></name></person-group><article-title>Off-lattice Monte Carlo simulations of polymer melts confined between two plates</article-title><source>J. Chem. Phys</source><year>1988</year><volume>89</volume><fpage>5206</fpage><lpage>5215</lpage><pub-id pub-id-type="doi">10.1063/1.455611</pub-id></citation></ref>
<ref id="b34-ijms-12-08449"><label>34</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Eisenhaber</surname><given-names>F.</given-names></name><name><surname>Argos</surname><given-names>P.</given-names></name></person-group><article-title>Improved strategy in analytic surface calculation for molecular systems: Handling of singularities and computational efficiency</article-title><source>J. Comput. Chem</source><year>1993</year><volume>14</volume><fpage>1272</fpage><lpage>1280</lpage><pub-id pub-id-type="doi">10.1002/jcc.540141103</pub-id></citation></ref>
<ref id="b35-ijms-12-08449"><label>35</label><citation citation-type="book"><source>InsightII, version 2000</source><publisher-name>Accelrys, Inc</publisher-name><publisher-loc>San Diego, CA, USA</publisher-loc><year>2001</year></citation></ref>
<ref id="b36-ijms-12-08449"><label>36</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Press</surname><given-names>W.H.</given-names></name><name><surname>Flannery</surname><given-names>B.P.</given-names></name><name><surname>Teukolsky</surname><given-names>S.A.</given-names></name><name><surname>Vetterling</surname><given-names>W.T.</given-names></name></person-group><source>Numerical recipes in FORTRAN</source><publisher-name>Cambridge University Press</publisher-name><publisher-loc>New York, NY, USA</publisher-loc><year>1986</year></citation></ref>
<ref id="b37-ijms-12-08449"><label>37</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Flory</surname><given-names>P.J.</given-names></name></person-group><source>Principles of Polymer Chemistry</source><publisher-name>Cornel University Press</publisher-name><publisher-loc>Ithaca, NY, USA</publisher-loc><year>1953</year></citation></ref>
<ref id="b38-ijms-12-08449"><label>38</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>de Gennes</surname><given-names>P.G.</given-names></name></person-group><source>Scaling Concepts in Polymer Physics</source><publisher-name>Cornel University Press</publisher-name><publisher-loc>Ithaca, NY, USA</publisher-loc><year>1979</year></citation></ref>
<ref id="b39-ijms-12-08449"><label>39</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Doi</surname><given-names>M.</given-names></name><name><surname>Edwards</surname><given-names>S.F.</given-names></name></person-group><source>The Theory of Polymer Dynamics</source><publisher-name>Clarendon Press</publisher-name><publisher-loc>Oxford, UK</publisher-loc><year>1986</year></citation></ref>
<ref id="b40-ijms-12-08449"><label>40</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sanchez</surname><given-names>I.C.</given-names></name></person-group><article-title>Phase transition behavior of the isolated polymer chain</article-title><source>Macromolecules</source><year>1979</year><volume>12</volume><fpage>980</fpage><lpage>988</lpage><pub-id pub-id-type="doi">10.1021/ma60071a040</pub-id></citation></ref>
<ref id="b41-ijms-12-08449"><label>41</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chan</surname><given-names>H.S.</given-names></name><name><surname>Dill</surname><given-names>K.A.</given-names></name></person-group><article-title>Polymer principles in protein structure and stability</article-title><source>Annu. Rev. Biophys. Biophys. Chem</source><year>1991</year><volume>20</volume><fpage>447</fpage><lpage>490</lpage><pub-id pub-id-type="doi">10.1146/annurev.bb.20.060191.002311</pub-id><pub-id pub-id-type="pmid">1867723</pub-id></citation></ref>
<ref id="b42-ijms-12-08449"><label>42</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Uzawa</surname><given-names>T.</given-names></name><name><surname>Kimura</surname><given-names>T.</given-names></name><name><surname>Ishimori</surname><given-names>K.</given-names></name><name><surname>Morishima</surname><given-names>I.</given-names></name><name><surname>Matsui</surname><given-names>T.</given-names></name><name><surname>Ikeda-Saito</surname><given-names>M.</given-names></name><name><surname>Takahashi</surname><given-names>S.</given-names></name><name><surname>Akiyama</surname><given-names>S.</given-names></name><name><surname>Fujisawa</surname><given-names>T.</given-names></name></person-group><article-title>Time-resolved small-angle X-ray scattering investigation of the folding dynamics of heme oxygenase: implication of the scaling relationship for the submillisecond intermediates of protein folding</article-title><source>J. Mol. Biol</source><year>2006</year><volume>35</volume><fpage>997</fpage><lpage>1008</lpage></citation></ref>
<ref id="b43-ijms-12-08449"><label>43</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lee</surname><given-names>K.-J.</given-names></name><name><surname>Eichinger</surname><given-names>B.E.</given-names></name></person-group><article-title>Computer simulation of end-linked networks: telechelic poly(oxyethylene) cross-linked with plurifunctional isocyanates</article-title><source>Macromolecules</source><year>1989</year><volume>22</volume><fpage>1441</fpage><lpage>1448</lpage><pub-id pub-id-type="doi">10.1021/ma00193a073</pub-id></citation></ref></ref-list>
<sec sec-type="display-objects">
<title>Figures and Tables</title>
<fig id="f1-ijms-12-08449" position="float">
<label>Figure 1</label>
<caption>
<p>The asphericity (<italic>δ</italic>) as a function of protein indices.</p></caption>
<graphic xlink:href="ijms-12-08449f1.gif"/></fig>
<fig id="f2-ijms-12-08449" position="float">
<label>Figure 2</label>
<caption>
<p>Plots of log (<italic>R</italic><italic><sub>g</sub></italic>) <italic>versus</italic> log (<italic>N</italic>), where R<italic><sub>g</sub></italic> is the mean radius of gyration and N is the number of residues, for proteins in Group I (filled squares), Group II (open circles), and Group III (filled circles). The solid line (<italic>R</italic> = 0.99), the dashed line (<italic>R</italic> = 0.98), and the dotted line (<italic>R</italic> = 0.98) are the regression lines for proteins in Groups I, II, and III, respectively.</p></caption>
<graphic xlink:href="ijms-12-08449f2.gif"/></fig>
<fig id="f3-ijms-12-08449" position="float">
<label>Figure 3</label>
<caption>
<p>The distance distribution W (d) as a function of d (Å) for the protein structures, as shown in <xref ref-type="table" rid="t1-ijms-12-08449">Table 1</xref>: (<bold>a</bold>) Proteins in Group I; (<bold>b</bold>) proteins in Group II; and (<bold>c</bold>) proteins in Group III. The indices, as shown in the legends of these figures, correspond to those shown in column 2 of <xref ref-type="table" rid="t1-ijms-12-08449">Table 1</xref>.</p></caption>
<graphic xlink:href="ijms-12-08449f3.gif"/></fig>
<fig id="f4-ijms-12-08449" position="float">
<label>Figure 4</label>
<caption>
<p>The spatial distribution of the second-order hydrophobic moment per residue, <italic>H</italic><sub>2</sub>(d<italic><sub>i</sub></italic>), for the protein 3PYP, plotted as a function of the percentage of solvent accessibility, <italic>p</italic> (%). The second-order hydrophobic moment was calculated using <italic>h</italic><italic><sub>i</sub></italic>′ selected from the consensus scales (<xref ref-type="table" rid="t2-ijms-12-08449">Table 2</xref>).</p></caption>
<graphic xlink:href="ijms-12-08449f4.gif"/></fig>
<fig id="f5-ijms-12-08449" position="float">
<label>Figure 5</label>
<caption>
<p>Comparative plots of the spatial distributions of the second-order hydrophobic moment per residue, <italic>H</italic><sub>2</sub>(d<italic><sub>i</sub></italic>), <italic>versus</italic> the percentage of solvent accessibility, <italic>p</italic> (%), for (a) protein IE70, (b) protein 1JAE, and (c) protein 1LAM. The solid line and dashed line represent, respectively, the second-order hydrophobic moments calculated using <italic>h</italic><italic><sub>i</sub></italic>′ selected from the consensus scales (<xref ref-type="table" rid="t2-ijms-12-08449">Table 2</xref>) and the <italic>h</italic><italic><sub>i</sub></italic>′ calculated according to <xref rid="FD8" ref-type="disp-formula">Equation (8)</xref> with the CHARMM partial atomic charges, |<italic>q</italic><italic><sub>i</sub></italic>| ≥ 0.3, assigned for polar atoms. The Pearson correlation coefficients between the spatial distributions of the second-order hydrophobic moments (solid and dashed lines) are 0.91, 0.91, and 0.90 for proteins IE70, 1JAE, and 1LAM, respectively.</p></caption>
<graphic xlink:href="ijms-12-08449f5a.gif"/>
<graphic xlink:href="ijms-12-08449f5b.gif"/></fig>
<table-wrap id="t1-ijms-12-08449" position="float">
<label>Table 1</label>
<caption>
<p>List of proteins used in this work.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="bottom">Protein</th>
<th align="center" valign="bottom">Protein index</th>
<th align="center" valign="bottom">PDB code</th>
<th align="center" valign="bottom">No. residues</th></tr></thead>
<tbody>
<tr>
<td colspan="4" align="left" valign="top">I</td></tr>
<tr>
<td align="left" valign="top">Photoactive yellow protein</td>
<td align="center" valign="top">4</td>
<td align="center" valign="top">3PYP</td>
<td align="center" valign="top">125</td></tr>
<tr>
<td align="left" valign="top">Exonuclease III</td>
<td align="center" valign="top">125</td>
<td align="center" valign="top">1AKO</td>
<td align="center" valign="top">268</td></tr>
<tr>
<td align="left" valign="top">Carboxypeptidase A</td>
<td align="center" valign="top">39</td>
<td align="center" valign="top">2CTC</td>
<td align="center" valign="top">307</td></tr>
<tr>
<td align="left" valign="top">Cellulase cela</td>
<td align="center" valign="top">96</td>
<td align="center" valign="top">1CEM</td>
<td align="center" valign="top">363</td></tr>
<tr>
<td align="left" valign="top">Myrosinase</td>
<td align="center" valign="top">55</td>
<td align="center" valign="top">1E70</td>
<td align="center" valign="top">499</td></tr>
<tr>
<td colspan="4" align="left" valign="top">II</td></tr>
<tr>
<td align="left" valign="top">Lysozyme</td>
<td align="center" valign="top">2</td>
<td align="center" valign="top">3LTZ</td>
<td align="center" valign="top">129</td></tr>
<tr>
<td align="left" valign="top">Dienoyl-coa isomerase</td>
<td align="center" valign="top">57</td>
<td align="center" valign="top">1DCIA</td>
<td align="center" valign="top">275</td></tr>
<tr>
<td align="left" valign="top">Sulfate-Binding protein</td>
<td align="center" valign="top">139</td>
<td align="center" valign="top">1SBP</td>
<td align="center" valign="top">309</td></tr>
<tr>
<td align="left" valign="top">Maltodextrin-binding protein</td>
<td align="center" valign="top">122</td>
<td align="center" valign="top">3MBP</td>
<td align="center" valign="top">370</td></tr>
<tr>
<td align="left" valign="top">Alpha-amylase</td>
<td align="center" valign="top">118</td>
<td align="center" valign="top">1JAE</td>
<td align="center" valign="top">470</td></tr>
<tr>
<td colspan="4" align="left" valign="top">III</td></tr>
<tr>
<td align="left" valign="top">Cytochrome C′</td>
<td align="center" valign="top">108</td>
<td align="center" valign="top">1CPQ</td>
<td align="center" valign="top">129</td></tr>
<tr>
<td align="left" valign="top">Molybdate transport protein</td>
<td align="center" valign="top">135</td>
<td align="center" valign="top">1AMF</td>
<td align="center" valign="top">231</td></tr>
<tr>
<td align="left" valign="top">L-arabinose-binding protein</td>
<td align="center" valign="top">81</td>
<td align="center" valign="top">1ABE</td>
<td align="center" valign="top">305</td></tr>
<tr>
<td align="left" valign="top">Phosphate-binding protein</td>
<td align="center" valign="top">133</td>
<td align="center" valign="top">2ABH</td>
<td align="center" valign="top">321</td></tr>
<tr>
<td align="left" valign="top">Leucine aminopeptidase</td>
<td align="center" valign="top">83</td>
<td align="center" valign="top">1LAM</td>
<td align="center" valign="top">484</td></tr></tbody></table></table-wrap>
<table-wrap id="t2-ijms-12-08449" position="float">
<label>Table 2</label>
<caption>
<p>Values of ASA (Å<sup>2</sup>) and the consensus scale for each amino-acid residue.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="bottom">Residue</th>
<th align="center" valign="bottom">ASA <xref ref-type="table-fn" rid="tfn1-ijms-12-08449">a</xref></th>
<th align="center" valign="bottom">Consensus scale (<italic>h</italic><italic><sub>i</sub></italic>) <xref ref-type="table-fn" rid="tfn2-ijms-12-08449">b</xref></th></tr></thead>
<tbody>
<tr>
<td align="left" valign="top">Ala</td>
<td align="center" valign="top">116.5</td>
<td align="center" valign="top">0.25</td></tr>
<tr>
<td align="left" valign="top">Arg</td>
<td align="center" valign="top">249.7</td>
<td align="center" valign="top">−1.80</td></tr>
<tr>
<td align="left" valign="top">Asn</td>
<td align="center" valign="top">163.2</td>
<td align="center" valign="top">−0.64</td></tr>
<tr>
<td align="left" valign="top">Asp</td>
<td align="center" valign="top">156.2</td>
<td align="center" valign="top">−0.72</td></tr>
<tr>
<td align="left" valign="top">Cys</td>
<td align="center" valign="top">140.2</td>
<td align="center" valign="top">0.04</td></tr>
<tr>
<td align="left" valign="top">Gln</td>
<td align="center" valign="top">188.9</td>
<td align="center" valign="top">−0.69</td></tr>
<tr>
<td align="left" valign="top">Glu</td>
<td align="center" valign="top">183.8</td>
<td align="center" valign="top">−0.62</td></tr>
<tr>
<td align="left" valign="top">Gly</td>
<td align="center" valign="top">84.9</td>
<td align="center" valign="top">0.16</td></tr>
<tr>
<td align="left" valign="top">His</td>
<td align="center" valign="top">204.3</td>
<td align="center" valign="top">−0.40</td></tr>
<tr>
<td align="left" valign="top">Ile</td>
<td align="center" valign="top">183.6</td>
<td align="center" valign="top">0.73</td></tr>
<tr>
<td align="left" valign="top">Leu</td>
<td align="center" valign="top">193.5</td>
<td align="center" valign="top">0.53</td></tr>
<tr>
<td align="left" valign="top">Lys</td>
<td align="center" valign="top">211.3</td>
<td align="center" valign="top">−1.10</td></tr>
<tr>
<td align="left" valign="top">Met</td>
<td align="center" valign="top">205.9</td>
<td align="center" valign="top">0.26</td></tr>
<tr>
<td align="left" valign="top">Phe</td>
<td align="center" valign="top">223.7</td>
<td align="center" valign="top">0.61</td></tr>
<tr>
<td align="left" valign="top">Pro</td>
<td align="center" valign="top">148.3</td>
<td align="center" valign="top">−0.07</td></tr>
<tr>
<td align="left" valign="top">Ser</td>
<td align="center" valign="top">124.8</td>
<td align="center" valign="top">−0.26</td></tr>
<tr>
<td align="left" valign="top">Thr</td>
<td align="center" valign="top">144.1</td>
<td align="center" valign="top">−0.18</td></tr>
<tr>
<td align="left" valign="top">Trp</td>
<td align="center" valign="top">266.3</td>
<td align="center" valign="top">0.37</td></tr>
<tr>
<td align="left" valign="top">Tyr</td>
<td align="center" valign="top">236.0</td>
<td align="center" valign="top">0.02</td></tr>
<tr>
<td align="left" valign="top">Val</td>
<td align="center" valign="top">158.0</td>
<td align="center" valign="top">0.54</td></tr></tbody></table>
<table-wrap-foot><fn id="tfn1-ijms-12-08449">
<label>a</label>
<p>Solvent-accessible surface area (ASA) of each residue type X in the tripeptide (Gly-X-Gly) at the extended state, calculated using ASC [<xref ref-type="bibr" rid="b34-ijms-12-08449">34</xref>];</p></fn><fn id="tfn2-ijms-12-08449">
<label>b</label>
<p>The hydrophobicity consensus scale of Eisenburg [<xref ref-type="bibr" rid="b5-ijms-12-08449">5</xref>].</p></fn></table-wrap-foot></table-wrap>
<table-wrap id="t3-ijms-12-08449" position="float">
<label>Table 3</label>
<caption>
<p>The results of molecular shape and size.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="bottom">PDB code</th>
<th align="center" valign="bottom"><italic>α</italic><sub>1</sub><xref ref-type="table-fn" rid="tfn3-ijms-12-08449">a</xref></th>
<th align="center" valign="bottom"><italic>α</italic><sub>2</sub><xref ref-type="table-fn" rid="tfn3-ijms-12-08449">a</xref></th>
<th align="center" valign="bottom"><italic>α</italic><sub>3</sub><xref ref-type="table-fn" rid="tfn3-ijms-12-08449">a</xref></th>
<th align="center" valign="bottom">0<sup>2</sup> ×<italic>δ</italic> <xref ref-type="table-fn" rid="tfn4-ijms-12-08449">b</xref></th>
<th align="center" valign="bottom">10<sup>3</sup>× <italic>S</italic> <xref ref-type="table-fn" rid="tfn5-ijms-12-08449">c</xref></th>
<th align="center" valign="bottom"><italic>a</italic> <xref ref-type="table-fn" rid="tfn6-ijms-12-08449">d</xref></th>
<th align="center" valign="bottom"><italic>b</italic> <xref ref-type="table-fn" rid="tfn6-ijms-12-08449">d</xref></th>
<th align="center" valign="bottom"><italic>R</italic><italic><sub>g</sub></italic><xref ref-type="table-fn" rid="tfn7-ijms-12-08449">e</xref></th></tr></thead>
<tbody>
<tr>
<td colspan="9" align="left" valign="top">I</td></tr>
<tr>
<td align="left" valign="top">3PYP</td>
<td align="center" valign="top">41.11</td>
<td align="center" valign="top">61.82</td>
<td align="center" valign="top">71.06</td>
<td align="center" valign="top">2.33</td>
<td align="center" valign="top">−4.33</td>
<td align="center" valign="top">16.04</td>
<td align="center" valign="top">18.85</td>
<td align="center" valign="top">13.19</td></tr>
<tr>
<td align="left" valign="top">1AKO</td>
<td align="center" valign="top">78.40</td>
<td align="center" valign="top">96.64</td>
<td align="center" valign="top">129.61</td>
<td align="center" valign="top">2.18</td>
<td align="center" valign="top">3.05</td>
<td align="center" valign="top">20.92</td>
<td align="center" valign="top">25.46</td>
<td align="center" valign="top">17.45</td></tr>
<tr>
<td align="left" valign="top">2CTC</td>
<td align="center" valign="top">84.32</td>
<td align="center" valign="top">100.38</td>
<td align="center" valign="top">141.79</td>
<td align="center" valign="top">2.48</td>
<td align="center" valign="top">5.30</td>
<td align="center" valign="top">21.49</td>
<td align="center" valign="top">26.63</td>
<td align="center" valign="top">18.07</td></tr>
<tr>
<td align="left" valign="top">1CEM</td>
<td align="center" valign="top">93.46</td>
<td align="center" valign="top">116.77</td>
<td align="center" valign="top">143.17</td>
<td align="center" valign="top">1.49</td>
<td align="center" valign="top">0.39</td>
<td align="center" valign="top">22.93</td>
<td align="center" valign="top">26.76</td>
<td align="center" valign="top">18.80</td></tr>
<tr>
<td align="left" valign="top">1E70</td>
<td align="center" valign="top">110.18</td>
<td align="center" valign="top">169.06</td>
<td align="center" valign="top">176.62</td>
<td align="center" valign="top">1.91</td>
<td align="center" valign="top">−5.02</td>
<td align="center" valign="top">26.42</td>
<td align="center" valign="top">29.72</td>
<td align="center" valign="top">21.35</td></tr>
<tr>
<td colspan="9" align="left" valign="top">II</td></tr>
<tr>
<td align="left" valign="top">3LTZ</td>
<td align="center" valign="top">35.84</td>
<td align="center" valign="top">48.75</td>
<td align="center" valign="top">101.55</td>
<td align="center" valign="top">10.49</td>
<td align="center" valign="top">57.61</td>
<td align="center" valign="top">14.54</td>
<td align="center" valign="top">22.53</td>
<td align="center" valign="top">13.64</td></tr>
<tr>
<td align="left" valign="top">1DCIA</td>
<td align="center" valign="top">72.74</td>
<td align="center" valign="top">96.28</td>
<td align="center" valign="top">223.74</td>
<td align="center" valign="top">12.84</td>
<td align="center" valign="top">83.38</td>
<td align="center" valign="top">20.56</td>
<td align="center" valign="top">33.45</td>
<td align="center" valign="top">19.82</td></tr>
<tr>
<td align="left" valign="top">1SBP</td>
<td align="center" valign="top">56.12</td>
<td align="center" valign="top">105.62</td>
<td align="center" valign="top">206.44</td>
<td align="center" valign="top">12.79</td>
<td align="center" valign="top">51.45</td>
<td align="center" valign="top">20.17</td>
<td align="center" valign="top">32.13</td>
<td align="center" valign="top">19.21</td></tr>
<tr>
<td align="left" valign="top">3MBP</td>
<td align="center" valign="top">74.47</td>
<td align="center" valign="top">115.72</td>
<td align="center" valign="top">242.46</td>
<td align="center" valign="top">12.28</td>
<td align="center" valign="top">65.10</td>
<td align="center" valign="top">21.81</td>
<td align="center" valign="top">34.82</td>
<td align="center" valign="top">20.80</td></tr>
<tr>
<td align="left" valign="top">1JAE</td>
<td align="center" valign="top">88.54</td>
<td align="center" valign="top">105.91</td>
<td align="center" valign="top">312.60</td>
<td align="center" valign="top">18.13</td>
<td align="center" valign="top">151.02</td>
<td align="center" valign="top">22.05</td>
<td align="center" valign="top">39.53</td>
<td align="center" valign="top">22.52</td></tr>
<tr>
<td colspan="9" align="left" valign="top">III</td></tr>
<tr>
<td align="left" valign="top">1CPQ</td>
<td align="center" valign="top">26.97</td>
<td align="center" valign="top">39.53</td>
<td align="center" valign="top">178.83</td>
<td align="center" valign="top">35.41</td>
<td align="center" valign="top">410.89</td>
<td align="center" valign="top">12.89</td>
<td align="center" valign="top">29.90</td>
<td align="center" valign="top">15.66</td></tr>
<tr>
<td align="left" valign="top">1AMF</td>
<td align="center" valign="top">35.21</td>
<td align="center" valign="top">75.05</td>
<td align="center" valign="top">208.33</td>
<td align="center" valign="top">24.30</td>
<td align="center" valign="top">188.55</td>
<td align="center" valign="top">16.60</td>
<td align="center" valign="top">32.27</td>
<td align="center" valign="top">17.85</td></tr>
<tr>
<td align="left" valign="top">1ABE</td>
<td align="center" valign="top">53.21</td>
<td align="center" valign="top">89.04</td>
<td align="center" valign="top">256.31</td>
<td align="center" valign="top">22.25</td>
<td align="center" valign="top">183.73</td>
<td align="center" valign="top">18.86</td>
<td align="center" valign="top">35.80</td>
<td align="center" valign="top">19.96</td></tr>
<tr>
<td align="left" valign="top">2ABH</td>
<td align="center" valign="top">65.13</td>
<td align="center" valign="top">68.09</td>
<td align="center" valign="top">262.66</td>
<td align="center" valign="top">24.53</td>
<td align="center" valign="top">242.78</td>
<td align="center" valign="top">18.25</td>
<td align="center" valign="top">36.24</td>
<td align="center" valign="top">19.90</td></tr>
<tr>
<td align="left" valign="top">1LAM</td>
<td align="center" valign="top">44.08</td>
<td align="center" valign="top">141.19</td>
<td align="center" valign="top">388.73</td>
<td align="center" valign="top">28.90</td>
<td align="center" valign="top">211.84</td>
<td align="center" valign="top">21.46</td>
<td align="center" valign="top">44.09</td>
<td align="center" valign="top">23.94</td></tr></tbody></table>
<table-wrap-foot><fn id="tfn3-ijms-12-08449">
<label>a</label>
<p>The eigenvalues of gyration tensor, obtained by diagonalization of <xref rid="FD1" ref-type="disp-formula">Equation (1)</xref>. The values were sorted according to increasing magnitude;</p></fn><fn id="tfn4-ijms-12-08449">
<label>b</label>
<p>The asphericity (<italic>δ</italic>), calculated according to <xref rid="FD3" ref-type="disp-formula">Equation (3)</xref>;</p></fn><fn id="tfn5-ijms-12-08449">
<label>c</label>
<p>The shape parameter (S), calculated according to <xref rid="FD4" ref-type="disp-formula">Equation (4)</xref>;</p></fn><fn id="tfn6-ijms-12-08449">
<label>d</label>
<p>Semiaxes (in Å) [<xref ref-type="bibr" rid="b32-ijms-12-08449">32</xref>], calculated with <italic>a</italic> = [5(<italic>α</italic><sub>1</sub> +<italic>α</italic><sub>2</sub>)/2]<sup>1/2</sup> and <italic>b</italic> = (5<italic>α</italic><sub>3</sub>)<sup>1/2</sup>;</p></fn><fn id="tfn7-ijms-12-08449">
<label>e</label>
<p>The mean radius of gyrations (in Å), calculated according <xref rid="FD6" ref-type="disp-formula">Equation (6)</xref>.</p></fn></table-wrap-foot></table-wrap>
<table-wrap id="t4-ijms-12-08449" position="float">
<label>Table 4</label>
<caption>
<p>The details of the spatial distribution of the second-order moment for 3PYP.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="bottom"><italic>P</italic> (%)</th>
<th align="center" valign="bottom"><italic>n</italic><italic><sub>p</sub></italic> <xref ref-type="table-fn" rid="tfn8-ijms-12-08449">a</xref></th>
<th align="center" valign="bottom"><italic>H</italic><sub>2</sub>(d<italic><sub>i</sub></italic>) <xref ref-type="table-fn" rid="tfn9-ijms-12-08449">b</xref></th>
<th align="center" valign="bottom"><italic>P</italic> (%)</th>
<th align="center" valign="bottom"><italic>n</italic><italic><sub>p</sub></italic> <xref ref-type="table-fn" rid="tfn8-ijms-12-08449">a</xref></th>
<th align="center" valign="bottom"><italic>H</italic><sub>2</sub>(d<italic><sub>i</sub></italic>) <xref ref-type="table-fn" rid="tfn9-ijms-12-08449">b</xref></th></tr></thead>
<tbody>
<tr>
<td align="center" valign="top">0.00</td>
<td align="center" valign="top">13</td>
<td align="center" valign="top">51.14</td>
<td align="center" valign="top">47.0</td>
<td align="center" valign="top">88</td>
<td align="center" valign="top">14.18</td></tr>
<tr>
<td align="center" valign="top">0.40</td>
<td align="center" valign="top">17</td>
<td align="center" valign="top">60.63</td>
<td align="center" valign="top">49.0</td>
<td align="center" valign="top">90</td>
<td align="center" valign="top">7.79</td></tr>
<tr>
<td align="center" valign="top">0.60</td>
<td align="center" valign="top">18</td>
<td align="center" valign="top">53.46</td>
<td align="center" valign="top">51.0</td>
<td align="center" valign="top">92</td>
<td align="center" valign="top">1.01</td></tr>
<tr>
<td align="center" valign="top">1.00</td>
<td align="center" valign="top">21</td>
<td align="center" valign="top">55.51</td>
<td align="center" valign="top">53.0</td>
<td align="center" valign="top">98</td>
<td align="center" valign="top">−8.13</td></tr>
<tr>
<td align="center" valign="top">3.00</td>
<td align="center" valign="top">24</td>
<td align="center" valign="top">60.15</td>
<td align="center" valign="top">55.0</td>
<td align="center" valign="top">99</td>
<td align="center" valign="top">−11.95</td></tr>
<tr>
<td align="center" valign="top">5.00</td>
<td align="center" valign="top">30</td>
<td align="center" valign="top">62.87</td>
<td align="center" valign="top">57.0</td>
<td align="center" valign="top">102</td>
<td align="center" valign="top">−12.64</td></tr>
<tr>
<td align="center" valign="top">7.00</td>
<td align="center" valign="top">30</td>
<td align="center" valign="top">62.87</td>
<td align="center" valign="top">59.0</td>
<td align="center" valign="top">103</td>
<td align="center" valign="top">−17.51</td></tr>
<tr>
<td align="center" valign="top">9.00</td>
<td align="center" valign="top">33</td>
<td align="center" valign="top">61.17</td>
<td align="center" valign="top">61.0</td>
<td align="center" valign="top">104</td>
<td align="center" valign="top">−16.08</td></tr>
<tr>
<td align="center" valign="top">11.0</td>
<td align="center" valign="top">39</td>
<td align="center" valign="top">64.58</td>
<td align="center" valign="top">63.0</td>
<td align="center" valign="top">106</td>
<td align="center" valign="top">−12.87</td></tr>
<tr>
<td align="center" valign="top">13.0</td>
<td align="center" valign="top">42</td>
<td align="center" valign="top">66.37</td>
<td align="center" valign="top">65.0</td>
<td align="center" valign="top">109</td>
<td align="center" valign="top">−15.63</td></tr>
<tr>
<td align="center" valign="top">15.0</td>
<td align="center" valign="top">44</td>
<td align="center" valign="top">72.18</td>
<td align="center" valign="top">67.0</td>
<td align="center" valign="top">111</td>
<td align="center" valign="top">−19.49</td></tr>
<tr>
<td align="center" valign="top">17.0</td>
<td align="center" valign="top">48</td>
<td align="center" valign="top">62.50</td>
<td align="center" valign="top">69.0</td>
<td align="center" valign="top">111</td>
<td align="center" valign="top">−19.49</td></tr>
<tr>
<td align="center" valign="top">19.0</td>
<td align="center" valign="top">50</td>
<td align="center" valign="top">60.74</td>
<td align="center" valign="top">71.0</td>
<td align="center" valign="top">114</td>
<td align="center" valign="top">−19.64</td></tr>
<tr>
<td align="center" valign="top">21.0</td>
<td align="center" valign="top">54</td>
<td align="center" valign="top">46.35</td>
<td align="center" valign="top">73.0</td>
<td align="center" valign="top">114</td>
<td align="center" valign="top">−19.64</td></tr>
<tr>
<td align="center" valign="top">23.0</td>
<td align="center" valign="top">57</td>
<td align="center" valign="top">45.69</td>
<td align="center" valign="top">75.0</td>
<td align="center" valign="top">114</td>
<td align="center" valign="top">−19.64</td></tr>
<tr>
<td align="center" valign="top">25.0</td>
<td align="center" valign="top">63</td>
<td align="center" valign="top">41.04</td>
<td align="center" valign="top">77.0</td>
<td align="center" valign="top">117</td>
<td align="center" valign="top">−19.73</td></tr>
<tr>
<td align="center" valign="top">27.0</td>
<td align="center" valign="top">64</td>
<td align="center" valign="top">41.57</td>
<td align="center" valign="top">79.0</td>
<td align="center" valign="top">119</td>
<td align="center" valign="top">−22.16</td></tr>
<tr>
<td align="center" valign="top">29.0</td>
<td align="center" valign="top">66</td>
<td align="center" valign="top">33.47</td>
<td align="center" valign="top">81.0</td>
<td align="center" valign="top">120</td>
<td align="center" valign="top">−20.88</td></tr>
<tr>
<td align="center" valign="top">31.0</td>
<td align="center" valign="top">68</td>
<td align="center" valign="top">32.77</td>
<td align="center" valign="top">83.0</td>
<td align="center" valign="top">121</td>
<td align="center" valign="top">−18.87</td></tr>
<tr>
<td align="center" valign="top">33.0</td>
<td align="center" valign="top">70</td>
<td align="center" valign="top">32.65</td>
<td align="center" valign="top">85.0</td>
<td align="center" valign="top">121</td>
<td align="center" valign="top">−18.87</td></tr>
<tr>
<td align="center" valign="top">35.0</td>
<td align="center" valign="top">73</td>
<td align="center" valign="top">37.21</td>
<td align="center" valign="top">87.0</td>
<td align="center" valign="top">123</td>
<td align="center" valign="top">−24.80</td></tr>
<tr>
<td align="center" valign="top">37.0</td>
<td align="center" valign="top">76</td>
<td align="center" valign="top">30.78</td>
<td align="center" valign="top">89.0</td>
<td align="center" valign="top">124</td>
<td align="center" valign="top">−28.33</td></tr>
<tr>
<td align="center" valign="top">39.0</td>
<td align="center" valign="top">77</td>
<td align="center" valign="top">30.14</td>
<td align="center" valign="top">91.0</td>
<td align="center" valign="top">125</td>
<td align="center" valign="top">−29.56</td></tr>
<tr>
<td align="center" valign="top">41.0</td>
<td align="center" valign="top">81</td>
<td align="center" valign="top">25.23</td>
<td align="center" valign="top">93.0</td>
<td align="center" valign="top">125</td>
<td align="center" valign="top">−29.56</td></tr>
<tr>
<td align="center" valign="top">43.0</td>
<td align="center" valign="top">82</td>
<td align="center" valign="top">26.02</td>
<td align="center" valign="top">95.0</td>
<td align="center" valign="top">125</td>
<td align="center" valign="top">−29.56</td></tr>
<tr>
<td align="center" valign="top">45.0</td>
<td align="center" valign="top">86</td>
<td align="center" valign="top">17.47</td>
<td align="center" valign="top">97.0</td>
<td align="center" valign="top">125</td>
<td align="center" valign="top">−29.56</td></tr></tbody></table>
<table-wrap-foot><fn id="tfn8-ijms-12-08449">
<label>a</label>
<p>Number of residues collected in the space defined by <italic>p</italic> (%);</p></fn><fn id="tfn9-ijms-12-08449">
<label>b</label>
<p>The second-order hydrophobic moment per residue, calculated according <xref rid="FD10" ref-type="disp-formula">Equation (10)</xref>.</p></fn></table-wrap-foot></table-wrap>
<table-wrap id="t5-ijms-12-08449" position="float">
<label>Table 5</label>
<caption>
<p>The Pearson correlation coefficient between two data sets: the spatial distribution of the second-order hydrophobic moment calculated with the consensus scale and that calculated with <xref rid="FD8" ref-type="disp-formula">Equation (8)</xref> derived from the CHARMM partial atomic charges.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="bottom">PDB code</th>
<th colspan="3" align="center" valign="bottom">Correlation coefficient (R) <xref ref-type="table-fn" rid="tfn10-ijms-12-08449">a</xref>
<hr/></th></tr>
<tr>
<th align="center" valign="top"/>
<th align="center" valign="bottom">|<italic>q</italic><italic><sub>i</sub></italic>| ≥ 0.25 <xref ref-type="table-fn" rid="tfn11-ijms-12-08449">b</xref></th>
<th align="center" valign="bottom">|<italic>q</italic><italic><sub>i</sub></italic>|≥ 0.27 <xref ref-type="table-fn" rid="tfn12-ijms-12-08449">c</xref></th>
<th align="center" valign="bottom">|<italic>q</italic><italic><sub>i</sub></italic>| ≥ 0.3 <xref ref-type="table-fn" rid="tfn13-ijms-12-08449">d</xref></th></tr></thead>
<tbody>
<tr>
<td colspan="4" align="left" valign="top">I</td></tr>
<tr>
<td align="left" valign="top">3PYP</td>
<td align="center" valign="top">0.89</td>
<td align="center" valign="top">0.89</td>
<td align="center" valign="top">0.94</td></tr>
<tr>
<td align="left" valign="top">1AKO</td>
<td align="center" valign="top">0.78</td>
<td align="center" valign="top">0.77</td>
<td align="center" valign="top">0.87</td></tr>
<tr>
<td align="left" valign="top">2CTC</td>
<td align="center" valign="top">0.86</td>
<td align="center" valign="top">0.87</td>
<td align="center" valign="top">0.91</td></tr>
<tr>
<td align="left" valign="top">1CEM</td>
<td align="center" valign="top">0.85</td>
<td align="center" valign="top">0.86</td>
<td align="center" valign="top">0.92</td></tr>
<tr>
<td align="left" valign="top">1E70</td>
<td align="center" valign="top">0.88</td>
<td align="center" valign="top">0.87</td>
<td align="center" valign="top">0.91</td></tr>
<tr>
<td colspan="4" align="left" valign="top">II</td></tr>
<tr>
<td align="left" valign="top">3LTZ</td>
<td align="center" valign="top">0.63</td>
<td align="center" valign="top">0.63</td>
<td align="center" valign="top">0.85</td></tr>
<tr>
<td align="left" valign="top">1DCIA</td>
<td align="center" valign="top">0.78</td>
<td align="center" valign="top">0.79</td>
<td align="center" valign="top">0.82</td></tr>
<tr>
<td align="left" valign="top">1SBP</td>
<td align="center" valign="top">0.70</td>
<td align="center" valign="top">0.70</td>
<td align="center" valign="top">0.86</td></tr>
<tr>
<td align="left" valign="top">3MBP</td>
<td align="center" valign="top">0.59</td>
<td align="center" valign="top">0.59</td>
<td align="center" valign="top">0.82</td></tr>
<tr>
<td align="left" valign="top">1JAE</td>
<td align="center" valign="top">0.86</td>
<td align="center" valign="top">0.87</td>
<td align="center" valign="top">0.91</td></tr>
<tr>
<td colspan="4" align="left" valign="top">III</td></tr>
<tr>
<td align="left" valign="top">1CPQ</td>
<td align="center" valign="top">0.81</td>
<td align="center" valign="top">0.81</td>
<td align="center" valign="top">0.96</td></tr>
<tr>
<td align="left" valign="top">1AMF</td>
<td align="center" valign="top">0.70</td>
<td align="center" valign="top">0.70</td>
<td align="center" valign="top">0.88</td></tr>
<tr>
<td align="left" valign="top">1ABE</td>
<td align="center" valign="top">0.72</td>
<td align="center" valign="top">0.74</td>
<td align="center" valign="top">0.81</td></tr>
<tr>
<td align="left" valign="top">2ABH</td>
<td align="center" valign="top">0.62</td>
<td align="center" valign="top">0.62</td>
<td align="center" valign="top">0.77</td></tr>
<tr>
<td align="left" valign="top">1LAM</td>
<td align="center" valign="top">0.87</td>
<td align="center" valign="top">0.87</td>
<td align="center" valign="top">0.90</td></tr></tbody></table>
<table-wrap-foot><fn id="tfn10-ijms-12-08449">
<label>a</label>
<p>The Pearson correlation coefficient, calculated according to <xref rid="FD11" ref-type="disp-formula">Equation (11)</xref>;</p></fn><fn id="tfn11-ijms-12-08449">
<label>b</label>
<p>The values of the coefficient calculated using the CHARMM partial atomic charges, |<italic>q</italic><italic><sub>i</sub></italic>| ≥ 0.25, assigned for polar atoms in the protein structures;</p></fn><fn id="tfn12-ijms-12-08449">
<label>c</label>
<p>The values of the coefficient calculated using the CHARMM partial atomic charges, |<italic>q</italic><italic><sub>i</sub></italic>| ≥ 0.27, assigned for polar atoms in the protein structures;</p></fn><fn id="tfn13-ijms-12-08449">
<label>d</label>
<p>The values of the coefficient calculated using the CHARMM partial atomic charges, |<italic>q</italic><italic><sub>i</sub></italic>| ≥ 0.3, assigned for polar atoms in the protein structures.</p></fn></table-wrap-foot></table-wrap>
<table-wrap id="t6-ijms-12-08449" position="float">
<label>Table 6</label>
<caption>
<p>The results of the spatial distributions of the second-order hydrophobic moment at <italic>p</italic>_ (%) or at <italic>d̄</italic>_ (Å).</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="bottom"><bold>PDB code</bold></th>
<th colspan="2" align="center" valign="bottom"><italic>p</italic>_(%)<xref ref-type="table-fn" rid="tfn14-ijms-12-08449">a</xref></th>
<th colspan="2" align="center" valign="bottom"><italic>d̄</italic> _ <bold>(Å)</bold> <xref ref-type="table-fn" rid="tfn15-ijms-12-08449">b</xref></th></tr>
<tr>
<th align="left" valign="bottom"/>
<th colspan="2" align="left" valign="bottom">
<hr/></th>
<th colspan="2" align="left" valign="bottom">
<hr/></th></tr>
<tr>
<th align="left" valign="bottom"/>
<th align="center" valign="bottom">|<italic>q</italic><italic><sub>i</sub></italic>| ≥ 0.3 <xref ref-type="table-fn" rid="tfn16-ijms-12-08449">c</xref></th>
<th align="center" valign="bottom"><bold>C scale</bold> <xref ref-type="table-fn" rid="tfn17-ijms-12-08449">d</xref></th>
<th align="center" valign="bottom">|<italic>q</italic><italic><sub>i</sub></italic>| ≥ 0.3 <xref ref-type="table-fn" rid="tfn18-ijms-12-08449">e</xref></th>
<th align="center" valign="bottom"><bold>C scale</bold> <xref ref-type="table-fn" rid="tfn19-ijms-12-08449">f</xref></th></tr></thead>
<tbody>
<tr>
<td colspan="5" align="left" valign="top">I</td></tr>
<tr>
<td align="left" valign="top">3PYP</td>
<td align="center" valign="top">49</td>
<td align="center" valign="top">53</td>
<td align="center" valign="top">11.45</td>
<td align="center" valign="top">11.79</td></tr>
<tr>
<td align="left" valign="top">1AKO</td>
<td align="center" valign="top">59</td>
<td align="center" valign="top">53</td>
<td align="center" valign="top">15.79</td>
<td align="center" valign="top">15.45</td></tr>
<tr>
<td align="left" valign="top">2CTC</td>
<td align="center" valign="top">47</td>
<td align="center" valign="top">49</td>
<td align="center" valign="top">16.14</td>
<td align="center" valign="top">16.24</td></tr>
<tr>
<td align="left" valign="top">1CEM</td>
<td align="center" valign="top">41</td>
<td align="center" valign="top">49</td>
<td align="center" valign="top">16.80</td>
<td align="center" valign="top">17.10</td></tr>
<tr>
<td align="left" valign="top">1E70</td>
<td align="center" valign="top">33</td>
<td align="center" valign="top">51</td>
<td align="center" valign="top">18.52</td>
<td align="center" valign="top">19.57</td></tr>
<tr>
<td colspan="5" align="left" valign="top">II</td></tr>
<tr>
<td align="left" valign="top">3LTZ</td>
<td align="center" valign="top">39</td>
<td align="center" valign="top">55</td>
<td align="center" valign="top">11.56</td>
<td align="center" valign="top">12.16</td></tr>
<tr>
<td align="left" valign="top">1DCIA</td>
<td align="center" valign="top">49</td>
<td align="center" valign="top">57</td>
<td align="center" valign="top">16.66</td>
<td align="center" valign="top">17.33</td></tr>
<tr>
<td align="left" valign="top">1SBP</td>
<td align="center" valign="top">61</td>
<td align="center" valign="top">61</td>
<td align="center" valign="top">17.43</td>
<td align="center" valign="top">17.43</td></tr>
<tr>
<td align="left" valign="top">3MBP</td>
<td align="center" valign="top">51</td>
<td align="center" valign="top">61</td>
<td align="center" valign="top">18.72</td>
<td align="center" valign="top">19.07</td></tr>
<tr>
<td align="left" valign="top">1JAE</td>
<td align="center" valign="top">49</td>
<td align="center" valign="top">53</td>
<td align="center" valign="top">20.56</td>
<td align="center" valign="top">20.65</td></tr>
<tr>
<td colspan="5" align="left" valign="top">III</td></tr>
<tr>
<td align="left" valign="top">1CPQ</td>
<td align="center" valign="top">53</td>
<td align="center" valign="top">81</td>
<td align="center" valign="top">13.92</td>
<td align="center" valign="top">14.70</td></tr>
<tr>
<td align="left" valign="top">1AMF</td>
<td align="center" valign="top">49</td>
<td align="center" valign="top">59</td>
<td align="center" valign="top">15.49</td>
<td align="center" valign="top">15.77</td></tr>
<tr>
<td align="left" valign="top">1ABE</td>
<td align="center" valign="top">41</td>
<td align="center" valign="top">71</td>
<td align="center" valign="top">17.23</td>
<td align="center" valign="top">18.39</td></tr>
<tr>
<td align="left" valign="top">2ABH</td>
<td align="center" valign="top">37</td>
<td align="center" valign="top">61</td>
<td align="center" valign="top">17.00</td>
<td align="center" valign="top">17.95</td></tr>
<tr>
<td align="left" valign="top">1LAM</td>
<td align="center" valign="top">43</td>
<td align="center" valign="top">55</td>
<td align="center" valign="top">20.67</td>
<td align="center" valign="top">21.20</td></tr></tbody></table>
<table-wrap-foot><fn id="tfn14-ijms-12-08449">
<label>a</label>
<p>The space defined by the percentage of solvent accessibility, <italic>p</italic>_(%), at which the spatial distribution of the second-order moment vanishes and approaches a negative value;</p></fn><fn id="tfn15-ijms-12-08449">
<label>b</label>
<p>The average distance from the protein centroid to the centroids of the amino-acid residues in the same space defined by (%) <italic>p</italic>_ as shown in footnote (a);</p></fn><fn id="tfn16-ijms-12-08449">
<label>c</label>
<p>The values of <italic>p</italic>_(%), calculated using <italic>h</italic><italic><sub>i</sub></italic>′ [<xref rid="FD9" ref-type="disp-formula">Equation (9)</xref>] derived from the CHARMM partial atomic charges, |<italic>q</italic><italic><sub>i</sub></italic>| ≥ 0.3, assigned for polar atoms in the protein structures.</p></fn><fn id="tfn17-ijms-12-08449">
<label>d</label>
<p>The values of <italic>p</italic>_ (%), calculated using the consensus scales.</p></fn><fn id="tfn18-ijms-12-08449">
<label>e</label>
<p>The values of <italic>d̄</italic>_ (Å), calculated using <italic>h</italic><italic><sub>i</sub></italic>′ [ <xref rid="FD9" ref-type="disp-formula">Equation (9)</xref>] derived from the CHARMM partial atomic charges, |<italic>q</italic><italic><sub>i</sub></italic>| ≥ 0.3, assigned for polar atoms in the protein structures.</p></fn><fn id="tfn19-ijms-12-08449">
<label>f</label>
<p>The values of <italic>d̄</italic>_ (Å), calculated using the consensus scales.</p></fn></table-wrap-foot></table-wrap></sec></back></article>
