<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xml:lang="en" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">ijms</journal-id>
<journal-title>International Journal of Molecular Sciences</journal-title>
<abbrev-journal-title>Int. J. Mol. Sci.</abbrev-journal-title>
<issn pub-type="epub">1422-0067</issn>
<publisher>
<publisher-name>Molecular Diversity Preservation International (MDPI)</publisher-name></publisher></journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3390/ijms13067354</article-id>
<article-id pub-id-type="publisher-id">ijms-13-07354</article-id>
<article-categories>
<subj-group>
<subject>Article</subject></subj-group></article-categories>
<title-group>
<article-title>Solution NMR Structure of Hypothetical Protein CV_2116 Encoded by a Viral Prophage Element in <italic>Chromobacterium violaceum</italic></article-title></title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Yang</surname><given-names>Yunhuang</given-names></name><xref ref-type="aff" rid="af1-ijms-13-07354">1</xref></contrib>
<contrib contrib-type="author">
<name><surname>Ramelot</surname><given-names>Theresa A.</given-names></name><xref ref-type="aff" rid="af1-ijms-13-07354">1</xref></contrib>
<contrib contrib-type="author">
<name><surname>Cort</surname><given-names>John R.</given-names></name><xref ref-type="aff" rid="af2-ijms-13-07354">2</xref></contrib>
<contrib contrib-type="author">
<name><surname>Garcia</surname><given-names>Maite</given-names></name><xref ref-type="aff" rid="af3-ijms-13-07354">3</xref></contrib>
<contrib contrib-type="author">
<name><surname>Yee</surname><given-names>Adelinda</given-names></name><xref ref-type="aff" rid="af3-ijms-13-07354">3</xref></contrib>
<contrib contrib-type="author">
<name><surname>Arrowsmith</surname><given-names>Cheryl H.</given-names></name><xref ref-type="aff" rid="af3-ijms-13-07354">3</xref></contrib>
<contrib contrib-type="author">
<name><surname>Kennedy</surname><given-names>Michael A.</given-names></name><xref ref-type="aff" rid="af1-ijms-13-07354">1</xref><xref ref-type="corresp" rid="c1-ijms-13-07354">*</xref></contrib></contrib-group>
<aff id="af1-ijms-13-07354">
<label>1</label>Department of Chemistry and Biochemistry and the Northeast Structural Genomics Consortium, Miami University, Oxford, OH 45056, USA; E-Mails: <email>yangy9@muohio.edu</email> (Y.Y.); <email>ramelota@muohio.edu</email> (T.A.R.)</aff>
<aff id="af2-ijms-13-07354">
<label>2</label>Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352, USA; E-Mail: <email>john.cort@pnl.gov</email></aff>
<aff id="af3-ijms-13-07354">
<label>3</label>Campbell Family Cancer Research Institute, Ontario Cancer Institute and the Northeast Structural Genomics Consortium, University Health Network, Toronto, ON, M5G 2C4, Canada; E-Mails: <email>mgarcia@uhnres.utoronto.ca</email> (M.G.); <email>aayee@uhnres.utoronto.ca</email> (A.Y.); <email>carrow@uhnres.utoronto.ca</email> (C.H.A.)</aff>
<author-notes>
<corresp id="c1-ijms-13-07354">
<label>*</label>Author to whom correspondence should be addressed; E-Mail: <email>kennedm4@muohio.edu</email>; Tel.: +1-513-529-8267; Fax: +1-513-529-5715.</corresp></author-notes>
<pub-date pub-type="collection">
<year>2012</year></pub-date>
<pub-date pub-type="epub">
<day>14</day>
<month>06</month>
<year>2012</year></pub-date>
<volume>13</volume>
<issue>6</issue>
<fpage>7354</fpage>
<lpage>7364</lpage>
<history>
<date date-type="received">
<day>20</day>
<month>04</month>
<year>2012</year></date>
<date date-type="rev-recd">
<day>25</day>
<month>05</month>
<year>2012</year></date>
<date date-type="accepted">
<day>04</day>
<month>06</month>
<year>2012</year></date></history>
<permissions>
<copyright-statement>© 2012 by the authors; licensee Molecular Diversity Preservation International, Basel, Switzerland.</copyright-statement>
<copyright-year>2012</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/3.0">
<p>This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).</p></license></permissions>
<abstract>
<p>CV_2116 is a small hypothetical protein of 82 amino acids from the Gram-negative coccobacillus <italic>Chromobacterium violaceum</italic>. A PSI-BLAST search using the CV_2116 sequence as a query identified only one hit (E = 2e<sup>−07</sup>) corresponding to a hypothetical protein OR16_04617 from <italic>Cupriavidus basilensis</italic> OR16, which failed to provide insight into the function of CV_2116. The <italic>CV_2116</italic> gene was cloned into the p15TvLic expression plasmid, transformed into <italic>E. coli</italic>, and <sup>13</sup>C- and <sup>15</sup>N-labeled NMR samples of CV_2116 were overexpressed in <italic>E. coli</italic> and purified for structure determination using NMR spectroscopy. The resulting high-quality solution NMR structure of CV_2116 revealed a novel α + β fold containing two anti-parallel β-sheets in the <italic>N</italic>-terminal two-thirds of the protein and one α-helix in the <italic>C</italic>-terminal third of the protein. CV_2116 does not belong to any known protein sequence family and a Dali search indicated that no similar structures exist in the protein data bank. Although no function of CV_2116 could be derived from either sequence or structural similarity searches, the neighboring genes of <italic>CV_2116</italic> encode various proteins annotated as similar to bacteriophage tail assembly proteins. Interestingly, <italic>C. violaceum</italic> exhibits an extensive network of bacteriophage tail-like structures that likely result from lateral gene transfer by incorporation of viral DNA into its genome (prophages) due to bacteriophage infection. Indeed, <italic>C. violaceum</italic> has been shown to contain four prophage elements and <italic>CV_2116</italic> resides in the fourth of these elements. Analysis of the putative operon in which CV_2116 resides indicates that CV_2116 might be a component of the bacteriophage tail-like assembly that occurs in <italic>C. violaceum</italic>.</p></abstract>
<kwd-group>
<kwd>CV_2116</kwd>
<kwd><italic>Chromobacterium violaceum</italic></kwd>
<kwd>NMR</kwd>
<kwd>solution structure</kwd>
<kwd>structural genomics</kwd>
<kwd>bacteriophage tail assembly</kwd>
<kwd>prophage</kwd>
<kwd>lateral gene transfer</kwd></kwd-group></article-meta></front>
<body>
<sec sec-type="intro">
<title>1. Introduction</title>
<p><italic>Chromobacterium violaceum</italic> is a Gram-negative non-sporulating coccobacillus. It produces a natural antibiotic called violacein, which may be useful for treatment of colon and other cancers [<xref ref-type="bibr" rid="b1-ijms-13-07354">1</xref>]. The complete genome of <italic>C. violaceum</italic>, which was published in 2003 [<xref ref-type="bibr" rid="b2-ijms-13-07354">2</xref>], contains four identified integrated viral genomes (prophages) [<xref ref-type="bibr" rid="b3-ijms-13-07354">3</xref>]. Among these prophages, CvP4 is composed of 51 ORFs spanning <italic>CV2114</italic> to <italic>CV2150</italic>, including <italic>CV_2116</italic>, which encodes a hypothetical protein of 82 residues (UniProtKB/TrEMBL ID Q7NW74_CHRVO). To date, CV_2116 does not belong to any known protein families (Pfam 26.0, November, 2011) and is simply annotated as a putative uncharacterized protein [<xref ref-type="bibr" rid="b4-ijms-13-07354">4</xref>]. This hypothetical protein was selected for structure determination as a target by the Northeast Structural Genomics (NESG) consortium (NESG Target ID CvT4). A goal of the United States National Institutes of Health’s Protein Structure Initiative (PSI) during the 10-year period spanning the PSI-1 and PSI-2 phases was to establish an understanding of all protein sequence/protein fold relationships that occur in nature with an emphasis on determining at least one experimental protein structure for every identified protein sequence family. However, not all proteins predicted by genome sequencing efforts belong to existing sequence families. Such proteins are referred to as “singletons”, which is currently the case for CV_2116. Therefore, despite not belonging to an existing protein family, knowledge of the three-dimensional structure of the hypothetical CV_2116 protein helps to fill in a missing part of the sequence-protein fold space that occurs in nature.</p></sec>
<sec sec-type="results|discussion">
<title>2. Results and Discussion</title>
<p>A sequence similarity search against CV_2116 using PSI-BLAST identified only one hit, a hypothetical protein OR16_04617 from <italic>Cupriavidus basilensis</italic> OR16 (E-value of 2e<sup>−07</sup>) with sequence identity of 43% over all 82 residues. Likewise, a PSI-BLAST search using OR16_04617 as the query identified CV_2116 as the only significant sequence match. <xref ref-type="fig" rid="f1-ijms-13-07354">Figure 1</xref> shows the sequence alignment of CV_2116 to OR16_04617. The other two proteins identified by PSI-BLAST had E-values greater than 1.0, including GFO_0138 (E-value of 1.3) from <italic>Gramella forsetti</italic> (UniProtKB/TrEMBL ID A0LXN2_GRAFK) [<xref ref-type="bibr" rid="b5-ijms-13-07354">5</xref>] with sequence identity of 33% over 60 residues, and Ribonuclease HII (E-value of 6.3) from <italic>Bacillus coagulans</italic> (UniProtKB/TrEMBL ID G2TJM8_BACCO) [<xref ref-type="bibr" rid="b6-ijms-13-07354">6</xref>] with sequence identity of 36% over 42 resides. The low sequence similarity to other proteins indicated that CV_2116 is not a highly conserved protein domain, and the single PSI-BLAST hit to OR16_04617 failed to provide any insight into the function of CV_2116.</p>
<p>A high quality solution structure of CV_2116 was determined by NMR spectroscopy. CV_2116 exhibited an excellent <sup>1</sup>H-<sup>15</sup>N-HSQC spectrum (<xref ref-type="fig" rid="f2-ijms-13-07354">Figure 2A</xref>) indicative of a well-behaved and structured protein in solution. The rotational correlation time of CV_2116 was determined from average <sup>15</sup>N relaxation times extracted from 1D <sup>15</sup>N-edited <italic>T</italic><sub>1</sub> and <italic>T</italic><sub>2</sub> experiments of the NC sample recorded on a 600 MHz spectrometer at 298 K. The overall isotropic rotational correlation time (τ<sub>c</sub>) of 6.2 ± 0.5 ns derived from <italic>T</italic><sub>1</sub> and <italic>T</italic><sub>2</sub> measurements confirmed its monomeric state in solution based on a curve of molecular weight versus τ<sub>c</sub> from other proteins characterized in our laboratory with molecular weights under 20 kDa as shown in <xref ref-type="fig" rid="f2-ijms-13-07354">Figure 2B</xref> [<xref ref-type="bibr" rid="b8-ijms-13-07354">8</xref>].</p>
<p><xref ref-type="fig" rid="f3-ijms-13-07354">Figure 3A</xref> shows the stereoview of the superimposition of the 20 lowest energy structures of CV_2116. The secondary structural elements are locally and globally well defined. CV_2116 contains two anti-parallel β sheets in the <italic>N</italic>-terminal two-thirds of the protein and one α helix in the <italic>C</italic>-terminal third of the protein (<xref ref-type="fig" rid="f3-ijms-13-07354">Figure 3B</xref>). The first β sheet is composed of four β-strands (β1, A4-Y6; β2, Y9-E12; β5, L29-K32; β6, I41-T44) in the order of β1-β2-β5-β6. The short β6 β-strand is present, as predicted by PROCHECK 3.5.4 [<xref ref-type="bibr" rid="b9-ijms-13-07354">9</xref>,<xref ref-type="bibr" rid="b10-ijms-13-07354">10</xref>], in 13 out of 20 structures in the final NMR ensemble.</p>
<p>Two β-strands (β3, H15-R18 and β4, K23-P26) compose the second anti-parallel β sheet. The 21-residue helix (E52-D72) in the <italic>C</italic>-terminal third of the protein flanks the two anti-parallel β sheets. Analysis of the NOEs and tertiary structure of CV_2116 revealed that Y6 from β1, Y9 from β2, I30 from β5, T44 from β6, and A63, I66, and I67 from α-helix constitute the hydrophobic core contacts to the first β sheet, whereas few NOEs between Y24 from β4 and A55 from the α-helix were observed to the second β sheet. Two longer loops, one between β5 and β6 (V33-P40) and one between β6 and α-helix (E52-D72), and the <italic>C</italic>-terminal tail (R73-G82) were flexible and disordered in the NMR structure. Structural statistics for CV_2116 are presented in <xref ref-type="table" rid="t1-ijms-13-07354">Table 1</xref>.</p>
<p>CV_2116 adopted an apparently novel fold without any current matches in the CATH [<xref ref-type="bibr" rid="b11-ijms-13-07354">11</xref>] and SCOP [<xref ref-type="bibr" rid="b12-ijms-13-07354">12</xref>] structural classification databases. Comparison of the three-dimensional structure of CV_2116 using Dali (v.3) [<xref ref-type="bibr" rid="b13-ijms-13-07354">13</xref>] revealed little similarity to any known protein structures in the PDB. Three proteins, peptide chain release factor (PDB ID: 3D5C-X), UDP-<italic>N</italic>-acetylmuramate-alanine ligase (PDB ID: 1P31-A), and hyponastic leave 1 (PDB ID: 2L2N-A), had the highest <italic>Z</italic> scores of 3.7, despite the fact that they had low sequence identities of 8%, 8% and 5%, respectively. No biochemical or biological functions could be derived from the structural comparisons with the known structures in the PDB.</p>
<p>The electrostatic surface diagram of CV_2116 is shown in <xref ref-type="fig" rid="f3-ijms-13-07354">Figure 3C</xref>. Positive and negative regions were observed on one side of the protein while one negative region was present on the opposite side. So far, no potential functions of these regions have been identified.</p>
<p>Although the function of the hypothetical CV_2116 protein could not be predicted by either sequence or structure similarity searches, the <italic>CV_2116</italic> gene from <italic>C. violaceum</italic> is sandwiched between several neighboring genes that encode for apparent bacteriophage tail-like assembly proteins. In fact, <italic>C. violaceum</italic> has been observed to exhibit an unusual and extensive organization of bacteriophage tail-like particles [<xref ref-type="bibr" rid="b14-ijms-13-07354">14</xref>]. The locus two genes before <italic>CV_2116</italic>, namely <italic>CV_2114,</italic> encodes a probable pyocin R2_PP tail fiber protein. R-type pyocins are a component of the non-flexible tail structure of bacteriophage [<xref ref-type="bibr" rid="b15-ijms-13-07354">15</xref>] The gene immediately preceding <italic>CV_2116</italic>, namely <italic>CV_2115</italic>, encodes a protein homologous to other bacteriophage J tail proteins belonging to the protein family PF04865, which are involved in baseplate assembly. Among the genes immediately following <italic>CV_2116</italic>, <italic>CV_2117</italic> encodes a protein homologous to other bacteriophage V proteins (PF04717), which is also involved in baseplate assembly [<xref ref-type="bibr" rid="b16-ijms-13-07354">16</xref>] and <italic>CV_2118</italic> is homologous to several members of PF05954, which are phage late control gene D proteins. Hence, it is reasonable to speculate that CV_2116 is a component of a bacteriophage-like tail assembly specific to <italic>C. violaceum</italic>.</p>
<p>Previous analysis of the <italic>C. violaceum</italic> genome has shown that it contains four prophages denoted CvP1 through CvP4 [<xref ref-type="bibr" rid="b3-ijms-13-07354">3</xref>]. The latter, CvP4, is composed of 51 ORFs (<italic>CV_2114 to CV_2150</italic>) and is thought to be a chimeric prophage containing ORFs with similarity to different phages. Specifically, it has been suggested that the tail and tail fiber genes, which include <italic>CV_2114-CV_2118</italic>, are similar to P2-like phage genes [<xref ref-type="bibr" rid="b14-ijms-13-07354">14</xref>], and in particular, that these genes exhibit similarity to those found in a P2-like prophage of <italic>Salmonella enterica</italic> subsp. <italic>enterica</italic> serovar Typhi, which are the so-called CT18 phages [<xref ref-type="bibr" rid="b14-ijms-13-07354">14</xref>,<xref ref-type="bibr" rid="b17-ijms-13-07354">17</xref>,<xref ref-type="bibr" rid="b18-ijms-13-07354">18</xref>]. Collectively, it has been speculated that CvP4 was responsible for introducing a number of genes from <italic>S. enterica</italic> into the <italic>C. violaceum</italic> genome. A PSI-BLAST search of CV_2117 indicated ~100 hits to proteins with &gt;30% identity, including NP_456044 from <italic>S. enterica</italic> subsp. <italic>enterica</italic> serovar Typhi str. CT18, which had 41% identity with 96% coverage and E = 9e<sup>−40</sup>. Examination of the genes surrounding NP_456044 indicated a putative operon containing 10 consecutive genes spanning NP_456044 to NP_456053, all encoding proteins involved in bacteriophage tail-like assembly. Examination of the strongest hit PSI-BLAST hit (E = 1e<sup>−105</sup>, 72% sequence identity) against CV_2117, namely YP_003745638 from <italic>Ralstonia solanacearum</italic> CFBP2957, which is an aerobic non-sporing, Gram-negative plant pathogenic bacterium, also revealed a putative operon with a collection of genes spanning YP_003745640 to YP_003745645 that are all annotated as bacteriophage tail-like assembly proteins. These comparisons further support the speculation that CV_2116 is likely a novel gene of prophage origin that plays a structural or functional role in bacteriophage tail-like assembly in <italic>C. violaceum.</italic></p></sec>
<sec>
<title>3. Experimental Section</title>
<sec>
<title>3.1. Preparation of Protein Samples</title>
<p>The full-length gene for CV_2116 from <italic>C. violaceum</italic> was cloned into the expression plasmid p15TvLic including an additional <italic>N</italic>-terminal hexa-histidine tag and TEV cleavage site (22 residues, MGSSHHHHHHSSGRENLYFQGH), and the plasmid was transformed into <italic>E. coli</italic> BL21 (DE3)-RIPL (Stratagene). <italic>E. coli</italic> cells were grown in 0.5 L of 2 × M9 minimal medium containing <sup>15</sup>NH<sub>4</sub>Cl and <sup>13</sup>C-glucose as the sole nitrogen and carbon sources, and supplemented with ZnSO<sub>4</sub>, thiamine, and biotin. Cells were grown at 37 °C to an OD<sub>600</sub> of 1.0, followed by addition of 1 mM isopropyl β-<sc>d</sc>-thiogalactoside. The temperature was reduced to 15 °C, and cells grew overnight before harvesting. Cell pellets were lysed by sonication in a buffer containing 500 mM NaCl, 20 mM Tris and 5 mM imidazole (pH 8.0). The proteins were isolated from the lysate by nickel affinity chromatography (Qiagen) after washing the beads three times with five column volumes of 500 mM NaCl, 20 mM Tris (pH 8.0), 30 mM imidazole. Protein was eluted with five column volumes of 500 mM imidazole added to the washing buffer. For NMR structure determination, the hexahistidine tag of CV_2116 was cleaved with TEV protease and the mixture passed through a nickel affinity column. The purified <sup>13</sup>C/<sup>15</sup>N labeled protein (referred to as the NC sample) was concentrated, and buffer was exchanged by ultrafiltration and dilution/reconcentration into the NMR buffer containing 10 mM Tris (pH 7.0), 300 mM NaCl, 10 mM DTT, 1 mM benzamidine, 0.01% NaN<sub>3</sub>, 1× inhibitor cocktail (Roche Applied Science), 95% H<sub>2</sub>O/5% D<sub>2</sub>O. Following the same procedure, a uniformly 100% <sup>15</sup>N-labeled and 7% biosynthetically directed <sup>13</sup>C (NC7) CV_2116 sample was also prepared for stereospecific assignments of isopropyl methyl groups of Val and Leu residues.</p></sec>
<sec sec-type="methods">
<title>3.2. NMR Spectroscopy and Data Collection</title>
<p>NMR spectra were acquired on 250 μL NC and NC7 samples in 5 mm Shigemi NMR tubes at 293 K on Varian Inova 600 and 750 MHz NMR spectrometers. Backbone and sidechain resonance assignments were obtained from the analysis of <sup>15</sup>N-HSQC (<xref ref-type="fig" rid="f3-ijms-13-07354">Figure 3A</xref>), <sup>13</sup>C-HSQC, HNCO, HNCA, HN(CO)CA, HNCACB, CBCA(CO)NH, C(CO)NH, HC(CO)NH, HBHA(CO)NH, HCCH-COSY, HCCH-TOCSY and (H)CCH-TOCSY spectra. Stereospecific assignments of isopropyl methyl groups of Val and Leu residues were determined from characteristic <sup>1</sup>H-<sup>13</sup>C coupling patterns in a high resolution <sup>1</sup>H-<sup>13</sup>C HSQC of the NC7 sample [<xref ref-type="bibr" rid="b19-ijms-13-07354">19</xref>]. NOE distance restraints were derived from a <sup>15</sup>N-edited NOESY-HSQC (mixing time τ<sub>m</sub> = 70 ms) and two <sup>13</sup>C-edited NOESY-HSQC (τ<sub>m</sub> = 70 ms) optimized for either aliphatic or aromatic carbon detection. Additional NOEs were assigned from a 4D <sup>13</sup>C-<sup>13</sup>C-HMQC-NOESY-HMQC (τ<sub>m</sub> = 70 ms) recorded after lyophilization and dissolution into pure D<sub>2</sub>O. Spectra were processed by NMRPipe and analyzed with Sparky 3.110 [<xref ref-type="bibr" rid="b20-ijms-13-07354">20</xref>]. Nearly complete resonance assignments (99.3%) were determined. NMR assignments, raw NOESY FIDs, and NOESY peak lists have been deposited in the BioMagResBank (BMRB accession number 16,521).</p></sec>
<sec>
<title>3.3. NMR Structure Calculation</title>
<p>The solution NMR structure of CV_2116 was initially calculated using both AutoStructure [<xref ref-type="bibr" rid="b21-ijms-13-07354">21</xref>] and CYANA 2.1 [<xref ref-type="bibr" rid="b22-ijms-13-07354">22</xref>]. Inputs for calculation were resonance assignments, NOESY peak lists from the four NOESY spectra that included peak intensities, and TALOS-derived dihedral restraints for ϕ and ψ dihedral angles. NOE assignments were initially made automatically in AutoStructure and CYANA and the assignments refined manually. NMR RPF quality assessment scores were used to assess “goodness of fit” between calculated structures and NOESY peaks lists and were used to guide manual and iterative refinement of NOESY peak picking. The NOE-derived distance constraints, dihedral constraints, and hydrogen-bond constraints derived from AutoStructure were converted to Xplor/CNS format using PDBStat and the upper bounds for the NOEs were increased by 10%. These constraints were used to calculate 150 structures using Xplor-NIH (version 2.25) with a standard simulated annealing protocol followed by refinement of the 20 lowest energy structure using the Xplor-NIH protocol [<xref ref-type="bibr" rid="b23-ijms-13-07354">23</xref>,<xref ref-type="bibr" rid="b24-ijms-13-07354">24</xref>]. For the final NMR ensemble, the 20 lowest energy structures were deposited in the Protein Data Bank (PDB ID, 2KON). NMR structure quality analysis and structural statistics were performed using the PSVS [<xref ref-type="bibr" rid="b25-ijms-13-07354">25</xref>] and RPF [<xref ref-type="bibr" rid="b26-ijms-13-07354">26</xref>] software packages. Structural statistics for CV_2116 are presented in <xref ref-type="table" rid="t1-ijms-13-07354">Table 1</xref>.</p></sec></sec>
<sec sec-type="conclusions">
<title>4. Conclusions</title>
<p>In conclusion, protein sequence analysis indicated that the hypothetical protein CV_2116 from <italic>C. violaceum</italic> is not highly conserved among known protein families. The three-dimensional structure of CV_2116 was determined using solution state NMR spectroscopy and analysis of the structure indicated that CV_2116 adopted a novel fold containing two anti-parallel β-sheets and one α-helix. Despite determination of a high quality three-dimensional structure, no biochemical or biological function of CV_2116 could be predicted based on sequence similarity or structural alignment searches. However, analysis of the genes neighboring <italic>CV_2116</italic> indicated that it resides in a putative operon that encodes for five proteins that are likely involved in bacteriophage tail-like assembly, and that the gene and its associated operon is of prophage origin, thereby giving <italic>C. violaceum</italic> distinctly bacteriophage-like characteristics that include the occurrence of a bacteriophage tail-like assembly. In closing, the analysis of CV_2116 from <italic>C. violaceum</italic>, which is both a hypothetical protein and a singleton, is typical of the challenge that is often encountered when researchers attempt to construct a hypothesis regarding the biochemical or cellular function of a hypothetical and/or singleton protein exclusively from sequence and structural information when little other knowledge about the protein is available.</p></sec></body>
<back>
<ack>
<title>Acknowledgments</title>
<p>This work was supported by the National Institute of General Medical Sciences; Protein Structure Initiative-Biology Program; Grant Number: U54-GM074958 and U54-GM094597. The majority of the data collection was conducted at the Ohio Center of Excellence in Biomedicine in Structural Biology and Metabonomics at Miami University. A portion of the NMR experiments was performed in the Environmental Molecular Sciences Laboratory, a national scientific user facility sponsored by the U.S. Department of Energy’s Office of Biological and Environmental Research and located at Pacific Northwest National Laboratory. The authors acknowledge Gaetano T. Montelione and Shuisong Ni for helpful discussions.</p></ack>
<ref-list>
<title>References</title>
<ref id="b1-ijms-13-07354"><label>1</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kodach</surname><given-names>L.L.</given-names></name><name><surname>Bos</surname><given-names>C.L.</given-names></name><name><surname>Durán</surname><given-names>N.</given-names></name><name><surname>Peppelenbosch</surname><given-names>M.P.</given-names></name><name><surname>Ferreira</surname><given-names>C.V.</given-names></name><name><surname>Hardwick</surname><given-names>J.C.H.</given-names></name></person-group><article-title>Violacein synergistically increases 5-fluorouracil cytotoxicity, induces apoptosis and inhibits Akt-mediated signal transduction in human colorectal cancer cells</article-title><source>Carcinogenesis</source><year>2006</year><volume>27</volume><fpage>508</fpage><lpage>516</lpage><pub-id pub-id-type="pmid">16344270</pub-id></citation></ref>
<ref id="b2-ijms-13-07354"><label>2</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>De Almeida</surname><given-names>D.F.</given-names></name><name><surname>Hungria</surname><given-names>M.</given-names></name><name><surname>Guimarães</surname><given-names>C.T.</given-names></name><name><surname>Antônio</surname><given-names>R.V.</given-names></name><name><surname>Almeida</surname><given-names>F.C.</given-names></name><name><surname>de Almeida</surname><given-names>L.G.P.</given-names></name><name><surname>de Almeida</surname><given-names>R.</given-names></name><name><surname>Andrade</surname><given-names>E.M.</given-names></name><name><surname>Araripe</surname><given-names>J.</given-names></name><name><surname>de Araújo</surname><given-names>M.F.F.</given-names></name><etal/></person-group><article-title>The complete genome sequence of <italic>Chromobacterium violaceum</italic> reveals remarkable and exploitable bacterial adaptability</article-title><source>Proc. Nat. Acad. Sci. USA</source><year>2003</year><volume>100</volume><fpage>11660</fpage><lpage>11665</lpage><pub-id pub-id-type="doi">10.1073/pnas.1832124100</pub-id><pub-id pub-id-type="pmid">14500782</pub-id></citation></ref>
<ref id="b3-ijms-13-07354"><label>3</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>De Almeida</surname><given-names>R.</given-names></name><name><surname>Trevilato</surname><given-names>P.B.</given-names></name><name><surname>Bartoleti</surname><given-names>L.A.</given-names></name><name><surname>Proença-Módena</surname><given-names>J.L.</given-names></name><name><surname>Hanna</surname><given-names>E.S.</given-names></name><name><surname>Gregoracci</surname><given-names>G.B.</given-names></name><name><surname>Brocchi</surname><given-names>M.</given-names></name></person-group><article-title>Bacteriophages and insertion sequences of <italic>Chromobacterium violaceum</italic> ATCC 12472</article-title><source>Genet. Mol. Res</source><year>2004</year><volume>3</volume><fpage>76</fpage><lpage>84</lpage><pub-id pub-id-type="pmid">15100989</pub-id></citation></ref>
<ref id="b4-ijms-13-07354"><label>4</label><citation citation-type="web"><source>Protein: Q7NW74_CHRVO (Q7NW74)</source><comment>Availbale online: <ext-link xlink:href="http://pfam.sanger.ac.uk/protein/Q7NW74" ext-link-type="uri">http://pfam.sanger.ac.uk/protein/Q7NW74</ext-link></comment><access-date>accessed on 15 February 2012</access-date></citation></ref>
<ref id="b5-ijms-13-07354"><label>5</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bauer</surname><given-names>M.</given-names></name><name><surname>Kube</surname><given-names>M.</given-names></name><name><surname>Teeling</surname><given-names>H.</given-names></name><name><surname>Richter</surname><given-names>M.</given-names></name><name><surname>Lombardot</surname><given-names>T.</given-names></name><name><surname>Allers</surname><given-names>E.</given-names></name><name><surname>Würdemann</surname><given-names>C.A.</given-names></name><name><surname>Quast</surname><given-names>C.</given-names></name><name><surname>Kuhl</surname><given-names>H.</given-names></name><name><surname>Knaust</surname><given-names>F.</given-names></name><etal/></person-group><article-title>Whole genome analysis of the marine Bacteroidetes “<italic>Gramella forsetii</italic>” reveals adaptations to degradation of polymeric organic matter</article-title><source>Environ. Microbiol</source><year>2006</year><volume>8</volume><fpage>2201</fpage><lpage>2213</lpage><pub-id pub-id-type="doi">10.1111/j.1462-2920.2006.01152.x</pub-id><pub-id pub-id-type="pmid">17107561</pub-id></citation></ref>
<ref id="b6-ijms-13-07354"><label>6</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rhee</surname><given-names>M.S.</given-names></name><name><surname>Moritz</surname><given-names>B.E.</given-names></name><name><surname>Xie</surname><given-names>G.</given-names></name><name><surname>del Rio</surname><given-names>T.G.</given-names></name><name><surname>Dalin</surname><given-names>E.</given-names></name><name><surname>Tice</surname><given-names>H.</given-names></name><name><surname>Bruce</surname><given-names>D.</given-names></name><name><surname>Goodwin</surname><given-names>L.</given-names></name><name><surname>Chertkov</surname><given-names>O.</given-names></name><name><surname>Brettin</surname><given-names>T.</given-names></name><etal/></person-group><article-title>Complete Genome Sequence of a thermotolerant sporogenic lactic acid bacterium, <italic>Bacillus coagulans</italic> strain 36D1</article-title><source>Stand. Genomic Sci</source><year>2011</year><volume>5</volume><pub-id pub-id-type="doi">10.4056/sigs.2365342</pub-id></citation></ref>
<ref id="b7-ijms-13-07354"><label>7</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Larkin</surname><given-names>M.A.</given-names></name><name><surname>Blackshields</surname><given-names>G.</given-names></name><name><surname>Brown</surname><given-names>N.P.</given-names></name><name><surname>Chenna</surname><given-names>R.</given-names></name><name><surname>McGettigan</surname><given-names>P.A.</given-names></name><name><surname>McWilliam</surname><given-names>H.</given-names></name><name><surname>Valentin</surname><given-names>F.</given-names></name><name><surname>Wallace</surname><given-names>I.M.</given-names></name><name><surname>Wilm</surname><given-names>A.</given-names></name><name><surname>Lopez</surname><given-names>R.</given-names></name><etal/></person-group><article-title>Clustal W and Clustal X version 2.0</article-title><source>Bioinformatics</source><year>2007</year><volume>23</volume><fpage>2947</fpage><lpage>2948</lpage><pub-id pub-id-type="doi">10.1093/bioinformatics/btm404</pub-id><pub-id pub-id-type="pmid">17846036</pub-id></citation></ref>
<ref id="b8-ijms-13-07354"><label>8</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname><given-names>Y.</given-names></name><name><surname>Ramelot</surname><given-names>T.A.</given-names></name><name><surname>Cort</surname><given-names>J.R.</given-names></name><name><surname>Wang</surname><given-names>H.</given-names></name><name><surname>Ciccosanti</surname><given-names>C.</given-names></name><name><surname>Jiang</surname><given-names>M.</given-names></name><name><surname>Janjua</surname><given-names>H.</given-names></name><name><surname>Acton</surname><given-names>T.B.</given-names></name><name><surname>Xiao</surname><given-names>R.</given-names></name><name><surname>Everett</surname><given-names>J.K.</given-names></name><etal/></person-group><article-title>Solution NMR structure of Dsy0195 homodimer from <italic>Desulfitobacterium hafniense</italic>: First structure representative of the YabP domain family of proteins involved in spore coat assembly</article-title><source>J. Struct. Funct. Genomics</source><year>2011</year><volume>12</volume><fpage>175</fpage><lpage>179</lpage><pub-id pub-id-type="doi">10.1007/s10969-011-9117-z</pub-id><pub-id pub-id-type="pmid">21904870</pub-id></citation></ref>
<ref id="b9-ijms-13-07354"><label>9</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Laskowski</surname><given-names>R.A.</given-names></name><name><surname>MacArthur</surname><given-names>M.W.</given-names></name><name><surname>Moss</surname><given-names>D.S.</given-names></name><name><surname>Thornton</surname><given-names>J.M.</given-names></name></person-group><article-title>PROCHECK—A program to check the stereochemical quality of protein structures</article-title><source>J. Appl. Cryst</source><year>1993</year><volume>26</volume><fpage>283</fpage><lpage>291</lpage><pub-id pub-id-type="doi">10.1107/S0021889892009944</pub-id></citation></ref>
<ref id="b10-ijms-13-07354"><label>10</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Laskowski</surname><given-names>R.A.</given-names></name><name><surname>Rullmannn</surname><given-names>J.A.</given-names></name><name><surname>MacArthur</surname><given-names>M.W.</given-names></name><name><surname>Kaptein</surname><given-names>R.</given-names></name><name><surname>Thornton</surname><given-names>J.M.</given-names></name></person-group><article-title>AQUA and PROCHECK-NMR: Programs for checking the quality of protein structures solved by NMR</article-title><source>J. Biomol. NMR</source><year>1996</year><volume>8</volume><fpage>477</fpage><lpage>486</lpage><pub-id pub-id-type="pmid">9008363</pub-id></citation></ref>
<ref id="b11-ijms-13-07354"><label>11</label><citation citation-type="web"><source>CATH: Protein Structure Classification Database</source><comment>Available online: <ext-link xlink:href="http://www.cathdb.info/" ext-link-type="uri">http://www.cathdb.info/</ext-link></comment><access-date>accessed on 15 February 2012</access-date></citation></ref>
<ref id="b12-ijms-13-07354"><label>12</label><citation citation-type="web"><source>SCOP: Structural Classification of Proteins</source><comment>Available online: <ext-link xlink:href="http://scop.mrc-lmb.cam.ac.uk/scop/" ext-link-type="uri">http://scop.mrc-lmb.cam.ac.uk/scop/</ext-link></comment><access-date>accessed on 15 February 2012</access-date></citation></ref>
<ref id="b13-ijms-13-07354"><label>13</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Holm</surname><given-names>L.</given-names></name><name><surname>Rosenstrom</surname><given-names>P.</given-names></name></person-group><article-title>Dali server: Conservation mapping in 3D</article-title><source>Nucleic. Acids. Res</source><year>2010</year><volume>38</volume><fpage>W545</fpage><lpage>W549</lpage><pub-id pub-id-type="doi">10.1093/nar/gkq366</pub-id><pub-id pub-id-type="pmid">20457744</pub-id></citation></ref>
<ref id="b14-ijms-13-07354"><label>14</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kumazaki</surname><given-names>T.</given-names></name><name><surname>Ishii</surname><given-names>S.</given-names></name></person-group><article-title>Comparative study on fibers isolated from four R-type pyocins, phage-tail-like bacteriocins of <italic>Pseudomonas aeruginosa</italic></article-title><source>J. Biochem</source><year>1982</year><volume>92</volume><fpage>1559</fpage><lpage>1566</lpage><pub-id pub-id-type="pmid">6818225</pub-id></citation></ref>
<ref id="b15-ijms-13-07354"><label>15</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Haggard-Ljungquist</surname><given-names>E.</given-names></name><name><surname>Jacobsen</surname><given-names>E.</given-names></name><name><surname>Rishovd</surname><given-names>S.</given-names></name><name><surname>Six</surname><given-names>E.W.</given-names></name><name><surname>Sunshine</surname><given-names>M.G.</given-names></name><name><surname>Lindqvist</surname><given-names>B.H.</given-names></name><name><surname>Kim</surname><given-names>K.-J.</given-names></name><name><surname>Barreiro</surname><given-names>V.</given-names></name><name><surname>Koonin</surname><given-names>E.V.</given-names></name><name><surname>Calendar</surname><given-names>R.</given-names></name><etal/></person-group><article-title>Bacteriophage P2: Genes involved in baseplate assembly</article-title><source>Virology</source><year>1995</year><volume>213</volume><fpage>109</fpage><lpage>121</lpage><pub-id pub-id-type="doi">10.1006/viro.1995.1551</pub-id><pub-id pub-id-type="pmid">7483254</pub-id></citation></ref>
<ref id="b16-ijms-13-07354"><label>16</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rucinsky</surname><given-names>T.E.</given-names></name><name><surname>Gregory</surname><given-names>J.P.</given-names></name><name><surname>Cota-Robles</surname><given-names>E.H.</given-names></name></person-group><article-title>Organization of bacteriophage tail-like particles in cells of <italic>Chromobacterium violaceum</italic></article-title><source>J. Bacteriol</source><year>1972</year><volume>110</volume><fpage>754</fpage><lpage>757</lpage><pub-id pub-id-type="pmid">4112258</pub-id></citation></ref>
<ref id="b17-ijms-13-07354"><label>17</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Canchaya</surname><given-names>C.</given-names></name><name><surname>Proux</surname><given-names>C.</given-names></name><name><surname>Fournous</surname><given-names>G.</given-names></name><name><surname>Bruttin</surname><given-names>A.</given-names></name><name><surname>Brüssow</surname><given-names>H.</given-names></name></person-group><article-title>Prophage genomics</article-title><source>Microbiol. Mol. Biol. Rev</source><year>2003</year><volume>67</volume><fpage>238</fpage><lpage>276</lpage><pub-id pub-id-type="doi">10.1128/MMBR.67.2.238-276.2003</pub-id><pub-id pub-id-type="pmid">12794192</pub-id></citation></ref>
<ref id="b18-ijms-13-07354"><label>18</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Parkhill</surname><given-names>J.</given-names></name><name><surname>Dougan</surname><given-names>G.</given-names></name><name><surname>James</surname><given-names>K.D.</given-names></name><name><surname>Thomson</surname><given-names>N.R.</given-names></name><name><surname>Pickard</surname><given-names>D.</given-names></name><name><surname>Wain</surname><given-names>J.</given-names></name><name><surname>Churcher</surname><given-names>C.</given-names></name><name><surname>Mungall</surname><given-names>K.L.</given-names></name><name><surname>Bentley</surname><given-names>S.D.</given-names></name><name><surname>Holden</surname><given-names>M.T.G.</given-names></name><etal/></person-group><article-title>Complete genome sequence of a multiple drug resistant <italic>Salmonella enterica</italic> serovar Typhi CT18</article-title><source>Nature</source><year>2001</year><volume>413</volume><fpage>848</fpage><lpage>852</lpage><pub-id pub-id-type="doi">10.1038/35101607</pub-id><pub-id pub-id-type="pmid">11677608</pub-id></citation></ref>
<ref id="b19-ijms-13-07354"><label>19</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Neri</surname><given-names>D.</given-names></name><name><surname>Szyperski</surname><given-names>T.</given-names></name><name><surname>Otting</surname><given-names>G.</given-names></name><name><surname>Senn</surname><given-names>H.</given-names></name><name><surname>Wuethrich</surname><given-names>K.</given-names></name></person-group><article-title>Stereospecific nuclear magnetic resonance assignments of the methyl groups of valine and leucine in the DNA-binding domain of the 434 repressor by biosynthetically directed fractional 13C labeling</article-title><source>Biochemistry</source><year>1989</year><volume>28</volume><fpage>7510</fpage><lpage>7516</lpage><pub-id pub-id-type="doi">10.1021/bi00445a003</pub-id><pub-id pub-id-type="pmid">2692701</pub-id></citation></ref>
<ref id="b20-ijms-13-07354"><label>20</label><citation citation-type="web"><person-group person-group-type="author"><name><surname>Goddard</surname><given-names>T.</given-names></name><name><surname>Kneller</surname><given-names>D.</given-names></name></person-group><source>SPARKY, verison 3</source><publisher-name>University of California</publisher-name><publisher-loc>San Fransisco, CA, USA</publisher-loc><year>2008</year><comment>Available online: <ext-link xlink:href="http://www.cgl.ucsf.edu/home/sparky/" ext-link-type="uri">http://www.cgl.ucsf.edu/home/sparky/</ext-link></comment><access-date>accessed on 15 February 2012</access-date></citation></ref>
<ref id="b21-ijms-13-07354"><label>21</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname><given-names>Y.J.</given-names></name><name><surname>Tejero</surname><given-names>R.</given-names></name><name><surname>Powers</surname><given-names>R.</given-names></name><name><surname>Montelione</surname><given-names>G.T.</given-names></name></person-group><article-title>A topology-constrained distance network algorithm for protein structure determination from NOESY data</article-title><source>Proteins</source><year>2006</year><volume>62</volume><fpage>587</fpage><lpage>603</lpage><pub-id pub-id-type="pmid">16374783</pub-id></citation></ref>
<ref id="b22-ijms-13-07354"><label>22</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Guntert</surname><given-names>P.</given-names></name></person-group><article-title>Automated NMR structure calculation with CYANA</article-title><source>Methods Mol. Biol</source><year>2004</year><volume>278</volume><fpage>353</fpage><lpage>378</lpage><pub-id pub-id-type="pmid">15318003</pub-id></citation></ref>
<ref id="b23-ijms-13-07354"><label>23</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schwieters</surname><given-names>C.D.</given-names></name><name><surname>Kuszewski</surname><given-names>J.J.</given-names></name><name><surname>Clore</surname><given-names>G.M.</given-names></name></person-group><article-title>Using Xplor-NIH for NMR molecular structure determination</article-title><source>Progr. Nucl. Magn. Reson. Spectrosc</source><year>2006</year><volume>48</volume><fpage>47</fpage><lpage>62</lpage><pub-id pub-id-type="doi">10.1016/j.pnmrs.2005.10.001</pub-id></citation></ref>
<ref id="b24-ijms-13-07354"><label>24</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cai</surname><given-names>M.</given-names></name><name><surname>Huang</surname><given-names>Y.</given-names></name><name><surname>Suh</surname><given-names>J.-Y.</given-names></name><name><surname>Louis</surname><given-names>J.M.</given-names></name><name><surname>Ghirlando</surname><given-names>R.</given-names></name><name><surname>Craigie</surname><given-names>R.</given-names></name><name><surname>Clore</surname><given-names>J.M.</given-names></name></person-group><article-title>Solution NMR structure of the barrier-to-autointegration factor-Emerin complex</article-title><source>J. Biol. Chem</source><year>2007</year><volume>282</volume><fpage>14525</fpage><lpage>14535</lpage><pub-id pub-id-type="doi">10.1074/jbc.M700576200</pub-id><pub-id pub-id-type="pmid">17355960</pub-id></citation></ref>
<ref id="b25-ijms-13-07354"><label>25</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bhattacharya</surname><given-names>A.</given-names></name><name><surname>Tejero</surname><given-names>R.</given-names></name><name><surname>Montelione</surname><given-names>G.T.</given-names></name></person-group><article-title>Evaluating protein structures determined by structural genomics consortia</article-title><source>Proteins</source><year>2007</year><volume>66</volume><fpage>778</fpage><lpage>795</lpage><pub-id pub-id-type="pmid">17186527</pub-id></citation></ref>
<ref id="b26-ijms-13-07354"><label>26</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname><given-names>Y.J.</given-names></name><name><surname>Powers</surname><given-names>R.</given-names></name><name><surname>Montelione</surname><given-names>G.T.</given-names></name></person-group><article-title>Protein NMR recall, precision, and F-measure scores (RPF scores): Structure quality assessment measures based on information retrieval statistics</article-title><source>J. Am. Chem. Soc</source><year>2005</year><volume>127</volume><fpage>1665</fpage><lpage>1674</lpage><pub-id pub-id-type="doi">10.1021/ja047109h</pub-id><pub-id pub-id-type="pmid">15701001</pub-id></citation></ref></ref-list>
<sec sec-type="display-objects">
<title>Figures and Table</title>
<fig id="f1-ijms-13-07354" position="float">
<label>Figure 1</label>
<caption>
<p>Sequence alignment of CV_2116 to OR16_04617 generated using ClustalW2 [<xref ref-type="bibr" rid="b7-ijms-13-07354">7</xref>].</p></caption>
<graphic xlink:href="ijms-13-07354f1.gif"/></fig>
<fig id="f2-ijms-13-07354" position="float">
<label>Figure 2</label>
<caption>
<p>(<bold>A</bold>) Two dimensional <sup>1</sup>H-<sup>15</sup>N HSQC spectrum of about 0.8 mM CV_2116 in 10 mM Tris (pH 7.0), 300 mM NaCl, 10 mM DTT, 1 mM benzamidine, 0.01% NaN<sub>3</sub>, 1× inhibitor cocktail (Roche Applied Science), 95% H<sub>2</sub>O/5% D<sub>2</sub>O collected at 20 °C on a Varian Inova 600 MHz NMR spectrometer. Backbone resonance assignments are labeled with one-letter amino acid codes followed by their sequence numbers. Assigned side chain NH<sup>□</sup> resonances of Arg (aliased), and side chain NH<sub>2</sub> resonances of Asn and Gln are also indicated; (<bold>B</bold>) 1D <sup>15</sup>N <italic>T</italic><sub>1</sub> and <italic>T</italic><sub>2</sub> relaxation data for CV_2116. (Top): <sup>15</sup>N <italic>T</italic><sub>1</sub> and <italic>T</italic><sub>2</sub> values were extracted by plotting the decay of integrated <sup>1</sup>H<sup>N</sup> intensity between 8.5 and 10.5 ppm and fitting the curves with standard exponential equations. (Bottom): Plot of rotational correlation time, τ<sub>c</sub> (ns), versus protein molecular weight (kDa) for known monomeric NESG targets of ranging size (taking into account isotope enrichment as well as affinity tags in the sequence). <sup>15</sup>N <italic>T</italic><sub>1</sub>/<italic>T</italic><sub>2</sub> data for all monomeric proteins used for the τ<sub>c</sub> <italic>vs</italic>. MW plot were obtained on the same Varian Inova 600 MHz spectrometer at 298 K.</p></caption>
<graphic xlink:href="ijms-13-07354f2.gif"/></fig>
<fig id="f3-ijms-13-07354" position="float">
<label>Figure 3</label>
<caption>
<p>(<bold>A</bold>) Stereoview of the superimpositions of the 20 lowest energy structures of CV_2116. The six β-strands are colored green, α-helix colored red, and the <italic>N</italic>-terminal tail, <italic>C</italic>-terminal tail and loop regions are colored grey; (<bold>B</bold>) Rainbow-colored ribbon representation of the lowest energy CV_2116 structure from the ensemble shown in (<bold>A</bold>). All secondary structural elements are labeled; (<bold>C</bold>) Electrostatic potential surface diagram for identical orientation as in (<bold>B</bold>). A positively charged region (R31, K32, R37 and K64) and a negatively charged region (D68, D71 and D74) are denoted in the left panel, while another negatively charged region (E47, D50, D52 and E56) is denoted in the right panel.</p></caption>
<graphic xlink:href="ijms-13-07354f3.gif"/></fig>
<table-wrap id="t1-ijms-13-07354" position="float">
<label>Table 1</label>
<caption>
<p>Summary of NMR and structure statistics for CV_2116 protein from <italic>Chromobacterium violaceum</italic> <xref ref-type="table-fn" rid="tfn1-ijms-13-07354">a</xref>.</p></caption>
<table frame="hsides" rules="groups">
<tbody>
<tr>
<td colspan="2" align="center" valign="top"><bold>Conformationally-Restricting Constraints</bold> <xref ref-type="table-fn" rid="tfn2-ijms-13-07354">b</xref></td></tr>
<tr>
<td colspan="2" align="center" valign="top"><bold>Distance Constraints</bold></td></tr>
<tr>
<td colspan="2" align="left" valign="top">
<hr/></td></tr>
<tr>
<td align="left" valign="top">Intra-residue (I = j)</td>
<td align="center" valign="top">306</td></tr>
<tr>
<td align="left" valign="top">Sequential (|i–j| = 1)</td>
<td align="center" valign="top">357</td></tr>
<tr>
<td align="left" valign="top">Medium-range (1 &lt; |i–j| &lt; 5)</td>
<td align="center" valign="top">193</td></tr>
<tr>
<td align="left" valign="top">Long-range (|i–j| ≥ 5)</td>
<td align="center" valign="top">419</td></tr>
<tr>
<td align="left" valign="top">Total</td>
<td align="center" valign="top">1275</td></tr>
<tr>
<td align="left" valign="top">Distance constraints per residue</td>
<td align="center" valign="top">15.5</td></tr>
<tr>
<td colspan="2" align="left" valign="top">Hydrogen bond constraints</td></tr>
<tr>
<td align="left" valign="top">Long-range (|i–j| ≥ 5)/total</td>
<td align="center" valign="top">20/56</td></tr>
<tr>
<td align="left" valign="top">Dihedral angle constraints</td>
<td align="center" valign="top">92</td></tr>
<tr>
<td align="left" valign="top">Total number of restricting constraints</td>
<td align="center" valign="top">1423</td></tr>
<tr>
<td align="left" valign="top">Number of constraints per residue</td>
<td align="center" valign="top">5.4</td></tr>
<tr>
<td colspan="2" align="left" valign="top">
<hr/></td></tr>
<tr>
<td colspan="2" align="center" valign="top"><bold>Residue Constraint Violations</bold> <xref ref-type="table-fn" rid="tfn2-ijms-13-07354">b</xref></td></tr>
<tr>
<td colspan="2" align="center" valign="top"><bold>Average Number of Distance Violations Per Structure</bold></td></tr>
<tr>
<td colspan="2" align="left" valign="top">
<hr/></td></tr>
<tr>
<td align="left" valign="top">0.1–0.2 Å</td>
<td align="center" valign="top">4.85</td></tr>
<tr>
<td align="left" valign="top">0.2–0.5 Å</td>
<td align="center" valign="top">1.15</td></tr>
<tr>
<td align="left" valign="top">&gt;0.5 Å</td>
<td align="center" valign="top">0</td></tr>
<tr>
<td align="left" valign="top">Average RMS distance violation/constraint (Å)</td>
<td align="center" valign="top">0.01</td></tr>
<tr>
<td align="left" valign="top">Maximum distance violation (Å)</td>
<td align="center" valign="top">0.29</td></tr>
<tr>
<td colspan="2" align="left" valign="top">
<hr/></td></tr>
<tr>
<td colspan="2" align="center" valign="top"><bold>Average Number of Dihedral Angle Violations Per Structure</bold></td></tr>
<tr>
<td colspan="2" align="left" valign="top">
<hr/></td></tr>
<tr>
<td align="left" valign="top">1–10°</td>
<td align="center" valign="top">0.7</td></tr>
<tr>
<td align="left" valign="top">&gt;10°</td>
<td align="center" valign="top">0</td></tr>
<tr>
<td align="left" valign="top">Average RMS dihedral angle violation/constraint (degree)</td>
<td align="center" valign="top">0.15</td></tr>
<tr>
<td align="left" valign="top">Maximum dihedral angle violation (degree)</td>
<td align="center" valign="top">2.60</td></tr>
<tr>
<td colspan="2" align="left" valign="top">
<hr/></td></tr>
<tr>
<td colspan="2" align="center" valign="top"><bold>RMSD from Average Coordinates (Å)</bold> <xref ref-type="table-fn" rid="tfn3-ijms-13-07354">c</xref></td></tr>
<tr>
<td colspan="2" align="left" valign="top">
<hr/></td></tr>
<tr>
<td align="left" valign="top">Backbone/Heavy atoms</td>
<td align="center" valign="top">0.5/0.9</td></tr>
<tr>
<td colspan="2" align="left" valign="top">
<hr/></td></tr>
<tr>
<td colspan="2" align="center" valign="top"><bold>Ramachandran Plot Statistics</bold> <xref ref-type="table-fn" rid="tfn3-ijms-13-07354">c</xref></td></tr>
<tr>
<td colspan="2" align="left" valign="top">
<hr/></td></tr>
<tr>
<td align="left" valign="top">Most favored regions (%)</td>
<td align="center" valign="top">96.3</td></tr>
<tr>
<td align="left" valign="top">Additional allowed regions (%)</td>
<td align="center" valign="top">3.6</td></tr>
<tr>
<td align="left" valign="top">Generously allowed (%)</td>
<td align="center" valign="top">0.1</td></tr>
<tr>
<td align="left" valign="top">Disallowed regions (%)</td>
<td align="center" valign="top">0</td></tr>
<tr>
<td colspan="2" align="left" valign="top">
<hr/></td></tr>
<tr>
<td colspan="2" align="center" valign="top"><bold>Global Quality Scores (raw/</bold><bold><italic>Z</italic></bold><bold>-score)</bold> <xref ref-type="table-fn" rid="tfn2-ijms-13-07354">b</xref></td></tr>
<tr>
<td colspan="2" align="left" valign="top">
<hr/></td></tr>
<tr>
<td align="left" valign="top">Verify3D</td>
<td align="center" valign="top">0.24/−3.53</td></tr>
<tr>
<td align="left" valign="top">Prosall</td>
<td align="center" valign="top">0.30/−1.45</td></tr>
<tr>
<td align="left" valign="top">Procheck(phi-psi) <xref ref-type="table-fn" rid="tfn3-ijms-13-07354">c</xref></td>
<td align="center" valign="top">0.21/1.14</td></tr>
<tr>
<td align="left" valign="top">Procheck(all) <xref ref-type="table-fn" rid="tfn3-ijms-13-07354">c</xref></td>
<td align="center" valign="top">0.23/1.36</td></tr>
<tr>
<td align="left" valign="top">Molprobity clash</td>
<td align="center" valign="top">16.49/−1.30</td></tr>
<tr>
<td colspan="2" align="left" valign="top">
<hr/></td></tr>
<tr>
<td colspan="2" align="center" valign="top"><bold>RPF Scores</bold> <xref ref-type="table-fn" rid="tfn4-ijms-13-07354">d</xref></td></tr>
<tr>
<td colspan="2" align="left" valign="top">
<hr/></td></tr>
<tr>
<td align="left" valign="top">Recall</td>
<td align="center" valign="top">0.98</td></tr>
<tr>
<td align="left" valign="top">Precision</td>
<td align="center" valign="top">0.91</td></tr>
<tr>
<td align="left" valign="top">F-measure</td>
<td align="center" valign="top">0.94</td></tr>
<tr>
<td align="left" valign="top">DP-score</td>
<td align="center" valign="top">0.80</td></tr></tbody></table>
<table-wrap-foot><fn id="tfn1-ijms-13-07354">
<label>a</label>
<p>Structure statistics were computed for the ensemble of 20 deposited structures;</p></fn><fn id="tfn2-ijms-13-07354">
<label>b</label>
<p>Calculated using PSVS 1.4 program. Average distance violations were calculated using the summation of r<sup>−6</sup>. Residues 1–82 were analyzed;</p></fn><fn id="tfn3-ijms-13-07354">
<label>c</label>
<p>Ordered residues ranges (with sum of phi and psi &gt; 1.8): 3–5, 8–13, 16–32, 40–45, 51–71, 79–81;</p></fn><fn id="tfn4-ijms-13-07354">
<label>d</label>
<p>PF scores reflected the goodness-of-fit of the final ensemble of structures including disordered residues to the NMR data.</p></fn></table-wrap-foot></table-wrap></sec></back></article>
