<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xml:lang="en" article-type="research-article">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">toxins</journal-id>
      <journal-title>Toxins</journal-title>
      <abbrev-journal-title abbrev-type="publisher">Toxins</abbrev-journal-title>
      <abbrev-journal-title abbrev-type="pubmed">Toxins</abbrev-journal-title>
      <issn pub-type="epub">2072-6651</issn>
      <publisher>
        <publisher-name>MDPI</publisher-name>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.3390/toxins4111367</article-id>
      <article-id pub-id-type="publisher-id">toxins-04-01367</article-id>
      <article-categories>
        <subj-group>
          <subject>Article</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>Short Toxin-like Proteins Abound in Cnidaria Genomes </article-title>
      </title-group>
      
      <contrib-group>
        <contrib contrib-type="author">
          <name>
            <surname>Tirosh</surname>
            <given-names>Yitshak</given-names>
          </name>
          <xref rid="af1-toxins-04-01367" ref-type="aff">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Linial</surname>
            <given-names>Itai</given-names>
          </name>
          <xref rid="af2-toxins-04-01367" ref-type="aff">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Askenazi</surname>
            <given-names>Manor</given-names>
          </name>
          <xref rid="af1-toxins-04-01367" ref-type="aff">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Linial</surname>
            <given-names>Michal</given-names>
          </name>
          <xref rid="af1-toxins-04-01367" ref-type="aff">1</xref>
          <xref rid="c1-toxins-04-01367" ref-type="corresp">*</xref>
        </contrib>
      </contrib-group>
      <aff id="af1-toxins-04-01367"><label>1 </label>Department of Biological Chemistry, Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem 91904, Israel; Email: <email>yitshak.tirosh@mail.huji.ac.il</email> (Y.T.); <email>manoras@cs.huji.ac.il</email> (M.A.)</aff>
      <aff id="af2-toxins-04-01367"><label>2 </label>The Racah Institute of Physics, The Hebrew University of Jerusalem, Jerusalem 91904, Israel; Email: <email>itai.linial@mail.huji.ac.il</email> </aff>
	  <author-notes>
        <corresp id="c1-toxins-04-01367"><label>*</label> Author to whom correspondence should be addressed; Email: <email>michall@cc.huji.ac.il</email>; Tel.: +972-2-658-5425; Fax: +972-2-658-6448.</corresp>
      </author-notes>
      <pub-date pub-type="epub">
        <day>16</day>
        <month>11</month>
        <year>2012</year>
      </pub-date>
      <pub-date pub-type="collection"> <month>11</month>
        <year>2012</year>
      </pub-date>
      <volume>4</volume>
      <issue>11</issue>
      <fpage>1367</fpage>
      <lpage>1384</lpage>
      <history>
        <date date-type="received">
          <day>24</day>
          <month>09</month>
          <year>2012</year>
        </date>
        <date date-type="rev-recd">
          <day>08</day>
          <month>11</month>
          <year>2012</year>
        </date>
        <date date-type="accepted">
          <day>09</day>
          <month>11</month>
          <year>2012</year>
        </date>
      </history>
      <permissions>
        <copyright-statement>© 2012 by the authors; licensee MDPI, Basel, Switzerland.</copyright-statement>
        <copyright-year>2012</copyright-year>
        <license xmlns:xlink="http://www.w3.org/1999/xlink" license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/3.0/">
          <p>This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).</p>
        </license>
      </permissions>
      <abstract>
        <p>Cnidaria is a rich phylum that includes thousands of marine species. In this study, we focused on Anthozoa and Hydrozoa that are represented by the <italic>Nematostella vectensis</italic> (Sea anemone) and <italic>Hydra magnipapillata</italic> genomes. We present a method for ranking the toxin-like candidates from complete proteomes of Cnidaria. Toxin-like functions were revealed using ClanTox, a statistical machine-learning predictor trained on ion channel inhibitors from venomous animals. Fundamental features that were emphasized in training ClanTox include cysteines and their spacing along the sequences. Among the 83,000 proteins derived from Cnidaria representatives, we found 170 candidates that fulfill the properties of toxin-like-proteins, the vast majority of which were previously unrecognized as toxins. An additional 394 short proteins exhibit characteristics of toxin-like proteins at a moderate degree of confidence. Remarkably, only 11% of the predicted toxin-like proteins were previously classified as toxins. Based on our prediction methodology and manual annotation, we inferred functions for over 400 of these proteins. Such functions include protease inhibitors, membrane pore formation, ion channel blockers and metal binding proteins. Many of the proteins belong to small families of paralogs. We conclude that the evolutionary expansion of toxin-like proteins in Cnidaria contributes to their fitness in the complex environment of the aquatic ecosystem.</p>
      </abstract>
      <kwd-group>
        <kwd>hydra</kwd>
        <kwd>nematostella</kwd>
        <kwd>neurotoxin</kwd>
        <kwd>protein families</kwd>
        <kwd>disulfide bonds</kwd>
        <kwd>antimicrobial peptide</kwd>
        <kwd>ion channel inhibitor</kwd>
        <kwd>ClanTox</kwd>
        <kwd>complete proteome</kwd>
        <kwd>comparative proteomics</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec sec-type="intro">
      <title>1. Introduction</title>
      <p>To date, most multicellular model organisms that have been studied come from Bilateria. A glimpse of our metazoan origin can nevertheless be seen from the recently sequenced genome of the choanoflagellate <italic>Monosiga brevicollis</italic> [<xref ref-type="bibr" rid="B1-toxins-04-01367">1</xref>]. The genomic information from Porifera (sponges) has contributed to the reconstruction of the relative evolutionary position of Metazoa with respect to unicellular fungi [<xref ref-type="bibr" rid="B2-toxins-04-01367">2</xref>]. It is now clear that a more recent branch along the evolution of metazoa links the Cnidaria and the Bilateria. This separation is dated to 650–750 million years ago [<xref ref-type="bibr" rid="B3-toxins-04-01367">3</xref>,<xref ref-type="bibr" rid="B4-toxins-04-01367">4</xref>,<xref ref-type="bibr" rid="B5-toxins-04-01367">5</xref>] though, some inconsistency remains in the positioning of Cnidaria in the phylogenetic tree of the ctenophores, bilaterians and sponges [<xref ref-type="bibr" rid="B6-toxins-04-01367">6</xref>].</p>
      <p>Cnidaria is a phylum including thousands of species that live in aquatic environments. They include the following groups: (i) Anthozoa such as sea anemones, corals and sea pens; (ii) Scyphozoa such as the jellyfish; (iii) Cubozoa such as the box jellies, and (iv) Hydrozoa, such as the Hydra [<xref ref-type="bibr" rid="B7-toxins-04-01367">7</xref>]. Cnidarians are distinguished from other phyla by their cnidocytes, which they use to capture prey. In most Cnidarians, a “nettle” has evolved for effective injection of venom into the prey. Such a device is found in jellyfish and cubozoans [<xref ref-type="bibr" rid="B8-toxins-04-01367">8</xref>]. Cnidaria feed on a variety of organisms from plankton to large animals. The survival success of Cnidarians over millions of years is linked to the evolution of their toxins, many of which have yet to be discovered.</p>
      <p>The goal of this study is the identification of toxin and toxin-like proteins (collectively termed TOLIPs) in Cnidaria. The two completed genomes that were included in this study are the Sea anemone <italic>Nematostella vectensis</italic> [<xref ref-type="bibr" rid="B9-toxins-04-01367">9</xref>] from the Atlantic coasts and the <italic>Hydra magnipapillata</italic>. The Sea anemone is a model for the underlying developmental program of the body plan [<xref ref-type="bibr" rid="B10-toxins-04-01367">10</xref>] and the Hydra is the first sequenced representative of the Hydrozoa that includes the fire corals, siphonophores and hydrocorals [<xref ref-type="bibr" rid="B11-toxins-04-01367">11</xref>]. </p>
      <p>Animal toxins and other short proteins share a compact, cysteine rich scaffold. An increasing number of proteins resembling animal-toxins have been identified in non-venomous contexts. These proteins often act as natural cell modulators. They include pore forming proteins, proteases, protease inhibitors, as well as secreted proteins that resemble cell antigens and growth factors [<xref ref-type="bibr" rid="B12-toxins-04-01367">12</xref>]. Several predictors were developed for identifying toxin related proteins from animals. However, each such predictor focuses on only one type or property such as the conotoxins family [<xref ref-type="bibr" rid="B13-toxins-04-01367">13</xref>], peptidases [<xref ref-type="bibr" rid="B14-toxins-04-01367">14</xref>] or cysteine-rich proteins [<xref ref-type="bibr" rid="B15-toxins-04-01367">15</xref>]. A strong evolutionary relationship exists between animal toxins and ancestral cysteine cross-linked proteins [<xref ref-type="bibr" rid="B16-toxins-04-01367">16</xref>,<xref ref-type="bibr" rid="B17-toxins-04-01367">17</xref>,<xref ref-type="bibr" rid="B18-toxins-04-01367">18</xref>]. The most striking examples are proteins from rodents and humans that resemble snake α-neurotoxins and act as modulators in brain [<xref ref-type="bibr" rid="B19-toxins-04-01367">19</xref>] and skin [<xref ref-type="bibr" rid="B20-toxins-04-01367">20</xref>]. </p>
      <p>The exponential growth rate of raw protein sequence has driven the field to acknowledge the need for automated, robust functional inference on a genomic scale. However, routinely used genome annotation tools often overlook the weak signal of short proteins. Furthermore, mass spectrometry (MS) methods only provide partial coverage of short proteins. The lack of transcriptomic evidence and the realization that many toxins (especially from marine animals) include non-classical post-translational modifications limit the knowledge of these short proteins. Consequently, EST collections, RNA-Seq and full-length cDNA remain the preferred source in seeking out novel short bioactive proteins.</p>
      <p>We have developed a machine-learning based classifier called ClanTox (CLssifier of ANimal TOXins) for ranking protein sequences according to their toxin-like properties. The short proteins that carry toxin activity and those that share toxin-like compact structures are collectively called TOLIPs. ClanTox creates a robust characterization of proteins that exhibit features of compact proteins, many of which resemble animal toxins [<xref ref-type="bibr" rid="B21-toxins-04-01367">21</xref>]. We have identified novel TOLIPs in the honeybee brain [<xref ref-type="bibr" rid="B21-toxins-04-01367">21</xref>], in viruses and in rodents [<xref ref-type="bibr" rid="B22-toxins-04-01367">22</xref>]. Recently, a TOLIP candidate expressed in the brain of the honeybee and other insects was validated as a non-coding brain specific expressed RNA [<xref ref-type="bibr" rid="B23-toxins-04-01367">23</xref>]. </p>
      <p>We applied ClanTox to the entire available Cnidaria proteome and have identified hundreds of novel candidates. We then prioritized the predicted TOLIPs in view of their key biological functions. We found 564 TOLIPs among the 17,000 short proteins from Nematostella and Hydra. The top TOLIPS (159 and 30 candidates from Nematostella and Hydra, respectively) were carefully analyzed and we were able to infer functions for most of these proteins. We conclude this analysis with a discussion of the evolutionary and functional insights achieved through the expansion of TOLIP genes in Cnidaria.</p>
    </sec>
    <sec sec-type="results">
      <title>2. Results</title>
      <sec>
        <title>2.1. The Cnidarian Short Proteome</title>
        <p>Currently, there are over 83,000 known proteins for the different branches of Cnidaria. Most of these sequences originate from the recently completed proteomes from the genomes of the sea anemones <italic>Nematostella vectensis</italic> and <italic>Hydra magnipapillata</italic>. <xref ref-type="fig" rid="toxins-04-01367-f001">Figure 1</xref> shows the number of proteins associated with Cnidaria and the branch of the sponges. The latter are represented by <italic>Amphimedon queenslandica</italic> and will not be further discussed.</p>
        <p>Short proteins are under-represented in all organisms. We have shown that a rather small number of functions populate this subset of the proteome. Notably, many of the short proteins archived in the main databases (e.g., UniProtKB [<xref ref-type="bibr" rid="B24-toxins-04-01367">24</xref>] and NCBI Proteins) are incomplete. These databases also include fragmented sequences from incomplete mRNA sequence (<italic>i.e.</italic>, sequences that lack initiating Methionines or stop codons). Only a negligible percentage is attributable to processed peptides that carry a distinct biological function. The fraction of short proteins (&lt;150 amino acids) in all eukaryotes (total 6.3 million sequences) is close to 20%. This ratio is consistent across all major branches of metazoa (e.g., Insects 19%; Echinodermata 18%). However, the proportion of short proteins in Cnidaria is significantly higher (27.5%) even relative to Porifera (Sponges, 24%). The rest of the analysis will focus exclusively on this fraction.</p>
        <p>There are two resources for the complete proteomes from <italic>Nematostella vectensis</italic> and <italic>Hydra magnipapillata</italic> that differ in their level of redundancy. The UniProtKB reports on a total of 33,000 proteins from Cnidaria among which 9050 are short. The protein section from NCBI, on the other hand, appears to be a more complete (but somewhat redundant) resource. NCBI reports 82,400 Cnidarian sequences (<xref ref-type="fig" rid="toxins-04-01367-f001">Figure 1</xref>). There are 68,000 protein sequences originating from the two complete sequenced genomes of Nematostella and Hydra, 16,900 of which are short proteins (&lt;150 amino acids, 25%). We combine these sources and focus exclusively on the short proteins (13,586 and 3314 from Nematostella and Hydra, respectively) to ensure a maximal discovery rate.</p>
		<fig id="toxins-04-01367-f001" position="anchor">
          <label>Figure 1</label>
          <caption>
            <p>Phylogenetic tree of the metazoa. The number of protein sequences for each branch is indicated. Data are retrieved from the NCBI taxonomy database. </p>
          </caption>
          <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="toxins-04-01367-g001.tif"/>
        </fig>
        <fig id="toxins-04-01367-f002" position="anchor">
          <label>Figure 2</label>
          <caption>
            <p>Complete proteome annotations. The fraction of the proteins annotated as predicted, hypothetical or putative are shown for <italic>Apis mellifera</italic>, <italic>Hydra magnipapillata</italic> and <italic>Nematostella vectensis</italic>. </p>
          </caption>
          <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="toxins-04-01367-g002.tif"/>
        </fig>
        
      </sec>
      <sec>
        <title>2.2. Cnidarian Proteomes Are Partial and Poorly Annotated</title>
        <p>The annotations of protein sequences from Cnidaria (Namatostella and Hydra) lags by comparison to the model organisms curated by the NCBI. This state of affairs is particularly extreme for the Nematostella genome where only 1.5% of the proteome has informative protein titles. The rest is indicated as “predicted” or “hypothetical” (<xref ref-type="fig" rid="toxins-04-01367-f002">Figure 2</xref>). For the Hydra proteome, about 50% of the sequences are associated with informative annotations and the rest are marked “predicted” or “hypothetical”. Proteomes that belong to the class of Cubozoa (sea wasps) and Scyphozoa (jellyfishes) are only partially sequenced, with less than 200 and 1000 known ORFs, respectively. </p>
        <p>By comparison, we present the annotation coverage of the <italic>Apis mellifera</italic> completed proteome. <italic>A. mellifera</italic> (honeybee), curation provides informative annotations for 75% of the proteome. For other species (e.g., popular model organisms), the annotation assignment of the complete proteome is higher than 75% and may reach 98% (e.g., in the case of the human proteome). Note that due to the difficulty in assigning function to short proteins, the fraction of annotated short proteins from Hydra and Nematostella is effectively negligible. </p>
      </sec>
      <sec>
        <title>2.3. Discovery of Toxin-like Proteins (TOLIPs) in Hydra</title>
        <p>While most Hydrae are non-toxic, a few species, such as the fire coral Millepora and the Portugese Man-O-War Physalia are highly venomous animals. A bioinformatics approach for detecting bioactive peptides with toxin-like activities was conducted [<xref ref-type="bibr" rid="B25-toxins-04-01367">25</xref>]. Surprisingly, Hydra lacks classical ion channel blockers that are found in almost all venomous organisms. However, some proteins act as Ryanodine receptor Ca<sup>2+</sup> channel blockers [<xref ref-type="bibr" rid="B26-toxins-04-01367">26</xref>]. Manual inspection reveals the complexity and richness of bioactive peptides in Hydra [<xref ref-type="bibr" rid="B25-toxins-04-01367">25</xref>]. Among them are proteins that belong to the phospholipase family PLA2, pore forming sequences and non-classical ion channel blockers.</p>
        <fig id="toxins-04-01367-f003" position="anchor">
          <label>Figure 3</label>
          <caption>
            <p>Scheme of toxin-like proteins (TOLIPs) discovery. The three major steps in TOLIPs discovery and functional inference are shown. The schematic representation of the histogram of the ClanTox prediction for short proteins is shown. The high confidence predictions are indicated as P2 and P3.</p>
          </caption>
          <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="toxins-04-01367-g003.tif"/>
        </fig>
        
		<p><xref ref-type="fig" rid="toxins-04-01367-f003">Figure 3</xref> illustrates the schematic steps in the discovery of Cnidarian TOLIPs. The steps include the input set of short sequences, the training protocol for ClanTox, the prediction results and the semi-manual functional inference. We demonstrate the entire protocol for the Hydra short proteome whose annotation coverage is superior relative to the Nematostella proteome (<xref ref-type="fig" rid="toxins-04-01367-f002">Figure 2</xref>). </p>
        <p>ClanTox tends to identify proteins containing multiple cysteines distributed along the entire sequence. Multiple cysteines and their spacing are the hallmark of many animal secreted ion channel inhibitors. Activating ClanTox on the 17,580 proteins from the Hydra proteome revealed 110 sequences that are positively predicted to be toxin-like (marked P1–P3). </p>
        <p><xref ref-type="fig" rid="toxins-04-01367-f004">Figure 4</xref> lists the inferred functions for the 30 highest confidence sequences (P3 and P2, <xref ref-type="fig" rid="toxins-04-01367-f003">Figure 3</xref>) from Hydra. We note that four of the proteins are composed of tandem repeats (TRs). For such cases, ClanTox wrongly predicts these proteins as TOLIPs (<xref ref-type="fig" rid="toxins-04-01367-f003">Figure 3</xref>, stars). A protein that has even one (or more) cysteine in its repeated unit is prone to being mistakenly characterized as TOLIP. For the rest of the Hydra TOLIPs, evidence of their function can be exposed based on a homology search for domains and structural resemblance (<xref ref-type="fig" rid="toxins-04-01367-f004">Figure 4</xref>, arrowheads).</p>
        <fig id="toxins-04-01367-f004" position="anchor">
          <label>Figure 4</label>
          <caption>
            <p>Functional inference for the predictions of TOLIPs from Hydra. TOLIPs that are listed are predicted as P2 and P3. Tandem Repeat (TR) proteins are marked by a star. All other functions are marked by colored arrowheads. XP_002156558.1 carries two functional domains.</p>
          </caption>
          <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="toxins-04-01367-g004.tif"/>
        </fig>
        <p><xref ref-type="fig" rid="toxins-04-01367-f004">Figure 4</xref> summarizes the 10 major functions associated with high-confidence predictions from Hydra. These functions include antimicrobial peptides, metal binding, protease inhibitors and domains that participate in protein interactions. Of special interest are TOLIPs that act in the Wnt signaling pathway. Reports on Wnt pathway in Cnidaria are in accord with our previously reported finding [<xref ref-type="bibr" rid="B27-toxins-04-01367">27</xref>]. Ultimately, for about 2/3 of the predicted TOLIPs, some function can be inferred (based on Hidden Markov Models comparisons, see Experimental Section). Furthermore, for a subset of these findings, a mode of action for the toxic effect of the protein can be envisioned. </p>
        <p>A salient example in our findings is the identification as TOLIPs of short proteins (XP_002166541.1 and XP_002161753.1) that structurally resemble the porcine spasmolytic proteins (pSP). The latter appears in extracellular eukaryotic proteins that are stabilized by three disulphide bonds to form a trefoil motif [<xref ref-type="bibr" rid="B28-toxins-04-01367">28</xref>]. Possibly, the Hydra pSP-like proteins comprise unknown receptor binding or a growth factor-like domain. </p>
      </sec>
      <sec>
        <title>2.4. Expansion of Cell Modulatory Functions among TOLIPs from Hydra</title>
        <p>Most of the Hydra TOLIPs can be considered secreted with extracellular functions (<xref ref-type="fig" rid="toxins-04-01367-f004">Figure 4</xref>). For example, XP_002156558.1 (135 amino acids) is a secretory protein with two regions. The <italic>N</italic>-terminus resembles the single insulin-like growth factor binding domain protein (SIBD-1) and the <italic>C</italic>-terminus resembles a Protease inhibitor of the Kunitz superfamily. An architecture that is based on this combination of domains is found in additional toxins such as the β-bungarotoxin [<xref ref-type="bibr" rid="B29-toxins-04-01367">29</xref>]. </p>
        <p>Beta-bungarotoxin is a heterodimeric neurotoxin consisting of a phospholipase subunit linked by a disulfide bond to a K<sup>+</sup> channel binding subunit (belonging to the Kunitz protease inhibitor superfamily). Thus, toxicity is achieved by a phospholipase that is targeted to the presynaptic membrane by way of a paired Kunitz module [<xref ref-type="bibr" rid="B30-toxins-04-01367">30</xref>]. In the case of the Hydra, we anticipate a mode in which the Kunitz protease inhibitor domain presents the SIBD-1 to produce an effective binding. Among the 3D-solved structures (from the PDB), The Hydra Kunitz domain is similar to that of several potent toxins: β-bungarotoxin (PDB: 1BUN_B), Huwentoxin-11 (PDB: 2JOT_A), Anntoxin from the tree frog <italic>Hyla annectans</italic> (PDB: 2KCR_A), the snake venom of the <italic>Bungarus fasciatus</italic> (PDB: 1JC6_A) and the green Mamba <italic>Dendroaspis angusticeps</italic> (PDB: 1DTK_A). </p>
        <p>We now focus on a predicted TOLIP that represents a short, secreted protein with a modulatory function. <xref ref-type="fig" rid="toxins-04-01367-f005">Figure 5</xref>A compares a statistical model (HMM, Hidden Markov Model) that was based on the sequence XP_002164320.1 (105 amino acids) with a library of HMM models from all 3D solved structures that are archived in the PDB. The resulting model was based on PDB accession 3ZXC from the Central America hunting spider <italic>Cupiennius salei</italic>. This sequence is a single insulin-like growth factor binding domain protein (SIBD-1). SIBD-1 was proposed to act in the spider’s immune system. This domain appears in 10 additional remote paralogs (the closest paralog XP_002156854.1 was scored by ClanTox at a moderate P1 confidence level). </p>
        <p>An expansion of TOLIP genes is a general trend among the Hydra. <xref ref-type="fig" rid="toxins-04-01367-f005">Figure 5</xref>B shows such instance. All paralogs of XP_002154511.1 maintain the cysteine positions (<xref ref-type="fig" rid="toxins-04-01367-f005">Figure 5</xref>B). While the Signal peptide segment (green font) is less conserved, the 10 cysteines in addition to a number of charged amino acids are fully conserved. It is likely that these conserved amino acids participate in the folding or binding properties of these proteins.</p>
        <p>Modulation of adhesion through the activation of the integrin signaling pathways was identified among the proposed TOLIPs. XP_002157505.1 (and five additional paralogs) resembles the vascular apoptosis-inducing protein (VAP) from <italic>Crotalus atrox</italic> venom (Western diamond back rattlesnake). The similarity covers the disintegrin domain. Disintegrin is a short metalloproteinase domain that appears in viper venoms and functions as potent inhibitors of platelet aggregation and integrin-dependent cell adhesion [<xref ref-type="bibr" rid="B31-toxins-04-01367">31</xref>].</p>
		<fig id="toxins-04-01367-f005" position="anchor">
          <label>Figure 5</label>
          <caption>
            <p>Cell modulators from Hydra. (<bold>A</bold>) The sequence XP_002164320.1 is shown. The segments of the sequence that were excluded from the HHPred comparative models are colored gray. HHPred representation of SIBD domain from Hydra and a structural homologue PDB: 3ZXC_A. The domain of 3ZXC includes a Single Insulin-like Growth Factor-Binding Domain Protein (SIBD-1) from the Central American Hunting Spider <italic>Cupiennius salei</italic>; (<bold>B</bold>) Set of secreted proteins and their paralogs. The function of these proteins is unknown. However, the spacing and the number of cysteines along the sequences are conserved (marked red).</p>
          </caption>
          <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="toxins-04-01367-g005.tif"/>
        </fig>
        
      </sec>
      <sec>
        <title>2.5. The Number of Tolips Is Exceptionally High among Short Proteins from Nematostella</title>
        <p><xref ref-type="table" rid="toxins-04-01367-t001">Table 1</xref> summarizes the prediction results from ClanTox for Nematostella and Hydra proteomes. We divided the results according to the significance of the predictions: P3 (Very High), P2 (High) and P1 (Moderate). We also indicated the number of negatives predictions (N). It is evident that the fraction of positively identified TOLIPs in Nematostella is exceptionally high (4 fold higher relative to the Hydra). As expected, the fraction of TOLIPs among the proteins that are shorter than 100 amino acids is very high (17% for P1–P3), the fraction of TOLIPs is somewhat smaller (11%) for proteins of length 101–150. However, in both cases, the fraction appears significantly larger than the equivalent Hydra. Hydra’s TOLIPs occupy 3.1% and 3.4% of the proteins for length &lt;100 and 101–150 amino acids, respectively. We therefore investigated the origin of TOLIPs’ expansion in the sea anemone proteome. </p>
        <table-wrap id="toxins-04-01367-t001" position="float">
          <object-id pub-id-type="pii">toxins-04-01367-t001_Table 1</object-id>
          <label>Table 1</label>
          <caption>
            <p>Results of ClanTox predictions on short proteins.</p>
          </caption>
          <table>
<thead>
              <tr align="center">
                <th valign="middle">Species</th>
                <th valign="middle">Range (aa)</th>
                <th valign="middle">P3 (Very high)</th>
                <th valign="middle">P2 (High)</th>
                <th valign="middle">P1 (Moderate)</th>
                <th valign="middle">% P2–P3 predictions </th>
                <th valign="middle">Negative predictions</th>
                <th valign="middle">Total </th>
              </tr>
            </thead>
            <tbody>
              <tr align="center">
                <td rowspan="3" valign="middle">
                  <italic>N. vectensis</italic>
                </td>
                <td align="center" valign="middle">10–100</td>
                <td align="center" valign="middle">133</td>
                <td align="center" valign="middle">253</td>
                <td align="center" valign="middle">657</td>
                <td align="center" valign="middle">6.3</td>
                <td align="center" valign="middle">5083</td>
                <td align="center" valign="middle">6126</td>
              </tr>
              <tr>
                <td align="center" valign="middle">101–150</td>
                <td align="center" valign="middle">26</td>
                <td align="center" valign="middle">122</td>
                <td align="center" valign="middle">704</td>
                <td align="center" valign="middle">1.9</td>
                <td align="center" valign="middle">6608</td>
                <td align="center" valign="middle">7460</td>
              </tr>
              <tr>
                <td align="center" valign="middle">10–150</td>
                <td align="center" valign="middle">159</td>
                <td align="center" valign="middle">375</td>
                <td align="center" valign="middle">1361</td>
                <td align="center" valign="middle">3.9</td>
                <td align="center" valign="middle">11691</td>
                <td align="center" valign="middle">13586</td>
              </tr>
              <tr align="center">
                <td rowspan="3" valign="middle">
                  <italic>H. magnipapillata</italic>
                </td>
                <td align="center" valign="middle">10–100</td>
                <td align="center" valign="middle">8</td>
                <td align="center" valign="middle">7</td>
                <td align="center" valign="middle">19</td>
                <td align="center" valign="middle">1.4</td>
                <td align="center" valign="middle">1038</td>
                <td align="center" valign="middle">1073</td>
              </tr>
              <tr>
                <td align="center" valign="middle">101–150</td>
                <td align="center" valign="middle">3</td>
                <td align="center" valign="middle">12</td>
                <td align="center" valign="middle">61</td>
                <td align="center" valign="middle">0.6</td>
                <td align="center" valign="middle">2164</td>
                <td align="center" valign="middle">2241</td>
              </tr>
              <tr>
                <td align="center" valign="middle">10–150</td>
                <td align="center" valign="middle">11</td>
                <td align="center" valign="middle">19</td>
                <td align="center" valign="middle">80</td>
                <td align="center" valign="middle">0.9</td>
                <td align="center" valign="middle">3202</td>
                <td align="center" valign="middle">3314</td>
              </tr>
            </tbody>
          </table>
		  </table-wrap>
      </sec>
      <sec>
        <title>2.6. False Detection of TOLIPs Is Associated with Tandem Repeats Sequences</title>
        <p>It has been noted that the Nematosella proteome is enriched in tandem repeats (TRs). The properties of TRs have been thoroughly studied [<xref ref-type="bibr" rid="B32-toxins-04-01367">32</xref>]. We found that the fraction of TRs among the most highly significant TOLIPs (P3, see Experimental Section) reaches 25% of the sequences. It is substantially higher than the fraction of TR appearance in the overall proteome (16%). </p>
        <table-wrap id="toxins-04-01367-t002" position="float">
          <object-id pub-id-type="pii">toxins-04-01367-t002_Table 2</object-id>
          <label>Table 2</label>
          <caption>
            <p>Tandem Repeats (TR) proteins among the top predictions from Nematostella. Each repeat was identified in two proteins (total 40 proteins) due to redundancy.</p>
          </caption>
          <table>
<thead>
              <tr align="center">
                <th valign="middle">Consensus error</th>
                <th valign="middle">Copy number</th>
                <th valign="middle">Period</th>
                <th valign="middle">Repeat</th>
              </tr>
            </thead>
            <tbody>
              <tr align="center">
                <td valign="middle">0.02</td>
                <td valign="middle">3.03</td>
                <td valign="middle">38</td>
                <td valign="middle">1</td>
              </tr>
              <tr align="center">
                <td valign="middle">0.11</td>
                <td valign="middle">2.09</td>
                <td valign="middle">35</td>
                <td valign="middle">2</td>
              </tr>
              <tr align="center">
                <td valign="middle">0.12</td>
                <td valign="middle">2</td>
                <td valign="middle">29</td>
                <td valign="middle">3</td>
              </tr>
              <tr align="center">
                <td valign="middle">0.03</td>
                <td valign="middle">3.05</td>
                <td valign="middle">20</td>
                <td valign="middle">4</td>
              </tr>
              <tr align="center">
                <td valign="middle">0.07</td>
                <td valign="middle">5.26</td>
                <td valign="middle">19</td>
                <td valign="middle">5</td>
              </tr>
              <tr align="center">
                <td valign="middle">0.08</td>
                <td valign="middle">2.17</td>
                <td valign="middle">18</td>
                <td valign="middle">6</td>
              </tr>
              <tr align="center">
                <td valign="middle">0.04</td>
                <td valign="middle">4.58</td>
                <td valign="middle">12</td>
                <td valign="middle">7</td>
              </tr>
              <tr align="center">
                <td valign="middle">0.03</td>
                <td valign="middle">5.5</td>
                <td valign="middle">12</td>
                <td valign="middle">8</td>
              </tr>
              <tr align="center">
                <td valign="middle">0.04</td>
                <td valign="middle">7</td>
                <td valign="middle">11</td>
                <td valign="middle">9</td>
              </tr>
              <tr align="center">
                <td valign="middle">0</td>
                <td valign="middle">9.2</td>
                <td valign="middle">10</td>
                <td valign="middle">10</td>
              </tr>
              <tr align="center">
                <td valign="middle">0.04</td>
                <td valign="middle">7.78</td>
                <td valign="middle">9</td>
                <td valign="middle">11</td>
              </tr>
              <tr align="center">
                <td valign="middle">0.06</td>
                <td valign="middle">8.75</td>
                <td valign="middle">8</td>
                <td valign="middle">12</td>
              </tr>
              <tr align="center">
                <td valign="middle">0.18</td>
                <td valign="middle">5.38</td>
                <td valign="middle">8</td>
                <td valign="middle">13</td>
              </tr>
              <tr align="center">
                <td valign="middle">0.07</td>
                <td valign="middle">13.25</td>
                <td valign="middle">8</td>
                <td valign="middle">14</td>
              </tr>
              <tr align="center">
                <td valign="middle">0.03</td>
                <td valign="middle">8.71</td>
                <td valign="middle">7</td>
                <td valign="middle">15</td>
              </tr>
              <tr align="center">
                <td valign="middle">0.06</td>
                <td valign="middle">15.71</td>
                <td valign="middle">7</td>
                <td valign="middle">16</td>
              </tr>
              <tr align="center">
                <td valign="middle">0.08</td>
                <td valign="middle">9.14</td>
                <td valign="middle">7</td>
                <td valign="middle">17</td>
              </tr>
              <tr align="center">
                <td valign="middle">0.07</td>
                <td valign="middle">8.71</td>
                <td valign="middle">7</td>
                <td valign="middle">18</td>
              </tr>
              <tr align="center">
                <td valign="middle">0.04</td>
                <td valign="middle">11.17</td>
                <td valign="middle">6</td>
                <td valign="middle">19</td>
              </tr>
              <tr align="center">
                <td valign="middle">0.07</td>
                <td valign="middle">7.67</td>
                <td valign="middle">6</td>
                <td valign="middle">20</td>
              </tr>
            </tbody>
          </table>
		  </table-wrap>
        
		<p>The properties of the repeats, the repeated unit length and the copy number of the periodicity are summarized in <xref ref-type="table" rid="toxins-04-01367-t002">Table 2</xref>. There are 20 types of TR units in 40 of the top 159 TOLIP predictions (Very high, P3, <xref ref-type="table" rid="toxins-04-01367-t001">Table 1</xref>).</p>
        <p>We anticipate that these 40 TR proteins are false positives and do not play a role as Toxins or Toxin-like proteins. The TR proteome is prone to false identification of TOLIPs due to the pattern of repeats that include at least one cysteine. Importantly, the length of the repeated segment (<italic>i.e.</italic>, Number of TR units × Unit length) occupies most of the protein length. Many of the TR proteins lack an initiator Methionine and constitute of partial sequences with no evidence for their expression (see discussion in [<xref ref-type="bibr" rid="B32-toxins-04-01367">32</xref>]).</p>
      </sec>
      <sec>
        <title>2.7. Functional Assignment of Most TOLIPs from Nematostella</title>
        <p>Among the 119 predicted TOLIPs from Nematostella (<xref ref-type="table" rid="toxins-04-01367-t001">Table 1</xref>, excluding TR proteins), 19 were already annotated as Neurotoxins. For the 100 remaining proteins, no annotations are available. These proteins are named predicted/hypothetical proteins. To assign function to these 100 proteins, we first removed the most obviously redundant proteins (<italic>i.e.</italic>, 100% identity in amino acids, identical length). This step led to 80 non-redundant proteins (<xref ref-type="fig" rid="toxins-04-01367-f006">Figure 6</xref>). Among them 20 were TR proteins (<xref ref-type="table" rid="toxins-04-01367-t002">Table 2</xref>) and 12 were named Neurotoxins (<xref ref-type="fig" rid="toxins-04-01367-f006">Figure 6</xref>, marked yellow). </p>
        <p>Each sequence was tested for its most likely 3D structure using the HHpred algorithm (see Experimental Section). From this analysis we were able to annotate an additional 8 TOLIPs based on similarities to neurotoxin structural models (<xref ref-type="fig" rid="toxins-04-01367-f006">Figure 6</xref>, marked blue; 22 redundant proteins). These proteins can be partitioned into two main classes. The major group (5 proteins, 14 redundant proteins) shares a strong similarity to the Navs fold [<xref ref-type="bibr" rid="B33-toxins-04-01367">33</xref>]. The Nav polypeptides (e.g., Nv1, <italic>N. vectensis</italic> toxin 1) inhibit the inactivation of voltage-gated sodium channels. These proteins occupy an expanded chromosomal region. Notably, changes in the expression and maturation of Nv1 transcripts are known to occur throughout the development and the life cycle of the sea anemone [<xref ref-type="bibr" rid="B33-toxins-04-01367">33</xref>].</p>
        <p>The other class of predicted neurotoxins is longer (range from 105–125 amino acids) with homologues from a wide array of venoms that block K<sup>+</sup> channels. An example is EDO49171.1. The closest homologue of EDO49171.1 is the human EDO45628.1. The shared segment matches the MMP23 (matrix metalloproteinase 23) that is evolutionarily related to the Sea anemones peptides ShK. The ShK is a short peptide (35 amino acids) stabilized by three disulfide bridges. There are three such sequences that form a paralogous group. </p>
        <p>Additional functions that dominated the Nematostella’s TOLIPs belong to ligand-cell surface modulators (including Adhesion, Wnt signaling) and the Kunitz protease inhibitors (<xref ref-type="fig" rid="toxins-04-01367-f006">Figure 6</xref>, marked brown). These functions are shared with the Hydra TOLIPS (<xref ref-type="fig" rid="toxins-04-01367-f004">Figure 4</xref>).</p>
        <p>TOLIPs that resemble adhesion domains may participate in cell-cell interaction networks. Many adhesion proteins are composed of a series of EGF-like domains that also bind calcium. For example, the protein EDO26015.1 share this domain that is found in several calcium-binding cell adhesion regulators (modeled on PDB: 2Bo2_A). Cell interaction by calcium regulation is an attractive extension of TOLIP functionality that calls for further investigation. </p>
        <p>In a few cases we identified TOLIPs as fragments that eventually belong to long proteins (<xref ref-type="fig" rid="toxins-04-01367-f006">Figure 6</xref>, F). Such cases are propagated from a failure in the genome annotation phase. From the 80 non-redundant high confidence TOLIPs, only 3% resisted functional characterization.</p>
		<fig id="toxins-04-01367-f006" position="anchor">
          <label>Figure 6</label>
          <caption>
            <p>High confidence predictions from Nematostella. A list of 80 TOLIPs that were predicted by ClanTox as P3 are shown. Major functions are indicated by the colored bar next to the Cysteine pattern scheme. Neurotoxins (NTx, marked yellow) include proteins that were previously annotated as such. Predicted overlooked neurotoxins are marked blue. The other functions are colored as detailed: Gray, Tandem repeats (TR) proteins; Orange, Extracellular regulation and ligand binding; Brown, Protease inhibitors, mainly represented by the Kunitz domain; Green, homologue to specialized domains from Pfam; Light blue, Calciun modulating domains of adhesion and EGF like; M, metaloprotein; Proteins with exceptionally high number of paralogs are assigned by their number; F marks a fragment that apparently belongs to a long protein. These sequences reflect mistakes in the database assignments. The redundant list includes 159 sequences. Most proteins appear in Refseq and GeneBank and thus appear redundant by the NCBI protein database. Only the non-redundant set is shown. </p>
          </caption>
          <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="toxins-04-01367-g006.tif"/>
        </fig>
        
      </sec>
    </sec>
    <sec sec-type="discussion">
      <title>3. Discussion</title>
      <p>The proteomes of Hydra and Nematostella are representatives of the Anthozoa and Hydrozoa that have diverged &gt;540 million year ago [<xref ref-type="bibr" rid="B11-toxins-04-01367">11</xref>]. These two genomes differ in their genome sizes, the GC nucleotide content, the number of transposomal elements among other features. We found that the spectrum of functions which were predicted for TOLIPs in Hydra and Nematostella proteomes overlap (compare <xref ref-type="fig" rid="toxins-04-01367-f004">Figure 4</xref> and <xref ref-type="fig" rid="toxins-04-01367-f006">Figure 6</xref>). On the other hand, the basis for the drastic difference in the number of toxin-like candidate in each of these genomes (<xref ref-type="table" rid="toxins-04-01367-t001">Table 1</xref>) is not evident. We show that 25% of the Nematostella TOLIPs are actually tandem repeat (TR) proteins. We postulate that in addition to the organism’s unique proteome, a permissive gene annotation contributes to wrongly identified sequences as TOLIP.</p>
      <sec>
        <title>3.1. Lack of Knowledge Regarding the Cnidaria Secretome</title>
        <p>It is expected that toxins (and TOLIPs) have a Signal peptide. However, the Cnidaria genomes are mostly un-annotated. Thus, only 11 proteins were indicated as containing a Signal peptide (SwissProt based annotation). Among the analyzed Nematostella’s predictions (P3, excluding TR proteins), we identified 34% as having a Signal peptide using SignalP 4.0. Recall that Signal peptide information was not included in the training of ClanTox. We attribute such relatively low fraction to the missing segments at the <italic>N</italic>-terminal of the proteins. Actually, only 32% of the analyzed short proteins from Nematostella contain an initiator Methionine. It is expected that transcriptomic data will be needed to improve the completeness of the Cnidaria sequences. </p>
        <p>Most Nematostella proteins (98.5%) are unannotated (<xref ref-type="fig" rid="toxins-04-01367-f002">Figure 2</xref>), thus the functions of their predicted TOLIPs remain elusive. Sequence search of the predicted TOLIPs highlights homologues among marine metagenomics sequences with unknown origin. For example, sequence XP_001624064.1 (130 amino acids, P3), resembles several uncharacterized sequences from metagenomic experiments. The potential for active genetic material exchange through viruses and pathogen of Cnidaria cannot be excluded. A genetic exchange from viruses to their metazoan hosts was demonstrated among short proteins [<xref ref-type="bibr" rid="B34-toxins-04-01367">34</xref>]. However, for a number of sequences the apparent relatedness to metagenomic sequences is clearly spurious (e.g., EDO31964.1, 130 amino acids). </p>
      </sec>
      <sec>
        <title>3.2. The Cnidaria TOLIPs—A Source for New Drugs</title>
        <p>We propose that the top Toxin-like protein predictions may lead to an expansion of known toxins, toxin-like and antibacterial proteins. Further analysis of homologs and paralogs presented in this study will lead to the identification of amino acids critical to binding and specificity. Such analysis is beyond the scope of this research. </p>
        <p>The therapeutic potential of TOLIPs has led to the development of toxin-based drugs [<xref ref-type="bibr" rid="B35-toxins-04-01367">35</xref>]. Some small peptides from the conotoxin family are already in clinical use for managing chronic pain [<xref ref-type="bibr" rid="B36-toxins-04-01367">36</xref>]. Some toxins from Nematostella act by forming pores in the targeted membranes [<xref ref-type="bibr" rid="B37-toxins-04-01367">37</xref>]. From the mesoglea of a scyphoid jellyfish (<italic>Aurelia aurita</italic>) a novel antimicrobial peptide was biochemically identified with weak similarity to ion channel blockers or defensins [<xref ref-type="bibr" rid="B38-toxins-04-01367">38</xref>]. Indeed, the Defensin-fold carries an antimicrobial activity. As such, Defensins were proposed as attractive vaccines and as potential drugs [<xref ref-type="bibr" rid="B39-toxins-04-01367">39</xref>]. A Defensin-like fold is missing in Cnidaria. Most likely, the expansion of Defensins occurred in recently evolved phylogenetic branches prior to the speciation of Chordata. </p>
      </sec>
      <sec>
        <title>3.3. Evolution Dynamics—Expansion and Deletion of TOLIP Sequences</title>
        <p>Representative genomes from Porifera (sponges) have provided molecular explanation for the increase in gene number due to a burst of gene duplication events. This process gave rise to the evolution of new domains (<italic>i.e.</italic>, adhesion molecules, lectin, proteases) [<xref ref-type="bibr" rid="B2-toxins-04-01367">2</xref>] in Cnidaria. Our results support local expansion events of genes encoding for short proteins. The ability of a duplication burst to increase functional diversity was illustrated in yeast [<xref ref-type="bibr" rid="B40-toxins-04-01367">40</xref>] and humans [<xref ref-type="bibr" rid="B41-toxins-04-01367">41</xref>].</p>
        <p>The analysis of neurotoxin (Nav1) evolution exposed extensive genomic expansion of this region [<xref ref-type="bibr" rid="B42-toxins-04-01367">42</xref>]. Gene expansion has shaped many domain families mainly for the immune system, signaling (e.g., leucine-rich repeats) and adhesion. Several venom components evolved via convergent evolution [<xref ref-type="bibr" rid="B43-toxins-04-01367">43</xref>]. Our study confirms that the phenomenon of genetic expansion and convergent evolution is not limited to vertebrates (e.g., reptiles, platypus) [<xref ref-type="bibr" rid="B44-toxins-04-01367">44</xref>] but already dominates in the Cnidaria. </p>
      </sec>
    </sec>
    <sec>
      <title>4. Experimental Section</title>
      <sec>
        <title>4.1. Data Collection</title>
        <p>Protein sequences from Cnidaria were collected from UniProtKB [<xref ref-type="bibr" rid="B24-toxins-04-01367">24</xref>] and sequences marked as “fragments” were excluded. UniProtKB was used as an annotation source for “Signal peptide” and “cell localization”. Only 1% of the Cnidaria proteins are curated and represented in the SwissProt collection (391/32,934 proteins). The proteome of <italic>Nematostella vectensis</italic> (Starlet sea anemone) includes 24,435 proteins in UniProtKB. The original data set was extracted from the <italic>N. vectensis</italic> JGI complete genome 1.0 (2007) [<xref ref-type="bibr" rid="B45-toxins-04-01367">45</xref>]. In the case of Nematostella proteome, protein redundancy originates from accessions obtained from RefSeq and GeneBank databases. Analysis was performed on protein shorter than 150 amino acids. The FASTA file from the NCBI protein collection [<xref ref-type="bibr" rid="B46-toxins-04-01367">46</xref>] was used as input for ClanTox prediction [<xref ref-type="bibr" rid="B47-toxins-04-01367">47</xref>].</p>
      </sec>
      <sec>
        <title>4.2. Bioinformatics Analysis Tools</title>
        <p>SignalP 4.0 was applied for prediction of signal peptides [<xref ref-type="bibr" rid="B48-toxins-04-01367">48</xref>]. ClustalW and alignment viewer tools were used from EBI’s (ClustalW2) server and the NCBI (Cobalt multiple sequence alignment). Multiple sequence alignment was applied using the default parameters. HHpred was used to identify remote homologues [<xref ref-type="bibr" rid="B49-toxins-04-01367">49</xref>]. HHpred is a sensitive algorithm that is based on HMM-HMM-comparisons for proposing the most likely structure of domain family assignments. We applied HHpred to build an HMM from the query sequence and compared it with a library of HMMs representing all known 3D-structures from the PDB.</p>
      </sec>
      <sec>
        <title>4.3. ClanTox Scoring</title>
        <p>The typical performance of ClanTox as assessed by cross-validation testing is exceptionally high with a Receiver operating characteristic (ROC curve) and mean area under the curve (AUC) of &gt;0.99% accuracy (for details see [<xref ref-type="bibr" rid="B22-toxins-04-01367">22</xref>]). The classifier returns one of four labels: N for negative predictions and P1–P3, reflecting three levels of positive predictions for TOLIPs. The most significant set of predictions is labeled P3. The labeling P1 to P3 reflects the mean score (the higher the score, the higher is the prediction confidence), and the robustness of the score [<xref ref-type="bibr" rid="B47-toxins-04-01367">47</xref>]. The robustness is calculated from 10 independent runs of the predictor on different negative sets and calculating the standard deviation (SD) of the prediction results. P3 comprises proteins with a mean score &gt; 0.2. The negative predictions (<italic>i.e.</italic>, predicted as non-toxin) result from proteins with a mean score &lt; −0.2. We separate the confidence of positive predictions to 3 levels: P3 are predictions with a mean score &gt; 0.2 or mean score &gt; 2 * SD; P2 are predictions with a mean score &gt; 0.2 or mean score between SD and 2 * SD; P1 are predictions with a mean score &gt; −0.2 or mean score &lt; SD. ClanTox is accessible as an interactive web server [<xref ref-type="bibr" rid="B50-toxins-04-01367">50</xref>].</p>
      </sec>
      <sec>
        <title>4.4. Discovery of Tandem Repeats (TRs)</title>
        <p>The presence of tandem repeats (TRs) in proteins and transcripts was determined using the Xstream web tool [<xref ref-type="bibr" rid="B51-toxins-04-01367">51</xref>] with the following parameters: (i) TRs are &gt;70% identical in their sequence; (ii) The minimal length of the repeated unit is 3 amino acids; (iii) The minimal domain length (defined as the total length of the repeated units) is 10 amino acids; (iv) The repeated unit appears at least twice; (v) Each repeat unit shares &gt;80% identity to the consensus sequence; (vi) There are at most three gaps in the repeats.</p>
      </sec>
    </sec>
    <sec sec-type="conclusions">
      <title>5. Conclusions</title>
      <p>We present here a systematic analysis for predicting Cnidarian toxin-like proteins (TOLIPs). We showed that even with poorly annotated genomes, identifying new TOLIPs candidates and inference of their possible functions is feasible. Over 95% of TOLIP candidates can be confidently annotated. For many of these predictions, experimental evidence is still lacking.</p>
      <p>From a functional perspective, we identified candidates that are predicted to function as protease inhibitors, components of a membrane pore, ion channel blockers, metal binding proteins and signaling molecules. Importantly, many of the short compact neurotoxin folds exhibit similarity to adhesion domains (signaling and extracellular modulators). We postulate that the basic elements of adhesion in Cnidaria resemble toxin-like proteins. </p>
      <p>Lastly, the TOLIPs in Cnidaria belong to small families of paralogs. The identified TOLIPs from Nematostella and Hydra genomes exposed an abundance of genes that code for short templates of venom molecules. Remarkably, cysteine-rich templates account for a rich spectrum of related functions. Gene expansion dynamics is fundamental to increase the repertoire of functions with a broad range of specificity and potency. We conclude that the reported evolutionary expansion of toxin-like proteins contribute to the fitness in the complex environment of the aquatic ecosystem. </p>
    </sec>
    
  </body>
  <back>
  <ack>
      <title>Acknowledgments</title>
      <p>This study is supported by grants from the ISF 592/07 and the BSF 2007/219. N.R. and M.A. are student fellows of the SCCB, the Sudarsky Center of Computational Biology.</p>
    </ack>
    <notes>
      <title>Conflict of Interest</title>
      <p>The authors declare no conflict of interest. </p>
    </notes>
    <ref-list>
      <title>References</title>
      <ref id="B1-toxins-04-01367">
        <label>1.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>King</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Westbrook</surname>
              <given-names>M.J.</given-names>
            </name>
            <name>
              <surname>Young</surname>
              <given-names>S.L.</given-names>
            </name>
            <name>
              <surname>Kuo</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Abedin</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Chapman</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Fairclough</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Hellsten</surname>
              <given-names>U.</given-names>
            </name>
            <name>
              <surname>Isogai</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Letunic</surname>
              <given-names>I.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>The genome of the choanoflagellate Monosiga brevicollis and the origin of metazoans</article-title>
          <source>Nature</source>
          <year>2008</year>
          <volume>451</volume>
          <fpage>783</fpage>
          <lpage>788</lpage>
        <pub-id pub-id-type="doi">10.1038/nature06617</pub-id><pub-id pub-id-type="pmid">18273011</pub-id></citation>
      </ref>
      <ref id="B2-toxins-04-01367">
        <label>2.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Muller</surname>
              <given-names>W.E.</given-names>
            </name>
            <name>
              <surname>Schroder</surname>
              <given-names>H.C.</given-names>
            </name>
            <name>
              <surname>Skorokhod</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Bunz</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Muller</surname>
              <given-names>I.M.</given-names>
            </name>
            <name>
              <surname>Grebenjuk</surname>
              <given-names>V.A.</given-names>
            </name>
          </person-group>
          <article-title>Contribution of sponge genes to unravel the genome of the hypothetical ancestor of Metazoa (Urmetazoa)</article-title>
          <source>Gene</source>
          <year>2001</year>
          <volume>276</volume>
          <fpage>161</fpage>
          <lpage>173</lpage>
          <pub-id pub-id-type="doi">10.1016/S0378-1119(01)00669-2</pub-id>
        </citation>
      </ref>
      <ref id="B3-toxins-04-01367">
        <label>3.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Hemmrich</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Anokhin</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Zacharias</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Bosch</surname>
              <given-names>T.C.</given-names>
            </name>
          </person-group>
          <article-title>Molecular phylogenetics in Hydra, a classical model in evolutionary developmental biology</article-title>
          <source>Mol. Phylogenet. Evol.</source>
          <year>2007</year>
          <volume>44</volume>
          <fpage>281</fpage>
          <lpage>290</lpage>
          <pub-id pub-id-type="doi">10.1016/j.ympev.2006.10.031</pub-id>
        </citation>
      </ref>
      <ref id="B4-toxins-04-01367">
        <label>4.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Philippe</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Derelle</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Lopez</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Pick</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Borchiellini</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Boury-Esnault</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Vacelet</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Renard</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Houliston</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Quéinnec</surname>
              <given-names>E.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Phylogenomics revives traditional views on deep animal relationships</article-title>
          <source>Curr. Biol.</source>
          <year>2009</year>
          <volume>19</volume>
          <fpage>706</fpage>
          <lpage>712</lpage>
          <pub-id pub-id-type="doi">10.1016/j.cub.2009.02.052</pub-id>
        </citation>
      </ref>
      <ref id="B5-toxins-04-01367">
        <label>5.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Bridge</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Cunningham</surname>
              <given-names>C.W.</given-names>
            </name>
            <name>
              <surname>DeSalle</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Buss</surname>
              <given-names>L.W.</given-names>
            </name>
          </person-group>
          <article-title>Class-level relationships in the phylum Cnidaria: Molecular and morphological evidence</article-title>
          <source>Mol. Biol. Evol.</source>
          <year>1995</year>
          <volume>12</volume>
          <fpage>679</fpage>
          <lpage>689</lpage>
        <pub-id pub-id-type="pmid">7659022</pub-id></citation>
      </ref>
      <ref id="B6-toxins-04-01367">
        <label>6.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Seipel</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Schmid</surname>
              <given-names>V.</given-names>
            </name>
          </person-group>
          <article-title>Evolution of striated muscle: Jellyfish and the origin of triploblasty</article-title>
          <source>Dev. Biol.</source>
          <year>2005</year>
          <volume>282</volume>
          <fpage>14</fpage>
          <lpage>26</lpage>
          <pub-id pub-id-type="doi">10.1016/j.ydbio.2005.03.032</pub-id>
        </citation>
      </ref>
      <ref id="B7-toxins-04-01367">
        <label>7.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Evans</surname>
              <given-names>N.M.</given-names>
            </name>
            <name>
              <surname>Lindner</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Raikova</surname>
              <given-names>E.V.</given-names>
            </name>
            <name>
              <surname>Collins</surname>
              <given-names>A.G.</given-names>
            </name>
            <name>
              <surname>Cartwright</surname>
              <given-names>P.</given-names>
            </name>
          </person-group>
          <article-title>Phylogenetic placement of the enigmatic parasite, Polypodium hydriforme, within the Phylum Cnidaria</article-title>
          <source>BMC Evol. Biol.</source>
          <year>2008</year>
          <volume>8</volume>
          <fpage>139</fpage>
        <pub-id pub-id-type="doi">10.1186/1471-2148-8-139</pub-id><pub-id pub-id-type="pmid">18471296</pub-id></citation>
      </ref>
      <ref id="B8-toxins-04-01367">
        <label>8.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Cartwright</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Nawrocki</surname>
              <given-names>A.M.</given-names>
            </name>
          </person-group>
          <article-title>Character evolution in Hydrozoa (phylum Cnidaria)</article-title>
          <source>Integr. Comp. Biol.</source>
          <year>2011</year>
          <volume>50</volume>
          <fpage>456</fpage>
          <lpage>472</lpage>
          <pub-id pub-id-type="doi">10.1093/icb/icq089</pub-id>
        </citation>
      </ref>
      <ref id="B9-toxins-04-01367">
        <label>9.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Putnam</surname>
              <given-names>N.H.</given-names>
            </name>
            <name>
              <surname>Srivastava</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Hellsten</surname>
              <given-names>U.</given-names>
            </name>
            <name>
              <surname>Dirks</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Chapman</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Salamov</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Terry</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Shapiro</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Lindquist</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Kapitonov</surname>
              <given-names>V.V.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Sea anemone genome reveals ancestral eumetazoan gene repertoire and genomic organization</article-title>
          <source>Science</source>
          <year>2007</year>
          <volume>317</volume>
          <fpage>86</fpage>
          <lpage>94</lpage>
          <pub-id pub-id-type="doi">10.1126/science.1139158</pub-id>
        </citation>
      </ref>
      <ref id="B10-toxins-04-01367">
        <label>10.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Martindale</surname>
              <given-names>M.Q.</given-names>
            </name>
            <name>
              <surname>Finnerty</surname>
              <given-names>J.R.</given-names>
            </name>
            <name>
              <surname>Henry</surname>
              <given-names>J.Q.</given-names>
            </name>
          </person-group>
          <article-title>The Radiata and the evolutionary origins of the bilaterian body plan</article-title>
          <source>Mol. Phylogenet. Evol.</source>
          <year>2002</year>
          <volume>24</volume>
          <fpage>358</fpage>
          <lpage>365</lpage>
          <pub-id pub-id-type="doi">10.1016/S1055-7903(02)00208-7</pub-id>
        </citation>
      </ref>
      <ref id="B11-toxins-04-01367">
        <label>11.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Chapman</surname>
              <given-names>J.A.</given-names>
            </name>
            <name>
              <surname>Kirkness</surname>
              <given-names>E.F.</given-names>
            </name>
            <name>
              <surname>Simakov</surname>
              <given-names>O.</given-names>
            </name>
            <name>
              <surname>Hampson</surname>
              <given-names>S.E.</given-names>
            </name>
            <name>
              <surname>Mitros</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Weinmaier</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Rattei</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Balasubramanian</surname>
              <given-names>P.G.</given-names>
            </name>
            <name>
              <surname>Borman</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Busam</surname>
              <given-names>D.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>The dynamic genome of Hydra</article-title>
          <source>Nature</source>
          <year>2010</year>
          <volume>464</volume>
          <fpage>592</fpage>
          <lpage>596</lpage>
          <pub-id pub-id-type="doi">10.1038/nature08830</pub-id>
        </citation>
      </ref>
      <ref id="B12-toxins-04-01367">
        <label>12.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Whittington</surname>
              <given-names>C.M.</given-names>
            </name>
            <name>
              <surname>Papenfuss</surname>
              <given-names>A.T.</given-names>
            </name>
            <name>
              <surname>Bansal</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Torres</surname>
              <given-names>A.M.</given-names>
            </name>
            <name>
              <surname>Wong</surname>
              <given-names>E.S.</given-names>
            </name>
            <name>
              <surname>Deakin</surname>
              <given-names>J.E.</given-names>
            </name>
            <name>
              <surname>Graves</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Alsop</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Schatzkamer</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Kremitzki</surname>
              <given-names>C.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Defensins and the convergent evolution of platypus and reptile venom genes</article-title>
          <source>Genome Res.</source>
          <year>2008</year>
          <volume>18</volume>
          <fpage>986</fpage>
          <lpage>994</lpage>
          <pub-id pub-id-type="doi">10.1101/gr.7149808</pub-id>
        </citation>
      </ref>
      <ref id="B13-toxins-04-01367">
        <label>13.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Koua</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Brauer</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Laht</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Kaplinski</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Favreau</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Remm</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Lisacek</surname>
              <given-names>F.</given-names>
            </name>
            <name>
              <surname>Stocklin</surname>
              <given-names>R.</given-names>
            </name>
          </person-group>
          <article-title>ConoDictor: A tool for prediction of conopeptide superfamilies</article-title>
          <source>Nucleic. Acids Res.</source>
          <year>2012</year>
          <volume>40</volume>
          <fpage>W238</fpage>
          <lpage>W241</lpage>
          <pub-id pub-id-type="doi">10.1093/nar/gks337</pub-id>
        </citation>
      </ref>
      <ref id="B14-toxins-04-01367">
        <label>14.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Xu</surname>
              <given-names>X.</given-names>
            </name>
            <name>
              <surname>Yu</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Fang</surname>
              <given-names>W.</given-names>
            </name>
            <name>
              <surname>Cheng</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Qian</surname>
              <given-names>Z.</given-names>
            </name>
            <name>
              <surname>Lu</surname>
              <given-names>W.</given-names>
            </name>
            <name>
              <surname>Cai</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Feng</surname>
              <given-names>K.</given-names>
            </name>
          </person-group>
          <article-title>Prediction of peptidase category based on functional domain composition</article-title>
          <source>J. Proteome Res.</source>
          <year>2008</year>
          <volume>7</volume>
          <fpage>4521</fpage>
          <lpage>4524</lpage>
          <pub-id pub-id-type="doi">10.1021/pr800292w</pub-id>
        </citation>
      </ref>
      <ref id="B15-toxins-04-01367">
        <label>15.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Lenffer</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Lai</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>el Mejaber</surname>
              <given-names>W.</given-names>
            </name>
            <name>
              <surname>Khan</surname>
              <given-names>A.M.</given-names>
            </name>
            <name>
              <surname>Koh</surname>
              <given-names>J.L.</given-names>
            </name>
            <name>
              <surname>Tan</surname>
              <given-names>P.T.</given-names>
            </name>
            <name>
              <surname>Seah</surname>
              <given-names>S.H.</given-names>
            </name>
            <name>
              <surname>Brusic</surname>
              <given-names>V.</given-names>
            </name>
          </person-group>
          <article-title>CysView: Protein classification based on cysteine pairing patterns</article-title>
          <source>Nucleic Acids Res.</source>
          <year>2004</year>
          <volume>32</volume>
          <fpage>W350</fpage>
          <lpage>W355</lpage>
          <pub-id pub-id-type="doi">10.1093/nar/gkh475</pub-id>
        </citation>
      </ref>
      <ref id="B16-toxins-04-01367">
        <label>16.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Fry</surname>
              <given-names>B.G.</given-names>
            </name>
          </person-group>
          <article-title>From genome to “venome”: Molecular origin and evolution of the snake venom proteome inferred from phylogenetic analysis of toxin sequences and related body proteins</article-title>
          <source>Genome Res.</source>
          <year>2005</year>
          <volume>15</volume>
          <fpage>403</fpage>
          <lpage>420</lpage>
          <pub-id pub-id-type="doi">10.1101/gr.3228405</pub-id>
        </citation>
      </ref>
      <ref id="B17-toxins-04-01367">
        <label>17.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Fry</surname>
              <given-names>B.G.</given-names>
            </name>
            <name>
              <surname>Vidal</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Norman</surname>
              <given-names>J.A.</given-names>
            </name>
            <name>
              <surname>Vonk</surname>
              <given-names>F.J.</given-names>
            </name>
            <name>
              <surname>Scheib</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Ramjan</surname>
              <given-names>S.F.R.</given-names>
            </name>
            <name>
              <surname>Kuruppu</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Fung</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Hedges</surname>
              <given-names>S.B.</given-names>
            </name>
            <name>
              <surname>Richardson</surname>
              <given-names>M.K.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Early evolution of the venom system in lizards and snakes</article-title>
          <source>Nature</source>
          <year>2006</year>
          <volume>439</volume>
          <fpage>584</fpage>
          <lpage>588</lpage>
        <pub-id pub-id-type="doi">10.1038/nature04328</pub-id><pub-id pub-id-type="pmid">16292255</pub-id></citation>
      </ref>
      <ref id="B18-toxins-04-01367">
        <label>18.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Kini</surname>
              <given-names>R.M.</given-names>
            </name>
          </person-group>
          <article-title>Molecular moulds with multiple missions: Functional sites in three-finger toxins</article-title>
          <source>Clin. Exp. Pharmacol. Physiol.</source>
          <year>2002</year>
          <volume>29</volume>
          <fpage>815</fpage>
          <lpage>822</lpage>
          <pub-id pub-id-type="doi">10.1046/j.1440-1681.2002.03725.x</pub-id>
        </citation>
      </ref>
      <ref id="B19-toxins-04-01367">
        <label>19.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Miwa</surname>
              <given-names>J.M.</given-names>
            </name>
            <name>
              <surname>Ibanez-Tallon</surname>
              <given-names>I.</given-names>
            </name>
            <name>
              <surname>Crabtree</surname>
              <given-names>G.W.</given-names>
            </name>
            <name>
              <surname>Sanchez</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Sali</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Role</surname>
              <given-names>L.W.</given-names>
            </name>
            <name>
              <surname>Heintz</surname>
              <given-names>N.</given-names>
            </name>
          </person-group>
          <article-title>lynx1, an endogenous toxin-like modulator of nicotinic acetylcholine receptors in the mammalian CNS</article-title>
          <source>Neuron</source>
          <year>1999</year>
          <volume>23</volume>
          <fpage>105</fpage>
          <lpage>114</lpage>
          <pub-id pub-id-type="doi">10.1016/S0896-6273(00)80757-6</pub-id>
        </citation>
      </ref>
      <ref id="B20-toxins-04-01367">
        <label>20.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Tjiu</surname>
              <given-names>J.W.</given-names>
            </name>
            <name>
              <surname>Lin</surname>
              <given-names>P.J.</given-names>
            </name>
            <name>
              <surname>Wu</surname>
              <given-names>W.H.</given-names>
            </name>
            <name>
              <surname>Cheng</surname>
              <given-names>Y.P.</given-names>
            </name>
            <name>
              <surname>Chiu</surname>
              <given-names>H.C.</given-names>
            </name>
            <name>
              <surname>Thong</surname>
              <given-names>H.Y.</given-names>
            </name>
            <name>
              <surname>Chiang</surname>
              <given-names>B.L.</given-names>
            </name>
            <name>
              <surname>Yang</surname>
              <given-names>W.S.</given-names>
            </name>
            <name>
              <surname>Jee</surname>
              <given-names>S.H.</given-names>
            </name>
          </person-group>
          <article-title>SLURP1 mutation-impaired T-cell activation in a family with mal de Meleda</article-title>
          <source>Br. J. Dermatol.</source>
          <year>2011</year>
          <volume>164</volume>
          <fpage>47</fpage>
          <lpage>53</lpage>
          <pub-id pub-id-type="doi">10.1111/j.1365-2133.2010.10059.x</pub-id>
        </citation>
      </ref>
      <ref id="B21-toxins-04-01367">
        <label>21.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Kaplan</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Morpurgo</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Linial</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>Novel families of toxin-like peptides in insects and mammals: A computational approach</article-title>
          <source>J. Mol. Biol.</source>
          <year>2007</year>
          <volume>369</volume>
          <fpage>553</fpage>
          <lpage>566</lpage>
          <pub-id pub-id-type="doi">10.1016/j.jmb.2007.02.106</pub-id>
        </citation>
      </ref>
      <ref id="B22-toxins-04-01367">
        <label>22.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Naamati</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Askenazi</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Linial</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>A predictor for toxin-like proteins exposes cell modulator candidates within viral genomes</article-title>
          <source>Bioinformatics</source>
          <year>2010</year>
          <volume>26</volume>
          <fpage>i482</fpage>
          <lpage>i488</lpage>
          <pub-id pub-id-type="doi">10.1093/bioinformatics/btq375</pub-id>
        </citation>
      </ref>
      <ref id="B23-toxins-04-01367">
        <label>23.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Tirosh</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Morpurgo</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Cohen</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Linial</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Bloch</surname>
              <given-names>G.</given-names>
            </name>
          </person-group>
          <article-title>Raalin, a transcript enriched in the honey bee brain, is a remnant of genomic rearrangement in Hymenoptera</article-title>
          <source>Insect Mol. Biol.</source>
          <year>2012</year>
          <volume>21</volume>
          <fpage>305</fpage>
          <lpage>318</lpage>
          <pub-id pub-id-type="doi">10.1111/j.1365-2583.2012.01138.x</pub-id>
        </citation>
      </ref>
      <ref id="B24-toxins-04-01367">
        <label>24.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Boutet</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Lieberherr</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Tognolli</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Schneider</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Bairoch</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <article-title>UniProtKB/Swiss-Prot</article-title>
          <source>Methods Mol. Biol.</source>
          <year>2007</year>
          <volume>406</volume>
          <fpage>89</fpage>
          <lpage>112</lpage>
        <pub-id pub-id-type="pmid">18287689</pub-id></citation>
      </ref>
      <ref id="B25-toxins-04-01367">
        <label>25.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Sher</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Knebel</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Bsor</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Nesher</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Tal</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Morgenstern</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Cohen</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Fishman</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Zlotkin</surname>
              <given-names>E.</given-names>
            </name>
          </person-group>
          <article-title>Toxic polypeptides of the hydra—A bioinformatic approach to cnidarian allomones</article-title>
          <source>Toxicon</source>
          <year>2005</year>
          <volume>45</volume>
          <fpage>865</fpage>
          <lpage>879</lpage>
          <pub-id pub-id-type="doi">10.1016/j.toxicon.2005.02.004</pub-id>
        </citation>
      </ref>
      <ref id="B26-toxins-04-01367">
        <label>26.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Brown</surname>
              <given-names>R.L.</given-names>
            </name>
            <name>
              <surname>Haley</surname>
              <given-names>T.L.</given-names>
            </name>
            <name>
              <surname>West</surname>
              <given-names>K.A.</given-names>
            </name>
            <name>
              <surname>Crabb</surname>
              <given-names>J.W.</given-names>
            </name>
          </person-group>
          <article-title>Pseudechetoxin: A peptide blocker of cyclic nucleotide-gated ion channels</article-title>
          <source>Proc. Natl. Acad. Sci. USA</source>
          <year>1999</year>
          <volume>96</volume>
          <fpage>754</fpage>
          <lpage>759</lpage>
          <pub-id pub-id-type="doi">10.1073/pnas.96.2.754</pub-id>
        </citation>
      </ref>
      <ref id="B27-toxins-04-01367">
        <label>27.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Kusserow</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Pang</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Sturm</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Hrouda</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Lentfer</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Schmidt</surname>
              <given-names>H.A.</given-names>
            </name>
            <name>
              <surname>Technau</surname>
              <given-names>U.</given-names>
            </name>
            <name>
              <surname>von Haeseler</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Hobmayer</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Martindale</surname>
              <given-names>M.Q.</given-names>
            </name>
            <name>
              <surname>Holstein</surname>
              <given-names>T.W.</given-names>
            </name>
          </person-group>
          <article-title>Unexpected complexity of the <italic>Wnt</italic> gene family in a sea anemone</article-title>
          <source>Nature</source>
          <year>2005</year>
          <volume>433</volume>
          <fpage>156</fpage>
          <lpage>160</lpage>
          <pub-id pub-id-type="doi">10.1038/nature03158</pub-id>
        </citation>
      </ref>
      <ref id="B28-toxins-04-01367">
        <label>28.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Gajhede</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Petersen</surname>
              <given-names>T.N.</given-names>
            </name>
            <name>
              <surname>Henriksen</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Petersen</surname>
              <given-names>J.F.</given-names>
            </name>
            <name>
              <surname>Dauter</surname>
              <given-names>Z.</given-names>
            </name>
            <name>
              <surname>Wilson</surname>
              <given-names>K.S.</given-names>
            </name>
            <name>
              <surname>Thim</surname>
              <given-names>L.</given-names>
            </name>
          </person-group>
          <article-title>Pancreatic spasmolytic polypeptide: First three-dimensional structure of a member of the mammalian trefoil family of peptides</article-title>
          <source>Structure</source>
          <year>1993</year>
          <volume>1</volume>
          <fpage>253</fpage>
          <lpage>262</lpage>
          <pub-id pub-id-type="doi">10.1016/0969-2126(93)90014-8</pub-id>
        </citation>
      </ref>
      <ref id="B29-toxins-04-01367">
        <label>29.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Abe</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Limbrick</surname>
              <given-names>A.R.</given-names>
            </name>
            <name>
              <surname>Miledi</surname>
              <given-names>R.</given-names>
            </name>
          </person-group>
          <article-title>Acute muscle denervation induced by beta-bungarotoxin</article-title>
          <source>Proc. R. Soc. Lond. B</source>
          <year>1976</year>
          <volume>194</volume>
          <fpage>545</fpage>
          <lpage>553</lpage>
          <pub-id pub-id-type="doi">10.1098/rspb.1976.0093</pub-id>
        </citation>
      </ref>
      <ref id="B30-toxins-04-01367">
        <label>30.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Kwong</surname>
              <given-names>P.D.</given-names>
            </name>
            <name>
              <surname>McDonald</surname>
              <given-names>N.Q.</given-names>
            </name>
            <name>
              <surname>Sigler</surname>
              <given-names>P.B.</given-names>
            </name>
            <name>
              <surname>Hendrickson</surname>
              <given-names>W.A.</given-names>
            </name>
          </person-group>
          <article-title>Structure of beta 2-bungarotoxin: Potassium channel binding by Kunitz modules and targeted phospholipase action</article-title>
          <source>Structure</source>
          <year>1995</year>
          <volume>3</volume>
          <fpage>1109</fpage>
          <lpage>1119</lpage>
          <pub-id pub-id-type="doi">10.1016/S0969-2126(01)00246-5</pub-id>
        </citation>
      </ref>
      <ref id="B31-toxins-04-01367">
        <label>31.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Calvete</surname>
              <given-names>J.J.</given-names>
            </name>
            <name>
              <surname>Marcinkiewicz</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Monleon</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Esteve</surname>
              <given-names>V.</given-names>
            </name>
            <name>
              <surname>Celda</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Juarez</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Sanz</surname>
              <given-names>L.</given-names>
            </name>
          </person-group>
          <article-title>Snake venom disintegrins: Evolution of structure and function</article-title>
          <source>Toxicon</source>
          <year>2005</year>
          <volume>45</volume>
          <fpage>1063</fpage>
          <lpage>1074</lpage>
          <pub-id pub-id-type="doi">10.1016/j.toxicon.2005.02.024</pub-id>
        </citation>
      </ref>
      <ref id="B32-toxins-04-01367">
        <label>32.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Naamati</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Fromer</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Linial</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>Expansion of tandem repeats in sea anemone Nematostella vectensis proteome: A source for gene novelty?</article-title>
          <source>BMC Genomics</source>
          <year>2009</year>
          <volume>10</volume>
          <fpage>593</fpage>
          <pub-id pub-id-type="doi">10.1186/1471-2164-10-593</pub-id>
        </citation>
      </ref>
      <ref id="B33-toxins-04-01367">
        <label>33.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Moran</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Weinberger</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Reitzel</surname>
              <given-names>A.M.</given-names>
            </name>
            <name>
              <surname>Sullivan</surname>
              <given-names>J.C.</given-names>
            </name>
            <name>
              <surname>Kahn</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Gordon</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Finnerty</surname>
              <given-names>J.R.</given-names>
            </name>
            <name>
              <surname>Gurevitz</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>Intron retention as a posttranscriptional regulatory mechanism of neurotoxin expression at early life stages of the starlet anemone Nematostella vectensis</article-title>
          <source>J. Mol. Biol.</source>
          <year>2008</year>
          <volume>380</volume>
          <fpage>437</fpage>
          <lpage>443</lpage>
          <pub-id pub-id-type="doi">10.1016/j.jmb.2008.05.011</pub-id>
        </citation>
      </ref>
      <ref id="B34-toxins-04-01367">
        <label>34.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Rappoport</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Linial</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>Viral proteins acquired from a host converge to simplified domain architectures</article-title>
          <source>PLoS Comput. Biol.</source>
          <year>2012</year>
          <volume>8</volume>
          <fpage>e1002364</fpage>
          <pub-id pub-id-type="doi">10.1371/journal.pcbi.1002364</pub-id>
        </citation>
      </ref>
      <ref id="B35-toxins-04-01367">
        <label>35.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Craik</surname>
              <given-names>D.J.</given-names>
            </name>
            <name>
              <surname>Daly</surname>
              <given-names>N.L.</given-names>
            </name>
            <name>
              <surname>Waine</surname>
              <given-names>C.</given-names>
            </name>
          </person-group>
          <article-title>The cystine knot motif in toxins and implications for drug design</article-title>
          <source>Toxicon</source>
          <year>2001</year>
          <volume>39</volume>
          <fpage>43</fpage>
          <lpage>60</lpage>
          <pub-id pub-id-type="doi">10.1016/S0041-0101(00)00160-4</pub-id>
        </citation>
      </ref>
      <ref id="B36-toxins-04-01367">
        <label>36.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Armishaw</surname>
              <given-names>C.J.</given-names>
            </name>
          </person-group>
          <article-title>Synthetic alpha-conotoxin mutants as probes for studying nicotinic acetylcholine receptors and in the development of novel drug leads</article-title>
          <source>Toxins</source>
          <year>2010</year>
          <volume>2</volume>
          <fpage>1471</fpage>
          <lpage>1499</lpage>
          <pub-id pub-id-type="doi">10.3390/toxins2061471</pub-id>
        </citation>
      </ref>
      <ref id="B37-toxins-04-01367">
        <label>37.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Kristan</surname>
              <given-names>K.C.</given-names>
            </name>
            <name>
              <surname>Viero</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>dalla Serra</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Macek</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Anderluh</surname>
              <given-names>G.</given-names>
            </name>
          </person-group>
          <article-title>Molecular mechanism of pore formation by actinoporins</article-title>
          <source>Toxicon</source>
          <year>2009</year>
          <volume>54</volume>
          <fpage>1125</fpage>
          <lpage>1134</lpage>
          <pub-id pub-id-type="doi">10.1016/j.toxicon.2009.02.026</pub-id>
        </citation>
      </ref>
      <ref id="B38-toxins-04-01367">
        <label>38.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Ovchinnikova</surname>
              <given-names>T.V.</given-names>
            </name>
            <name>
              <surname>Balandin</surname>
              <given-names>S.V.</given-names>
            </name>
            <name>
              <surname>Aleshina</surname>
              <given-names>G.M.</given-names>
            </name>
            <name>
              <surname>Tagaev</surname>
              <given-names>A.A.</given-names>
            </name>
            <name>
              <surname>Leonova</surname>
              <given-names>Y.F.</given-names>
            </name>
            <name>
              <surname>Krasnodembsky</surname>
              <given-names>E.D.</given-names>
            </name>
            <name>
              <surname>Men’shenin</surname>
              <given-names>A.V.</given-names>
            </name>
            <name>
              <surname>Kokryakov</surname>
              <given-names>V.N.</given-names>
            </name>
          </person-group>
          <article-title>Aurelin, a novel antimicrobial peptide from jellyfish Aurelia aurita with structural features of defensins and channel-blocking toxins</article-title>
          <source>Biochem. Biophys. Res. Commun.</source>
          <year>2006</year>
          <volume>348</volume>
          <fpage>514</fpage>
          <lpage>523</lpage>
          <pub-id pub-id-type="doi">10.1016/j.bbrc.2006.07.078</pub-id>
        </citation>
      </ref>
      <ref id="B39-toxins-04-01367">
        <label>39.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Biragyn</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <article-title>Defensins-non-antibiotic use for vaccine development</article-title>
          <source>Curr. Protein Pept. Sci.</source>
          <year>2005</year>
          <volume>6</volume>
          <fpage>53</fpage>
          <lpage>60</lpage>
          <pub-id pub-id-type="doi">10.2174/1389203053027601</pub-id>
        </citation>
      </ref>
      <ref id="B40-toxins-04-01367">
        <label>40.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Verstrepen</surname>
              <given-names>K.J.</given-names>
            </name>
            <name>
              <surname>Jansen</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Lewitter</surname>
              <given-names>F.</given-names>
            </name>
            <name>
              <surname>Fink</surname>
              <given-names>G.R.</given-names>
            </name>
          </person-group>
          <article-title>Intragenic tandem repeats generate functional variability</article-title>
          <source>Nat. Genet.</source>
          <year>2005</year>
          <volume>37</volume>
          <fpage>986</fpage>
          <lpage>990</lpage>
          <pub-id pub-id-type="doi">10.1038/ng1618</pub-id>
        </citation>
      </ref>
      <ref id="B41-toxins-04-01367">
        <label>41.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Zhang</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Gu</surname>
              <given-names>Z.</given-names>
            </name>
            <name>
              <surname>Li</surname>
              <given-names>W.H.</given-names>
            </name>
          </person-group>
          <article-title>Different evolutionary patterns between young duplicate genes in the human genome</article-title>
          <source>Genome Biol.</source>
          <year>2003</year>
          <volume>4</volume>
          <fpage>R56</fpage>
          <pub-id pub-id-type="doi">10.1186/gb-2003-4-9-r56</pub-id>
        </citation>
      </ref>
      <ref id="B42-toxins-04-01367">
        <label>42.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Wanke</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Zaharenko</surname>
              <given-names>A.J.</given-names>
            </name>
            <name>
              <surname>Redaelli</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Schiavon</surname>
              <given-names>E.</given-names>
            </name>
          </person-group>
          <article-title>Actions of sea anemone type 1 neurotoxins on voltage-gated sodium channel isoforms</article-title>
          <source>Toxicon</source>
          <year>2009</year>
          <volume>54</volume>
          <fpage>1102</fpage>
          <lpage>1111</lpage>
          <pub-id pub-id-type="doi">10.1016/j.toxicon.2009.04.018</pub-id>
        </citation>
      </ref>
      <ref id="B43-toxins-04-01367">
        <label>43.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Brodie</surname>
              <given-names>E.D.</given-names>
              <suffix>III.</suffix>
            </name>
          </person-group>
          <article-title>Convergent evolution: pick your poison carefully</article-title>
          <source>Curr. Biol.</source>
          <year>2010</year>
          <volume>20</volume>
          <fpage>R152</fpage>
          <lpage>R154</lpage>
          <pub-id pub-id-type="doi">10.1016/j.cub.2009.12.029</pub-id>
        </citation>
      </ref>
      <ref id="B44-toxins-04-01367">
        <label>44.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Whittington</surname>
              <given-names>C.M.</given-names>
            </name>
            <name>
              <surname>Koh</surname>
              <given-names>J.M.</given-names>
            </name>
            <name>
              <surname>Warren</surname>
              <given-names>W.C.</given-names>
            </name>
            <name>
              <surname>Papenfuss</surname>
              <given-names>A.T.</given-names>
            </name>
            <name>
              <surname>Torres</surname>
              <given-names>A.M.</given-names>
            </name>
            <name>
              <surname>Kuchel</surname>
              <given-names>P.W.</given-names>
            </name>
            <name>
              <surname>Belov</surname>
              <given-names>K.</given-names>
            </name>
          </person-group>
          <article-title>Understanding and utilising mammalian venom via a platypus venom transcriptome</article-title>
          <source>J. Proteomics</source>
          <year>2009</year>
          <volume>72</volume>
          <fpage>155</fpage>
          <lpage>164</lpage>
          <pub-id pub-id-type="doi">10.1016/j.jprot.2008.12.004</pub-id>
        </citation>
      </ref>
      <ref id="B45-toxins-04-01367">
        <label>45.</label>
        <citation citation-type="web">
          <article-title>JOE Joint Genome Institute. Nematostella vectensis genome assembly 1.0</article-title>
          <access-date>(accessed on 2 March 2012)</access-date>
          <comment>Available online:<ext-link xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="http://genome.jgi.doe.gov/Nemve1" ext-link-type="uri">http://genome.jgi.doe.gov/Nemve1</ext-link></comment>
        </citation>
      </ref>
      <ref id="B46-toxins-04-01367">
        <label>46.</label>
        <citation citation-type="web">
          <collab>NCBI</collab>
          <article-title>Protein database from NCBI including translations from GenBank, RefSeq, TPA, SwissProt, PIR, PRF, and PDB</article-title>
          <access-date>(accessed on 12 May 2010)</access-date>
          <comment>Available online:<ext-link xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="http://www.ncbi.nlm.nih.gov/protein" ext-link-type="uri">http://www.ncbi.nlm.nih.gov/protein</ext-link></comment>
        </citation>
      </ref>
      <ref id="B47-toxins-04-01367">
        <label>47.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Naamati</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Askenazi</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Linial</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>ClanTox: A classifier of short animal toxins</article-title>
          <source>Nucleic Acids Res</source>
          <year>2009</year>
          <volume>37</volume>
          <fpage>W363</fpage>
          <lpage>W368</lpage>
          <pub-id pub-id-type="doi">10.1093/nar/gkp299</pub-id>
        </citation>
      </ref>
      <ref id="B48-toxins-04-01367">
        <label>48.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Petersen</surname>
              <given-names>T.N.</given-names>
            </name>
            <name>
              <surname>Brunak</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>von Heijne</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Nielsen</surname>
              <given-names>H.</given-names>
            </name>
          </person-group>
          <article-title>SignalP 4.0: Discriminating signal peptides from transmembrane regions</article-title>
          <source>Nat. Methods</source>
          <year>2011</year>
          <volume>8</volume>
          <fpage>785</fpage>
          <lpage>786</lpage>
          <pub-id pub-id-type="doi">10.1038/nmeth.1701</pub-id>
        </citation>
      </ref>
      <ref id="B49-toxins-04-01367">
        <label>49.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Hildebrand</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Remmert</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Biegert</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Soding</surname>
              <given-names>J.</given-names>
            </name>
          </person-group>
          <article-title>Fast and accurate automatic structure prediction with HHpred</article-title>
          <source>Proteins</source>
          <year>2009</year>
          <volume>77</volume>
          <fpage>128</fpage>
          <lpage>132</lpage>
          <pub-id pub-id-type="doi">10.1002/prot.22499</pub-id>
        </citation>
      </ref>
      <ref id="B50-toxins-04-01367">
        <label>50.</label>
        <citation citation-type="web">
          <collab>ClanTox</collab>
          <article-title>Predictor for Toxin-like proteins</article-title>
          <access-date>(accessed on 31 December 2008)</access-date>
          <comment>Available online:<ext-link xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="www.clantox.cs.huji.ac.il" ext-link-type="uri">www.clantox.cs.huji.ac.il</ext-link></comment>
        </citation>
      </ref>
      <ref id="B51-toxins-04-01367">
        <label>51.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Newman</surname>
              <given-names>A.M.</given-names>
            </name>
            <name>
              <surname>Cooper</surname>
              <given-names>J.B.</given-names>
            </name>
          </person-group>
          <article-title>XSTREAM: A practical algorithm for identification and architecture modeling of tandem repeats in protein sequences</article-title>
          <source>BMC Bioinformatics</source>
          <year>2007</year>
          <volume>8</volume>
          <fpage>382</fpage>
          <pub-id pub-id-type="doi">10.1186/1471-2105-8-382</pub-id>
        </citation>
      </ref>
    </ref-list>
  </back>
</article>
