<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xml:lang="en" article-type="research-article">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">genes</journal-id>
      <journal-title>Genes</journal-title>
      <abbrev-journal-title abbrev-type="publisher">Genes</abbrev-journal-title>
      <abbrev-journal-title abbrev-type="pubmed">Genes</abbrev-journal-title>
      <issn pub-type="epub">2073-4425</issn>
      <publisher>
        <publisher-name>MDPI</publisher-name>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.3390/genes3010138</article-id>
      <article-id pub-id-type="publisher-id">genes-03-00138</article-id>
      <article-categories>
        <subj-group>
          <subject>Article</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>The Genetics of Symbiotic Nitrogen Fixation: Comparative Genomics of 14 Rhizobia Strains by Resolution of Protein Clusters </article-title>
      </title-group>
      
      <contrib-group>
        <contrib contrib-type="author">
          <name>
            <surname>Black</surname>
            <given-names>Michael</given-names>
          </name>
          <xref rid="af1-genes-03-00138" ref-type="aff">1</xref>
          <xref rid="c1-genes-03-00138" ref-type="corresp">*</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Moolhuijzen</surname>
            <given-names>Paula</given-names>
          </name>
          <xref rid="af1-genes-03-00138" ref-type="aff">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Chapman</surname>
            <given-names>Brett</given-names>
          </name>
          <xref rid="af1-genes-03-00138" ref-type="aff">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Barrero</surname>
            <given-names>Roberto</given-names>
          </name>
          <xref rid="af1-genes-03-00138" ref-type="aff">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Howieson</surname>
            <given-names>John</given-names>
          </name>
          <xref rid="af2-genes-03-00138" ref-type="aff">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Hungria</surname>
            <given-names>Mariangela</given-names>
          </name>
          <xref rid="af3-genes-03-00138" ref-type="aff">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Bellgard</surname>
            <given-names>Matthew</given-names>
          </name>
          <xref rid="af1-genes-03-00138" ref-type="aff">1</xref>
          <xref rid="c1-genes-03-00138" ref-type="corresp">*</xref>
        </contrib>
      </contrib-group>
      <aff id="af1-genes-03-00138"><label>1 </label>Centre for Comparative Genomics, Murdoch University, South Street, Murdoch, Perth, WA 6150, Australia; E-Mails: <email>pmoolhuijzen@ccg.murdoch.edu.au</email> (P.M.); <email>bchapman@ccg.murdoch.edu.au</email> (B.C.); <email>rbarrero@ccg.murdoch.edu.au</email> (R.B.);</aff>
      <aff id="af2-genes-03-00138"><label>2 </label>Centre for Rhizobium Studies, Murdoch University, South Street, Murdoch, Perth, WA 6150, Australia; E-Mail: <email>j.howieson@murdoch.edu.au</email> </aff>
      <aff id="af3-genes-03-00138"><label>3 </label>Embrapa Soja, Cx. Postal 231, Londrina, Parana, 86001-970, Brazil; E-Mail: <email>hungria@cnpso.embrapa.br</email></aff>
      <author-notes>
        <corresp id="c1-genes-03-00138"><label>* </label>Author  to whom correspondence should be addressed; E-Mail: <email>mblack@ccg.murdoch.edu.au</email>; E-Mail: <email>mbellgard@ccg.murdoch.edu.au</email>.</corresp>
      </author-notes>
      <pub-date pub-type="epub">
        <day>16</day>
        <month>02</month>
        <year>2012</year>
      </pub-date>
      <pub-date pub-type="collection"><month>03</month>
        <year>2012</year>
      </pub-date>
      <volume>3</volume>
      <issue>1</issue>
      <fpage>138</fpage>
      <lpage>166</lpage>
      <history>
        <date date-type="received">
          <day>24</day>
          <month>01</month>
          <year>2012</year>
        </date>
        <date date-type="rev-recd">
          <day>10</day>
          <month>02</month>
          <year>2012</year>
        </date>
        <date date-type="accepted">
          <day>13</day>
          <month>02</month>
          <year>2012</year>
        </date>
      </history>
      <permissions>
        <copyright-statement>© 2012 by the authors; licensee MDPI, Basel, Switzerland.</copyright-statement>
        <copyright-year>2012</copyright-year>
        <license xmlns:xlink="http://www.w3.org/1999/xlink" license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/3.0/">
          <p>This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).</p>
        </license>
      </permissions>
      <abstract>
        <p>The symbiotic relationship between legumes and nitrogen fixing bacteria is critical for agriculture, as it may have profound impacts on lowering costs for farmers, on land sustainability, on soil quality, and on mitigation of greenhouse gas emissions. However, despite the importance of the symbioses to the global nitrogen cycling balance, very few rhizobial genomes have been sequenced so far, although there are some ongoing efforts in sequencing elite strains. In this study, the genomes of fourteen selected strains of the order <italic>Rhizobiales</italic>, all previously fully sequenced and annotated, were compared to assess differences between the strains and to investigate the feasibility of defining a core ‘symbiome’—the essential genes required by all rhizobia for nodulation and nitrogen fixation. Comparison of these whole genomes has revealed valuable information, such as several events of lateral gene transfer, particularly in the symbiotic plasmids and genomic islands that have contributed to a better understanding of the evolution of contrasting symbioses. Unique genes were also identified, as well as omissions of symbiotic genes that were expected to be found. Protein comparisons have also allowed the identification of a variety of similarities and differences in several groups of genes, including those involved in nodulation, nitrogen fixation, production of exopolysaccharides, Type I to Type VI secretion systems, among others, and identifying some key genes that could be related to host specificity and/or a better saprophytic ability. However, while several significant differences in the type and number of proteins were observed, the evidence presented suggests no simple core symbiome exists. A more abstract systems biology concept of nitrogen fixing symbiosis may be required. The results have also highlighted that comparative genomics represents a valuable tool for capturing specificities and generalities of each genome.</p>
      </abstract>
      <kwd-group>
        <kwd>nitrogen fixation</kwd>
        <kwd>symbiosis</kwd>
        <kwd>
          <italic>Rhizobiales</italic>
        </kwd>
        <kwd>genome</kwd>
        <kwd>comparative genomics</kwd>
        <kwd>nodulation genes</kwd>
        <kwd>secretion systems</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec sec-type="intro">
      <title>1. Introduction</title>
      <p>The symbiotic relationship between legumes and nitrogen fixing bacteria is critical for agriculture, as it may have profound impacts on lowering costs for farmers, on land sustainability, on soil quality, and on mitigation of greenhouse gas emissions. The major importance of the symbioses is usually attributed to the decrease in the use of costly nitrogen based fertilizers, but the rehabilitation of infertile, environmentally stressed soils should also be highlighted. With an increasing global demand for food production, combined with the need to reduce carbon emissions, the reliance on biological nitrogen fixation as an alternative to nitrogen fertilizers is forecast to increase [<xref ref-type="bibr" rid="B1-genes-03-00138">1</xref>].</p>
      <p>Symbiotic nitrogen fixing bacteria are represented by a phylogenetically disparate class of alpha- and beta-proteobacteria—usually collectively termed rhizobia—that have achieved the function of fixing atmospheric nitrogen (N<sub>2</sub>) in symbiosis with legumes. The majority of the symbiotic speciesare represented in the alpha-proteobacteria order <italic>Rhizobiales</italic>, which, amongst many others, contain the agriculturally important nitrogen fixing genera of <italic>Rhizobium</italic>, <italic>Bradyrhizobium</italic>, <italic>Mesorhizobium</italic>, <italic>Sinorhizobium</italic> (=<italic>Ensifer</italic>) and <italic>Azorhizobium</italic>. One impediment to the broader use of rhizobia in agriculture is the production of compatible inoculants. There are substantial host, strain and environmental specificities that limit the use of potentially important legume fodder and crops as alternatives to nitrogen fertilizers [<xref ref-type="bibr" rid="B2-genes-03-00138">2</xref>,<xref ref-type="bibr" rid="B3-genes-03-00138">3</xref>]. </p>
      <p>Despite the importance of nitrogen fixation to the global nitrogen balance, very few rhizobial strains have been completely sequenced, representing less than 1% of the complete bacterial genomes available today. The scenario may slightly improve in the following years, as there are large-scale genome sequencing projects in progress. Furthermore, very few genomic studies comparing nitrogen fixing bacteria have been performed [<xref ref-type="bibr" rid="B4-genes-03-00138">4</xref>,<xref ref-type="bibr" rid="B5-genes-03-00138">5</xref>], but the results obtained have indicated that comparative genomics represents a promising tool to reveal bacterial specificities. Furthermore, advances in bioinformatics tools will reveal increasing details in the comparison of genomes.</p>
      <p>In this present study, fourteen rhizobial genomes were selected based on maximizing geographical, environmental and host range, spanning most Vavilov centers of origin [<xref ref-type="bibr" rid="B6-genes-03-00138">6</xref>]. This comparison was undertaken to attempt to establish a reference ‘symbiome’, meaning a set of genes critical to successful symbiosis and subsequent nitrogen fixation, as well as to obtain a better understanding of the strategies adopted by disparate strains to maintain their symbiotic apparatus. Different tools were evaluated in the comparison, and the results may also contribute to a further exploration in future large scale comparative genomic studies.</p>
    </sec>
    <sec sec-type="results">
      <title>2. Results and Discussion</title>
      <sec>
        <title>2.1. Key Characteristics of the Fourteen Strains of the Order <italic>Rhizobiales</italic></title>
        <p>The aim of this study was to investigate the possibility of defining a common ‘symbiome’ amongst selected nitrogen fixing strains of the order <italic>Rhizobiales</italic>. The fourteen genomes selected represent a broad range, from classic legume symbionts to <italic>Mesorhizobium</italic> sp. BNC1, a strain that has seemingly lost the ability to fix nitrogen and form symbiotic relationships. In addition, the selected organisms represent both root nodulating (e.g., <italic>Bradyrhizobium japonicum</italic> USDA 110, symbiont of soybean—<italic>Glycine max</italic>, <italic>Sinorhizobium meliloti</italic> 1021, symbiont of alfalfa—<italic>Medicago sativa</italic>)) and stem nodulating (e.g., <italic>Azorhizobium caulinodans</italic> strain ORS571, symbiont of sesbania—<italic>Sesbania rostrata</italic>) microsymbionts. All available nucleotide and protein FASTA files, as well as GENBANK files available were utilized (<xref ref-type="table" rid="genes-03-00138-t001">Table 1</xref>). The fourteen genomes sequenced and their hosts are listed in <xref ref-type="table" rid="genes-03-00138-t002">Table 2</xref>, and the strains will be referred to by their KEGG organism code.</p>
        
        <sec>
          <title>2.1.1. Azorhizobium</title>
          <p><italic>A. caulinodans</italic> (<italic>azc</italic>) is a nitrogen fixing member of the <italic>Xanthobacteraceae</italic> family. It is primarily a stem nodulator of the African legume sesbania. It is thought to have originally been a non-nitrogen fixer that developed the ability to fix N<sub>2</sub> entirely through lateral gene transfer from another unknown species [<xref ref-type="bibr" rid="B7-genes-03-00138">7</xref>]. Unlike the other thirteen strains from this study, <italic>azc</italic> can reduce di-nitrogen both in symbiosis and in the free living stage, and as a result can be grown on nitrogen free medium, a key defining property. It has a relatively small genome—5.37 MB—in comparison to other rhizobia and it is the most taxonomically distant species in this study [<xref ref-type="bibr" rid="B7-genes-03-00138">7</xref>,<xref ref-type="bibr" rid="B8-genes-03-00138">8</xref>].</p>
        </sec>
        <sec>
          <title>2.1.2. Bradyrhizobium</title>
          <p>Three <italic>Bradyrhizobium</italic> genomes have been selected, <italic>B. japonicum</italic> USDA 110 (<italic>bja</italic>), <italic>Bradyrhizobium</italic> sp. BTAi1 (<italic>bbt</italic>), and <italic>Bradyrhizobium</italic> ORS278 (<italic>bra</italic>). <italic>Bradyrhizobium</italic> is primarily distinguished from the genera <italic>Mesorhizobium, Rhizobium</italic> and <italic>Sinorhizobium</italic> by the slower growth rate (doubling time) of at least 8 h.</p>
          <p>The soybean root nodulator <italic>bja</italic> has the largest genome of this study, 9.1 MB, in a unique chromosome [<xref ref-type="bibr" rid="B9-genes-03-00138">9</xref>]. The main feature of the more recently obtained genomes of <italic>Bradyrhizobium</italic> strains BTAi1 (<italic>bbt</italic>) and ORS278 (<italic>bra</italic>) is that they are photosynthetic stem nodulators of tropical <italic>Aeschynomene</italic> species, and for the first time described as lacking common nodulation genes <italic>nodA</italic>, <italic>nodB</italic> and <italic>nodC.</italic> These genes are traditionally involved in the construction of the Nod factor backbone [<xref ref-type="bibr" rid="B10-genes-03-00138">10</xref>,<xref ref-type="bibr" rid="B11-genes-03-00138">11</xref>]. Both strains also have large genomes, similar to <italic>bja</italic> (<xref ref-type="table" rid="genes-03-00138-t002">Table 2</xref>).</p>
          <table-wrap id="genes-03-00138-t001" position="float">
          <object-id pub-id-type="pii">genes-03-00138-t001_Table 1</object-id>
          <label>Table 1</label>
          <caption>
            <p>REFSEQ identifiers for the fourteen <italic>Rhizobiales</italic> genomes.</p>
          </caption>
          <table>
            <thead>
              <tr>
                <th align="center" valign="middle"><italic>Rhizobium</italic> species</th>
                <th align="center" valign="middle">Chr</th>
                <th align="center" valign="middle">Plasmid1</th>
                <th align="center" valign="middle">Plasmid2</th>
                <th align="center" valign="middle">Plasmid3</th>
                <th align="center" valign="middle">Plasmid4</th>
                <th align="center" valign="middle">Plasmid5</th>
                <th align="center" valign="middle">Plasmid6</th>
              </tr>
            </thead>
            <tbody>
              <tr>
                <td align="left" valign="middle"><italic>Azorhizobium caulinodans</italic> ORS 571</td>
                <td align="center" valign="middle">NC_009937</td>
                <td align="center" valign="middle">-</td>
                <td align="center" valign="middle">-</td>
                <td align="center" valign="middle">-</td>
                <td align="center" valign="middle">-</td>
                <td align="center" valign="middle">-</td>
                <td align="center" valign="middle">-</td>
              </tr>
              <tr>
                <td align="left" valign="middle"><italic>Bradyrhizobium japonicum</italic> USDA 110</td>
                <td align="center" valign="middle">NC_004463</td>
                <td align="center" valign="middle">-</td>
                <td align="center" valign="middle">-</td>
                <td align="center" valign="middle">-</td>
                <td align="center" valign="middle">-</td>
                <td align="center" valign="middle">-</td>
                <td align="center" valign="middle">-</td>
              </tr>
              <tr>
                <td align="left" valign="middle"><italic>Bradyrhizobium</italic> sp. BTAi1</td>
                <td align="center" valign="middle">NC_009485</td>
                <td align="center" valign="middle">NC_009475</td>
                <td align="center" valign="middle"> </td>
                <td align="center" valign="middle"> </td>
                <td align="center" valign="middle"> </td>
                <td align="center" valign="middle"> </td>
                <td align="center" valign="middle"> </td>
              </tr>
              <tr>
                <td align="left" valign="middle"><italic>Bradyrhizobium</italic> sp. ORS 278</td>
                <td align="center" valign="middle">NC_009445</td>
                <td align="center" valign="middle">-</td>
                <td align="center" valign="middle">-</td>
                <td align="center" valign="middle">-</td>
                <td align="center" valign="middle">-</td>
                <td align="center" valign="middle">-</td>
                <td align="center" valign="middle">-</td>
              </tr>
              <tr>
                <td align="left" valign="middle"><italic>Mesorhizobium loti</italic> MAFF303099</td>
                <td align="center" valign="middle">NC_002678</td>
                <td align="center" valign="middle">NC_002679</td>
                <td align="center" valign="middle">NC_002682</td>
                <td align="center" valign="middle"> </td>
                <td align="center" valign="middle"> </td>
                <td align="center" valign="middle"> </td>
                <td align="center" valign="middle"> </td>
              </tr>
              <tr>
                <td align="left" valign="middle"><italic>Mesorhizobium</italic> sp. BNC1 (<italic>Chelativorans</italic> sp. BNC1)</td>
                <td align="center" valign="middle">NC_008254</td>
                <td align="center" valign="middle">NC_008242</td>
                <td align="center" valign="middle">NC_008243</td>
                <td align="center" valign="middle">NC_008244</td>
                <td align="center" valign="middle"> </td>
                <td align="center" valign="middle"> </td>
                <td align="center" valign="middle"> </td>
              </tr>
              <tr>
                <td align="left" valign="middle"><italic>Rhizobium etli</italic> CFN 42</td>
                <td align="center" valign="middle">NC_007761</td>
                <td align="center" valign="middle">NC_007762</td>
                <td align="center" valign="middle">NC_007763</td>
                <td align="center" valign="middle">NC_007764</td>
                <td align="center" valign="middle">NC_004041</td>
                <td align="center" valign="middle">NC_007765</td>
                <td align="center" valign="middle">NC_007766</td>
              </tr>
              <tr>
                <td align="left" valign="middle"><italic>Rhizobium etli</italic> CIAT 652</td>
                <td align="center" valign="middle">NC_010994</td>
                <td align="center" valign="middle">NC_010998</td>
                <td align="center" valign="middle">NC_010996</td>
                <td align="center" valign="middle">NC_010997</td>
                <td align="center" valign="middle"> </td>
                <td align="center" valign="middle"> </td>
                <td align="center" valign="middle"> </td>
              </tr>
              <tr>
                <td align="left" valign="middle"><italic>Rhizobium leguminosarum bv. trifolii</italic> WSM1325</td>
                <td align="center" valign="middle">NC_012850</td>
                <td align="center" valign="middle">NC_012848</td>
                <td align="center" valign="middle">NC_012858</td>
                <td align="center" valign="middle">NC_012853</td>
                <td align="center" valign="middle">NC_012852</td>
                <td align="center" valign="middle">NC_012854</td>
                <td align="center" valign="middle"> </td>
              </tr>
              <tr>
                <td align="left" valign="middle"><italic>Rhizobium leguminosarum bv. trifolii</italic> WSM2304</td>
                <td align="center" valign="middle">NC_011369</td>
                <td align="center" valign="middle">NC_011368</td>
                <td align="center" valign="middle">NC_011366</td>
                <td align="center" valign="middle">NC_011370</td>
                <td align="center" valign="middle">NC_011371</td>
                <td align="center" valign="middle"> </td>
                <td align="center" valign="middle"> </td>
              </tr>
              <tr>
                <td align="left" valign="middle"><italic>Rhizobium leguminosarum bv. viciae</italic> 3841</td>
                <td align="center" valign="middle">NC_008380</td>
                <td align="center" valign="middle">NC_008382</td>
                <td align="center" valign="middle">NC_008383</td>
                <td align="center" valign="middle">NC_008379</td>
                <td align="center" valign="middle">NC_008381</td>
                <td align="center" valign="middle">NC_008384</td>
                <td align="center" valign="middle">NC_008378</td>
              </tr>
              <tr>
                <td align="left" valign="middle"><italic>Sinorhizobium fredii</italic> sp. NGR 234</td>
                <td align="center" valign="middle">NC_012587</td>
                <td align="center" valign="middle">NC_000914</td>
                <td align="center" valign="middle">NC_012586</td>
                <td align="center" valign="middle"> </td>
                <td align="center" valign="middle"> </td>
                <td align="center" valign="middle"> </td>
                <td align="center" valign="middle"> </td>
              </tr>
              <tr>
                <td align="left" valign="middle"><italic>Sinorhizobium medicae</italic> WSM419</td>
                <td align="center" valign="middle">NC_009636</td>
                <td align="center" valign="middle">NC_009620</td>
                <td align="center" valign="middle">NC_009621</td>
                <td align="center" valign="middle">NC_009622</td>
                <td align="center" valign="middle"> </td>
                <td align="center" valign="middle"> </td>
                <td align="center" valign="middle"> </td>
              </tr>
              <tr>
                <td align="left" valign="middle"><italic>Sinorhizobium meliloti</italic> 1021</td>
                <td align="center" valign="middle">NC_003047</td>
                <td align="center" valign="middle">NC_003037</td>
                <td align="center" valign="middle">NC_003078</td>
                <td align="center" valign="middle"> </td>
                <td align="center" valign="middle"> </td>
                <td align="center" valign="middle"> </td>
                <td align="center" valign="middle"> </td>
              </tr>
              
            </tbody>
  </table>
        </table-wrap>
        <table-wrap id="genes-03-00138-t002" position="float">
          <object-id pub-id-type="pii">genes-03-00138-t002_Table 2</object-id>
          <label>Table 2</label>
          <caption>
            <p>Summary statistics of the fourteen genomes of <italic>Rhizobiales</italic>.</p>
          </caption>
          <table rules="rows">
            <thead>
              <tr>
                <th align="center" valign="middle"><italic>Rhizobiales</italic> species</th>
                <th align="center" valign="middle">Species Code</th>
                <th align="center" valign="middle">No. of Plasmids</th>
                <th align="center" valign="middle">Total Genome Length (nucleotides)</th>
                <th align="center" valign="middle">Protein Coding Genes</th>
                <th align="center" valign="middle">tRNA genes</th>
                <th align="center" valign="middle">Pseudo genes</th>
                <th align="center" valign="middle">GC Content (%)</th>
                <th align="center" valign="middle">Proportion of Genome that is Gene Coding (%)</th>
                <th align="center" valign="middle">Host (Scientific/common name)</th>
              </tr>
            </thead>
            <tbody>
              <tr>
                <td align="left" valign="middle"><italic>Azorhizobium caulinodans</italic> ORS 571</td>
                <td align="center" valign="middle">
                  <italic>azc</italic>
                </td>
                <td align="center" valign="middle">0</td>
                <td align="center" valign="middle">5,369,772</td>
                <td align="center" valign="middle">4717</td>
                <td align="center" valign="middle">63</td>
                <td align="center" valign="middle">-</td>
                <td align="center" valign="middle">67</td>
                <td align="center" valign="middle">89</td>
                <td align="left" valign="middle">
                  <italic>Sesbania rostrata (sesbania)</italic>
                </td>
              </tr>
              <tr>
                <td align="left" valign="middle"><italic>Bradyrhizobium japonicum</italic> USDA 110</td>
                <td align="center" valign="middle">
                  <italic>bja</italic>
                </td>
                <td align="center" valign="middle">0</td>
                <td align="center" valign="middle">9,105,828</td>
                <td align="center" valign="middle">8317</td>
                <td align="center" valign="middle">56</td>
                <td align="center" valign="middle">-</td>
                <td align="center" valign="middle">64</td>
                <td align="center" valign="middle">86</td>
                <td align="left" valign="middle">
                  <italic>Glycine max (soybean)</italic>
                </td>
              </tr>
              <tr>
                <td align="left" valign="middle"><italic>Bradyrhizobium</italic> sp. BTAi1</td>
                <td align="center" valign="middle">
                  <italic>bbt</italic>
                </td>
                <td align="center" valign="middle">1</td>
                <td align="center" valign="middle">8,493,513</td>
                <td align="center" valign="middle">7621</td>
                <td align="center" valign="middle">70</td>
                <td align="center" valign="middle">90</td>
                <td align="center" valign="middle">64</td>
                <td align="center" valign="middle">85</td>
                <td align="left" valign="middle">
                  <italic>Aeschynomene indica (Indian joint-vetch)</italic>
                </td>
              </tr>
              <tr>
                <td align="left" valign="middle"><italic>Bradyrhizobium</italic> sp. ORS 278</td>
                <td align="center" valign="middle">
                  <italic>bra</italic>
                </td>
                <td align="center" valign="middle">0</td>
                <td align="center" valign="middle">7,456,587</td>
                <td align="center" valign="middle">6717</td>
                <td align="center" valign="middle">66</td>
                <td align="center" valign="middle">35</td>
                <td align="center" valign="middle">65</td>
                <td align="center" valign="middle">85</td>
                <td align="left" valign="middle">
                  <italic>Aeschynomene sensitiva (sensitive joint-vetch)</italic>
                </td>
              </tr>
              <tr>
                <td align="left" valign="middle"><italic>Mesorhizobium loti</italic> MAFF303099</td>
                <td align="center" valign="middle">
                  <italic>mlo</italic>
                </td>
                <td align="center" valign="middle">2</td>
                <td align="center" valign="middle">7,596,297</td>
                <td align="center" valign="middle">7272</td>
                <td align="center" valign="middle">57</td>
                <td align="center" valign="middle">-</td>
                <td align="center" valign="middle">62</td>
                <td align="center" valign="middle">86</td>
                <td align="left" valign="middle">
                  <italic>Lotus</italic> sp., <italic>including Lotus japonicas (trifoils, vetches)</italic>
                </td>
              </tr>
              <tr>
                <td align="left" valign="middle"><italic>Mesorhizobium</italic> sp. BNC1</td>
                <td align="center" valign="middle">
                  <italic>mes</italic>
                </td>
                <td align="center" valign="middle">3</td>
                <td align="center" valign="middle">4,935,185</td>
                <td align="center" valign="middle">4543</td>
                <td align="center" valign="middle">68</td>
                <td align="center" valign="middle">40</td>
                <td align="center" valign="middle">61</td>
                <td align="center" valign="middle">89</td>
                <td align="left" valign="middle">
                  <italic>non-symbiotic</italic>
                </td>
              </tr>
              <tr>
                <td align="left" valign="middle"><italic>Rhizobium etli</italic> CFN 42</td>
                <td align="center" valign="middle">
                  <italic>ret</italic>
                </td>
                <td align="center" valign="middle">6</td>
                <td align="center" valign="middle">6,530,228</td>
                <td align="center" valign="middle">5963</td>
                <td align="center" valign="middle">59</td>
                <td align="center" valign="middle">32</td>
                <td align="center" valign="middle">61</td>
                <td align="center" valign="middle">86</td>
                <td align="left" valign="middle">
                  <italic>Phaseolus vulgaris (common bean)</italic>
                </td>
              </tr>
              <tr>
                <td align="left" valign="middle"><italic>Rhizobium etli</italic> CIAT 652</td>
                <td align="center" valign="middle">
                  <italic>rec</italic>
                </td>
                <td align="center" valign="middle">3</td>
                <td align="center" valign="middle">6,448,048</td>
                <td align="center" valign="middle">6056</td>
                <td align="center" valign="middle">60</td>
                <td align="center" valign="middle">15</td>
                <td align="center" valign="middle">61</td>
                <td align="center" valign="middle">86</td>
                <td align="left" valign="middle">
                  <italic>Phaseolus vulgaris (common bean)</italic>
                </td>
              </tr>
              <tr>
                <td align="left" valign="middle"><italic>Rhizobium leguminosarum bv. trifolii</italic> WSM1325</td>
                <td align="center" valign="middle">
                  <italic>rlg</italic>
                </td>
                <td align="center" valign="middle">5</td>
                <td align="center" valign="middle">7,418,122</td>
                <td align="center" valign="middle">7001</td>
                <td align="center" valign="middle">63</td>
                <td align="center" valign="middle">75</td>
                <td align="center" valign="middle">61</td>
                <td align="center" valign="middle">86</td>
                <td align="left" valign="middle"><italic>Trifolli pratense and other Mediterraneum Trifollium</italic> (clovers),</td>
              </tr>
              <tr>
                <td align="left" valign="middle"><italic>Rhizobium leguminosarum bv. trifolii</italic> WSM2304</td>
                <td align="center" valign="middle">
                  <italic>rlt</italic>
                </td>
                <td align="center" valign="middle">4</td>
                <td align="center" valign="middle">6,872,702</td>
                <td align="center" valign="middle">6415</td>
                <td align="center" valign="middle">65</td>
                <td align="center" valign="middle">45</td>
                <td align="center" valign="middle">61</td>
                <td align="center" valign="middle">86</td>
                <td align="left" valign="middle"><italic>Trifolium polymorphum</italic> from Uruguay (clover)</td>
              </tr>
              <tr>
                <td align="left" valign="middle"><italic>Rhizobium leguminosarum bv. viciae</italic> 3841</td>
                <td align="center" valign="middle">
                  <italic>rle</italic>
                </td>
                <td align="center" valign="middle">6</td>
                <td align="center" valign="middle">7,751,309</td>
                <td align="center" valign="middle">7143</td>
                <td align="center" valign="middle">61</td>
                <td align="center" valign="middle">37</td>
                <td align="center" valign="middle">61</td>
                <td align="center" valign="middle">86</td>
                <td align="left" valign="middle">Tribe Viciae –<italic>Vicia, Pisum, Lathyrus, Lens</italic> (vetchs, peas, lathyrus, lentils)</td>
              </tr>
              <tr>
                <td align="left" valign="middle"><italic>Sinorhizobium fredii</italic> NGR 234</td>
                <td align="center" valign="middle">
                  <italic>rhi</italic>
                </td>
                <td align="center" valign="middle">2</td>
                <td align="center" valign="middle">6,891,900</td>
                <td align="center" valign="middle">6363</td>
                <td align="center" valign="middle">70</td>
                <td align="center" valign="middle">-</td>
                <td align="center" valign="middle">63</td>
                <td align="center" valign="middle">87</td>
                <td align="left" valign="middle">112 legume species and the non-legume <italic>Parasponia</italic> (family Ulmaceae)</td>
              </tr>
              <tr>
                <td align="left" valign="middle"><italic>Sinorhizobium medicae</italic> WSM419</td>
                <td align="center" valign="middle">
                  <italic>smd</italic>
                </td>
                <td align="center" valign="middle">3</td>
                <td align="center" valign="middle">6,817,576</td>
                <td align="center" valign="middle">6213</td>
                <td align="center" valign="middle">63</td>
                <td align="center" valign="middle">43</td>
                <td align="center" valign="middle">61</td>
                <td align="center" valign="middle">87</td>
                <td align="left" valign="middle"><italic>Medicago</italic> spp.</td>
              </tr>
              <tr>
                <td align="left" valign="middle"><italic>Sinorhizobium meliloti</italic> 1021</td>
                <td align="center" valign="middle">
                  <italic>sme</italic>
                </td>
                <td align="center" valign="middle">2</td>
                <td align="center" valign="middle">6,691,694</td>
                <td align="center" valign="middle">6218</td>
                <td align="center" valign="middle">66</td>
                <td align="center" valign="middle">4</td>
                <td align="center" valign="middle">62</td>
                <td align="center" valign="middle">86</td>
                <td align="left" valign="middle">
                  <italic>Medicago, Melilotus, Trigonella (alfalfa)</italic>
                </td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        </sec>
        <sec>
          <title>2.1.3. Mesorhizobium</title>
          <p>As the suffix <italic>Meso</italic> suggests, species of this genus show growth rates intermediate to <italic>Bradyrhizobiu</italic><italic>m</italic> (&gt;8 h) and <italic>Rhizobium</italic>/<italic>Sinorhizobium</italic> (&lt;6 h). <italic>Mesorhizobium loti</italic> MAFF303099 (<italic>mlo</italic>) is a root nodulator of <italic>Lotus japonicum</italic>, with one chromosome and two plasmids; as in other <italic>Mesorhizobium</italic> and <italic>Bradyrhizobium</italic>, <italic>mlo</italic> has a symbiotic island within the chromosome that contains all key genes for nodulation and nitrogen fixation [<xref ref-type="bibr" rid="B12-genes-03-00138">12</xref>].</p>
          <p><italic>Mesorhizobium</italic> sp. BNC1 (<italic>mes</italic>), formerly known as <italic>Agrobacterium</italic> sp. BNC1, and alternately named <italic>Chelativorans</italic> sp. BNC1, is the functional outlier of the fourteen species, being asymbiotic. It was isolated from a mixed-culture enriched from sewage using the chelating agent EDTA as the sole carbon and nitrogen source [<xref ref-type="bibr" rid="B13-genes-03-00138">13</xref>]. For this study, <italic>mes</italic> is utilized as an outlier for the basis of “symbiome” definition and investigation.</p>
        </sec>
        <sec>
          <title>2.1.4. Rhizobium</title>
          <p>The genus <italic>Rhizobium</italic>, originally defining all bacterium with the ability to nodulate legumes [<xref ref-type="bibr" rid="B14-genes-03-00138">14</xref>], has undergone multiple redefinitions and now encompasses a variety of fast growing nitrogen fixers including some former <italic>Agrobacterium</italic> and <italic>Sinorhizobium</italic> (<italic>Ensifer)</italic> species and the genus <italic>Allorhizobium</italic>. In this study, five <italic>Rhizobium</italic> genomes are investigated: <italic>R. etli</italic> CFN 42 (<italic>ret</italic>), <italic>R. etli</italic> CIAT 652 (<italic>rec</italic>), <italic>R. leguminosarum</italic> bv. <italic>trifolii</italic> WSM1325 (<italic>rlg</italic>), <italic>R. leguminosarum</italic> bv. <italic>trifolii</italic> WSM2304 (<italic>rlt</italic>) and <italic>R. leguminosarum</italic> bv. <italic>viciae</italic> 3841 (<italic>rle</italic>). These five <italic>Rhizobium</italic> strains have different origins and hosts. The <italic>R.</italic> <italic>etli</italic> strains nodulate common bean (<italic>Phaselous vulgaris</italic> and were originally isolated in Central America) [<xref ref-type="bibr" rid="B15-genes-03-00138">15</xref>,<xref ref-type="bibr" rid="B16-genes-03-00138">16</xref>,<xref ref-type="bibr" rid="B17-genes-03-00138">17</xref>,<xref ref-type="bibr" rid="B18-genes-03-00138">18</xref>]. In contrast, the <italic>R.</italic> <italic>leguminosarum</italic> strains nodulate temperate legumes: bv. trifolii <italic>rlg</italic> and <italic>rlt</italic> are symbionts of clovers (<italic>Trifolium</italic>), although individually matched to annual (<italic>rlg</italic>) and perennial (<italic>rlt</italic>) species for N fixation [<xref ref-type="bibr" rid="B19-genes-03-00138">19</xref>,<xref ref-type="bibr" rid="B20-genes-03-00138">20</xref>], Bv. viciae <italic>rle</italic> has a broader host range, nodulating legumes within the tribe <italic>Viciae</italic>, including <italic>Pisum, Vicia, Lathyrus</italic> and <italic>Lens</italic> [<xref ref-type="bibr" rid="B21-genes-03-00138">21</xref>].</p>
        </sec>
        <sec>
          <title>2.1.5. Sinorhizobium</title>
          <p>The genus <italic>Sinorhizobium</italic> is subject of some controversy. Members of this genus were originally classified as <italic>Rhizobium</italic> (e.g., <italic>Rhizobium meliloti</italic>) and the new genus was proposed in 1988 [<xref ref-type="bibr" rid="B22-genes-03-00138">22</xref>]. More recently, the genus has been reclassified as <italic>Ensifer</italic> [<xref ref-type="bibr" rid="B23-genes-03-00138">23</xref>,<xref ref-type="bibr" rid="B24-genes-03-00138">24</xref>], but in general the new nomenclature was not broadly accepted by rhizobiologists. <italic>Sinorhizobium</italic> and <italic>Rhizobium</italic> have similar morpho-physiological properties, and only the 16S rRNA taxonomy resolves the genus as a distinct clade [<xref ref-type="bibr" rid="B25-genes-03-00138">25</xref>].</p>
          <p>Three <italic>Sinorhizobium</italic> were chosen for this study, <italic>S. meliloti</italic> 1021 (<italic>sme</italic>), <italic>S.</italic> <italic>medicae</italic> WSM419 (<italic>smd</italic>) and <italic>Sinorhizobium</italic> sp. strain NGR 234 (<italic>rhi</italic>), also denominated as <italic>S. fredii</italic> NGR 234 <italic>and Rhizobium</italic> sp. NGR 234 [<xref ref-type="bibr" rid="B26-genes-03-00138">26</xref>,<xref ref-type="bibr" rid="B27-genes-03-00138">27</xref>,<xref ref-type="bibr" rid="B28-genes-03-00138">28</xref>]. Both <italic>sme</italic> and <italic>smd</italic> nodulate temperate legumes of the genus <italic>Medicago</italic>. It has been established that <italic>smd</italic> has greater acid tolerance, and is a more effective nodulator of <italic>Medicago truncatula</italic> than <italic>sme</italic> [<xref ref-type="bibr" rid="B29-genes-03-00138">29</xref>,<xref ref-type="bibr" rid="B30-genes-03-00138">30</xref>]. It is also hypothesized that <italic>sme</italic> and <italic>smd</italic> evolved in association with hosts adapted to different edaphic conditions [<xref ref-type="bibr" rid="B31-genes-03-00138">31</xref>]. The tropical strain NGR 234 (<italic>rhi)</italic> is probably the most intriguing rhizobia isolated so far. Originally isolated in Papua New Guinea and labeled <italic>Rhizobium</italic> sp. NGR 234, it has since been found to be highly promiscuous, capable of nodulating at least 112 different legume genera, from most Vavilov centers of origin, in addition to the non-legume <italic>Parasponia</italic> [<xref ref-type="bibr" rid="B32-genes-03-00138">32</xref>,<xref ref-type="bibr" rid="B33-genes-03-00138">33</xref>,<xref ref-type="bibr" rid="B34-genes-03-00138">34</xref>]. </p>
        </sec>
      </sec>
      <sec>
        <title>2.2. Phylogeny and Taxonomy of Rhizobiales</title>
        <sec>
          <title>2.2.1. 16S rRNA Taxonomy</title>
          <p>The phylogeny and taxonomy of nitrogen fixing <italic>Rhizobiales</italic> is in a state of flux. Confusion over the status of <italic>Sinorhizobium</italic> [<xref ref-type="bibr" rid="B22-genes-03-00138">22</xref>,<xref ref-type="bibr" rid="B23-genes-03-00138">23</xref>,<xref ref-type="bibr" rid="B35-genes-03-00138">35</xref>] and the split phylogeny of <italic>Agrobacterium</italic> within the genus <italic>Rhizobium</italic> [<xref ref-type="bibr" rid="B24-genes-03-00138">24</xref>,<xref ref-type="bibr" rid="B36-genes-03-00138">36</xref>] are just two cases in point. Therefore in this genomic comparison several methods of taxonomic and genomic comparison were utilized to highlight different aspects of the phylogeny of the selected strains. </p>
          <fig id="genes-03-00138-f001" position="float">
            <label>Figure 1</label>
            <caption>
              <p>16S rRNA gene tree built with the multiple 16S rRNA genes from each of the fourteen genomes compared. Taxonomic analysis performed with MEGA5 [<xref ref-type="bibr" rid="B68-genes-03-00138">68</xref>], by constructing a bootstrapped neighbor-joining (NJ) gene tree with Jukes-Cantor substitution.</p>
            </caption>
            <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="genes-03-00138-g001.tif"/>
          </fig>
          <p>The ‘classical’ relationship between <italic>Bradyrhizobium</italic>, <italic>Rhizobium</italic>, <italic>Sinorhizobium</italic> and <italic>Mesorhizobium</italic> is clearly shown in the 16S rRNA phylogeny (<xref ref-type="fig" rid="genes-03-00138-f001">Figure 1</xref>). The four major groups are delineated from each other forming four major clusters, with <italic>Azorhizobium</italic> as a separate branch. All strains were clustered within the expected genera and as expected the <italic>Trifolium</italic> nodulators <italic>rlg</italic> and <italic>rlt</italic> were indistinguishable from each other using exclusively the 16S rRNA. </p>
          
        </sec>
        <sec>
          <title>2.2.2. Dotplot Analysis</title>
          <p>Further comparison of whole chromosomes adds valuable information to the 16S rRNA taxonomy results. A dotplot of the chromosome nucleotide sequences was compiled using an in-house application called FRECKLE (<xref ref-type="fig" rid="genes-03-00138-f002">Figure 2</xref>) (<uri>http://code.google.com/p/freckle/</uri>). Two primary blocks of significant sequence similarity are those of the <italic>Bradyrhizobium</italic> species with another larger block consisting of the <italic>Rhizobium</italic> and <italic>Sinorhizobium</italic> strains, while <italic>Azorhizobium</italic> and <italic>Mesorhizobium</italic> are outliers. The <italic>sme</italic> chromosome shows a highly repetitive structure, more than any other chromosome. Interestingly, the dotplot also displayed a strong sequence similarity between <italic>Sinorhizobium</italic> NGR 234 and <italic>S. meliloti</italic> 1021, including a shared repetitive structure. </p>
          <fig id="genes-03-00138-f002" position="float">
            <label>Figure 2</label>
            <caption>
              <p>FRECKLE DNA dotplot of the fourteen <italic>Rhizobiales</italic> chromosomes, constructed from nucleotide FASTA files using an in house script.</p>
            </caption>
            <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="genes-03-00138-g002.tif"/>
          </fig>
          <p>A dotplot of plasmids (<xref ref-type="supplementary-material" rid="genes-03-00138-s001">Figure S1</xref>) shows markedly less homology. Greater symmetries were detected in <italic>Rhizobium</italic> non-symbiotic plasmids: plasmids p42e (<italic>R. etli</italic> CFN 42), pRL11 (<italic>R. leguminosarum</italic> bv. viciae 3841), pRLG202 (<italic>R. leguminosarum</italic> bv. trifolii WSM2304) and pR132502 (<italic>R. leguminosarum</italic> bv. trifolii WSM132502) are especially homologous, suggesting higher stability. In addition, pSMED01 (<italic>S. medicae</italic> WSM419) and pSymB (<italic>S. meliloti</italic> 1021) also show symmetry. This observation has been previously explored, resulting in the proposal that these plasmids—with little or no direct involvement in the symbiosis—represent secondary chromosomes [<xref ref-type="bibr" rid="B37-genes-03-00138">37</xref>], or alternatively ‘chromids’ [<xref ref-type="bibr" rid="B38-genes-03-00138">38</xref>], as they contain essential metabolic genes, as well as tRNA and/or rRNA. </p>
          <p>In contrast, little homology was detected in the analysis of the symbiotic plasmids (<xref ref-type="supplementary-material" rid="genes-03-00138-s001">Figure S1</xref>). Symbiotically important plasmids have similar <italic>nod</italic>, <italic>nif</italic> and, in some cases, <italic>fix</italic> clusters. It is hypothesized that multiple lateral transfer events, which originally enabled the symbiotic and nitrogen fixing ability of many of these bacteria, would have also caused the overall lack of homology amongst the symbiotic plasmids. Our study has pointed out that the <italic>rhi</italic> plasmids pNGR234a and pNGR234b display lesser homology with the other plasmids, suggesting a greater number of lateral transfer events. Such ‘elastic’ genomes could partly explain the adaptation to a wide host range of <italic>rhi</italic>. </p>
          <p>The contrast between the 16S rRNA taxonomy and the two dotplots (chromosomal and plasmids) relies on the disparate relationships between the plasmids. In fact, Young <italic>et al.</italic> 2006 concluded that <italic>R. leguminosarum</italic> bv. viciae (<italic>rle</italic>) had a core genome, mostly chromosomal with a high G+C content and shared with other organisms, and an accessory genome with a lower G+C content, on plasmids and chromosomal genomic islands [<xref ref-type="bibr" rid="B21-genes-03-00138">21</xref>]. The broader whole genome taxonomy in this study suggests that this two-component genome model may prevail in other rhizobial species. The <italic>R.</italic> <italic>etli</italic> and <italic>R.</italic> <italic>leguminosarum</italic> bv. trifolii genomes species share much of their chromosome with <italic>R.</italic> <italic>leguminosarum</italic> bv. viciae, further expanding the “core—genome” definition of Young <italic>et al.</italic> 2006, and, in turn, shrinking the “accessory genome”. </p>
          <p>Most important, the low homology observed in this study in the comparison of the genomes of both closely related strains within the same species and also between species showing reasonable similarity of the 16S rRNA, and clearly showing several events of lateral gene transfer reinforces the discussion about the validity of prokaryote species definition [<xref ref-type="bibr" rid="B39-genes-03-00138">39</xref>,<xref ref-type="bibr" rid="B40-genes-03-00138">40</xref>]. </p>
        </sec>
      </sec>
      <sec>
        <title>2.3. KEGG Orthology and Protein Clustering</title>
        <sec>
          <title>2.3.1. KEGG Pathway Analysis</title>
          <p>The next stage of our genomic study was to compare KEGG genes and pathways in each species. The summary of KEGG orthologs is shown in <xref ref-type="table" rid="genes-03-00138-t003">Table 3</xref>, and the full table of KEGG pathways is included as <xref ref-type="supplementary-material" rid="genes-03-00138-s001">Table S1</xref>.</p>
          
          <p>All fourteen genomes have a large number of nitrogen, methane, sulfur, amino acid, vitamins and cofactors metabolism KEGG orthologs. This diverse suite of metabolism pathways is indicative of the ability to live in the complex rhizosphere environments, as well as to adapt to the nodule environment. However, the fact that the non-symbiotic genome of <italic>mes</italic> shows the same wide range of KEGG metabolism pathway orthologs as the symbiotic <italic>mlo</italic>, indicates the role of these genes in determining saprophytic capacity in a broad range of soil conditions. Noteworthy was the wide range of nitrogen metabolism protein orthologs detected (<xref ref-type="supplementary-material" rid="genes-03-00138-s001">Table S2</xref>), showing a variety of nitrogen metabolism capabilities. The largest number of orthologs in <italic>Bradyrhizobium</italic> and <italic>Sinorhizobium</italic> indicate the capacity to metabolize nitriles, nitrates, formamide, nitroalkanes, as well as related amino acids glutamine and asparagine. </p>
          <p>The genomes of <italic>Mesorhizobium</italic> and <italic>Rhizobium leguminosarum</italic> lack <italic>norC</italic>, <italic>norD</italic> and <italic>norE</italic> orthologs, protein subunits of the nitric oxide reductase complex. Interestingly, within the same species <italic>R. etli</italic>, <italic>ret</italic> but not <italic>rec</italic> contains asparagine synthetase (<italic>asnB)</italic> and <italic>nor</italic> orthologs. <italic>R. etli</italic> CFN 42 (<italic>ret</italic>) is unable to use nitrate for respiration and lacks nitrate reductase activity, as well as the <italic>nap</italic> and <italic>nar</italic> genes encoding respiratory nitrate reductase; however, the strain carries proteins closely related to denitrification enzymes, <italic>norCBQD</italic>. The functionality of <italic>nor</italic> genes in CFN 42 has been recently demonstrated, allowing the reduction of nitrite and nitric oxide [<xref ref-type="bibr" rid="B41-genes-03-00138">41</xref>]. It is worth mentioning that differences between strains within the same species made visible only by comparative genomics can help to explain metabolomic advantages, e.g., in this case of <italic>ret</italic> over <italic>rec</italic>.</p>
        </sec>
        <sec>
          <title>2.3.2. Protein Cluster Analysis</title>
          <p>A comprehensive method of comparative genome analysis is to cluster protein families with the BlastlineMCL algorithm [<xref ref-type="bibr" rid="B42-genes-03-00138">42</xref>], representing a Markov clustering of all orthologous protein groups across species. This could be considered a more complete clustering of orthologous proteins than from KEGG, as it includes all proteins annotated across all the genomes, not just the more restricted set of established KEGG orthologs. The use of this tool has been shown to be crucial in the comparison of rhizobial species, as many symbiotic and nitrogen fixing related gene pathways and families are not represented in KEGG.</p>
          <p>Cluster information resolved from BlastlineMCL analysis is available online (see <xref ref-type="sec" rid="sec3-genes-03-00138">experimental section</xref>), where users can compare any combination of genome protein family homology to search for orthologous protein groups of interest. Raw sequence data is available in FASTA format and CLUSTALW aligned format (MSF). This resource represents a vast repository of comparative genomics information on the fourteen genomes, of which only a few are discussed in this paper.</p>
          <p>A nitrogen-fixing <italic>Rhizobiales</italic> “pan-genome” of 1,126 clusters and 28,110 proteins was resolved amongst the fourteen genomes (<xref ref-type="table" rid="genes-03-00138-t004">Table 4</xref>). The clusters within this group include ABC transporters, some transposases, ribosomal RNA synthetases, DNA polymerases and other core proteins. The ABC transporters are the second most abundant family of non-hypothetical protein encoding genes found in all sequenced prokaryotic, eukaryotic, viral genomes as well as in metagenomic sequences. ABC transporters represent one of the largest superfamilies of active membrane transport proteins (MTPs), with a highly conserved ATPase domain that binds and hydrolyzes ATP, supplying energy for the uptake of a variety of nutrients and for the extrusion of drugs and metabolic wastes [<xref ref-type="bibr" rid="B43-genes-03-00138">43</xref>]. It is therefore not surprising that by far the biggest cluster found in the BlastlineMCL analysis is that of ABC type transporters, with 1,128 separate proteins. Together with the previous KEGG findings, this is indicative of ability to use a very wide range of substrates, a key property for adaption in the variety of environments, such as the numerous geographical rhizospheres represented (such as China, Papua New Guinea, Uruguay and Mexico) and the diverse hosts they nodulate.</p>
          <table-wrap id="genes-03-00138-t003" position="float">
            <object-id pub-id-type="pii">genes-03-00138-t003_Table 3</object-id>
            <label>Table 3</label>
            <caption>
              <p>Summary of KEGG pathway and protein orthologs in the fourteen <italic>Rhizobiales</italic> genomes.</p>
            </caption>
            <table>
              <thead>
                <tr>
                  <th rowspan="2" align="center" valign="middle">Relevant KEGG Pathways</th>
                  <th colspan="14" align="center" valign="middle">Number of KEGG Protein Orthologs</th>
                </tr>
                <tr style="border-top: solid thin">
                  <th align="center" valign="middle">azc</th>
                  <th align="center" valign="middle">bja</th>
                  <th align="center" valign="middle">bbt</th>
                  <th align="center" valign="middle">bra</th>
                  <th align="center" valign="middle">mlo</th>
                  <th align="center" valign="middle">mes</th>
                  <th align="center" valign="middle">ret</th>
                  <th align="center" valign="middle">rec</th>
                  <th align="center" valign="middle">rlg</th>
                  <th align="center" valign="middle">rlt</th>
                  <th align="center" valign="middle">rle</th>
                  <th align="center" valign="middle">rhi</th>
                  <th align="center" valign="middle">smd</th>
                  <th align="center" valign="middle">sme</th>
                </tr>
              </thead>
              <tbody>
                <tr>
                  <td align="left" valign="middle">1.1 Carbohydrate Metabolism</td>
                  <td align="center" valign="middle">240</td>
                  <td align="center" valign="middle">295</td>
                  <td align="center" valign="middle">295</td>
                  <td align="center" valign="middle">296</td>
                  <td align="center" valign="middle">274</td>
                  <td align="center" valign="middle">245</td>
                  <td align="center" valign="middle">249</td>
                  <td align="center" valign="middle">279</td>
                  <td align="center" valign="middle">257</td>
                  <td align="center" valign="middle">256</td>
                  <td align="center" valign="middle">299</td>
                  <td align="center" valign="middle">262</td>
                  <td align="center" valign="middle">267</td>
                  <td align="center" valign="middle">273</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">1.2 Energy Metabolism</td>
                  <td align="center" valign="middle">114</td>
                  <td align="center" valign="middle">147</td>
                  <td align="center" valign="middle">146</td>
                  <td align="center" valign="middle">145</td>
                  <td align="center" valign="middle">113</td>
                  <td align="center" valign="middle">114</td>
                  <td align="center" valign="middle">112</td>
                  <td align="center" valign="middle">111</td>
                  <td align="center" valign="middle">99</td>
                  <td align="center" valign="middle">99</td>
                  <td align="center" valign="middle">118</td>
                  <td align="center" valign="middle">111</td>
                  <td align="center" valign="middle">124</td>
                  <td align="center" valign="middle">138</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">1.3 Lipid Metabolism</td>
                  <td align="center" valign="middle">52</td>
                  <td align="center" valign="middle">63</td>
                  <td align="center" valign="middle">72</td>
                  <td align="center" valign="middle">72</td>
                  <td align="center" valign="middle">72</td>
                  <td align="center" valign="middle">53</td>
                  <td align="center" valign="middle">61</td>
                  <td align="center" valign="middle">76</td>
                  <td align="center" valign="middle">58</td>
                  <td align="center" valign="middle">57</td>
                  <td align="center" valign="middle">81</td>
                  <td align="center" valign="middle">56</td>
                  <td align="center" valign="middle">58</td>
                  <td align="center" valign="middle">71</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">1.4 Nucleotide Metabolism</td>
                  <td align="center" valign="middle">102</td>
                  <td align="center" valign="middle">96</td>
                  <td align="center" valign="middle">100</td>
                  <td align="center" valign="middle">99</td>
                  <td align="center" valign="middle">111</td>
                  <td align="center" valign="middle">105</td>
                  <td align="center" valign="middle">100</td>
                  <td align="center" valign="middle">108</td>
                  <td align="center" valign="middle">97</td>
                  <td align="center" valign="middle">99</td>
                  <td align="center" valign="middle">112</td>
                  <td align="center" valign="middle">97</td>
                  <td align="center" valign="middle">98</td>
                  <td align="center" valign="middle">114</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">1.5 Amino Acid Metabolism</td>
                  <td align="center" valign="middle">221</td>
                  <td align="center" valign="middle">249</td>
                  <td align="center" valign="middle">268</td>
                  <td align="center" valign="middle">258</td>
                  <td align="center" valign="middle">275</td>
                  <td align="center" valign="middle">224</td>
                  <td align="center" valign="middle">235</td>
                  <td align="center" valign="middle">261</td>
                  <td align="center" valign="middle">227</td>
                  <td align="center" valign="middle">241</td>
                  <td align="center" valign="middle">272</td>
                  <td align="center" valign="middle">239</td>
                  <td align="center" valign="middle">236</td>
                  <td align="center" valign="middle">253</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">1.6 Metabolism of Other Amino Acids</td>
                  <td align="center" valign="middle">50</td>
                  <td align="center" valign="middle">58</td>
                  <td align="center" valign="middle">62</td>
                  <td align="center" valign="middle">57</td>
                  <td align="center" valign="middle">60</td>
                  <td align="center" valign="middle">48</td>
                  <td align="center" valign="middle">54</td>
                  <td align="center" valign="middle">58</td>
                  <td align="center" valign="middle">54</td>
                  <td align="center" valign="middle">57</td>
                  <td align="center" valign="middle">58</td>
                  <td align="center" valign="middle">56</td>
                  <td align="center" valign="middle">54</td>
                  <td align="center" valign="middle">62</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">1.7 Glycan Biosynthesis and Metabolism</td>
                  <td align="center" valign="middle">30</td>
                  <td align="center" valign="middle">29</td>
                  <td align="center" valign="middle">31</td>
                  <td align="center" valign="middle">33</td>
                  <td align="center" valign="middle">32</td>
                  <td align="center" valign="middle">25</td>
                  <td align="center" valign="middle">33</td>
                  <td align="center" valign="middle">35</td>
                  <td align="center" valign="middle">32</td>
                  <td align="center" valign="middle">34</td>
                  <td align="center" valign="middle">36</td>
                  <td align="center" valign="middle">31</td>
                  <td align="center" valign="middle">22</td>
                  <td align="center" valign="middle">32</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">1.8 Metabolism of Cofactors and Vitamins</td>
                  <td align="center" valign="middle">117</td>
                  <td align="center" valign="middle">132</td>
                  <td align="center" valign="middle">143</td>
                  <td align="center" valign="middle">137</td>
                  <td align="center" valign="middle">131</td>
                  <td align="center" valign="middle">103</td>
                  <td align="center" valign="middle">122</td>
                  <td align="center" valign="middle">126</td>
                  <td align="center" valign="middle">114</td>
                  <td align="center" valign="middle">117</td>
                  <td align="center" valign="middle">130</td>
                  <td align="center" valign="middle">119</td>
                  <td align="center" valign="middle">117</td>
                  <td align="center" valign="middle">124</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">1.9 Biosynthesis of Polyketides and Terpenoids</td>
                  <td align="center" valign="middle">29</td>
                  <td align="center" valign="middle">29</td>
                  <td align="center" valign="middle">42</td>
                  <td align="center" valign="middle">43</td>
                  <td align="center" valign="middle">29</td>
                  <td align="center" valign="middle">27</td>
                  <td align="center" valign="middle">28</td>
                  <td align="center" valign="middle">36</td>
                  <td align="center" valign="middle">28</td>
                  <td align="center" valign="middle">27</td>
                  <td align="center" valign="middle">34</td>
                  <td align="center" valign="middle">27</td>
                  <td align="center" valign="middle">27</td>
                  <td align="center" valign="middle">34</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">1.10 Biosynthesis of Other Secondary Metabolites</td>
                  <td align="center" valign="middle">8</td>
                  <td align="center" valign="middle">21</td>
                  <td align="center" valign="middle">32</td>
                  <td align="center" valign="middle">35</td>
                  <td align="center" valign="middle">31</td>
                  <td align="center" valign="middle">20</td>
                  <td align="center" valign="middle">27</td>
                  <td align="center" valign="middle">34</td>
                  <td align="center" valign="middle">26</td>
                  <td align="center" valign="middle">26</td>
                  <td align="center" valign="middle">37</td>
                  <td align="center" valign="middle">26</td>
                  <td align="center" valign="middle">23</td>
                  <td align="center" valign="middle">28</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">1.11 Xenobiotics Biodegradation and Metabolism</td>
                  <td align="center" valign="middle">85</td>
                  <td align="center" valign="middle">134</td>
                  <td align="center" valign="middle">166</td>
                  <td align="center" valign="middle">163</td>
                  <td align="center" valign="middle">97</td>
                  <td align="center" valign="middle">72</td>
                  <td align="center" valign="middle">58</td>
                  <td align="center" valign="middle">129</td>
                  <td align="center" valign="middle">60</td>
                  <td align="center" valign="middle">59</td>
                  <td align="center" valign="middle">138</td>
                  <td align="center" valign="middle">55</td>
                  <td align="center" valign="middle">53</td>
                  <td align="center" valign="middle">101</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">2.1 Transcription</td>
                  <td align="center" valign="middle">4</td>
                  <td align="center" valign="middle">4</td>
                  <td align="center" valign="middle">4</td>
                  <td align="center" valign="middle">4</td>
                  <td align="center" valign="middle">4</td>
                  <td align="center" valign="middle">4</td>
                  <td align="center" valign="middle">4</td>
                  <td align="center" valign="middle">4</td>
                  <td align="center" valign="middle">4</td>
                  <td align="center" valign="middle">4</td>
                  <td align="center" valign="middle">4</td>
                  <td align="center" valign="middle">4</td>
                  <td align="center" valign="middle">4</td>
                  <td align="center" valign="middle">4</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">2.2 Translation</td>
                  <td align="center" valign="middle">134</td>
                  <td align="center" valign="middle">131</td>
                  <td align="center" valign="middle">128</td>
                  <td align="center" valign="middle">129</td>
                  <td align="center" valign="middle">130</td>
                  <td align="center" valign="middle">123</td>
                  <td align="center" valign="middle">128</td>
                  <td align="center" valign="middle">128</td>
                  <td align="center" valign="middle">129</td>
                  <td align="center" valign="middle">129</td>
                  <td align="center" valign="middle">127</td>
                  <td align="center" valign="middle">124</td>
                  <td align="center" valign="middle">129</td>
                  <td align="center" valign="middle">132</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">2.3 Folding, Sorting and Degradation</td>
                  <td align="center" valign="middle">40</td>
                  <td align="center" valign="middle">40</td>
                  <td align="center" valign="middle">38</td>
                  <td align="center" valign="middle">38</td>
                  <td align="center" valign="middle">36</td>
                  <td align="center" valign="middle">34</td>
                  <td align="center" valign="middle">36</td>
                  <td align="center" valign="middle">37</td>
                  <td align="center" valign="middle">38</td>
                  <td align="center" valign="middle">38</td>
                  <td align="center" valign="middle">37</td>
                  <td align="center" valign="middle">36</td>
                  <td align="center" valign="middle">38</td>
                  <td align="center" valign="middle">38</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">2.4 Replication and Repair</td>
                  <td align="center" valign="middle">69</td>
                  <td align="center" valign="middle">74</td>
                  <td align="center" valign="middle">71</td>
                  <td align="center" valign="middle">71</td>
                  <td align="center" valign="middle">70</td>
                  <td align="center" valign="middle">70</td>
                  <td align="center" valign="middle">71</td>
                  <td align="center" valign="middle">71</td>
                  <td align="center" valign="middle">73</td>
                  <td align="center" valign="middle">71</td>
                  <td align="center" valign="middle">73</td>
                  <td align="center" valign="middle">70</td>
                  <td align="center" valign="middle">71</td>
                  <td align="center" valign="middle">71</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">3.1 Membrane Transport</td>
                  <td align="center" valign="middle">119</td>
                  <td align="center" valign="middle">141</td>
                  <td align="center" valign="middle">121</td>
                  <td align="center" valign="middle">114</td>
                  <td align="center" valign="middle">172</td>
                  <td align="center" valign="middle">135</td>
                  <td align="center" valign="middle">159</td>
                  <td align="center" valign="middle">163</td>
                  <td align="center" valign="middle">138</td>
                  <td align="center" valign="middle">131</td>
                  <td align="center" valign="middle">162</td>
                  <td align="center" valign="middle">165</td>
                  <td align="center" valign="middle">148</td>
                  <td align="center" valign="middle">170</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">3.2 Signal Transduction</td>
                  <td align="center" valign="middle">57</td>
                  <td align="center" valign="middle">61</td>
                  <td align="center" valign="middle">55</td>
                  <td align="center" valign="middle">57</td>
                  <td align="center" valign="middle">50</td>
                  <td align="center" valign="middle">39</td>
                  <td align="center" valign="middle">50</td>
                  <td align="center" valign="middle">55</td>
                  <td align="center" valign="middle">48</td>
                  <td align="center" valign="middle">48</td>
                  <td align="center" valign="middle">53</td>
                  <td align="center" valign="middle">49</td>
                  <td align="center" valign="middle">46</td>
                  <td align="center" valign="middle">49</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">4.2 Cell Motility</td>
                  <td align="center" valign="middle">38</td>
                  <td align="center" valign="middle">41</td>
                  <td align="center" valign="middle">43</td>
                  <td align="center" valign="middle">43</td>
                  <td align="center" valign="middle">34</td>
                  <td align="center" valign="middle">38</td>
                  <td align="center" valign="middle">40</td>
                  <td align="center" valign="middle">41</td>
                  <td align="center" valign="middle">40</td>
                  <td align="center" valign="middle">40</td>
                  <td align="center" valign="middle">41</td>
                  <td align="center" valign="middle">40</td>
                  <td align="center" valign="middle">40</td>
                  <td align="center" valign="middle">40</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">Total KEGG Protein Orthologs</td>
                  <td align="center" valign="middle">1509</td>
                  <td align="center" valign="middle">1745</td>
                  <td align="center" valign="middle">1817</td>
                  <td align="center" valign="middle">1794</td>
                  <td align="center" valign="middle">1721</td>
                  <td align="center" valign="middle">1479</td>
                  <td align="center" valign="middle">1567</td>
                  <td align="center" valign="middle">1752</td>
                  <td align="center" valign="middle">1522</td>
                  <td align="center" valign="middle">1533</td>
                  <td align="center" valign="middle">1812</td>
                  <td align="center" valign="middle">1567</td>
                  <td align="center" valign="middle">1555</td>
                  <td align="center" valign="middle">1734</td>
                </tr>
              </tbody>
            </table>
          </table-wrap>
          <table-wrap id="genes-03-00138-t004" position="float">
            <object-id pub-id-type="pii">genes-03-00138-t004_Table 4</object-id>
            <label>Table 4</label>
            <caption>
              <p>Summary of selected MCL BLASTline protein cluster groups.</p>
            </caption>
            <table>
              <thead>
                <tr>
                  <th align="center" valign="middle">Cluster Family</th>
                  <th align="center" valign="middle">Clusters</th>
                  <th align="center" valign="middle">Proteins</th>
                  <th align="center" valign="middle">Proteins in Chromosomes</th>
                  <th align="center" valign="middle">Proteins in Plasmids</th>
                  <th align="center" valign="middle">Percent in Chromosomes</th>
                  <th align="center" valign="middle">Percent in Plasmids</th>
                </tr>
              </thead>
              <tbody>
                <tr>
                  <td align="left" valign="middle">Pan-genome (all 14)</td>
                  <td align="center" valign="middle">1126</td>
                  <td align="center" valign="middle">28110</td>
                  <td align="center" valign="middle">23686</td>
                  <td align="center" valign="middle">4424</td>
                  <td align="center" valign="middle">84.26</td>
                  <td align="center" valign="middle">15.74</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">13 Fix+ genomes</td>
                  <td align="center" valign="middle">105</td>
                  <td align="center" valign="middle">1126</td>
                  <td align="center" valign="middle">619</td>
                  <td align="center" valign="middle">507</td>
                  <td align="center" valign="middle">54.97</td>
                  <td align="center" valign="middle">45.03</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">11 NodABC+ genomes</td>
                  <td align="center" valign="middle">9</td>
                  <td align="center" valign="middle">113</td>
                  <td align="center" valign="middle">60</td>
                  <td align="center" valign="middle">53</td>
                  <td align="center" valign="middle">53.10</td>
                  <td align="center" valign="middle">46.90</td>
                </tr>
                <tr style="border-top: solid thin">
                  <td align="left" valign="middle">
                    <italic>azc+bbt+bja+bra</italic>
                  </td>
                  <td align="center" valign="middle">206</td>
                  <td align="center" valign="middle">1150</td>
                  <td align="center" valign="middle">1141</td>
                  <td align="center" valign="middle">9</td>
                  <td align="center" valign="middle">99.22</td>
                  <td align="center" valign="middle">0.78</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">
                    <italic>bbt+bja+bra</italic>
                  </td>
                  <td align="center" valign="middle">857</td>
                  <td align="center" valign="middle">2981</td>
                  <td align="center" valign="middle">2976</td>
                  <td align="center" valign="middle">5</td>
                  <td align="center" valign="middle">99.83</td>
                  <td align="center" valign="middle">0.17</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">
                    <italic>bbt+bra</italic>
                  </td>
                  <td align="center" valign="middle">577</td>
                  <td align="center" valign="middle">1224</td>
                  <td align="center" valign="middle">1224</td>
                  <td align="center" valign="middle">0</td>
                  <td align="center" valign="middle">100.00</td>
                  <td align="center" valign="middle">0.00</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">
                    <italic>mlo+mes</italic>
                  </td>
                  <td align="center" valign="middle">86</td>
                  <td align="center" valign="middle">192</td>
                  <td align="center" valign="middle">167</td>
                  <td align="center" valign="middle">25</td>
                  <td align="center" valign="middle">86.98</td>
                  <td align="center" valign="middle">13.02</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">
                    <italic>mes+mlo+rhi+ret+rec+rlg+rlt+rle+smd+sme</italic>
                  </td>
                  <td align="center" valign="middle">214</td>
                  <td align="center" valign="middle">2424</td>
                  <td align="center" valign="middle">2081</td>
                  <td align="center" valign="middle">343</td>
                  <td align="center" valign="middle">85.85</td>
                  <td align="center" valign="middle">14.15</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">
                    <italic>mlo+rhi+ret+rec+rlg+rlt+rle+smd+sme</italic>
                  </td>
                  <td align="center" valign="middle">161</td>
                  <td align="center" valign="middle">1661</td>
                  <td align="center" valign="middle">1140</td>
                  <td align="center" valign="middle">521</td>
                  <td align="center" valign="middle">68.63</td>
                  <td align="center" valign="middle">31.37</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">
                    <italic>rhi+ret+rec+rlg+rlt+rle+smd+sme</italic>
                  </td>
                  <td align="center" valign="middle">155</td>
                  <td align="center" valign="middle">1347</td>
                  <td align="center" valign="middle">1122</td>
                  <td align="center" valign="middle">225</td>
                  <td align="center" valign="middle">83.30</td>
                  <td align="center" valign="middle">16.70</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">
                    <italic>rhi+ret+rec+rlg+rlt+rle</italic>
                  </td>
                  <td align="center" valign="middle">51</td>
                  <td align="center" valign="middle">347</td>
                  <td align="center" valign="middle">191</td>
                  <td align="center" valign="middle">156</td>
                  <td align="center" valign="middle">55.04</td>
                  <td align="center" valign="middle">44.96</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">
                    <italic>rlg+rlt+rle</italic>
                  </td>
                  <td align="center" valign="middle">92</td>
                  <td align="center" valign="middle">286</td>
                  <td align="center" valign="middle">197</td>
                  <td align="center" valign="middle">89</td>
                  <td align="center" valign="middle">68.88</td>
                  <td align="center" valign="middle">31.12</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">
                    <italic>smd+sme</italic>
                  </td>
                  <td align="center" valign="middle">253</td>
                  <td align="center" valign="middle">555</td>
                  <td align="center" valign="middle">182</td>
                  <td align="center" valign="middle">373</td>
                  <td align="center" valign="middle">32.79</td>
                  <td align="center" valign="middle">67.21</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">
                    <italic>rhi+ret+rec</italic>
                  </td>
                  <td align="center" valign="middle">6</td>
                  <td align="center" valign="middle">21</td>
                  <td align="center" valign="middle">5</td>
                  <td align="center" valign="middle">16</td>
                  <td align="center" valign="middle">23.81</td>
                  <td align="center" valign="middle">76.19</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">
                    <italic>rhi+smd+sme</italic>
                  </td>
                  <td align="center" valign="middle">242</td>
                  <td align="center" valign="middle">767</td>
                  <td align="center" valign="middle">476</td>
                  <td align="center" valign="middle">291</td>
                  <td align="center" valign="middle">62.06</td>
                  <td align="center" valign="middle">37.94</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">
                    <italic>ret+rec</italic>
                  </td>
                  <td align="center" valign="middle">123</td>
                  <td align="center" valign="middle">262</td>
                  <td align="center" valign="middle">71</td>
                  <td align="center" valign="middle">191</td>
                  <td align="center" valign="middle">27.10</td>
                  <td align="center" valign="middle">72.90</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">
                    <italic>ret+rec+rlg+rlt+rle</italic>
                  </td>
                  <td align="center" valign="middle">352</td>
                  <td align="center" valign="middle">1866</td>
                  <td align="center" valign="middle">1436</td>
                  <td align="center" valign="middle">430</td>
                  <td align="center" valign="middle">76.96</td>
                  <td align="center" valign="middle">23.04</td>
                </tr>
                <tr style="border-top: solid thin">
                  <td align="left" valign="middle">
                    <bold>Singletons</bold>
                  </td>
                  <td align="center" valign="middle"> </td>
                  <td align="center" valign="middle"> </td>
                  <td align="center" valign="middle"> </td>
                  <td align="center" valign="middle"> </td>
                  <td align="center" valign="middle"> </td>
                  <td align="center" valign="middle"> </td>
                </tr>
                <tr style="border-top: solid thin">
                  <td align="left" valign="middle">
                    <italic>azc</italic>
                  </td>
                  <td align="center" valign="middle">956</td>
                  <td align="center" valign="middle">986</td>
                  <td align="center" valign="middle">986</td>
                  <td align="center" valign="middle">0</td>
                  <td align="center" valign="middle">100.00</td>
                  <td align="center" valign="middle">0.00</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">
                    <italic>bja</italic>
                  </td>
                  <td align="center" valign="middle">1760</td>
                  <td align="center" valign="middle">1839</td>
                  <td align="center" valign="middle">1839</td>
                  <td align="center" valign="middle">0</td>
                  <td align="center" valign="middle">100.00</td>
                  <td align="center" valign="middle">0.00</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">
                    <italic>bbt</italic>
                  </td>
                  <td align="center" valign="middle">1051</td>
                  <td align="center" valign="middle">1107</td>
                  <td align="center" valign="middle">1005</td>
                  <td align="center" valign="middle">102</td>
                  <td align="center" valign="middle">90.79</td>
                  <td align="center" valign="middle">9.21</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">
                    <italic>bra</italic>
                  </td>
                  <td align="center" valign="middle">959</td>
                  <td align="center" valign="middle">987</td>
                  <td align="center" valign="middle">987</td>
                  <td align="center" valign="middle">0</td>
                  <td align="center" valign="middle">100.00</td>
                  <td align="center" valign="middle">0.00</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">
                    <italic>mlo</italic>
                  </td>
                  <td align="center" valign="middle">1706</td>
                  <td align="center" valign="middle">1809</td>
                  <td align="center" valign="middle">1613</td>
                  <td align="center" valign="middle">196</td>
                  <td align="center" valign="middle">89.17</td>
                  <td align="center" valign="middle">10.83</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">
                    <italic>mes</italic>
                  </td>
                  <td align="center" valign="middle">857</td>
                  <td align="center" valign="middle">899</td>
                  <td align="center" valign="middle">761</td>
                  <td align="center" valign="middle">138</td>
                  <td align="center" valign="middle">84.65</td>
                  <td align="center" valign="middle">15.35</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">
                    <italic>ret</italic>
                  </td>
                  <td align="center" valign="middle">333</td>
                  <td align="center" valign="middle">344</td>
                  <td align="center" valign="middle">196</td>
                  <td align="center" valign="middle">148</td>
                  <td align="center" valign="middle">56.98</td>
                  <td align="center" valign="middle">43.02</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">
                    <italic>rec</italic>
                  </td>
                  <td align="center" valign="middle">563</td>
                  <td align="center" valign="middle">563</td>
                  <td align="center" valign="middle">374</td>
                  <td align="center" valign="middle">189</td>
                  <td align="center" valign="middle">66.43</td>
                  <td align="center" valign="middle">33.57</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">
                    <italic>rlg</italic>
                  </td>
                  <td align="center" valign="middle">576</td>
                  <td align="center" valign="middle">602</td>
                  <td align="center" valign="middle">260</td>
                  <td align="center" valign="middle">342</td>
                  <td align="center" valign="middle">43.19</td>
                  <td align="center" valign="middle">56.81</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">
                    <italic>rlt</italic>
                  </td>
                  <td align="center" valign="middle">490</td>
                  <td align="center" valign="middle">492</td>
                  <td align="center" valign="middle">286</td>
                  <td align="center" valign="middle">206</td>
                  <td align="center" valign="middle">58.13</td>
                  <td align="center" valign="middle">41.87</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">
                    <italic>rle</italic>
                  </td>
                  <td align="center" valign="middle">542</td>
                  <td align="center" valign="middle">550</td>
                  <td align="center" valign="middle">276</td>
                  <td align="center" valign="middle">274</td>
                  <td align="center" valign="middle">50.18</td>
                  <td align="center" valign="middle">49.82</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">
                    <italic>rhi</italic>
                  </td>
                  <td align="center" valign="middle">758</td>
                  <td align="center" valign="middle">797</td>
                  <td align="center" valign="middle">356</td>
                  <td align="center" valign="middle">441</td>
                  <td align="center" valign="middle">44.67</td>
                  <td align="center" valign="middle">55.33</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">
                    <italic>smd</italic>
                  </td>
                  <td align="center" valign="middle">598</td>
                  <td align="center" valign="middle">639</td>
                  <td align="center" valign="middle">317</td>
                  <td align="center" valign="middle">322</td>
                  <td align="center" valign="middle">49.61</td>
                  <td align="center" valign="middle">50.39</td>
                </tr>
                <tr>
                  <td align="left" valign="middle">
                    <italic>sme</italic>
                  </td>
                  <td align="center" valign="middle">472</td>
                  <td align="center" valign="middle">498</td>
                  <td align="center" valign="middle">170</td>
                  <td align="center" valign="middle">328</td>
                  <td align="center" valign="middle">34.14</td>
                  <td align="center" valign="middle">65.86</td>
                </tr>
              </tbody>
  </table>
          </table-wrap>
          <p>The number of clusters shared amongst all thirteen nitrogen fixers was of 105, containing a total of 2,038 proteins. Many of these clusters are of <italic>nif</italic> proteins, the components of the nitrogenase complex. Nine protein clusters were shared by the eleven strains containing the <italic>nodABC</italic> genes, with a total of 113 proteins. These clusters include other <italic>nod</italic>, <italic>nol</italic> and <italic>noe</italic> gene families, protein families crucial to nodulation.</p>
          <p><xref ref-type="table" rid="genes-03-00138-t004">Table 4</xref> lists many other cluster sharing combinations. For example, 857 clusters are exclusively shared by the <italic>Bradyrhizobium</italic> species, confirming the differences between this genus and the others. In addition, the 242 clusters containing 767 proteins—60% in chromosomes—found only in <italic>rhi</italic>, <italic>smd</italic>, <italic>sme</italic> confirm <italic>rhi</italic> as a <italic>Sinorhizobium</italic> species.</p>
          <p>The number of singleton clusters (clusters containing proteins only from one species) approximately correlated with previously resolved taxonomic divisions, as well as genome size (<xref ref-type="table" rid="genes-03-00138-t004">Table 4</xref>). The largest number of singletons was found in the biggest genome, <italic>bja</italic> (1,760 clusters containing 1,839 proteins). The number of singletons amongst the various <italic>Rhizobium</italic> and <italic>Sinorhizobium</italic> was much smaller, ranging from 758 clusters containing 797 proteins in <italic>rhi</italic> to 333 clusters containing 344 proteins in <italic>rec.</italic> The most common protein family found amongst these singletons is composed by hypothetical proteins.</p>
        </sec>
      </sec>
      <sec>
        <title>2.4. The “Symbiome”- Nodulation, Secretion, Exopolysaccharide Production, Oxygen Transport and Nitrogen Fixation</title>
        <p>A precise definition of what is included in a theoretical ‘symbiome’ is still elusive. For the purposes of the current study, the core ‘symbiome’ is defined as the protein families, found in the BlastlineMCL clusters, involved in symbiosis and in nitrogen fixation. For this study this is defined as proteins for nodulation, secretion, exopolyssacharide production, oxygen transport and nitrogen fixation. The concept of a “symbiome’ spans plasmids, genetic islands, as well as the rest of the chromosome. It is loosely based on the concept of ‘core’ and ‘accessory’ genomes, where the ‘accessory’ genome has a lower GC content and usually, but not always, is composed of plasmids and/or genomic islands [<xref ref-type="bibr" rid="B21-genes-03-00138">21</xref>].</p>
        <p>In general, nodulation gene clusters have been found in close proximity to <italic>nif</italic> and <italic>fix</italic> genes. In rhizobial species carrying plasmids, nodulation <italic>(nod, nol and noe)</italic>, <italic>nif</italic> and <italic>fix</italic>, as well as many secretion related genes, are found in a symbiotic plasmid, while in species or strains without plasmids, the genes are located in laterally transferrable genomic islands, also denominated as symbiotic islands. For example, in <italic>R. leguminosarum</italic> bv. viciae 3841 most symbiotic genes are located in plasmid pRL10 [<xref ref-type="bibr" rid="B21-genes-03-00138">21</xref>], while in <italic>mlo a</italic> 500 kb genomic island carries the genes responsible for symbiosis [<xref ref-type="bibr" rid="B11-genes-03-00138">11</xref>,<xref ref-type="bibr" rid="B44-genes-03-00138">44</xref>]. However, there are exceptions, e.g., <italic>Sinorhizobium</italic> NGR 234, in which <italic>nif</italic> and <italic>nod</italic> operons are located on a plasmid, while the <italic>fix</italic> genes are on the chromosome [<xref ref-type="bibr" rid="B34-genes-03-00138">34</xref>]. Also in <italic>R. etli</italic> CFN 42, the pSym p42d contains most of the genes needed for symbiosis, but homologs for nodulation genes are found in other replicons of the genome [<xref ref-type="bibr" rid="B16-genes-03-00138">16</xref>]. These symbiotic regions of nitrogen fixing <italic>Rhizobiales</italic> genomes have been found to be largely mosaic structures that have been frequently tailored by recombination, horizontal transfer and transposition [<xref ref-type="bibr" rid="B15-genes-03-00138">15</xref>].</p>
        <p>In the thirteen symbiotic nitrogen fixing bacterial genomes compared, there is a range of symbiosis region arrangements. The simplest is in <italic>azc</italic>, which contains a small 87.1 kb symbiotic island with <italic>nod</italic> and <italic>trb</italic> operons responsible for the production of Nod factor and type IV secretion, respectively [<xref ref-type="bibr" rid="B7-genes-03-00138">7</xref>]. The <italic>nif</italic> and <italic>fix</italic> gene families are in separate regions of the chromosome. As <italic>azc</italic> is unique amongst the fourteen genomes in being able to fix nitrogen without symbiosis, it has been hypothesised that this symbiotic island has been laterally transferred to <italic>azc</italic> some time after its initial evolution [<xref ref-type="bibr" rid="B7-genes-03-00138">7</xref>].</p>
        <p>Both <italic>bja</italic> and <italic>mlo</italic> possess large symbiotic islands (~500 kb) containing all nodulation, <italic>nif</italic> and <italic>fix</italic> genes as well as some secretion related genes [<xref ref-type="bibr" rid="B9-genes-03-00138">9</xref>,<xref ref-type="bibr" rid="B44-genes-03-00138">44</xref>]. The other ten symbiotic nitrogen fixers all have symbiotic plasmids in which most of the nodulation, <italic>nif</italic>, <italic>fix</italic> and secretion genes are located [<xref ref-type="bibr" rid="B16-genes-03-00138">16</xref>,<xref ref-type="bibr" rid="B17-genes-03-00138">17</xref>,<xref ref-type="bibr" rid="B21-genes-03-00138">21</xref>,<xref ref-type="bibr" rid="B26-genes-03-00138">26</xref>,<xref ref-type="bibr" rid="B28-genes-03-00138">28</xref>,<xref ref-type="bibr" rid="B45-genes-03-00138">45</xref>].</p>
        <sec>
          <title>2.4.1. <italic>Nod</italic> Genes</title>
          <p>Induction of nodulation genes leads to the production and secretion of return signals, the Nod Factors, which are lipochitooligosaccharides of variable structure. Nod factors are essential for the <italic>Rhizobiales</italic> to trigger root hair curling, to induce the formation of nodule primordia, and to enter the root via infection threads. Purified Nod factors are sufficient to induce root hair deformations, cortical cell divisions, and on some host plants, fully grown nodule-like structures. NodD is the core signaling protein, reacting to plant flavonoids then binding to <italic>nod</italic> boxes, binding sites upstream of <italic>nod</italic> genes, typically <italic>nodA</italic> and/or <italic>nodB</italic>, triggering the expression of a <italic>nod</italic> gene cascade and thus the construction of the Nod Factor (e.g., [<xref ref-type="bibr" rid="B46-genes-03-00138">46</xref>]).</p>
          <p>The results of BlastlineMCL clustering were utilized to the extraction of all <italic>nod</italic> protein ortholog clusters (<xref ref-type="supplementary-material" rid="genes-03-00138-s001">Table S3</xref>). In total, twenty five different Nod protein ortholog clusters amongst the fourteen species were found. All fourteen species contained <italic>nodD</italic>, <italic>nodE</italic>, <italic>nodG</italic>, <italic>nodI</italic>, <italic>nodM</italic>, <italic>nodP</italic>, <italic>nodQ</italic>, <italic>nodV</italic> and <italic>nodW</italic>. Unsurprisingly, the non-symbiotic <italic>mes</italic> had the least number of <italic>nod</italic> genes, with ten. In a previous study, <italic>nodD</italic> and <italic>nodM</italic> have also been indicated as common genes of bacteria of the order <italic>Rhizobiales</italic>, but including both symbiotic and pathogenic stains [<xref ref-type="bibr" rid="B5-genes-03-00138">5</xref>]. </p>
          <p>One of the Nod orthologs found in all species is NodG, which is in the second largest MCL cluster. It is a cation protein exporter, involved in the secretion of the finished Nod factor into the environment. It is noteworthy that in <italic>R. tropici</italic> strain PRF 81 <italic>nodG</italic> can be promptly transcribed—after only 5 min—in the presence of host specific flavonoids [<xref ref-type="bibr" rid="B47-genes-03-00138">47</xref>]. </p>
          <p>Two <italic>nod</italic> genes were found exclusively in one strain: <italic>nodR</italic> in <italic>R. leguminosarum</italic> bv. <italic>trifolii</italic> WSM1325 and <italic>nodO</italic> in <italic>R. leguminosarum</italic> bv. <italic>viciae</italic> 3804. The function of <italic>nodR</italic> is unknown; however, it is suspected that NodR might contribute to the superior host nodulation efficiency of WSM1325, compared to WSM2304 [<xref ref-type="bibr" rid="B20-genes-03-00138">20</xref>]. In relation to NodO, it catalyzes the addition of carbamoyl to the Nod factor backbone [<xref ref-type="bibr" rid="B48-genes-03-00138">48</xref>], and it is also a calcium-binding protein that promotes infection thread development in root hairs [<xref ref-type="bibr" rid="B49-genes-03-00138">49</xref>]. Interestingly, in <italic>Rhizobium</italic> sp. BR 816 (reclassified as <italic>Sinorhizobium</italic>) the transfer of <italic>nodO</italic> can extend its host range [<xref ref-type="bibr" rid="B50-genes-03-00138">50</xref>], therefore it might be involved in the ability of <italic>rle</italic> to nodulate a wide range of species within the <italic>Viciae</italic> tribe.</p>
          <p>While all species have many core and non-core <italic>nod</italic> genes, an investigation of <italic>nod</italic> operon/gene structure in the thirteen symbiotic genomes displays a wide variety of arrangements. The <italic>Sesbania</italic> nodulator <italic>azc</italic> has a large <italic>nod</italic> operon nod<italic>ABCSUIJZ</italic>noe<italic>CHOP</italic> with <italic>nodD1</italic> located downstream [<xref ref-type="bibr" rid="B7-genes-03-00138">7</xref>]. The soybean nodulator <italic>bja</italic> also has a large <italic>nod</italic> operon <italic>nodD1-YABCSUIJ</italic> [<xref ref-type="bibr" rid="B9-genes-03-00138">9</xref>]. Surprisingly, the other two <italic>Bradyrhizobium,</italic> <italic>bbt and bra,</italic> have no <italic>nodA</italic>, <italic>nodB</italic> or <italic>nodC</italic> genes [<xref ref-type="bibr" rid="B11-genes-03-00138">11</xref>], and the signaling mechanisms with the host plant are still under investigation [<xref ref-type="bibr" rid="B10-genes-03-00138">10</xref>]. </p>
          <p>The three <italic>R. leguminosarum</italic> and the two <italic>Sinorhizobium</italic> genomes studied have similar initial <italic>nod</italic> operon structures, <italic>nodDABCIJ</italic>. In <italic>R etli</italic>, the arrangement differs with <italic>nodD</italic>, <italic>nodIJ</italic>, a 200 bp gap and then <italic>nodCB</italic>; the gene <italic>nodA</italic> is 16 kb downstream. Finally, in <italic>mlo</italic>, there is a separate <italic>nodAC</italic> and <italic>nodIJ</italic> operon, with <italic>nodD</italic> and <italic>nodB</italic> in disparate regions of the 600 kb symbiotic island. </p>
          <fig id="genes-03-00138-f003" position="float">
            <label>Figure 3</label>
            <caption>
              <p>Phylogeny of NodD proteins of the <italic>Rhizobiales</italic> genomes from this study, achieved through a neighbor joining (NJ) gene tree based on BLOSUM-62 matrix alignment of the proteins in the NodD cluster. Number in sequence label is the GENBANK protein GI number. Colored line indicates genus of each genome as coded for in <xref ref-type="supplementary-material" rid="genes-03-00138-s001">Tables S1 to S18</xref>, as well as that the protein is a confirmed NodD protein. Black lines indicate protein is only a putative protein within the LysR family. </p>
            </caption>
            <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="genes-03-00138-g003.tif"/>
          </fig>
          <p>The most intriguing organization is found in <italic>rhi</italic>, apparently a composite arrangement. It has a <italic>nodABC</italic> operon, a gap of 152 bp, following a <italic>nodIJ</italic> operon. <italic>nodD1</italic> is ~200 kb downstream from <italic>nodAB</italic> [<xref ref-type="bibr" rid="B34-genes-03-00138">34</xref>]. This arrangement of key <italic>nod</italic> genes in <italic>rhi</italic> seems to be a composite of <italic>R etli,</italic> <italic>R. leguminosarum</italic> and <italic>Sinorhizobium nod</italic> gene arrangements.</p>
          <p>The taxonomy of <italic>nod</italic> genes of the fourteen genomes was further explored by measuring the similarity of the NodD proteins within the designated MCL cluster. This was achieved through a neighbor joining (NJ) gene tree based on BLOSUM-62 matrix alignment of the proteins in that cluster (<xref ref-type="fig" rid="genes-03-00138-f003">Figure 3</xref>). The cluster itself is a broad collection of LysR transcription regulator-like proteins, and it is divided onto two broad branches representing proteins annotated as NodD (colored according to cluster table color scheme) and those annotated only as LysR or putatively LysR (black nodes). This initial division shows that <italic>bra</italic> and <italic>bbt</italic> <italic>nodD1</italic> are possibly not true NodD1 proteins but more general LysR, probably regulating the transcription of other gene systems, e.g., secretion, as both strains have no <italic>nodABC</italic> genes.</p>
          
          <p>Further exploration of the NodD tree shows that the <italic>azc</italic> is the most distant in terms of amino acid domain composition with its closest neighbor being <italic>bja</italic> NodD2. The rest of the proteins in the cluster annotated as Nod largely cluster according to previously determined taxonomic genus relationship, except for <italic>rhi</italic> NodD1, which more closely aligns with <italic>bja</italic> NodD1, rather than with its fellow <italic>Sinorhizobium</italic> species. The largest cluster consists of the NodD1 and NodD2 of <italic>sme</italic> and <italic>smd</italic> with the NodD1 of <italic>rle, rlg</italic> and <italic>rlg</italic> and NodD2 of <italic>rlt</italic>. (<xref ref-type="fig" rid="genes-03-00138-f003">Figure 3</xref>). </p>
          <p>In summary, there is a significant difference within this protein cluster between nodulation specific LysR type proteins (NodD proteins) and the other LysR transcription regulation orthologs, indicative of their biochemical roles. Also, NodD orthologs show a slightly different taxonomic relationship compared to 16S rRNA and whole genome approaches. </p>
        </sec>
        <sec>
          <title>2.4.2. Other Nodulation Genes</title>
          <p>The complexity of Nod factor synthesis and regulation is represented in the current study by two other important gene families, <italic>nol</italic> and <italic>noe</italic>. Among important proteins within this class we may cite NoeI, that catalyses the addition of -<italic>O</italic>-Me fucose to the Nod Factor backbone; NoeE, 3-O-SO3– fucose; NoeL; 3-or 4-<italic>O</italic>-acetyl fucose and NoeC, involved in the addition of arabinose to the Nod factor [<xref ref-type="bibr" rid="B48-genes-03-00138">48</xref>]. <italic>nol</italic> gene family is largely predominant in <italic>bja</italic>, but also abundant in both strains of <italic>R. etli</italic> and in S. <italic>fredii</italic> NGR 234 (<xref ref-type="supplementary-material" rid="genes-03-00138-s001">Table S5</xref>), and thus possibly important for the synthesis of specific Nod factors in these species. </p>
          <p>Many differences were found in <italic>noe</italic> clusters (<xref ref-type="supplementary-material" rid="genes-03-00138-s001">Table S4</xref>). Again, the greatest number of genes in this category was found in <italic>bja</italic>. <italic>Sinorhizobium</italic> species have only <italic>noeA</italic> and <italic>noeB,</italic> and these genes were thought to be specific for <italic>Medicago</italic> nodulation [<xref ref-type="bibr" rid="B50-genes-03-00138">50</xref>]; however, a <italic>noeA</italic> ortholog is found in the <italic>Trifolium</italic> nodulator <italic>rlg</italic> as well as the non-symbiont <italic>mes.</italic> While the functionality of these specific orthologs is not known, this suggests that noeA may not be specific to <italic>Medicago</italic>. </p>
          <p>Protein NolG is ubiquitous, found in cluster 23 and present in all strains from this study. However, it is only annotated as such in <italic>sme</italic>, with <italic>nolG-</italic>like genes annotated in <italic>smd</italic> and <italic>rec</italic>. Similar to the observations in relation to NodG, the NolG cluster contains many proteins, 130, all belonging to the very broad COG ortholog “COG0841, AcrB, cation/multidrug efflux pump”. Therefore it is possible that several orthologs represent only non-nodulation cation pumps. </p>
        </sec>
        <sec>
          <title>2.4.3. Bacterial Secretion Systems</title>
          <p>After the initial signaling between the host plant and the rhizobia, the next stage controls the progression of the bacteria into the plant and their maturation into nitrogen fixing bacteroids, and many gene families are related to these steps; among the most crucial genes in these stages are the protein secretion systems. Bacterial secretion systems have been classified into seven main groups, Types I through VII, with the fimbrial chaperone-usher pathway forming an additional group. Four of these systems, Types III, IV, V, and VI, assemble surface structures that contact target cells and deliver DNA and/or protein effectors.</p>
          <p>The secretion systems found in <italic>Rhizobiales</italic> range from Type I to Type VI, in addition to the twin-arginine transfer system (Tat). Noteworthy is that different nitrogen fixing <italic>Rhizobiales</italic> with differing host relationships contain different subsets of these systems [<xref ref-type="bibr" rid="B34-genes-03-00138">34</xref>,<xref ref-type="bibr" rid="B51-genes-03-00138">51</xref>,<xref ref-type="bibr" rid="B52-genes-03-00138">52</xref>,<xref ref-type="bibr" rid="B53-genes-03-00138">53</xref>,<xref ref-type="bibr" rid="B54-genes-03-00138">54</xref>]. Previously, it has been reported that only <italic>rhi</italic> had a suite of secretion systems spanning from Type I–V, which may strongly affect its extraordinary host range [<xref ref-type="bibr" rid="B34-genes-03-00138">34</xref>].</p>
          <p>Using BlastlineMCL protein clustering and the KEGG pathway orthology, the number of Type I, II, II, IV, V and VI secretion genes, as well as of exopolysaccharide gene families <italic>exo</italic> and <italic>pss</italic> were investigated (<xref ref-type="supplementary-material" rid="genes-03-00138-s001">Table S7–S14)</xref>. Interesting, the non-symbiotic <italic>mes</italic> contains many of these secretion systems, including the virulence related Type IV genes. This could be construed as additional evidence that this strain, or a recent ancestor, was once a symbiont, but lost the symbiotic ability due to the availability of alternate energy sources.</p>
        
        <sec>
          <title>2.4.3.1. Tat, Type I and II Secretion Systems</title>
          <p>Tat, Type I and Type II secretion systems are the most simple bacterial secretion systems, excreting proteins into extracellular space without the need of host contact. The twin arginine transport (Tat) system is responsible for transporting pre-folded proteins to the periplasmic space. The Tat pathway has been implicated in many bacterial cellular functions, including motility, biofilm formation, pathogenesis and symbiosis [<xref ref-type="bibr" rid="B55-genes-03-00138">55</xref>]. Protein clusters containing TatA, TatB and TatC were found in all fourteen species (<xref ref-type="supplementary-material" rid="genes-03-00138-s001">Table S6</xref>). For TatA, the basic taxonomic pattern was found with TatA clusters separated roughly according to genera. In contrast, TatB clusters were split between the <italic>Azorhizobium</italic>/<italic>Bradyrhizobium</italic> and the other bacteria in a separate cluster, while all TatC proteins fit in one cluster. Therefore, while every species has a Tat system, each one is at least genus specific.</p>
          <p>The Type I system is simple and comprises three proteins responsible for the transport of targeted proteins across both bacterial membranes to the extracellular space. This includes ArpD/E, TolC, and the HlyD and HlyB families. In addition, multiple PrtD and PrtE type I proteins have been isolated in <italic>rle</italic>. As shown in <xref ref-type="supplementary-material" rid="genes-03-00138-s001">Table S7</xref>, ArpD/E, HlyB and PrtDE system proteins are found in the one cluster spanning all fourteen species. Type I secretion is generally common across the fourteen <italic>Rhizobiales</italic> genomes, but with a notable absence of TolC in <italic>rec, rlg, rle</italic> and <italic>rlt</italic> and the lack of KEGG Type I secretion proteins orthologs in <italic>mes.</italic></p>
          <p>Type II secretion (<italic>gsp</italic>) was found in <italic>Bradyrhizobium, M. loti</italic> and <italic>Sinorhizobium</italic> NGR 234, and absent in the other genomes (<xref ref-type="supplementary-material" rid="genes-03-00138-s001">Table S9</xref>). The other Type II secretion family, <italic>sec</italic>, was found in all genomes (<xref ref-type="supplementary-material" rid="genes-03-00138-s001">Table S8</xref>). Interesting was the divergence in SecE (preprotein translocase subunit), with twelve different clusters; only <italic>rhi</italic>, <italic>sme</italic> and <italic>smd</italic> secE were positioned in a cluster, while each one of the other genomes occupied a unique cluster. Another contrast was the absence of SecG (preprotein translocase subunit) in <italic>rhi</italic>, <italic>sme</italic> and <italic>smd,</italic> indicating that <italic>Sinorhizobium</italic> may not contain a preprotein translocase composed of three units.</p>
        </sec>
        <sec>
          <title>2.4.3.2. Tat, Type III and IV Secretion Systems</title>
          <p>Probably the most important secretion systems in regards to the symbiotic relationship are the host contact dependant Type III and Type IV secretion systems [<xref ref-type="bibr" rid="B48-genes-03-00138">48</xref>,<xref ref-type="bibr" rid="B54-genes-03-00138">54</xref>,<xref ref-type="bibr" rid="B56-genes-03-00138">56</xref>]. Type III secretion system is responsible for producing nodulation outer proteins (Nops), some of which may be delivered into host plant cells via pili on the bacterial surface, and are also closely involved in flagella systems [<xref ref-type="bibr" rid="B57-genes-03-00138">57</xref>]. These Type III systems are present in the KEGG orthology as Ysc. Many species are annotated with different names such as <italic>fli</italic> and <italic>hrc</italic>. Amongst the fourteen genomes, Type III protein orthologs are present in <italic>bja, mlo, ret, rec</italic> and <italic>rhi</italic>, and absent in <italic>rlg, rlt, rle, smd</italic> and <italic>sme</italic> (<xref ref-type="supplementary-material" rid="genes-03-00138-s001">Table S10</xref>). One might assume from these results that the presence of Type III in <italic>Sinorhizobium</italic> NGR 234 could be implicated in its ability to nodulate the hosts of <italic>bja, mlo, ret</italic> and <italic>rec.</italic> </p>
          <p>In the <italic>Rhizobiales</italic>, Type IV secretion proteins are found in two broad families, F-type (<xref ref-type="supplementary-material" rid="genes-03-00138-s001">Table S11</xref>), which includes the <italic>virB</italic> and <italic>trb</italic> families, and the P-type <italic>cpa/tab/pli</italic> system (<xref ref-type="supplementary-material" rid="genes-03-00138-s001">Table S12</xref>). The <italic>virB/trb</italic> genes are tightly related to virulence and conjugal transfer, while P-type are thought to be adapted from flagella proteins. The <italic>trb</italic> system was first described in <italic>Agrobacterium</italic> Ti plasmid as coding for the key virulence system. In the genomes from this study, <italic>azc</italic>, <italic>brj</italic>, <italic>bbt</italic>, <italic>mlo</italic> and <italic>mes</italic> contain these genes. The other <italic>Rhizobiales</italic> have <italic>virB</italic> orthologs in differing clusters from most of the <italic>trb</italic>, thus opposed from the <italic>A. tumefaciens</italic> model. Nonetheless, F-type IV seems to be ubiquitous across the fourteen genomes, including <italic>mes,</italic> with many of the species having multiple paralogs of many of the <italic>virB</italic> genes. A similar situation is observed with the P-type secretion mechanism that is found all but <italic>azc</italic> genome, a further indicator of its relatively recent evolution in nodulation ability.</p>
        </sec>
        <sec>
          <title>2.4.3.3. Type V and Type VI Secretion Systems</title>
          <p>Type V (auto-transporters) secretion system possesses the simplest secretion apparatus and represents the largest family of protein translocating outer membrane porins in Gram-negative bacteria [<xref ref-type="bibr" rid="B58-genes-03-00138">58</xref>]. Paradoxically, it is not common in the rhizobial genomes studied. Type V is represented in all fourteen gemomes by three orthologs, <italic>autA</italic>, <italic>autB</italic> and <italic>autC</italic> (<xref ref-type="supplementary-material" rid="genes-03-00138-s001">Table S13</xref>), first isolated in <italic>rle</italic> [<xref ref-type="bibr" rid="B21-genes-03-00138">21</xref>], with <italic>mes</italic> being the only other genome under study that contained orthologs for all three Aut proteins. The existence of Type V orthologs is further evidence that <italic>mes,</italic> despite being non-symbiotic, still has virulence or infection potential. </p>
          <p>Type VI is a newly discovered system based on bacteriophage secretion systems. First described in 2006, Type VI system has been found in many bacteria, with possible orthologs in <italic>mlo</italic>, <italic>bja</italic> and <italic>rle</italic> [<xref ref-type="bibr" rid="B59-genes-03-00138">59</xref>,<xref ref-type="bibr" rid="B60-genes-03-00138">60</xref>]. A scan of the BlastlineMCL clusters, based on the previous <italic>mlo</italic>, <italic>bja</italic> and <italic>rle</italic> findings [<xref ref-type="bibr" rid="B59-genes-03-00138">59</xref>,<xref ref-type="bibr" rid="B60-genes-03-00138">60</xref>], found possible orthologs also in <italic>azc</italic> and <italic>rec</italic> (<xref ref-type="supplementary-material" rid="genes-03-00138-s001">Table S14</xref>). Noteworthy is the fact that <italic>rle</italic> is the only nitrogen fixer with full sets of Type V and Type VI secretion systems. As both <italic>rle</italic> and <italic>rhi</italic> nodulate members of the <italic>Viciae</italic> tribe, that might suggest some differences in the secretion and communication between microsymbionts and <italic>Viciae</italic>, and points out that the role of Type V and Type VI secretion systems represent an intriguing question to be explored.</p>
        </sec>
        </sec>
        <sec>
          <title>2.4.4. Exopolysaccharide Production</title>
          <p>Exopolysaccharide production has shown to be critical in host symbiont relationships, e.g., for the initial bacterial invasion that leads to the formation of indeterminate type of nodules on legumes [<xref ref-type="bibr" rid="B61-genes-03-00138">61</xref>]. The examination of exopolysaccharide synthesis and its role in legume infection has been mainly focused on the <italic>S. meliloti—Medicago truncatula</italic> model, with <italic>sme</italic> producing two exopolysaccharides, EPS I (succinoglycan) and EPS II (galactoglucan) [<xref ref-type="bibr" rid="B46-genes-03-00138">46</xref>]. In our comparison of genomes two exopolysaccharide synthesis families were surveyed, related to <italic>exo</italic> and <italic>pss</italic> genes; these were originally described in <italic>S. meliloti</italic> and <italic>R. leguminosarurm,</italic> respectively, and produce alternatively structured EPS I.</p>
          <p>In a few cases, the ontology of exopolysaccharide synthesis proteins in nitrogen fixing <italic>Rhizobiales</italic> seems to be overlapping with four <italic>exo</italic> and <italic>pss</italic> terms used interchangeably [<xref ref-type="bibr" rid="B21-genes-03-00138">21</xref>]. For example, <italic>exoM</italic> is clustered with <italic>pssC</italic> in the <italic>R. etli</italic> species, both coding for a glycosyltransferase. This could be either an ontology clash, that the two proteins are in the same cluster due to similar amino acid structure, or that the same single protein is involved in the two systems. </p>
          <p>Overall the clustering of <italic>exo</italic> (<xref ref-type="supplementary-material" rid="genes-03-00138-s001">Table S15</xref>), and <italic>pss</italic> (<xref ref-type="supplementary-material" rid="genes-03-00138-s001">Table S16</xref>) was similar to the phylogenetic clustering The African nodulator <italic>azc</italic> has only seven <italic>exo</italic> and three <italic>pss</italic> orthologs, compared to the twenty-three and six orthologs, respectively, found in <italic>sme.</italic> In addition, <italic>Bradyrhizobium</italic> has also few <italic>exo</italic> and <italic>pps</italic> genes, while <italic>Mesorhizobium</italic> has few <italic>pps genes.</italic> This interesting comparison could be indicative of a limited EPS I production in these microsymbionts, or that they use different strategies or sets of genes. It cannot be related to the tropical origin or the formation of determinate nodules, as other rhizobia, e.g., the tropical <italic>R. etli</italic> has several EPS genes and forms determinate nodules in common beans. Still considering the <italic>pps</italic> family, both <italic>ret</italic> and <italic>rle</italic> have twenty <italic>pss</italic> orthologs. Furthermore, EPS may also play other important roles, such as providing environmental protection, as pointed out in proteomic and transcriptomic studies with <italic>B. japonicum</italic> [<xref ref-type="bibr" rid="B62-genes-03-00138">62</xref>,<xref ref-type="bibr" rid="B63-genes-03-00138">63</xref>]. Continuing, the <italic>Trifolium</italic> nodulators have eighteen <italic>pss</italic> orthologs, missing <italic>pssH</italic> and <italic>pssI</italic>. Finally, <italic>rec</italic> has sixteen <italic>pps</italic>, missing <italic>ppsF</italic>, <italic>pssH</italic>, <italic>pssI</italic>, <italic>pssJ</italic> and <italic>pssK</italic>. In the other genomes the number of <italic>pps</italic> orthologs range from only three in <italic>azc</italic> to six in <italic>Sinorhizobium</italic>. </p>
          <p>The role and importance of EPS in the symbiosis has been long discussed and studied. However, the comparison of the genomes has shown that the complexity of the genes regulating EPS is far from being understood. Different species have adopted different sets of genes that seem more related to evolution of the core genome than of the symbiotic genes. Therefore, it can be concluded that the production of exopolysaccharide is not necessarily a feature of a ‘universal’ symbiome.</p>
        </sec>
        <sec>
          <title>2.4.5. <italic>Nif</italic> and <italic>Fix</italic> Genes</title>
          <p>The fixation of nitrogen <italic>via</italic> nitrogenase requires an anaerobic or micro-aerobic bacteroid environment and the <italic>fix</italic> gene family is involved in the regulation and metabolism of oxygen in this circumstance. The <italic>fix</italic> gene family is commonly found in three core operon structures: <italic>fixABCX</italic>, <italic>fixGHIS</italic> and <italic>fixNOPQ</italic>. The first operon is involved in the regulation of gene transcription under low oxygen concentrations. Metabolism of oxygen by the bacteroid occurs via the cytochrome <italic>cbb3</italic> oxidase complex, a membrane complex encoded by the <italic>fixNOPQ</italic> operon, that mediates electron exchange and synthesis of ATP via oxidation on the outer surface of the bacteroid membrane. Studies in <italic>bja</italic> have shown that the second operon, <italic>fixGHIS</italic>, is required for the initial construction of the <italic>cbb3</italic> complex, and like <italic>fixNOPQ</italic>, is only expressed under micro-aerobic or anaerobic conditions.</p>
          <p>BlastlineMCL clustering of the Fix proteins shows both important species distinctions as well as a possible limitation to the algorithm (<xref ref-type="supplementary-material" rid="genes-03-00138-s001">Table S17</xref>). All thirteen symbiotic species have the complete set of genes of the three operons; in the non-symbiont <italic>mes</italic>, <italic>fixABCX</italic> is entirely missing, as well as <italic>fixQ</italic>. </p>
          <p>Each different species, except for <italic>azc</italic>, has a unique <italic>fixS</italic> ortholog cluster, and <italic>mlo</italic> has two. However, as the FixS protein is only fifty five amino acids in size, this could also be related to a limitation of the algorithm used by BlastineMCL, adjusted to larger proteins. Another stem nodulator, <italic>bbt,</italic> has no <italic>fixQ</italic> ortholog, signifying different <italic>cbb3</italic> apparatuses from the structural models already resolved. This diversity of both FixQ and FixS proteins may suggest a diverse range of <italic>cbb3</italic> complexes for each species that could result from differences in host nodulation strategies.</p>
          <p>The <italic>nif</italic> family codes for the MoFe-dependant nitrogenase complex, the enzyme required for the catalysis for the nitrogen fixing process [<xref ref-type="bibr" rid="B64-genes-03-00138">64</xref>]. The thirteen symbionts contained, with a higher or lower similarity, the <italic>nifBHDKENX</italic> operon, which in turn was absent in the non-fixer <italic>mes</italic> (<xref ref-type="supplementary-material" rid="genes-03-00138-s001">Table S18</xref>). Therefore, along with <italic>nod</italic> and <italic>fix</italic>, this species has also lost <italic>nif</italic> genes.</p>
          <p>While a lack of <italic>nifX</italic> has been previous reported in <italic>rle</italic> [<xref ref-type="bibr" rid="B65-genes-03-00138">65</xref>], the current analysis has shown that both <italic>nifX</italic> and <italic>nifZ</italic> are also absent in <italic>R. leguminosarum</italic> <italic>rlt</italic> and <italic>rlg</italic>. This suggests the MoFe nitrogenase in these <italic>leguminosarum</italic> species may have a distinct structure from the other <italic>Rhizobiales</italic>. Also confirmed is the lack of <italic>nifQ</italic>, required for Mo-incorporation into the complex, in <italic>R</italic>. <italic>leguminosarum</italic>, <italic>S. meliloti</italic> and <italic>S. medicae</italic>, as well as in the non-symbiotic <italic>mes</italic>. Consequently, these species should use other means to incorporate Mo into the nitrogenise complex [<xref ref-type="bibr" rid="B65-genes-03-00138">65</xref>,<xref ref-type="bibr" rid="B66-genes-03-00138">66</xref>].</p>
          <p>A Nif ortholog cluster was chosen for more in depth analysis of protein alignment (<xref ref-type="fig" rid="genes-03-00138-f004">Figure 4</xref>). In this example, the NifN/K cluster was examined via neighbor joining (NJ) tree based on BLOSUM62 matrix. Both NifN and NifK (considered structural homologs in regards to their role in nitrogenase complex) split into two clear clades. Unlike much of the NodD cluster, the NifN clade is in agreement with the 16S rRNA phylogeny. Interestingly, the <italic>Bradyrhizobium</italic> and <italic>Azorhizobium</italic> NifN orthologs are genetically very distant from the other species, suggesting a different nitrogenase structure. A lower agreement with the 16S rRNA was obtained in the analysis of NifK; more specifically, NifK homologs of <italic>rhi</italic> show low similarity with other <italic>Sinorhizobium</italic> NifK homologs, showing higher similarity with <italic>mlo</italic> and with the <italic>R. leguminosarum</italic> orthologs. This is another suggestion of an important event of lateral transfer.</p>
          
        </sec>
      </sec>
      <sec>
        <title>2.5. Is it Possible to Define a Symbiome?</title>
        <p>As mentioned previously, the definition of a prokaryotic genome is rapidly changing as the number of genomes sequenced and annotated rises. The field of metagenomics has especially caused researchers to re-assess the nature of prokaryotic taxonomy and evolution, for example as a disconnected network topology rather than an exclusively cell and tree centered taxonomy [<xref ref-type="bibr" rid="B67-genes-03-00138">67</xref>]. This is arguably the case for the concept of a “symbiome”. However, the results of this study raise doubts about the definition of a universal “symbiome”: clearly, there are vastly different structures and profound differences in the set of symbiotic regions and genes spanning both chromosomes and plasmids.</p>
        <fig id="genes-03-00138-f004" position="float">
            <label>Figure 4</label>
            <caption>
              <p>Phylogeny of NifN/K proteins of the <italic>Rhizobiales</italic> genomes from this study, achieved through a neighbor joining (NJ) gene tree based on BLOSUM-62 matrix alignment of the proteins in the NifN/K cluster. Number in sequence label is the GENBANK protein GI number. Colored line indicates genus of each genome as coded for in <xref ref-type="supplementary-material" rid="genes-03-00138-s001">Tables S1 to S18</xref>. </p>
            </caption>
            <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="genes-03-00138-g004.tif"/>
          </fig>
        <p>The sharply different chromosome and plasmid symmetry taxonomies obtained in the comparison of the fourteen genomes have highlighted a disconnected network pattern caused by lateral transfer within the order <italic>Rhizobiales</italic>. While chromosome symmetry strongly correlates with genus and species designation, plasmid symmetry is much weaker, especially amongst symbiotic plasmids. This could be construed as evidence of multiple lateral transfer events and rearrangements of core symbiotic genes compared to lesser transfer on other non-symbiotic genes. A good example relies in the symbiotic plasmid of <italic>Sinorhizobium</italic> NGR 234, that lacks symmetry (e.g., considering the critical protein NodD, <xref ref-type="fig" rid="genes-03-00138-f003">Figure 3</xref>) with all the other symbiotic plasmid and genome islands analyzed. In addition, <italic>rhi</italic> has apparently non-<italic>Sinorhizobium</italic> NifK and NifN proteins (<xref ref-type="fig" rid="genes-03-00138-f004">Figure 4</xref>). As NGR 234 can nodulate a broad range of legumes, this might suggests a unique re-arrangement of nodulation genes to allow the highest symbiotic ubiquity reported so far.</p>
        <p>Protein cluster analysis has shown a much more complex pattern of correlation with host range and protein content, an example being the large array of secretion mechanisms available in the fourteen genomes. Some examples are that the genomes of <italic>bja</italic>, <italic>mlo</italic> and <italic>rhi</italic> contain the majority of Type II secretion proteins, while <italic>rle</italic> and <italic>ret</italic> do not contain any. Type III orthologs are found in <italic>rec</italic>, <italic>ret</italic> and <italic>rhi</italic>, but not in <italic>R. leguminosarum</italic> species, or in the other two <italic>Sinorhizobium</italic>. All species, except for <italic>azc</italic> and <italic>bra</italic>, contain most of the orthologs of Type IV. Intriguingly, <italic>rle</italic> is the only species with a full complement of both Type V and Type VI secretion orthologs. While NGR 234 (<italic>rhi</italic>) lacks these Type V and VI secretion proteins, as well as <italic>pss</italic> exopolysaccharide production enzymes, it can still successfully nodulate both <italic>Viciae</italic> and <italic>Trifollium</italic> species [<xref ref-type="bibr" rid="B32-genes-03-00138">32</xref>]. In addition, NGR contains a large number of Type II and III secretion orthologs, indicative of its ability to nodulate the hosts of <italic>Bradyrhizobium</italic> and <italic>Mesorhizobium</italic> [<xref ref-type="bibr" rid="B32-genes-03-00138">32</xref>,<xref ref-type="bibr" rid="B34-genes-03-00138">34</xref>]. Interesting, there was an overall lack of secretion genes in <italic>azc</italic> compared to the other species. One interpretation is that this omission could be related to the capacity of the species to fix nitrogen in free living conditions and only gaining the symbiotic ability in a later stage of its evolution [<xref ref-type="bibr" rid="B7-genes-03-00138">7</xref>]. The non-nitrogen fixing, non symbiont <italic>mes</italic> demonstrates a reverse situation, evolving from a symbiotic nitrogen fixing ancestor to a chemotroph, with the lack of a symbiotic island <italic>i.e.</italic>, no <italic>nif</italic>, <italic>nodDABC</italic> or <italic>fixABCX</italic> orthologs, but still retaining a wide range of other <italic>nod</italic>, <italic>fix</italic> as well as secretion and exopolysaccharide system orthologs typical of symbiotic nitrogen fixing organisms. Altogether, these differences among the fourteen rhizobial genomes highlight the difficulties of defining a “symbiome”.</p>
      </sec>
    </sec>
    <sec id="sec3-genes-03-00138">
      <title>3. Experimental Section</title>
      <sec>
        <title>3.1. Genomes</title>
        <p>All fourteen <italic>Rhizobiales</italic> genomes, including chromosomes and, if applicable, plasmids, were downloaded from NCBI RefSeq. These include three genome sequenced and analyzed by the Center for <italic>Rhizobium</italic> Studies with the suffix WSM (Western Australian Soil Microbiology Collection). All available nucleotide and protein FASTA files, as well as GENBANK files were utilized. See <xref ref-type="supplementary-material" rid="genes-03-00138-s001">Table S1</xref> for details.</p>
      </sec>
      <sec>
        <title>3.2. Bioinformatics Analysis</title>
        <p>Taxonomic analysis of the 16s rRNA gene sequences was completed with MEGA4 [<xref ref-type="bibr" rid="B68-genes-03-00138">68</xref>], by constructing a bootstrapped neighbor joining (NJ) gene tree using Jukes-Cantor substitution. Dotplots (<uri>http://code.google.com/p/freckle/</uri>) were constructed from the above chromosomal and plasmid nucleotide FASTA files using an in house script. The BLASTMATRIX of all protein sequences and extraction and analysis of KEGG data were performed as outlined in Bellgard <italic>et al.</italic> 2009 [<xref ref-type="bibr" rid="B69-genes-03-00138">69</xref>].</p>
        <p>Construction of phylogenetic NJ trees was based on a CLUSTALW alignment with BLOSUM62 matrix, all accomplished with JALVIEW 2.6.1 [<xref ref-type="bibr" rid="B70-genes-03-00138">70</xref>]. Further editing of trees to publication standard was completed with FigTree v1.3.1 (<uri>http://tree.bio.ed.ac.uk/software/figtree/</uri>). </p>
      </sec>
      <sec>
        <title>3.3. Protein Clustering</title>
        <p>A protein reciprocal Blast similarity search with a threshold maximum expected value 1e<sup>−20</sup> was conducted. This protein clustering used BlastlineMCL (<uri>http://www.micans.org/mcl/</uri>) program that implements the MCL cluster algorithm for graphs. The granularity of the output cluster was set with an inflation value of 2.5. A bioinformatics workflow was developed that would for each cluster, annotate the cluster based on domain conservation (NCBI CDD), produce a cluster multiple sequence alignment and phylogenic tree for web publication. The clustering analysis can be viewed at the following web site: <uri>http://ccg.murdoch.edu.au/organism/rhizobium/RhizobiumSummaries/</uri> </p>
        <p>The workflow is freely available for researchers interested in high-through put clustering and annotation by contacting <email>mbellgard@ccg.murdoch.edu.au</email> (reference IVEC and CCG).</p>
      </sec>
    </sec>
    <sec>
      <title>4. Concluding Remarks</title>
      <p>Overall, it can then be concluded from this study that protein clustering has demonstrated that there are multiple overlapping strategies for nitrogen fixing symbiosis and that perhaps the concept of a single self contained ‘symbiome’ is doubtful. Expanding this protein cluster comparison from <italic>Rhizobiales</italic> genomes to a larger range of nitrogen fixers could further resolve the symbiome conundrum. To date several species that do not belong to order <italic>Rhizobiales</italic> but establish nitrogen fixing symbioses have been discovered. These include several species of the beta-proteobacteria <italic>Burkholderia</italic> [<xref ref-type="bibr" rid="B71-genes-03-00138">71</xref>] and <italic>Cupriavidus taiwanensis</italic> [<xref ref-type="bibr" rid="B72-genes-03-00138">72</xref>] as well as the non-legume root nodule symbiosis with the Gram positive <italic>Actinomycetales</italic> genus <italic>Frankia</italic> [<xref ref-type="bibr" rid="B73-genes-03-00138">73</xref>]. Therefore, a wider exploration of several classes of bacteria would have to be considered. </p>
      <p>This wider approach is demonstrated, in part, by Pini <italic>et al.</italic> 2011 (92 alphaproteobacterial genomes) [<xref ref-type="bibr" rid="B74-genes-03-00138">74</xref>] and Amadou <italic>et al.</italic> 2008 (1 betaproteobacterial and 8 alphaproteobacterial genomes) [<xref ref-type="bibr" rid="B72-genes-03-00138">72</xref>]. These studies utilized significantly different clustering techniques and did not include all the 14 strains explored in the current study. However, both these studies share our view, that while there is a large shared gene pool amongst the plant symbionts, there are “multiple recipes” [<xref ref-type="bibr" rid="B74-genes-03-00138">74</xref>] possibly due to the different host environments [<xref ref-type="bibr" rid="B72-genes-03-00138">72</xref>]. This adds further doubt to the concept of a single self-contained “symbiome”. </p>
      <p>One must be conscious of the fact that just predicting an ortholog does not guarantee that the said ortholog is actually expressed. Therefore, despite the number of genomes available to be compared, a wider systems biology approach should also be taken into account. This would include symbiont transcriptomics, protein-protein interactions, the effect of the presence of other prokaryotic and eukaryotic species in the rhizosphere, and the genomics and transcriptomics of the plant host itself. Many pertinent studies have already been completed, such as the construction of a <italic>S. meliloti</italic> interactome [<xref ref-type="bibr" rid="B75-genes-03-00138">75</xref>], the demonstration of host dependant gene expression, based on extensive proteomic and transciptomic analysis in <italic>bja</italic> [<xref ref-type="bibr" rid="B76-genes-03-00138">76</xref>] and in <italic>azc</italic> [<xref ref-type="bibr" rid="B8-genes-03-00138">8</xref>], the role of mychorrhizal fungi in the development of nodules [<xref ref-type="bibr" rid="B77-genes-03-00138">77</xref>], and the increasingly complex role of plant nodule-specific cysteine-rich peptides on the symbiotic relationship between legume and bacterium [<xref ref-type="bibr" rid="B78-genes-03-00138">78</xref>], such as host control of <italic>R. leguminosarum</italic> bacteroid formation through symbiotic auxotrophy [<xref ref-type="bibr" rid="B79-genes-03-00138">79</xref>]. Therefore, if there were any nitrogen fixing ‘symbiome’, it would more likely to be a complex composition of molecular biological and biochemical networks from both symbiont and host in genomics, transcriptomics, proteomics, and even metabolomics. Further, comparative genomics and proteomics, as shown in this study, represents a valuable and key tool for capturing specificities and generalities of each genome, or of groups of genomes showing relevant functionalities, as is the case of nitrogen fixing symbiotic bacteria.</p>
    </sec>
  </body>
  <back>
    <ack>
      <title>Acknowledgements</title>
      <p>We gratefully acknowledge the funding received from Murdoch University Strategic Research Fund through the Crop and Plant Research Institute (CaPRI and the Centre for <italic>Rhizobium</italic> Studies (CRS).</p>
      <p>M. Hungria is also a fellow from CNPq (Conselho Nacional de Desenvolvimento Científico e Tecnológico).</p>
    </ack>
    <ref-list>
      <title>References</title>
      <ref id="B1-genes-03-00138">
        <label>1.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Cheng</surname>
              <given-names>Q.</given-names>
            </name>
          </person-group>
          <article-title>Perspectives in biological nitrogen fixation research</article-title>
          <source>J. Integr. Plant Biol.</source>
          <year>2008</year>
          <volume>50</volume>
          <fpage>786</fpage>
          <lpage>798</lpage>
          <pub-id pub-id-type="doi">10.1111/j.1744-7909.2008.00700.x</pub-id>
        </citation>
      </ref>
      <ref id="B2-genes-03-00138">
        <label>2.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Hungria</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Vargas</surname>
              <given-names>M.A.</given-names>
            </name>
          </person-group>
          <article-title>Environmental factors affecting N2 fixation in gratn legumes in the tropics, with an emphasis on Brazil</article-title>
          <source>Field Crops Res.</source>
          <year>2000</year>
          <volume>65</volume>
          <fpage>151</fpage>
          <lpage>164</lpage>
          <pub-id pub-id-type="doi">10.1016/S0378-4290(99)00084-2</pub-id>
        </citation>
      </ref>
      <ref id="B3-genes-03-00138">
        <label>3.</label>
        <citation citation-type="book">
          <person-group person-group-type="author">
            <name>
              <surname>Hungria</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Loureiro</surname>
              <given-names>M.F.</given-names>
            </name>
            <name>
              <surname>Mendes</surname>
              <given-names>I.C.</given-names>
            </name>
            <name>
              <surname>Campo</surname>
              <given-names>R.J.</given-names>
            </name>
            <name>
              <surname>Graham</surname>
              <given-names>P.H.</given-names>
            </name>
          </person-group>
          <article-title>Inoculant Preparation, Production and Application</article-title>
          <source>Nitrogen Fixation in Agriculture, Forestry, Ecology and the Environment</source>
          <person-group person-group-type="editor">
            <name>
              <surname>Werner</surname>
              <given-names>W.</given-names>
            </name>
            <name>
              <surname>Newton</surname>
              <given-names>W.E.</given-names>
            </name>
          </person-group>
          <publisher-name>Springer</publisher-name>
          <publisher-loc>Dordrecht, The Netherlands</publisher-loc>
          <year>2005</year>
          <fpage>223</fpage>
          <lpage>254</lpage>
        </citation>
      </ref>
      <ref id="B4-genes-03-00138">
        <label>4.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Guerrero</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Peralta</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Aguilar</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Diaz</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Villalobos</surname>
              <given-names>M.A.</given-names>
            </name>
            <name>
              <surname>Medrano-Soto</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Mora</surname>
              <given-names>J.</given-names>
            </name>
          </person-group>
          <article-title>Evolutionary, structural and functional relationships revealed by comparative analysis of syntenic genes in Rhizobiales</article-title>
          <source>BMC Evol. Biol.</source>
          <year>2005</year>
          <volume>5</volume>
          <fpage>55</fpage>
          <lpage>73</lpage>
          <pub-id pub-id-type="doi">10.1186/1471-2148-5-55</pub-id>
        </citation>
      </ref>
      <ref id="B5-genes-03-00138">
        <label>5.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Carvalho</surname>
              <given-names>F.M.</given-names>
            </name>
            <name>
              <surname>Souza</surname>
              <given-names>R.C.</given-names>
            </name>
            <name>
              <surname>Barcellos</surname>
              <given-names>F.G.</given-names>
            </name>
            <name>
              <surname>Hungria</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Vasconcelos</surname>
              <given-names>A.T.R.</given-names>
            </name>
          </person-group>
          <article-title>Genomic and evolutionary comparisons of diazotrophic and pathogenic bacteria of the order Rhizobiales</article-title>
          <source>BMC Microbiol.</source>
          <year>2010</year>
          <volume>10</volume>
          <pub-id pub-id-type="doi">10.1186/1471-2180-10-37</pub-id>
        </citation>
      </ref>
      <ref id="B6-genes-03-00138">
        <label>6.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Vavilov</surname>
              <given-names>N.I.</given-names>
            </name>
          </person-group>
          <article-title>Centers of origin of cultivated plants</article-title>
          <source>Trends Pract. Bot. Genet. Sel.</source>
          <year>1926</year>
          <volume>16</volume>
          <fpage>3</fpage>
          <lpage>248</lpage>
          <comment>(in Russian)</comment>
        </citation>
      </ref>
      <ref id="B7-genes-03-00138">
        <label>7.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Lee</surname>
              <given-names>K.B.</given-names>
            </name>
            <name>
              <surname>de Backer</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Aono</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Liu</surname>
              <given-names>C.T.</given-names>
            </name>
            <name>
              <surname>Suzuki</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Suzuki</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Kaneko</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Yamada</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Tabata</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Kupfer</surname>
              <given-names>D.M.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>The genome of the versatile nitrogen fixer Azorhizobium caulinodans ORS571</article-title>
          <source>BMC Genomics</source>
          <year>2008</year>
          <volume>9</volume>
          <pub-id pub-id-type="doi">10.1186/1471-2164-9-271</pub-id>
        </citation>
      </ref>
      <ref id="B8-genes-03-00138">
        <label>8.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Tsukada</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Aono</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Akiba</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Lee</surname>
              <given-names>K.B.</given-names>
            </name>
            <name>
              <surname>Liu</surname>
              <given-names>C.T.</given-names>
            </name>
            <name>
              <surname>Toyazaki</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Oyaizu</surname>
              <given-names>H.</given-names>
            </name>
          </person-group>
          <article-title>Comparative genome-wide transcriptional profiling of Azorhizobium caulinodans ORS571 grown under free-living and symbiotic conditions</article-title>
          <source>Appl. Environ. Microbiol.</source>
          <year>2009</year>
          <volume>75</volume>
          <fpage>5037</fpage>
          <lpage>5046</lpage>
        <pub-id pub-id-type="doi">10.1128/AEM.00398-09</pub-id><pub-id pub-id-type="pmid">19542345</pub-id></citation>
      </ref>
      <ref id="B9-genes-03-00138">
        <label>9.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Kaneko</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Nakamura</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Sato</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Minamisawa</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Uchiumi</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Sasamoto</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Watanabe</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Idesawa</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Iriguchi</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Kawashima</surname>
              <given-names>K.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Complete genomic sequence of nitrogen-fixing symbiotic bacterium Bradyrhizobium japonicum USDA110</article-title>
          <source>DNA Res.</source>
          <year>2002</year>
          <volume>9</volume>
          <fpage>189</fpage>
          <lpage>197</lpage>
          <pub-id pub-id-type="doi">10.1093/dnares/9.6.189</pub-id>
        </citation>
      </ref>
      <ref id="B10-genes-03-00138">
        <label>10.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Bonaldi</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Gourion</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Fardoux</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Hannibal</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Cartieaux</surname>
              <given-names>F.</given-names>
            </name>
            <name>
              <surname>Boursot</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Vallenet</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Chaintreuil</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Prin</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Nouwen</surname>
              <given-names>N.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Large-scale transposon mutagenesis of photosynthetic Bradyrhizobium sp. strain ORS278 reveals new genetic loci putatively important for nod-independent symbiosis with Aeschynomene indica</article-title>
          <source>Mol. Plant Microbe Interact.</source>
          <year>2010</year>
          <volume>23</volume>
          <fpage>760</fpage>
          <lpage>770</lpage>
          <pub-id pub-id-type="doi">10.1094/MPMI-23-6-0760</pub-id>
        </citation>
      </ref>
      <ref id="B11-genes-03-00138">
        <label>11.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Giraud</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Moulin</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Vallenet</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Barbe</surname>
              <given-names>V.</given-names>
            </name>
            <name>
              <surname>Cytryn</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Avarre</surname>
              <given-names>J.C.</given-names>
            </name>
            <name>
              <surname>Jaubert</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Simon</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Cartieaux</surname>
              <given-names>F.</given-names>
            </name>
            <name>
              <surname>Prin</surname>
              <given-names>Y.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Legumes symbioses: Absence of nod genes in photosynthetic bradyrhizobia</article-title>
          <source>Science</source>
          <year>2007</year>
          <volume>316</volume>
          <fpage>1307</fpage>
          <lpage>1312</lpage>
        <pub-id pub-id-type="doi">10.1126/science.1139548</pub-id><pub-id pub-id-type="pmid">17540897</pub-id></citation>
      </ref>
      <ref id="B12-genes-03-00138">
        <label>12.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Kaneko</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Nakamura</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Sato</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Asamizu</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Kato</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Sasamoto</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Watanabe</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Idesawa</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Ishikawa</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Kawashima</surname>
              <given-names>K.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Complete genome structure of the nitrogen-fixing symbiotic bacterium Mesorhizobium loti</article-title>
          <source>DNA Res.</source>
          <year>2000</year>
          <volume>7</volume>
          <fpage>331</fpage>
          <lpage>338</lpage>
          <pub-id pub-id-type="doi">10.1093/dnares/7.6.331</pub-id>
        </citation>
      </ref>
      <ref id="B13-genes-03-00138">
        <label>13.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Bohuslavek</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Payne</surname>
              <given-names>J.W.</given-names>
            </name>
            <name>
              <surname>Liu</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Bolton</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Xun</surname>
              <given-names>L.Y.</given-names>
            </name>
          </person-group>
          <article-title>Cloning, sequencing, and characterization of a gene cluster involved in EDTA degradation from the bacterium BNC1</article-title>
          <source>Appl. Environ. Microbiol.</source>
          <year>2001</year>
          <volume>67</volume>
          <fpage>688</fpage>
          <lpage>695</lpage>
          <pub-id pub-id-type="doi">10.1128/AEM.67.2.688-695.2001</pub-id>
        </citation>
      </ref>
      <ref id="B14-genes-03-00138">
        <label>14.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Baldwin</surname>
              <given-names>I.L.</given-names>
            </name>
            <name>
              <surname>Fred</surname>
              <given-names>E.B.</given-names>
            </name>
          </person-group>
          <article-title>Nomenclature of the root-nodule bacteria of the Leguminosae</article-title>
          <source>J. Bacteriol.</source>
          <year>1929</year>
          <volume>17</volume>
          <fpage>141</fpage>
          <lpage>150</lpage>
        <pub-id pub-id-type="pmid">16559355</pub-id></citation>
      </ref>
      <ref id="B15-genes-03-00138">
        <label>15.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Gonzalez</surname>
              <given-names>V.</given-names>
            </name>
            <name>
              <surname>Bustos</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Ramirez-Romero</surname>
              <given-names>M.A.</given-names>
            </name>
            <name>
              <surname>Medrano-Soto</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Salgado</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Hernandez-Gonzalez</surname>
              <given-names>I.</given-names>
            </name>
            <name>
              <surname>Hernandez-Celis</surname>
              <given-names>J.C.</given-names>
            </name>
            <name>
              <surname>Quintero</surname>
              <given-names>V.</given-names>
            </name>
            <name>
              <surname>Moreno-Hagelsieb</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Girard</surname>
              <given-names>L.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>The mosaic structure of the symbiotic plasmid of Rhizobium etli CFN42 and its relation to other symbiotic genome compartments</article-title>
          <source>Genome Biol.</source>
          <year>2003</year>
          <volume>4</volume>
          <pub-id pub-id-type="doi">10.1186/gb-2003-4-6-r36</pub-id>
        </citation>
      </ref>
      <ref id="B16-genes-03-00138">
        <label>16.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>González</surname>
              <given-names>V.</given-names>
            </name>
            <name>
              <surname>Santamaría</surname>
              <given-names>R.I.</given-names>
            </name>
            <name>
              <surname>Bustos</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Hernández-González</surname>
              <given-names>I.</given-names>
            </name>
            <name>
              <surname>Medrano-Soto</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Moreno-Hagelsieb</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Janga</surname>
              <given-names>S.C.</given-names>
            </name>
            <name>
              <surname>Ramírez</surname>
              <given-names>M.A.</given-names>
            </name>
            <name>
              <surname>Jiménez-Jacinto</surname>
              <given-names>V.</given-names>
            </name>
            <name>
              <surname>Collado-Vides</surname>
              <given-names>J.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>The partitioned Rhizobium etli genome: Genetic and metabolic redundancy in seven interacting replicons</article-title>
          <source>Proc. Natl. Acad. Sci. USA</source>
          <year>2006</year>
          <volume>103</volume>
          <fpage>3834</fpage>
          <lpage>3839</lpage>
        <pub-id pub-id-type="doi">10.1073/pnas.0508502103</pub-id><pub-id pub-id-type="pmid">16505379</pub-id></citation>
      </ref>
      <ref id="B17-genes-03-00138">
        <label>17.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>González</surname>
              <given-names>V.</given-names>
            </name>
            <name>
              <surname>Acosta</surname>
              <given-names>J.L.</given-names>
            </name>
            <name>
              <surname>Santamaría</surname>
              <given-names>R.I.</given-names>
            </name>
            <name>
              <surname>Bustos</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Fernández</surname>
              <given-names>J.L.</given-names>
            </name>
            <name>
              <surname>Hernández González</surname>
              <given-names>I.L.</given-names>
            </name>
            <name>
              <surname>Díaz</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Flores</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Palacios</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Mora</surname>
              <given-names>J.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Conserved symbiotic plasmid DNA sequences in the multireplicon pangenomic structure of Rhizobium etli</article-title>
          <source>Appl. Environ. Microbiol.</source>
          <year>2010</year>
          <volume>76</volume>
          <fpage>1604</fpage>
          <lpage>1614</lpage>
        <pub-id pub-id-type="doi">10.1128/AEM.02039-09</pub-id><pub-id pub-id-type="pmid">20048063</pub-id></citation>
      </ref>
      <ref id="B18-genes-03-00138">
        <label>18.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Lozano</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Hernández-González</surname>
              <given-names>I.</given-names>
            </name>
            <name>
              <surname>Bustos</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Santamaría</surname>
              <given-names>R.I.</given-names>
            </name>
            <name>
              <surname>Souza</surname>
              <given-names>V.</given-names>
            </name>
            <name>
              <surname>Young</surname>
              <given-names>J.P.W.</given-names>
            </name>
            <name>
              <surname>Dávila</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>González</surname>
              <given-names>V.</given-names>
            </name>
          </person-group>
          <article-title>Evolutionary dynamics of insertion sequences in relation to the evolutionary histories of the chromosome and symbiotic plasmid genes of Rhizobium etli populations</article-title>
          <source>Appl. Environ. Microbiol.</source>
          <year>2010</year>
          <volume>76</volume>
          <fpage>6504</fpage>
          <lpage>6513</lpage>
        <pub-id pub-id-type="doi">10.1128/AEM.01001-10</pub-id><pub-id pub-id-type="pmid">20675442</pub-id></citation>
      </ref>
      <ref id="B19-genes-03-00138">
        <label>19.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Howieson</surname>
              <given-names>J.G.</given-names>
            </name>
            <name>
              <surname>Yates</surname>
              <given-names>R.J.</given-names>
            </name>
            <name>
              <surname>Ryder</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Real</surname>
              <given-names>D.</given-names>
            </name>
          </person-group>
          <article-title>The interactions of Rhizobium leguminosarum biovar trifolii in nodulation of annual and perennial Trifolium spp. from diverse centres of origin</article-title>
          <source>Aust. J. Exp. Agric.</source>
          <year>2005</year>
          <volume>45</volume>
          <fpage>199</fpage>
          <lpage>207</lpage>
          <pub-id pub-id-type="doi">10.1071/EA03167</pub-id>
        </citation>
      </ref>
      <ref id="B20-genes-03-00138">
        <label>20.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Yates</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Howieson</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Reeve</surname>
              <given-names>W.</given-names>
            </name>
            <name>
              <surname>O’Hara</surname>
              <given-names>G.</given-names>
            </name>
          </person-group>
          <article-title>A re-appraisal of the biology and terminology describing rhizobial strain success in nodule occupancy of legumes in agriculture</article-title>
          <source>Plant Soil</source>
          <year>2011</year>
          <volume>348</volume>
          <fpage>255</fpage>
          <lpage>267</lpage>
          <pub-id pub-id-type="doi">10.1007/s11104-011-0971-z</pub-id>
        </citation>
      </ref>
      <ref id="B21-genes-03-00138">
        <label>21.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Young</surname>
              <given-names>J.P.</given-names>
            </name>
            <name>
              <surname>Crossman</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Johnston</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Thomson</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Ghazoui</surname>
              <given-names>Z.</given-names>
            </name>
            <name>
              <surname>Hull</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Wexler</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Curson</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Todd</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Poole</surname>
              <given-names>P.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>The genome of Rhizobium leguminosarum has recognizable core and accessory components</article-title>
          <source>Genome Biol.</source>
          <year>2006</year>
          <volume>7</volume>
          <pub-id pub-id-type="doi">10.1186/gb-2006-7-4-r34</pub-id>
        </citation>
      </ref>
      <ref id="B22-genes-03-00138">
        <label>22.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Chen</surname>
              <given-names>W.X.</given-names>
            </name>
            <name>
              <surname>Yan</surname>
              <given-names>G.H.</given-names>
            </name>
            <name>
              <surname>Li</surname>
              <given-names>J.L.</given-names>
            </name>
          </person-group>
          <article-title>Numerical taxonomic study of fast-growing soybean rhizobia and a proposal that Rhizobium fredii be assigned to Sinorhizobium gen. nov</article-title>
          <source>Int. J. Syst. Bacteriol.</source>
          <year>1988</year>
          <volume>38</volume>
          <fpage>392</fpage>
          <lpage>397</lpage>
          <pub-id pub-id-type="doi">10.1099/00207713-38-4-392</pub-id>
        </citation>
      </ref>
      <ref id="B23-genes-03-00138">
        <label>23.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Young</surname>
              <given-names>J.M.</given-names>
            </name>
          </person-group>
          <article-title>The genus name Ensifer Casida 1982 takes priority over Sinorhizobium Chen <italic>et al.</italic> 1988, and Sinorhizobium morelense Wang <italic>et al.</italic> 2002 is a later synonym of Ensfer adhaerens Casida 1982. Is the combination ‘Sinorhizobium adhaerens’ (Casida 1982) Willems <italic>et al.</italic> 2003 legitimate? Request for an Opinion</article-title>
          <source>Int. J. Syst. Evol. Microbiol.</source>
          <year>2003</year>
          <volume>53</volume>
          <fpage>2107</fpage>
          <lpage>2110</lpage>
          <pub-id pub-id-type="doi">10.1099/ijs.0.02665-0</pub-id>
        </citation>
      </ref>
      <ref id="B24-genes-03-00138">
        <label>24.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Sawada</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Kuykendall</surname>
              <given-names>L.D.</given-names>
            </name>
            <name>
              <surname>Young</surname>
              <given-names>J.M.</given-names>
            </name>
          </person-group>
          <article-title>Changing concepts in the systematics of bacterial nitrogen-fixing legume symbionts</article-title>
          <source>J. Gen. Appl. Microbiol.</source>
          <year>2003</year>
          <volume>49</volume>
          <fpage>155</fpage>
          <lpage>179</lpage>
          <pub-id pub-id-type="doi">10.2323/jgam.49.155</pub-id>
        </citation>
      </ref>
      <ref id="B25-genes-03-00138">
        <label>25.</label>
        <citation citation-type="book">
          <person-group person-group-type="author">
            <name>
              <surname>Graham</surname>
              <given-names>P.H.</given-names>
            </name>
          </person-group>
          <article-title>Ecology of the Root-Nodule Bacteria of Legumes</article-title>
          <source>Nitrogen-Fixing Leguminous Symbioses</source>
          <person-group person-group-type="editor">
            <name>
              <surname>Dilworth</surname>
              <given-names>M.J.</given-names>
            </name>
            <name>
              <surname>James</surname>
              <given-names>E.K.</given-names>
            </name>
            <name>
              <surname>Sprent</surname>
              <given-names>J.I.</given-names>
            </name>
            <name>
              <surname>Newton</surname>
              <given-names>W.E.</given-names>
            </name>
          </person-group>
          <publisher-name>Springer</publisher-name>
          <publisher-loc>Dordrecht, The Netherlands</publisher-loc>
          <year>2007</year>
          <volume>7</volume>
          <fpage>23</fpage>
          <lpage>58</lpage>
        </citation>
      </ref>
      <ref id="B26-genes-03-00138">
        <label>26.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Capela</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Barloy-Hubler</surname>
              <given-names>F.</given-names>
            </name>
            <name>
              <surname>Gouzy</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Bothe</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Ampe</surname>
              <given-names>F.</given-names>
            </name>
            <name>
              <surname>Batut</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Boistard</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Becker</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Boutry</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Cadieu</surname>
              <given-names>E.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Analysis of the chromosome sequence of the legume symbiont Sinorhizobium meliloti strain 1021</article-title>
          <source>Proc. Natl. Acad. Sci. USA</source>
          <year>2001</year>
          <volume>98</volume>
          <fpage>9877</fpage>
          <lpage>9882</lpage>
        <pub-id pub-id-type="doi">10.1073/pnas.161294398</pub-id><pub-id pub-id-type="pmid">11481430</pub-id></citation>
      </ref>
      <ref id="B27-genes-03-00138">
        <label>27.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Barnett</surname>
              <given-names>M.J.</given-names>
            </name>
            <name>
              <surname>Fisher</surname>
              <given-names>R.F.</given-names>
            </name>
            <name>
              <surname>Jones</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Komp</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Abola</surname>
              <given-names>A.P.</given-names>
            </name>
            <name>
              <surname>Gurjal</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Hong</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Huizar</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Bowser</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Capela</surname>
              <given-names>D.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Nucleotide sequence and predicted functions of the entire Sinorhizobium meliloti pSymA megaplasmid</article-title>
          <source>Proc. Natl. Acad. Sci. USA</source>
          <year>2001</year>
          <volume>98</volume>
          <fpage>9883</fpage>
          <lpage>9888</lpage>
        <pub-id pub-id-type="doi">10.1073/pnas.161294798</pub-id><pub-id pub-id-type="pmid">11481432</pub-id></citation>
      </ref>
      <ref id="B28-genes-03-00138">
        <label>28.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Finan</surname>
              <given-names>T.M.</given-names>
            </name>
            <name>
              <surname>Weidner</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Wong</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Buhrmester</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Chain</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Vorholter</surname>
              <given-names>F.J.</given-names>
            </name>
            <name>
              <surname>Hernandez-lucas</surname>
              <given-names>I.</given-names>
            </name>
            <name>
              <surname>Becker</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Cowie</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Gouzy</surname>
              <given-names>J.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>The complete sequence of the 1,683-kb pSymB megaplasmid from the N2-fixing endosymbiont Sinorhizobium meliloti</article-title>
          <source>Proc. Natl. Acad. Sci. USA</source>
          <year>2001</year>
          <volume>98</volume>
          <fpage>9889</fpage>
          <lpage>9894</lpage>
        <pub-id pub-id-type="doi">10.1073/pnas.161294698</pub-id><pub-id pub-id-type="pmid">11481431</pub-id></citation>
      </ref>
      <ref id="B29-genes-03-00138">
        <label>29.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Terpolilli</surname>
              <given-names>J.J.</given-names>
            </name>
            <name>
              <surname>Tiwari</surname>
              <given-names>R.P.</given-names>
            </name>
            <name>
              <surname>Dilworth</surname>
              <given-names>M.J.</given-names>
            </name>
            <name>
              <surname>O'Hara</surname>
              <given-names>G.W.</given-names>
            </name>
            <name>
              <surname>Howieson</surname>
              <given-names>J.G.</given-names>
            </name>
          </person-group>
          <article-title>Investigating nitrogen fixation in the Medicago-Sinorhizobium symbiosis</article-title>
          <source>Biol. Nitrogen Fixat.</source>
          <year>2008</year>
          <volume>42</volume>
          <fpage>145</fpage>
          <lpage>146</lpage>
        </citation>
      </ref>
      <ref id="B30-genes-03-00138">
        <label>30.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Terpolilli</surname>
              <given-names>J.J.</given-names>
            </name>
            <name>
              <surname>O'Hara</surname>
              <given-names>G.W.</given-names>
            </name>
            <name>
              <surname>Tiwari</surname>
              <given-names>R.P.</given-names>
            </name>
            <name>
              <surname>Dilworth</surname>
              <given-names>M.J.</given-names>
            </name>
            <name>
              <surname>Howieson</surname>
              <given-names>J.G.</given-names>
            </name>
          </person-group>
          <article-title>The model legume Medicago truncatula A17 is poorly matched for N2 fixation with the sequenced microsymbiont Sinorhizobium meliloti 1021</article-title>
          <source>New Phytol.</source>
          <year>2008</year>
          <volume>179</volume>
          <fpage>62</fpage>
          <lpage>66</lpage>
          <pub-id pub-id-type="doi">10.1111/j.1469-8137.2008.02464.x</pub-id>
        </citation>
      </ref>
      <ref id="B31-genes-03-00138">
        <label>31.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Garau</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Reeve</surname>
              <given-names>W.G.</given-names>
            </name>
            <name>
              <surname>Brau</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Deiana</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Yates</surname>
              <given-names>R.J.</given-names>
            </name>
            <name>
              <surname>James</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Tiwari</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>O’Hara</surname>
              <given-names>G.W.</given-names>
            </name>
            <name>
              <surname>Howieson</surname>
              <given-names>J.G.</given-names>
            </name>
          </person-group>
          <article-title>The symbiotic requirements of different Medicago spp. suggest the evolution of Sinorhizobium meliloti and S-Medicae with hosts differentially adapted to soil pH</article-title>
          <source>Plant Soil</source>
          <year>2005</year>
          <volume>276</volume>
          <fpage>263</fpage>
          <lpage>277</lpage>
          <pub-id pub-id-type="doi">10.1007/s11104-005-0374-0</pub-id>
        </citation>
      </ref>
      <ref id="B32-genes-03-00138">
        <label>32.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Pueppke</surname>
              <given-names>S.G.</given-names>
            </name>
            <name>
              <surname>Broughton</surname>
              <given-names>W.J.</given-names>
            </name>
          </person-group>
          <article-title>Rhizobium sp. strain NGR234 and R. fredii USDA257 share exceptionally broad, nested host ranges</article-title>
          <source>Mol. Plant Microbe Interact.</source>
          <year>1999</year>
          <volume>12</volume>
          <fpage>293</fpage>
          <lpage>318</lpage>
          <pub-id pub-id-type="doi">10.1094/MPMI.1999.12.4.293</pub-id>
        </citation>
      </ref>
      <ref id="B33-genes-03-00138">
        <label>33.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Viprey</surname>
              <given-names>V.</given-names>
            </name>
            <name>
              <surname>Rosenthal</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Broughton</surname>
              <given-names>W.J.</given-names>
            </name>
            <name>
              <surname>Perret</surname>
              <given-names>X.</given-names>
            </name>
          </person-group>
          <article-title>Genetic snapshots of the Rhizobium species NGR234 genome</article-title>
          <source>Genome Biol.</source>
          <year>2000</year>
          <volume>1</volume>
          <fpage>1</fpage>
          <lpage>17</lpage>
        <pub-id pub-id-type="pmid">11178226</pub-id></citation>
      </ref>
      <ref id="B34-genes-03-00138">
        <label>34.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Schmeisser</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Liesegang</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Krysciak</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Bakkou</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>le Quéré</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Wollherr</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Heinemeyer</surname>
              <given-names>I.</given-names>
            </name>
            <name>
              <surname>Morgenstern</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Pommerening-Röser</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Flores</surname>
              <given-names>M.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Rhizobium sp. strain NGR234 possesses a remarkable number of secretion systems</article-title>
          <source>Appl. Environ. Microbiol.</source>
          <year>2009</year>
          <volume>75</volume>
          <fpage>4035</fpage>
          <lpage>4045</lpage>
        <pub-id pub-id-type="doi">10.1128/AEM.00515-09</pub-id><pub-id pub-id-type="pmid">19376903</pub-id></citation>
      </ref>
      <ref id="B35-genes-03-00138">
        <label>35.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Turner</surname>
              <given-names>S.L.</given-names>
            </name>
            <name>
              <surname>Zhang</surname>
              <given-names>X.X.</given-names>
            </name>
            <name>
              <surname>Li</surname>
              <given-names>F.D.</given-names>
            </name>
            <name>
              <surname>Young</surname>
              <given-names>J.P.</given-names>
            </name>
          </person-group>
          <article-title>What does a bacterial genome sequence represent? Mis-assignment of MAFF 303099 to the genospecies Mesorhizobium loti</article-title>
          <source>Microbiology</source>
          <year>2002</year>
          <volume>148</volume>
          <fpage>3330</fpage>
          <lpage>3331</lpage>
        <pub-id pub-id-type="pmid">12427922</pub-id></citation>
      </ref>
      <ref id="B36-genes-03-00138">
        <label>36.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Willems</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <article-title>The taxonomy of rhizobia: An overview</article-title>
          <source>Plant and Soil</source>
          <year>2006</year>
          <volume>287</volume>
          <fpage>3</fpage>
          <lpage>14</lpage>
          <pub-id pub-id-type="doi">10.1007/s11104-006-9058-7</pub-id>
        </citation>
      </ref>
      <ref id="B37-genes-03-00138">
        <label>37.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Landeta</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Davalos</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Cevallos</surname>
              <given-names>M.A.</given-names>
            </name>
            <name>
              <surname>Geiger</surname>
              <given-names>O.</given-names>
            </name>
            <name>
              <surname>Brom</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Romero</surname>
              <given-names>D.</given-names>
            </name>
          </person-group>
          <article-title>Plasmids with a chromosome-like role in Rhizobium</article-title>
          <source>J. Bacteriol.</source>
          <year>2011</year>
          <volume>193</volume>
          <fpage>1317</fpage>
          <lpage>1326</lpage>
        <pub-id pub-id-type="doi">10.1128/JB.01184-10</pub-id><pub-id pub-id-type="pmid">21217003</pub-id></citation>
      </ref>
      <ref id="B38-genes-03-00138">
        <label>38.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Harrison</surname>
              <given-names>P.W.</given-names>
            </name>
            <name>
              <surname>Lower</surname>
              <given-names>R.P.J.</given-names>
            </name>
            <name>
              <surname>Kim</surname>
              <given-names>N.K.D.</given-names>
            </name>
            <name>
              <surname>Young</surname>
              <given-names>J.P.W.</given-names>
            </name>
          </person-group>
          <article-title>Introducing the bacterial ‘chromid’: Not a chromosome, not a plasmid</article-title>
          <source>Trends Microbiol.</source>
          <year>2010</year>
          <volume>18</volume>
          <fpage>141</fpage>
          <lpage>148</lpage>
          <pub-id pub-id-type="doi">10.1016/j.tim.2009.12.010</pub-id>
        </citation>
      </ref>
      <ref id="B39-genes-03-00138">
        <label>39.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Gevers</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Cohan</surname>
              <given-names>F.M.</given-names>
            </name>
            <name>
              <surname>Lawrence</surname>
              <given-names>J.G.</given-names>
            </name>
            <name>
              <surname>Spratt</surname>
              <given-names>B.G.</given-names>
            </name>
            <name>
              <surname>Coenye</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Feil</surname>
              <given-names>E.J.</given-names>
            </name>
            <name>
              <surname>Stackebrandt</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Peer</surname>
              <given-names>Y.V.D.</given-names>
            </name>
            <name>
              <surname>Vandamme</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>THOMPSON</surname>
              <given-names>F.L.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Re-evaluating prokaryotic species</article-title>
          <source>Nat. Rev. Microbiol.</source>
          <year>2005</year>
          <volume>3</volume>
          <fpage>733</fpage>
          <lpage>739</lpage>
          <pub-id pub-id-type="doi">10.1038/nrmicro1236</pub-id>
        </citation>
      </ref>
      <ref id="B40-genes-03-00138">
        <label>40.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Gogarten</surname>
              <given-names>J.P.</given-names>
            </name>
            <name>
              <surname>Townsend</surname>
              <given-names>J.P.</given-names>
            </name>
          </person-group>
          <article-title>Horizontal gene transfer, genomes innovation and evolution</article-title>
          <source>Nat. Rev. Microbiol.</source>
          <year>2005</year>
          <volume>3</volume>
          <fpage>679</fpage>
          <lpage>687</lpage>
          <pub-id pub-id-type="doi">10.1038/nrmicro1204</pub-id>
        </citation>
      </ref>
      <ref id="B41-genes-03-00138">
        <label>41.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Gómez-Hernández</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Reyes-González</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Sánchez</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Mora</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Delgado</surname>
              <given-names>M.J.</given-names>
            </name>
            <name>
              <surname>Girard</surname>
              <given-names>L.</given-names>
            </name>
          </person-group>
          <article-title>Regulation and symbiotic role of nirK and norC expression in Rhizobium etli</article-title>
          <source>Mol. Plant Microbe Interact.</source>
          <year>2011</year>
          <volume>24</volume>
          <fpage>233</fpage>
          <lpage>235</lpage>
          <pub-id pub-id-type="doi">10.1094/MPMI-07-10-0173</pub-id>
        </citation>
      </ref>
      <ref id="B42-genes-03-00138">
        <label>42.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Enright</surname>
              <given-names>A.J.</given-names>
            </name>
            <name>
              <surname>van Dongen</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Ouzounis</surname>
              <given-names>C.A.</given-names>
            </name>
          </person-group>
          <article-title>An efficient algorithm for large-scale detection of protein families</article-title>
          <source>Nucleic Acids Res.</source>
          <year>2002</year>
          <volume>30</volume>
          <fpage>1575</fpage>
          <lpage>1584</lpage>
          <pub-id pub-id-type="doi">10.1093/nar/30.7.1575</pub-id>
        </citation>
      </ref>
      <ref id="B43-genes-03-00138">
        <label>43.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Nicolás</surname>
              <given-names>M.F.</given-names>
            </name>
            <name>
              <surname>Barcellos</surname>
              <given-names>M.F.</given-names>
            </name>
            <name>
              <surname>Hess</surname>
              <given-names>P.N.</given-names>
            </name>
            <name>
              <surname>Hungria</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>ABC transporters in Mycoplasma hyopneumoniae and Mycoplasma synoviae: Insights into evolution and pathogenicity</article-title>
          <source>Genet. Mol. Biol.</source>
          <year>2007</year>
          <volume>30</volume>
          <fpage>202</fpage>
          <lpage>211</lpage>
          <pub-id pub-id-type="doi">10.1590/S1415-47572007000200006</pub-id>
        </citation>
      </ref>
      <ref id="B44-genes-03-00138">
        <label>44.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Kaneko</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Nakamura</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Sato</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Asamizu</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Kato</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Sasamoto</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Watanabe</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Idesawa</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Ishikawa</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Kawashima</surname>
              <given-names>K.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Complete genome structure of the nitrogen-fixing symbiotic bacterium Mesorhizobium loti (supplement)</article-title>
          <source>DNA Res.</source>
          <year>2000</year>
          <volume>7</volume>
          <fpage>381</fpage>
          <lpage>406</lpage>
          <pub-id pub-id-type="doi">10.1093/dnares/7.6.381</pub-id>
        </citation>
      </ref>
      <ref id="B45-genes-03-00138">
        <label>45.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Galibert</surname>
              <given-names>F.</given-names>
            </name>
            <name>
              <surname>Finan</surname>
              <given-names>T.M.</given-names>
            </name>
            <name>
              <surname>Long</surname>
              <given-names>S.R.</given-names>
            </name>
            <name>
              <surname>Puhler</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Abola</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Ampe</surname>
              <given-names>F.</given-names>
            </name>
            <name>
              <surname>Barloy-Hubler</surname>
              <given-names>F.</given-names>
            </name>
            <name>
              <surname>Barnett</surname>
              <given-names>M.J.</given-names>
            </name>
            <name>
              <surname>Becker</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Boistard</surname>
              <given-names>P.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>The composite genome of the legume symbiont Sinorhizobium meliloti</article-title>
          <source>Science</source>
          <year>2001</year>
          <volume>293</volume>
          <fpage>668</fpage>
          <lpage>672</lpage>
        <pub-id pub-id-type="doi">10.1126/science.1060966</pub-id><pub-id pub-id-type="pmid">11474104</pub-id></citation>
      </ref>
      <ref id="B46-genes-03-00138">
        <label>46.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Jones</surname>
              <given-names>K.M.</given-names>
            </name>
            <name>
              <surname>Kobayashi</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Davies</surname>
              <given-names>B.W.</given-names>
            </name>
            <name>
              <surname>Taga</surname>
              <given-names>M.E.</given-names>
            </name>
            <name>
              <surname>Walker</surname>
              <given-names>G.C.</given-names>
            </name>
          </person-group>
          <article-title>How rhizobial symbionts invade plants: The Sinorhizobium–Medicago model</article-title>
          <source>Nat. Rev. Microbiol.</source>
          <year>2007</year>
          <volume>5</volume>
          <fpage>619</fpage>
          <lpage>633</lpage>
          <pub-id pub-id-type="doi">10.1038/nrmicro1705</pub-id>
        </citation>
      </ref>
      <ref id="B47-genes-03-00138">
        <label>47.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Oliveira</surname>
              <given-names>L.R.</given-names>
            </name>
            <name>
              <surname>Marcelino</surname>
              <given-names>F.C.</given-names>
            </name>
            <name>
              <surname>Barcellos</surname>
              <given-names>F.G.</given-names>
            </name>
            <name>
              <surname>Rodrigues</surname>
              <given-names>E.P.</given-names>
            </name>
            <name>
              <surname>Megías</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Hungria</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>The nodC, nodG, and glgX genes of Rhizobium tropici strain PRF 81</article-title>
          <source>Funct. Integr. Genomics</source>
          <year>2010</year>
          <volume>10</volume>
          <fpage>425</fpage>
          <lpage>431</lpage>
          <pub-id pub-id-type="doi">10.1007/s10142-009-0151-x</pub-id>
        </citation>
      </ref>
      <ref id="B48-genes-03-00138">
        <label>48.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Downie</surname>
              <given-names>J.A.</given-names>
            </name>
          </person-group>
          <article-title>The roles of extracellular proteins, polysaccharides and signals in the interactions of rhizobia with legume roots</article-title>
          <source>FEMS Microbiol. Rev.</source>
          <year>2010</year>
          <volume>34</volume>
          <fpage>150</fpage>
          <lpage>170</lpage>
          <pub-id pub-id-type="doi">10.1111/j.1574-6976.2009.00205.x</pub-id>
        </citation>
      </ref>
      <ref id="B49-genes-03-00138">
        <label>49.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Walker</surname>
              <given-names>S.A.</given-names>
            </name>
            <name>
              <surname>Viprey</surname>
              <given-names>V.</given-names>
            </name>
            <name>
              <surname>Downie</surname>
              <given-names>J.A.</given-names>
            </name>
          </person-group>
          <article-title>Dissection of nodulation signaling using pea mutants defective for calcium spiking induced by nod factors and chitin oligomers</article-title>
          <source>Proc. Natl. Acad. Sci. USA</source>
          <year>2000</year>
          <volume>97</volume>
          <fpage>13413</fpage>
          <lpage>13418</lpage>
        <pub-id pub-id-type="doi">10.1073/pnas.230440097</pub-id><pub-id pub-id-type="pmid">11078515</pub-id></citation>
      </ref>
      <ref id="B50-genes-03-00138">
        <label>50.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Vlassak</surname>
              <given-names>K.M.</given-names>
            </name>
            <name>
              <surname>de Wilde</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Snoeck</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Luyten</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>van Rhijn</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Vanderleyden</surname>
              <given-names>J.</given-names>
            </name>
          </person-group>
          <article-title>The Rhizobium sp. BR816 nodD3 gene is regulated by a transcriptional regulator of the AraC/XylS family</article-title>
          <source>Mol. Gen. Genet.</source>
          <year>1998</year>
          <volume>258</volume>
          <fpage>558</fpage>
          <lpage>561</lpage>
          <pub-id pub-id-type="doi">10.1007/s004380050768</pub-id>
        </citation>
      </ref>
      <ref id="B51-genes-03-00138">
        <label>51.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Cooper</surname>
              <given-names>J.E.</given-names>
            </name>
          </person-group>
          <article-title>Multiple responses of rhizobia to flavonoids during legume root infection</article-title>
          <source>Adv. Bot. Res. Inc. Adv. Plant Pathol.</source>
          <year>2004</year>
          <volume>41</volume>
          <fpage>1</fpage>
          <lpage>62</lpage>
          <pub-id pub-id-type="doi">10.1016/S0065-2296(04)41001-5</pub-id>
        </citation>
      </ref>
      <ref id="B52-genes-03-00138">
        <label>52.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Cooper</surname>
              <given-names>J.E.</given-names>
            </name>
          </person-group>
          <article-title>Early interactions between legumes and rhizobia: Disclosing complexity in a molecular dialogue</article-title>
          <source>J. Appl. Microbiol.</source>
          <year>2007</year>
          <volume>103</volume>
          <fpage>1355</fpage>
          <lpage>1365</lpage>
          <pub-id pub-id-type="doi">10.1111/j.1365-2672.2007.03366.x</pub-id>
        </citation>
      </ref>
      <ref id="B53-genes-03-00138">
        <label>53.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Fauvart</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Michiels</surname>
              <given-names>J.</given-names>
            </name>
          </person-group>
          <article-title>Rhizobial secreted proteins as determinants of host specificity in the rhizobium-legume symbiosis</article-title>
          <source>FEMS Microbiol. Lett.</source>
          <year>2008</year>
          <volume>285</volume>
          <fpage>1</fpage>
          <lpage>9</lpage>
          <pub-id pub-id-type="doi">10.1111/j.1574-6968.2008.01254.x</pub-id>
        </citation>
      </ref>
      <ref id="B54-genes-03-00138">
        <label>54.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Deakin</surname>
              <given-names>W.J.</given-names>
            </name>
            <name>
              <surname>Broughton</surname>
              <given-names>W.J.</given-names>
            </name>
          </person-group>
          <article-title>Symbiotic use of pathogenic strategies: Rhizobial protein secretion systems</article-title>
          <source>Nat. Rev. Microbiol.</source>
          <year>2009</year>
          <volume>7</volume>
          <fpage>312</fpage>
          <lpage>320</lpage>
        <pub-id pub-id-type="pmid">19270720</pub-id></citation>
      </ref>
      <ref id="B55-genes-03-00138">
        <label>55.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Pickering</surname>
              <given-names>B.S.</given-names>
            </name>
            <name>
              <surname>Oresnik</surname>
              <given-names>I.J.</given-names>
            </name>
          </person-group>
          <article-title>The twin arginine transport system appears to be essential for viability in Sinorhizobium meliloti</article-title>
          <source>J. Bacteriol.</source>
          <year>2010</year>
          <volume>192</volume>
          <fpage>5173</fpage>
          <lpage>5180</lpage>
          <pub-id pub-id-type="doi">10.1128/JB.00206-10</pub-id>
        </citation>
      </ref>
      <ref id="B56-genes-03-00138">
        <label>56.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Alvarez-Martinez</surname>
              <given-names>C.E.</given-names>
            </name>
            <name>
              <surname>Christie</surname>
              <given-names>P.J.</given-names>
            </name>
          </person-group>
          <article-title>Biological diversity of prokaryotic type IV secretion systems</article-title>
          <source>Microbiol. Mol. Biol. Rev.</source>
          <year>2009</year>
          <volume>73</volume>
          <fpage>775</fpage>
          <lpage>808</lpage>
          <pub-id pub-id-type="doi">10.1128/MMBR.00023-09</pub-id>
        </citation>
      </ref>
      <ref id="B57-genes-03-00138">
        <label>57.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Krishnan</surname>
              <given-names>H.B.</given-names>
            </name>
            <name>
              <surname>Lorio</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Kim</surname>
              <given-names>W.S.</given-names>
            </name>
            <name>
              <surname>Jiang</surname>
              <given-names>G.Q.</given-names>
            </name>
            <name>
              <surname>Kim</surname>
              <given-names>K.Y.</given-names>
            </name>
            <name>
              <surname>DeBoer</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Pueppke</surname>
              <given-names>S.G.</given-names>
            </name>
          </person-group>
          <article-title>Extracellular proteins involved in soybean cultivar-specific nodulation are associated with pilus-like surface appendages and exported by a type III protein secretion system in Sinorhizobium fredii USDA257</article-title>
          <source>Mol. Plant Microbe Interact.</source>
          <year>2003</year>
          <volume>16</volume>
          <fpage>617</fpage>
          <lpage>625</lpage>
          <pub-id pub-id-type="doi">10.1094/MPMI.2003.16.7.617</pub-id>
        </citation>
      </ref>
      <ref id="B58-genes-03-00138">
        <label>58.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Desvaux</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Parham</surname>
              <given-names>N.J.</given-names>
            </name>
            <name>
              <surname>Henderson</surname>
              <given-names>I.R.</given-names>
            </name>
          </person-group>
          <article-title>Type V protein secretion: Simplicity gone awry?</article-title>
          <source>Curr. Issues Mol. Biol.</source>
          <year>2004</year>
          <volume>6</volume>
          <fpage>111</fpage>
          <lpage>124</lpage>
        <pub-id pub-id-type="pmid">15119822</pub-id></citation>
      </ref>
      <ref id="B59-genes-03-00138">
        <label>59.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Schwarz</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Hood</surname>
              <given-names>R.D.</given-names>
            </name>
            <name>
              <surname>Mougous</surname>
              <given-names>J.D.</given-names>
            </name>
          </person-group>
          <article-title>What is type VI secretion doing in all those bugs?</article-title>
          <source>Trends Microbiol.</source>
          <year>2010</year>
          <volume>18</volume>
          <fpage>531</fpage>
          <lpage>537</lpage>
          <pub-id pub-id-type="doi">10.1016/j.tim.2010.09.001</pub-id>
        </citation>
      </ref>
      <ref id="B60-genes-03-00138">
        <label>60.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Bonemann</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Pietrosiuk</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Mogk</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <article-title>Tubules and donuts: A type VI secretion story</article-title>
          <source>Mol. Microbiol.</source>
          <year>2010</year>
          <volume>76</volume>
          <fpage>815</fpage>
          <lpage>821</lpage>
          <pub-id pub-id-type="doi">10.1111/j.1365-2958.2010.07171.x</pub-id>
        </citation>
      </ref>
      <ref id="B61-genes-03-00138">
        <label>61.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Skorupska</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Janczarek</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Marczak</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Mazur</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Król</surname>
              <given-names>J.</given-names>
            </name>
          </person-group>
          <article-title>Rhizobial exopolysaccharides: Genetic control and symbiotic functions</article-title>
          <source>Microbial Cell Factories</source>
          <year>2006</year>
          <volume>5</volume>
          <pub-id pub-id-type="doi">10.1186/1475-2859-5-7</pub-id>
        </citation>
      </ref>
      <ref id="B62-genes-03-00138">
        <label>62.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Cytryn</surname>
              <given-names>E.J.</given-names>
            </name>
            <name>
              <surname>Sangurdekar</surname>
              <given-names>D.P.</given-names>
            </name>
            <name>
              <surname>Streeter</surname>
              <given-names>J.G.</given-names>
            </name>
            <name>
              <surname>Franck</surname>
              <given-names>W.L.</given-names>
            </name>
            <name>
              <surname>Chang</surname>
              <given-names>W.-S.</given-names>
            </name>
            <name>
              <surname>Stacey</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Emerich</surname>
              <given-names>D.W.</given-names>
            </name>
            <name>
              <surname>Joshi</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Xu</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Sadowsky</surname>
              <given-names>M.J.</given-names>
            </name>
          </person-group>
          <article-title>Transcriptional and physiological responses of Bradyrhizobium japonicum to desiccation-induced stress</article-title>
          <source>J. Bacteriol.</source>
          <year>2007</year>
          <volume>189</volume>
          <fpage>6751</fpage>
          <lpage>6762</lpage>
        <pub-id pub-id-type="doi">10.1128/JB.00533-07</pub-id><pub-id pub-id-type="pmid">17660288</pub-id></citation>
      </ref>
      <ref id="B63-genes-03-00138">
        <label>63.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Batista</surname>
              <given-names>J.S.S.</given-names>
            </name>
            <name>
              <surname>Torres</surname>
              <given-names>A.R.</given-names>
            </name>
            <name>
              <surname>Hungria</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>Towards a two-dimensional proteomic reference map of Bradyrhizobium japonicum CPAC 15: Spotlighting on “hypothetical proteins”</article-title>
          <source>Proteomics</source>
          <year>2010</year>
          <volume>10</volume>
          <fpage>3176</fpage>
          <lpage>3189</lpage>
          <pub-id pub-id-type="doi">10.1002/pmic.201000092</pub-id>
        </citation>
      </ref>
      <ref id="B64-genes-03-00138">
        <label>64.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Raymond</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Siefert</surname>
              <given-names>J.L.</given-names>
            </name>
            <name>
              <surname>Staples</surname>
              <given-names>C.R.</given-names>
            </name>
            <name>
              <surname>Blankenship</surname>
              <given-names>R.E.</given-names>
            </name>
          </person-group>
          <article-title>The natural history of nitrogen fixation</article-title>
          <source>Mol. Biol. Evol.</source>
          <year>2004</year>
          <volume>21</volume>
          <fpage>541</fpage>
          <lpage>554</lpage>
        <pub-id pub-id-type="pmid">14694078</pub-id></citation>
      </ref>
      <ref id="B65-genes-03-00138">
        <label>65.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Masson-Boivin</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Giraud</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Perret</surname>
              <given-names>X.</given-names>
            </name>
            <name>
              <surname>Batut</surname>
              <given-names>J.</given-names>
            </name>
          </person-group>
          <article-title>Establishing nitrogen-fixing symbiosis with legumes: How many rhizobium recipes?</article-title>
          <source>Trends Microbiol.</source>
          <year>2009</year>
          <volume>17</volume>
          <fpage>458</fpage>
          <lpage>466</lpage>
          <pub-id pub-id-type="doi">10.1016/j.tim.2009.07.004</pub-id>
        </citation>
      </ref>
      <ref id="B66-genes-03-00138">
        <label>66.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Rubio</surname>
              <given-names>L.M.</given-names>
            </name>
            <name>
              <surname>Ludden</surname>
              <given-names>P.W.</given-names>
            </name>
          </person-group>
          <article-title>Biosynthesis of the iron-molybdenum cofactor of nitrogenase</article-title>
          <source>Annu. Rev. Microbiol.</source>
          <year>2008</year>
          <volume>62</volume>
          <fpage>93</fpage>
          <lpage>111</lpage>
          <pub-id pub-id-type="doi">10.1146/annurev.micro.62.081307.162737</pub-id>
        </citation>
      </ref>
      <ref id="B67-genes-03-00138">
        <label>67.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Halary</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Leigh</surname>
              <given-names>J.W.</given-names>
            </name>
            <name>
              <surname>Cheaib</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Lopez</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Bapteste</surname>
              <given-names>E.</given-names>
            </name>
          </person-group>
          <article-title>Network analyses structure genetic diversity in independent genetic worlds</article-title>
          <source>Proc. Natl. Acad. Sci. USA</source>
          <year>2010</year>
          <volume>107</volume>
          <fpage>127</fpage>
          <lpage>132</lpage>
        <pub-id pub-id-type="doi">10.1073/pnas.0908978107</pub-id><pub-id pub-id-type="pmid">20007769</pub-id></citation>
      </ref>
      <ref id="B68-genes-03-00138">
        <label>68.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Kumar</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Nei</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Dudley</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Tamura</surname>
              <given-names>K.</given-names>
            </name>
          </person-group>
          <article-title>MEGA: A biologist-centric software for evolutionary analysis of DNA and protein sequences</article-title>
          <source>Brief Bioinform</source>
          <year>2008</year>
          <volume>9</volume>
          <fpage>299</fpage>
          <lpage>306</lpage>
          <pub-id pub-id-type="doi">10.1093/bib/bbn017</pub-id>
        </citation>
      </ref>
      <ref id="B69-genes-03-00138">
        <label>69.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Bellgard</surname>
              <given-names>M.I.</given-names>
            </name>
            <name>
              <surname>Wanchanthuek</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>La</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Ryan</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Moolhuijzen</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Albertyn</surname>
              <given-names>Z.</given-names>
            </name>
            <name>
              <surname>Shaban</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Motro</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Dunn</surname>
              <given-names>D.S.</given-names>
            </name>
            <name>
              <surname>Schibeci</surname>
              <given-names>D.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Genome sequence of the pathogenic intestinal spirochete brachyspira hyodysenteriae reveals adaptations to its lifestyle in the porcine large intestine</article-title>
          <source>Plos One</source>
          <year>2009</year>
          <volume>4</volume>
          <pub-id pub-id-type="doi">10.1371/journal.pone.0004641</pub-id>
        </citation>
      </ref>
      <ref id="B70-genes-03-00138">
        <label>70.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Waterhouse</surname>
              <given-names>A.M.</given-names>
            </name>
            <name>
              <surname>Procter</surname>
              <given-names>J.B.</given-names>
            </name>
            <name>
              <surname>Martin</surname>
              <given-names>D.M.</given-names>
            </name>
            <name>
              <surname>Clamp</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Barton</surname>
              <given-names>G.J.</given-names>
            </name>
          </person-group>
          <article-title>Jalview Version 2—A multiple sequence alignment editor and analysis workbench</article-title>
          <source>Bioinformatics</source>
          <year>2009</year>
          <volume>25</volume>
          <fpage>1189</fpage>
          <lpage>1191</lpage>
          <pub-id pub-id-type="doi">10.1093/bioinformatics/btp033</pub-id>
        </citation>
      </ref>
      <ref id="B71-genes-03-00138">
        <label>71.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Garau</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Yates</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Deiana</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Howieson</surname>
              <given-names>J.</given-names>
            </name>
          </person-group>
          <article-title>Novel strains of nodulating Burkholderia have a role in nitrogen fixation with papilionoid herbaceous legumes adapted to acid, infertile soils</article-title>
          <source>Soil Biol. Biochem.</source>
          <year>2009</year>
          <volume>41</volume>
          <fpage>125</fpage>
          <lpage>134</lpage>
          <pub-id pub-id-type="doi">10.1016/j.soilbio.2008.10.011</pub-id>
        </citation>
      </ref>
      <ref id="B72-genes-03-00138">
        <label>72.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Amadou</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Pascal</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Mangenot</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Glew</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Bontemps</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Capela</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Carrère</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Cruveiller</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Dossat</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Lajus</surname>
              <given-names>A.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Genome sequence of the beta-rhizobium Cupriavidus taiwanensis and comparative genomics of rhizobia</article-title>
          <source>Genome Res.</source>
          <year>2008</year>
          <volume>18</volume>
          <fpage>1472</fpage>
          <lpage>1483</lpage>
          <pub-id pub-id-type="doi">10.1101/gr.076448.108</pub-id>
        </citation>
      </ref>
      <ref id="B73-genes-03-00138">
        <label>73.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Pokharel</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Mirza</surname>
              <given-names>B.S.</given-names>
            </name>
            <name>
              <surname>Dawson</surname>
              <given-names>J.O.</given-names>
            </name>
            <name>
              <surname>Hahn</surname>
              <given-names>D.</given-names>
            </name>
          </person-group>
          <article-title>Frankia populations in soil and root nodules of sympatrically grown Alnus taxa</article-title>
          <source>Microb. Ecol.</source>
          <year>2011</year>
          <volume>61</volume>
          <fpage>92</fpage>
          <lpage>100</lpage>
          <pub-id pub-id-type="doi">10.1007/s00248-010-9726-2</pub-id>
        </citation>
      </ref>
      <ref id="B74-genes-03-00138">
        <label>74.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Pini</surname>
              <given-names>F.</given-names>
            </name>
            <name>
              <surname>Galardini</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Bazzicalupo</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Mengoni</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <article-title>Plant-bacteria association and symbiosis: Are there common genomic traits in alphaproteobacteria?</article-title>
          <source>Genes</source>
          <year>2011</year>
          <volume>2</volume>
          <fpage>1017</fpage>
          <lpage>1032</lpage>
          <pub-id pub-id-type="doi">10.3390/genes2041017</pub-id>
        </citation>
      </ref>
      <ref id="B75-genes-03-00138">
        <label>75.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Rodriguez-llorente</surname>
              <given-names>I.</given-names>
            </name>
            <name>
              <surname>Caviedes</surname>
              <given-names>M.A.</given-names>
            </name>
            <name>
              <surname>Dary</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Palomares</surname>
              <given-names>A.J.</given-names>
            </name>
          </person-group>
          <article-title>The symbiosis interactome: A computational approach reveals novel components, functional interactions and modules in Sinorhizobium meliloti</article-title>
          <source>BMC Syst. Biol.</source>
          <year>2009</year>
          <volume>3</volume>
          <fpage>1</fpage>
          <lpage>18</lpage>
          <pub-id pub-id-type="doi">10.1186/1752-0509-3-1</pub-id>
        </citation>
      </ref>
      <ref id="B76-genes-03-00138">
        <label>76.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Delmotte</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Ahrens</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Knief</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Qeli</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Koch</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>An integrated proteomics and transcriptomics reference data set provides new insights into the Bradyrhizobium japonicum bacteroid metabolism in soybean root nodules</article-title>
          <source>Proteomics</source>
          <year>2010</year>
          <volume>10</volume>
          <fpage>1391</fpage>
          <lpage>1400</lpage>
          <pub-id pub-id-type="doi">10.1002/pmic.200900710</pub-id>
        </citation>
      </ref>
      <ref id="B77-genes-03-00138">
        <label>77.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Ballesteros-Almanza</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Altamirano-Hernandez</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Pena-Cabriales</surname>
              <given-names>J.J.</given-names>
            </name>
            <name>
              <surname>Santoyo</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Sanchez-Yanez</surname>
              <given-names>J.M.</given-names>
            </name>
            <name>
              <surname>Valencia-Cantero</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Macias-Rodriguez</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Lopez-Bucio</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Cardenas-Navarro</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Farias-Rodriguez</surname>
              <given-names>R.</given-names>
            </name>
          </person-group>
          <article-title>Effect of co-inoculation with mycorrhiza and rhizobia on the nodule trehalose content of different bean genotypes</article-title>
          <source>Open Microbiol. J.</source>
          <year>2010</year>
          <volume>4</volume>
          <fpage>83</fpage>
          <lpage>92</lpage>
          <pub-id pub-id-type="doi">10.2174/1874285801004010083</pub-id>
        </citation>
      </ref>
      <ref id="B78-genes-03-00138">
        <label>78.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Batut</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Mergaert</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Masson-Boivin</surname>
              <given-names>C.</given-names>
            </name>
          </person-group>
          <article-title>Peptide signalling in the rhizobium-legume symbiosis</article-title>
          <source>Current Opinion in Microbiology</source>
          <year>2011</year>
          <volume>14</volume>
          <fpage>181</fpage>
          <lpage>187</lpage>
        <pub-id pub-id-type="doi">10.1016/j.mib.2010.12.010</pub-id><pub-id pub-id-type="pmid">21236724</pub-id></citation>
      </ref>
      <ref id="B79-genes-03-00138">
        <label>79.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Prell</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Bourdès</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Kumar</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Lodwig</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Hosie</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Kinghorn</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>White</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Poole</surname>
              <given-names>P.</given-names>
            </name>
          </person-group>
          <article-title>Role of symbiotic auxotrophy in the Rhizobium-legume symbioses</article-title>
          <source>Plos One</source>
          <year>2010</year>
          <volume>5</volume>
          <pub-id pub-id-type="doi">10.1371/journal.pone.0013933</pub-id>
        </citation>
      </ref>
    </ref-list>
    <app-group>
<app>
<title>Supplementary Files</title>
<supplementary-material xmlns:xlink="http://www.w3.org/1999/xlink" id="genes-03-00138-s001" xlink:href="genes-03-00138-s001.docx">
<label>Supplementary File 1</label>
<caption>
<p>DOCX-Document (DOCX, 685 KB)</p>
</caption>
</supplementary-material>
</app>
</app-group>

  </back>
</article>
