<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xml:lang="en" article-type="rapid-communication">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Genes</journal-id>
<journal-title>Genes</journal-title>
<issn pub-type="epub">2073-4425</issn>
<publisher>
<publisher-name>Molecular Diversity Preservation International (MDPI)</publisher-name></publisher></journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3390/genes2041050</article-id>
<article-id pub-id-type="publisher-id">genes-02-01050</article-id>
<article-categories>
<subj-group>
<subject>Communication</subject></subj-group></article-categories>
<title-group>
<article-title>Draft Genome Sequences of <italic>Xanthomonas sacchari</italic> and Two Banana-Associated Xanthomonads Reveal Insights into the <italic>Xanthomonas</italic> Group 1 Clade</article-title></title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Studholme</surname><given-names>David J.</given-names></name><xref ref-type="aff" rid="af1-genes-02-01050"><sup>1</sup></xref><xref ref-type="corresp" rid="c1-genes-02-01050"><sup>*</sup></xref></contrib>
<contrib contrib-type="author">
<name><surname>Wasukira</surname><given-names>Arthur</given-names></name><xref ref-type="aff" rid="af1-genes-02-01050"><sup>1</sup></xref><xref ref-type="aff" rid="af2-genes-02-01050"><sup>2</sup></xref></contrib>
<contrib contrib-type="author">
<name><surname>Paszkiewicz</surname><given-names>Konrad</given-names></name><xref ref-type="aff" rid="af1-genes-02-01050"><sup>1</sup></xref></contrib>
<contrib contrib-type="author">
<name><surname>Aritua</surname><given-names>Valente</given-names></name><xref ref-type="aff" rid="af2-genes-02-01050"><sup>2</sup></xref><xref ref-type="aff" rid="af3-genes-02-01050"><sup>3</sup></xref><xref ref-type="author-notes" rid="fn1-genes-02-01050"><sup>†</sup></xref></contrib>
<contrib contrib-type="author">
<name><surname>Thwaites</surname><given-names>Richard</given-names></name><xref ref-type="aff" rid="af3-genes-02-01050"><sup>3</sup></xref></contrib>
<contrib contrib-type="author">
<name><surname>Smith</surname><given-names>Julian</given-names></name><xref ref-type="aff" rid="af3-genes-02-01050"><sup>3</sup></xref></contrib>
<contrib contrib-type="author">
<name><surname>Grant</surname><given-names>Murray</given-names></name><xref ref-type="aff" rid="af1-genes-02-01050"><sup>1</sup></xref></contrib></contrib-group>
<aff id="af1-genes-02-01050">
<label>1</label> Department of Biosciences, University of Exeter, Geoffrey Pope Building, Stocker Road, Exeter, EX4 4QD, UK; E-Mails: <email>awasukira@gmail.com</email> (A.W.); <email>k.h.paszkiewicz@exeter.ac.uk</email> (K.P.); <email>m.r.grant@exeter.ac.uk</email> (M.G.)</aff>
<aff id="af2-genes-02-01050">
<label>2</label> National Crops Resources Research Institute (NaCRRI), P.O. Box 7084, Kampala, Uganda; E-Mail: <email>arituavalentine@yahoo.com</email></aff>
<aff id="af3-genes-02-01050">
<label>3</label> The Food and Environment Research Agency, Sand Hutton, York, YO41 1LZ, UK; E-Mails: <email>richard.thwaites@fera.gsi.gov.uk</email> (R.T.); <email>julian.smith@fera.gsi.gov.uk</email> (J.S.)</aff>
<author-notes><fn id="fn1-genes-02-01050" fn-type="present-address">
<label>†</label>
<p>Present address: Department of Plant Pathology, Citrus Research and Education Center, University of Florida, 700 Experiment Station Road, Lake Alfred 33850, FL, USA.</p></fn>
<corresp id="c1-genes-02-01050">
<label>*</label>Author to whom correspondence should be addressed; E-Mail: <email>d.j.studholme@exeter.ac.uk</email>.</corresp></author-notes>
<pub-date pub-type="collection">
<year>2011</year></pub-date>
<pub-date pub-type="epub">
<day>02</day>
<month>12</month>
<year>2011</year></pub-date>
<volume>2</volume>
<issue>4</issue>
<fpage>1050</fpage>
<lpage>1065</lpage>
<history>
<date date-type="received">
<day>23</day>
<month>10</month>
<year>2011</year></date>
<date date-type="rev-recd">
<day>04</day>
<month>11</month>
<year>2011</year></date>
<date date-type="accepted">
<day>21</day>
<month>11</month>
<year>2011</year></date></history>
<permissions>
<copyright-statement>© 2011 by the authors; licensee MDPI, Basel, Switzerland.</copyright-statement>
<copyright-year>2011</copyright-year>
<license>
<p>This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).</p></license></permissions>
<abstract>
<p>We present draft genome sequences for three strains of <italic>Xanthomonas</italic> species, each of which was associated with banana plants (<italic>Musa</italic> species) but is not closely related to the previously sequenced banana-pathogen <italic>Xanthomonas campestris</italic> pathovar <italic>musacearum</italic>. Strain NCPPB4393 had been deposited as <italic>Xanthomonas campestris</italic> pathovar <italic>musacearum</italic> but in fact falls within the species <italic>Xanthomonas sacchari</italic>. Strain NCPPB1132 is more distantly related to <italic>Xanthomonas sacchari</italic> whilst strain NCPPB 1131 grouped in a distinct species-level clade related to <italic>X. sacchari</italic>, along with strains from ginger, rice, cotton and sugarcane. These three newly sequenced strains share many genomic features with the previously sequenced <italic>Xanthomonas albilineans</italic>, for example possessing an unsual <italic>metE</italic> allele and lacking the Hrp type III secretion system. However, they are distinct from <italic>Xanthomonas albilineans</italic> in many respects, for example showing little evidence of genome reduction. They also lack the SPI-1 type III secretion system found in <italic>Xanthomonas albilineans</italic>. Unlike <italic>X. albilineans</italic>, all three strains possess a <italic>gum</italic> gene cluster. The data reported here provide the first genome-wide survey of non-Hrp <italic>Xanthomonas</italic> species other than <italic>Xanthomonas albilineans</italic>, which is an atypical member of this group. We hope that the availability of complete sequence data for this group of organisms is the first step towards understanding their interactions with plants and identifying potential virulence factors.</p></abstract>
<kwd-group>
<kwd>banana</kwd>
<kwd><italic>Xanthomonas</italic></kwd>
<kwd>genome</kwd></kwd-group></article-meta></front>
<body>
<sec sec-type="intro">
<label>1.</label>
<title>Introduction</title>
<p>The genus <italic>Xanthomonas</italic> includes some 27 species of plant-associated Gram-negative bacteria. Collectively these species, and their constituent pathovars, cause disease on several hundred plant species, including many economically important crops. Phylogenetic analyses of the genus <italic>Xanthomonas</italic> consistently reveal that it comprises two distinct groups [<xref ref-type="bibr" rid="b1-genes-02-01050">1</xref>-<xref ref-type="bibr" rid="b6-genes-02-01050">6</xref>]. Young and colleagues recently proposed that Group 1 and Group 2 might represent two distinct genera [<xref ref-type="bibr" rid="b6-genes-02-01050">6</xref>].</p>
<p>Complete genome sequences are available for several members of Group 2, including <italic>X. campestris</italic> pv. <italic>campestris</italic>, <italic>X. campestris</italic> pv. <italic>raphani</italic>, <italic>X. campestris</italic> pv. <italic>vesicatoria</italic>, <italic>X. citri</italic>, <italic>X. fuscans</italic> subsp. <italic>aurantifolii, X. oryzae</italic> pv. <italic>oryzae</italic> and <italic>X. oryzae</italic> pv. <italic>oryzicola</italic> [<xref ref-type="bibr" rid="b7-genes-02-01050">7</xref>]. Draft genome sequences are available for several further members of the Group 2 <italic>Xanthomonas</italic> species [<xref ref-type="bibr" rid="b7-genes-02-01050">7</xref>]. The genomes of all of the members of Group 2 investigated so far encode a type III secretion system (T3SS), known as the Hrp T3SS, which is required for delivery of effector proteins into the plant host in order to overcome the host's defences. The name “Hrp” is derived from “hypersensitive response and pathogenicity”.</p>
<p>Of the members of <italic>Xanthomonas</italic> Group 1, the only published genome sequence [<xref ref-type="bibr" rid="b8-genes-02-01050">8</xref>] is that of <italic>X. albilineans</italic>, a xylem-limited pathogen that causes leaf scorch in sugarcane (<italic>Saccharum</italic> species). Analysis of the <italic>X. albilineans</italic> genome sequence revealed that this species displays several interesting features that are quite distinct from those of the Group 2 species. For example, <italic>X. albilineans</italic> lacks the Hrp T3SS that is universally conserved and central to pathogenicity in Group 2, but it encodes an alternative non-Hrp T3SS that shares sequence similarity to the <italic>Salmonella</italic> SPI-1 T3SS [<xref ref-type="bibr" rid="b9-genes-02-01050">9</xref>]. This raises the question of whether these features of the <italic>X. albilineans</italic> genome are also shared with the genomes of other Group 1 <italic>Xanthomonas</italic> species. Furthermore, <italic>X. albilineans</italic> appears to have undergone significant genome reduction, perhaps as a consequence of, or adaptation to, its xylem-limited lifestyle [<xref ref-type="bibr" rid="b8-genes-02-01050">8</xref>]. Therefore, it would be interesting to compare its genome with those of other members of Group 2 that do not share this highly specialized lifestyle.</p>
<p>Until recently, the only known <italic>X. albilineans</italic> virulence factor was the toxin albicidin. The complete genome sequence of <italic>X. albilineans</italic> enabled the identification of several new candidate virulence factors via screening of a transposon mutagenesis library [<xref ref-type="bibr" rid="b10-genes-02-01050">10</xref>]. Many of these were not shared with the Group 2 <italic>Xanthomonas</italic> species and may reflect the distinctiveness of the pathogenic strategies adopted by Groups 1 and 2. Therefore, it raises the question of whether these newly discovered virulence factors are also found in Group 1 species other than <italic>X. albilineans</italic>.</p>
<p>We recently sequenced the genome of an isolate of <italic>X. campestris</italic> pv. <italic>musacearum</italic>, which is the causative agent of banana <italic>Xanthomonas</italic> wilt, a disease currently causing devastation to the banana crop in East Africa [<xref ref-type="bibr" rid="b11-genes-02-01050">11</xref>] and is a member of <italic>Xanthomonas</italic> Group 2. Subsequently, we performed follow-up genome-sequencing studies on additional isolates that had been deposited in the National Collection of Plant Pathogenic Bacteria (NCPPB) as <italic>X. campestris</italic> pv. <italic>musacearum</italic>. We discovered that NCPPB4393 shared little sequence similarity with the previously sequenced isolate; rather, it showed very close sequence similarity with <italic>X. sacchari</italic>, a member of <italic>Xanthomonas</italic> Group 1. On further investigation we learned that NCPPB4393 had in fact been isolated from an insect on a diseased banana plant but that there was no evidence that the strain is actually pathogenic on banana [<xref ref-type="bibr" rid="b12-genes-02-01050">12</xref>]. We also sequenced two additional <italic>Xanthomonas</italic> strains (NCPPB1131 and NCPPB1132) that had been isolated from banana plants in Eastern and Western Samoa.</p></sec>
<sec sec-type="results|discussion">
<label>2.</label>
<title>Results and Discussion</title>
<sec>
<label>2.1.</label>
<title>Bacterial Strains</title>
<p>Bacterial strains (<xref ref-type="table" rid="t1-genes-02-01050">Table 1</xref>) were obtained from the National Collection of Plant Pathogenic Bacteria (NCPPB) in the United Kingdom. NCPPB1131 and NCPPB1132 had been deposited in 1961 by Hayward A.C. after isolation from banana plants. NCPPB4393 was one of several strains isolates deposited by one of the authors of the present study (V.A.) in 2007. It was deposited in the strain collection as <italic>X. campestris</italic> pv. <italic>musacearum.</italic> However, the results of this study suggest that it is actually a member of the species <italic>X. sacchari</italic>. It was originally isolated by Mgenzi Byabachwezi from an insect on a diseased banana plant (Muleba district, Kagera region, North Western Tanzania, on the shores of Lake Victoria). Although the insect was collected from diseased banana, sugarcane is commonly grown in that district of Tanzania and so it is possible that the insect acquired the bacterium from sugarcane. Pathogenicity of strain NCPPB4393 has not been tested.</p></sec>
<sec sec-type="methods">
<label>2.2.</label>
<title>Genome-Wide Sequence Data</title>
<p>We generated genome-wide sequence data for three strains listed in <xref ref-type="table" rid="t1-genes-02-01050">Table 1</xref> using the Illumina GA2. After removing bar-code adaptors, the sequence reads were 70 nt long. We generated 1.7 million non-paired reads for NCPPB1131. For NCPPB1132 and NCPPB4393 we generated 1.9 million and 2.1 million paired reads respectively. These Whole Genome Shotgun project data have been deposited at DDBJ/EMBL/GenBank under the accession numbers AGHY00000000, AGHZ00000000 and AGDB00000000 respectively.</p></sec>
<sec>
<label>2.3.</label>
<title>Phylogenetic Position of the Sequenced Xanthomonas Strains</title>
<p>To ascertain the phylogenetic position of the three newly sequenced strains, we generated a series of phylogenetic trees based on the nucleotide sequences of house-keeping genes. We used the same set of seven genes that were used by Pieretti and colleagues [<xref ref-type="bibr" rid="b8-genes-02-01050">8</xref>]. For four of these genes (<italic>atpD</italic>, <italic>dnaK</italic>, <italic>groEL</italic> and <italic>recA</italic>) we were able to build trees from multiple sequence alignments using the maximum likelihood method. However, for three of the genes (<italic>efp</italic>, <italic>glnA</italic> and <italic>gyrB</italic>) we were unable to build valid multiple sequence alignments because of a lack of orthologues with detectable nucleotide sequence similarity. For example, <italic>blastn</italic> searches against the NCBI non-redundant nucleotide database, using <italic>X. albilineans gyrB</italic> (XALc_0004) as the query, yielded no significant matches in <italic>Xylella</italic> species. The phylogenetic reconstruction [<xref ref-type="bibr" rid="b13-genes-02-01050">13</xref>,<xref ref-type="bibr" rid="b14-genes-02-01050">14</xref>] based on <italic>atpD</italic> is shown in <xref ref-type="fig" rid="f1-genes-02-01050">Figure 1</xref>. Phylogenetic trees based on <italic>dnaK</italic>, <italic>groEL</italic> and <italic>recA</italic> are included in the <xref rid="SD1" ref-type="supplementary-material">Supplementary Files</xref>. The trees for <italic>atpD</italic>, <italic>dnaK</italic> and <italic>groEL</italic> are all toplogically consistent with each other, though there is some variation in branch lengths and the precise position of <italic>X. albilineans</italic> within Group 1 is less well resolved in the <italic>atpD</italic> tree than in the others. However, analysis of the <italic>recA</italic> sequences yielded a different branching pattern in which <italic>X. albilineans</italic> falls within the <italic>Xylella fastidiosa</italic> lineage rather than within the <italic>Xanthomonas</italic> Group 1.</p>
<p>Our phylogenetic analyses clearly and consistently indicated that all three strains fell within the phylogenetic range of the genus <italic>Xanthomonas</italic>. Specifically, all three strains are more closely related to <italic>X. albilineans</italic> than to the Group 2 <italic>Xanthomonas</italic> species and therefore likely belong to Group 1.</p>
<p>We note that in our analyses based on three out of four house-keeping genes, the genus <italic>Xanthomonas</italic> comprises a single monophyletic clade, distinct from the related genera <italic>Stenotrophomonas</italic> and <italic>Xylella</italic>. This is consistent with previous studies [<xref ref-type="bibr" rid="b7-genes-02-01050">7</xref>,<xref ref-type="bibr" rid="b15-genes-02-01050">15</xref>] but contradicts recent claims that <italic>X. albilineans</italic> and <italic>Xylella fastidiosa</italic> form a monophyletic clade distinct from the Group 2 <italic>Xanthomonas</italic> species [<xref ref-type="bibr" rid="b8-genes-02-01050">8</xref>,<xref ref-type="bibr" rid="b10-genes-02-01050">10</xref>]. On the other hand, our analyses based on the <italic>recA</italic> gene were consistent with Pieretti's hypothesis that <italic>X. albilineans</italic> and <italic>Xylella fastidiosa</italic> form a monophyletic clade distinct from the Group 2. The incongruity between <italic>atpD</italic>, <italic>groEL</italic> and <italic>dnaK</italic> on the one hand and <italic>recA</italic> on the other implies that there has been recombination and that not all of these house-keeping genes truly reflect the vertical descent of the core genome. The most parsimonious explanation is that <italic>recA</italic> has undergone horizontal transfer in either the <italic>Xylella</italic> lineage or in the <italic>X. albilineans</italic> lineage. The reasons for discrepancy between our phylogenetic reconstructions for <italic>atpD</italic>, <italic>groEL</italic> and <italic>dnaK</italic> compared with that of Pieretti and colleagues [<xref ref-type="bibr" rid="b8-genes-02-01050">8</xref>] are two-fold. First, the analysis presented by Pieretti is a composite of genes displaying at least two distinct phylogenetic histories. Second, Pieretti's analysis [<xref ref-type="bibr" rid="b8-genes-02-01050">8</xref>] appears to be partly based on alignments of non-orthologous gene sequences (e.g., their <italic>gyrB</italic> sequences are not orthologous between <italic>Xylella</italic> and <italic>Xanthomonas</italic> species).</p>
<p>The results of our analysis are also inconsistent with those of Sharma and Patil [<xref ref-type="bibr" rid="b16-genes-02-01050">16</xref>]. In their study [<xref ref-type="bibr" rid="b16-genes-02-01050">16</xref>] they present a phylogenetic tree in which <italic>X. albilineans</italic> is the outgroup and <italic>Stenotrophomonas</italic> species fall within a monophyletic group along with the other <italic>Xanthomonas</italic> species. However, this discrepancy is explained by their misplacing the root of their tree. Sharma and Patil [<xref ref-type="bibr" rid="b16-genes-02-01050">16</xref>] do not explain how they chose the position of the root in their tree; they simply assume that <italic>X. albilineans</italic> is the outgroup without offering any justification for this. On the other hand, we used a phylogenetically distinct species (<italic>P. aeruginosa</italic>) as the outgroup in order to root our tree. If the tree of Sharma and Patil [<xref ref-type="bibr" rid="b16-genes-02-01050">16</xref>] is re-rooted with <italic>Stenotrophomonas</italic> species as the outgroup, then their tree is topologically congruent with ours. Note that Sharma and Patil do not include <italic>Xylella fastidiosa</italic> in their analysis.</p>
<p>To more precisely resolve the newly sequenced strains' positions within <italic>Xanthomonas</italic> Group 1, we performed phylogenetic analyses based on the gyrase B (<italic>gyrB</italic>) gene (<xref ref-type="fig" rid="f2-genes-02-01050">Figure 2</xref>); partial sequences of <italic>gyrB</italic> are available from two studies [<xref ref-type="bibr" rid="b5-genes-02-01050">5</xref>,<xref ref-type="bibr" rid="b6-genes-02-01050">6</xref>] including many more <italic>Xanthomonas</italic> strains than those for which there are fully sequenced genomes. The sequences that we used are given in the <xref rid="SD1" ref-type="supplementary-material">Supplementary Files</xref>. We found that the <italic>gyrB</italic> sequence from NCPPB4393 was identical to those from strains of <italic>X. sacchari</italic>. Previously, <italic>X. sacchari</italic> was described as comprising strains isolated from diseased sugarcane [<xref ref-type="bibr" rid="b17-genes-02-01050">17</xref>]. Therefore, the description may need to be modified to include strains isolated from insects.</p>
<p>Strain NCPPB1132 is also closely related to <italic>X. sacchari</italic> but is more divergent than NCPPB4393 (<xref ref-type="fig" rid="f2-genes-02-01050">Figure 2</xref>). It falls within a clade that included <italic>X. sacchari sensu strictu</italic> as well as <italic>X.</italic> “<italic>campestris</italic>” pv. <italic>cannae</italic> (from canna lilly, a relative of banana) and several unnamed strains isolated from sugarcane (NCPPB888 and NCPPB917), foxtail millet (NCPPB3174) and arrow leaf elephant ear (NCPPB3570).</p>
<p>Strain NCPPB1131 is more closely related to <italic>X. albilineans</italic> than to <italic>X. sacchari</italic>, but shows closest affinity with NCPPB2250 (<xref ref-type="fig" rid="f2-genes-02-01050">Figure 2</xref>), which was originally isolated from ginger, a relative of banana. Strain NCPPB1131 is also similarly closely related to NCPPB3949, isolated from rice and erroneously deposited as <italic>X. oryzae</italic> pv. <italic>oryzicola</italic>. Parkinson and colleagues [<xref ref-type="bibr" rid="b6-genes-02-01050">6</xref>] describe a species-level clade (Slc 7) that included NCPPB1131, NCPPB3949 and strains isolated from ginger, cotton and sugarcane.</p></sec>
<sec>
<label>2.4.</label>
<title>Comparison of the Three Genomes <italic>Versus</italic> X. albilineans</title>
<p>The total sizes of the genome assemblies were 3.8 Mb for NCPPB1131, 4.7 Mb for NCPPB1132 and 4.9 Mb for NCPPB4393. These can be used as estimates of genome size, but may be inaccurate since we have not closed the gaps in the draft assembly. The size for NCPPB1131 should be treated with particular caution, since the use of non-paired sequence reads yielded a very fragmented assembly. Contiguity of the assemblies can be represented by N<sub>50</sub> scaffold lengths which were 1.1 Kb (NCPPB1131), 4.8 Kb (NCPPB1132) and 51.5 Kb (NCPPB4393). The numbers of scaffolds in each assembly were 4,158 (NCPPB1131), 1,652 (NCPPB1132) and 259 (NCPPB4393). Nevertheless, these estimates are congruent with sizes of previously sequenced <italic>Xanthomonas</italic> genomes, which range from 4.8 to 5.3 Mb for Group 2, whilst <italic>X. albilineans</italic> has a genome of less than 3.8 Mb.</p>
<p>We aligned the three genome assemblies against the genome sequence of <italic>X. albilineans</italic> species. We also aligned the reads, without performing <italic>de novo</italic> assembly. <xref ref-type="fig" rid="f3-genes-02-01050">Figure 3</xref> illustrates a genome-wide overview of these alignments. It should be noted that the sequencing depths obtained for the three strains ensures complete coverage over the entire breadth of the genomes. This means that by examining alignments of sequence reads against a reference sequence, independently from any <italic>de novo</italic> assembly, we can confidently determine the presence or absence of genes. The depths of coverage of each genome, as determined by depths of alignments of raw reads against assemblies, were 20× (NCPPB1131), 67× (NCPPB1132) and 72× (NCPPB4393).</p>
<p>Clearly, a significant proportion of the <italic>X. albilineans</italic> genome is not conserved (at the nucleotide sequence level) in the three newly sequenced Group 1 strains (<xref ref-type="fig" rid="f3-genes-02-01050">Figure 3</xref>). Prominent amongst the non-conserved regions is the gene cluster encoding the <italic>X. albilineans</italic> SPI-1 T3SS (positions 1,703,391-1,730,688). We could find no evidence for any non-flagellar T3SS in any of the three strains. All three strains also lack the albicidin biosynthesis cluster at positions 1,740,869-1,788,517 and so they likely do not produce albicidin.</p>
<p>Pieretti and colleagues [<xref ref-type="bibr" rid="b8-genes-02-01050">8</xref>] observed that the genomes of two xylem-limited pathogens, <italic>X. albilineans</italic> and <italic>Xylella fastidiosa</italic>, both share an unusual allele of the <italic>metE</italic> gene, which encodes 5-methyltetrahydropterolyl-triglutamate-homocysteine methyltransferase, an enzyme required for methionine biosynthesis. They infer that the ancestor of <italic>X. albilineans</italic> and <italic>X. fastidiosa</italic> lost <italic>metE</italic>, while the rest of the xanthomonads retained it, and then this ancestor gained a new allele of <italic>metE</italic> by horizontal transfer. We reject this interpretation, since the last common ancestor of <italic>X. albilineans</italic> and <italic>Xylella fastidiosa</italic> was also the ancestor of the other xanthomonads, including <italic>Stenotrophomonas</italic> and all <italic>Xanthomonas</italic> species (<xref ref-type="fig" rid="f1-genes-02-01050">Figure 1</xref>). We found that NCPPB1131, NCPPB1132 and NCPPB4393 all contain a <italic>metE</italic> gene that most closely resembles (at least 90% amino acid sequence identity) that of <italic>X. albilineans</italic> rather than those of Group 2 <italic>Xanthomonas</italic> species. This suggests that the <italic>X. albilineans metE</italic> occurs widely in the Group 1 <italic>Xanthomonas</italic> species and is not restricted only to xylem-limited species. The incongruence between the phylogeny of <italic>metE</italic> genes and the core house-keeping genes indicates that <italic>metE</italic> has been replaced independently in at least two distinct lineages during the evolution of the xanthomonads.</p></sec>
<sec>
<label>2.5.</label>
<title>Genome Reduction</title>
<p>Genome reduction has occurred independently in two separate lineages of xanthomonads that are specialized for a xylem-limited lifestyle. That is, <italic>Xylella fastidiosa</italic> and <italic>X. albilineans</italic> have independently converged on a xylem-limited lifestyle, each having evolved from a different ancestor with a larger genome. The only other xylem-limited bacterial species for which complete genome sequence data are available is <italic>Leifsonia xyli</italic>. Interestingly, <italic>L. xyli</italic> also appears to have a reduced genome, its chromosome being 2.6 megabases long in contrast to its non-xylem-associated relative <italic>Clavibacter michiganensis</italic>, which has a 3.3-megabase chromosome [<xref ref-type="bibr" rid="b18-genes-02-01050">18</xref>]. Thus, genome reduction is associated with at least three distinct lineages of xylem-limited bacteria, suggesting that a stripped-down genome may be adaptive for this relatively stable environment. Complete genome data are not yet available for the other well-known xylem-limited species, <italic>Ralstonia syzygii</italic> [<xref ref-type="bibr" rid="b19-genes-02-01050">19</xref>], so we cannot yet be sure that this is a universal phenomenon.</p>
<p>Since the only sequenced member of <italic>Xanthomonas</italic> Group 1 is a specialized xylem-limited pathogen with a reduced genome, this raises the question of whether other members of Group 1 show similar evidence of genome reduction. Our results reveal that <italic>X. albilineans</italic> has undergone significantly more reduction than NCPPB1131, NCPPB1132 and NCPPB4393. We aligned all four available genomes from Group 1 against the reference sequence of <italic>X. campestris</italic> pv. <italic>vesicatoria</italic> (<italic>Xcv</italic>) 85-10 (RefSeq: NC_007508), a member of Group 2 (<xref ref-type="fig" rid="f4-genes-02-01050">Figure 4</xref>). We found that only 13.82% of the <italic>Xcv</italic> chromosome was conserved in <italic>X. albilineans</italic>. Significantly larger portions of the <italic>Xcv</italic> were conserved in NCPPB1131, NCPPB1132 and NCPPB4393 (21.69%, 26.85% and 28.44% respectively). This is consistent with <italic>X. albilineans</italic> having lost more of the <italic>Xanthomonas</italic> genome than have the other three strains.</p>
<p><italic>X. albilineans</italic> produces the toxin albicidin, which is a DNA-gyrase inhibitor [<xref ref-type="bibr" rid="b20-genes-02-01050">20</xref>]. In addition to its action on host-plant chloroplasts, it is also deleterious to most bacteria. Pieretti and colleagues [<xref ref-type="bibr" rid="b8-genes-02-01050">8</xref>] propose that albicidin played a key role in the erosion of the <italic>X. albilineans</italic> genome. Specifically, they propose that exposure to sub-lethal doses of intracellular albicidin induced recombination and mutagenesis. They note that <italic>X. albilineans</italic> has, in addition to its albicidin transporter (AlbF), an unusual DNA gyrase, containing a 43-amino-acid insertion, that confers resistance to albicidin [<xref ref-type="bibr" rid="b21-genes-02-01050">21</xref>]. They go on to propose that “<italic>genome erosion induced by albicidin was likely arrested by the evolution of the albicidin-resistant DNA gyrase</italic>” [<xref ref-type="bibr" rid="b8-genes-02-01050">8</xref>]. Interesting though it is, there is no evidence to support this conjecture. Nor is there any need to invoke such a special mechanism, since genome reduction is a common phenomenon, seen in many parasitic organisms that do not produce albicidin-like antibiotics. Furthermore, sequence insertions similar to that in <italic>X. albilineans</italic> gyrase A are not uncommon among the gamma-Proteobacteria (see <xref rid="SD1" ref-type="supplementary-material">Supplementary Files</xref>). Therefore, it is quite plausible that albicidin-insensitivity is an ancient and/or common trait that was already present in <italic>X. albilineans</italic> before it acquired the ability to produce albicidin.</p>
<p>Examination of our genome-wide sequence data revealed that the Group 1 strains NCPPB1131, NCPPB1132 and NCPPB4393 lack the gene cluster required for albicidin biosynthesis, yet all three strains encode a gyrase A resembling that of <italic>X. albilineans</italic>, including the 43 amino acid insertion that is supposed to be responsible for albicidin resistance. These strains do have other non-ribosomal peptide synthesis genes, so we cannot exclude the possibility that the 43 amino acid insertion confers resistance to some other unknown toxin. Nevertheless, the presence of this insert clearly does not correlate with the incidence of genome reduction. Aside from any involvement in genome reduction, albicidin-like antibiotics probably have had profound effects on the evolution and ecology of xylem-limited parasites. For example, the genome sequence of <italic>L. xyli</italic> encodes a close homologue of AlbF that may allow it to colonise the host simultaneously with <italic>X. albilineans</italic> [<xref ref-type="bibr" rid="b22-genes-02-01050">22</xref>].</p></sec>
<sec>
<label>2.6.</label>
<title>Novel Genes in Xanthomonas Group 1</title>
<p>Until the present study, <italic>X. albilineans</italic> was the only member of Group 1 for which genome-wide sequence data were available. However, <italic>X. albilineans</italic> is an atypical example of the group, since it has undergone significant genome reduction associated with adaptation to life within the xylem. Therefore, we examined the genome sequence data from NCPPB1131, NCPPB1132 and NCPPB4393 for genomic regions that are absent from <italic>X. albilineans</italic> GPE PC73. Comprehensive lists of these regions are provided in the <xref rid="SD1" ref-type="supplementary-material">Supplementary Files</xref> and include heavy metal resistance proteins, haemolysins, haemagglutinins, citrate synthase, persistence factor HipA, sigma54-dependent activators, type-IV pili, drug efflux pumps, vancomycin B-type resistance protein VanW, xylanase, chitinase, beta-lactamase, restriction modification systems and many others.</p>
<p>There are, of course, some differences among the genomes of the Group I <italic>Xanthomonas</italic> strains. The genomes of NCPPB1132 and NCPPB4393 are alignable with each other over 88.64% and 84.33% of their respective lengths. That is, 12–16% of their genomes are variable and presumably subject to relatively recent horizontal transfer. They share 96.85% nucleotide sequence identity over the alignable conserved core portion of their genomes. Similarly they share 94.47% and 94.45% identity with NCPPB1131.</p>
<p>The genomes of NCPPB1131, NCPPB1132 and NCPPB4393 each encode a protein with 64% amino acid sequence identity to AvrXca (also known as AvrXccA1), a protein that confers avirulence on <italic>Arabidopsis</italic> in <italic>X. campestris</italic> pv. <italic>raphani</italic> [<xref ref-type="bibr" rid="b23-genes-02-01050">23</xref>]. Based on this avirulence activity, it has been speculated that AvrXca might be an effector. However, all three strains lack a T3SS, so it seems unlikely that it is secreted via the T3SS. Interestingly, at least two homologues of AvrXca are effectors secreted by the type-II secretion system (T2SS) [<xref ref-type="bibr" rid="b24-genes-02-01050">24</xref>-<xref ref-type="bibr" rid="b26-genes-02-01050">26</xref>], suggesting that AvrXca might also be a T2SS-dependent effector.</p>
<p>Each of the three newly sequenced genomes encode two cellulose-degrading enzymes that are absent from <italic>X. albilineans</italic>. Specifically, these include an endoglucanase that shares 61% amino acid sequence identity with XCC0028 from <italic>X. campestris</italic> pv. <italic>campestris</italic> ATCC 33913 and a cellulase (glycosyl hydrolase family 5) that shares 68% identity with XCV0358 from <italic>X. campestris</italic> pv. <italic>vesicatoria</italic> 85-10. Both enzymes are widely distributed among the Group 2 <italic>Xanthomonas</italic> species as well as in the three newly sequenced Group 1 strains, but absent from <italic>X. albilineans.</italic> It is possible that they play a role in degrading plant cell walls.</p>
<p>The <italic>gum</italic> genes, found in <italic>Xylella fastidiosa</italic> and all studied Group 2 <italic>Xanthomonas</italic> species, play a role in producing extracellular polysaccharides and forming biofilms and are implicated in pathogenicity. However, no <italic>gum</italic> genes have been found in <italic>X. albilineans</italic> [<xref ref-type="bibr" rid="b8-genes-02-01050">8</xref>]. We found homologues of these (<italic>gumBGCEKDHIMJL</italic>) conserved in all three newly sequenced strains. Therefore, it appears that the <italic>gum</italic> cluster was probably present in the common ancestor of <italic>Xylella</italic>, <italic>Stenotrophomonas</italic> and <italic>Xanthomonas</italic> and was subsequently lost by <italic>X. albilineans</italic> and <italic>Stenotrophomonas</italic> but retained in <italic>X. sacchari</italic> and the other Group 1 <italic>Xanthomonas</italic> species.</p></sec></sec>
<sec>
<label>3.</label>
<title>Experimental Section</title>
<p>Bacterial strains were obtained from the National Collection of Plant Pathogenic Bacteria (NCPPB) at FERA. Sequence alignments were performed using MAFFT [<xref ref-type="bibr" rid="b27-genes-02-01050">27</xref>], BWA [<xref ref-type="bibr" rid="b28-genes-02-01050">28</xref>], BLAST [<xref ref-type="bibr" rid="b29-genes-02-01050">29</xref>] and MUMMER [<xref ref-type="bibr" rid="b30-genes-02-01050">30</xref>]. DNA preparation and genome sequencing using the Illumina GA2 were performed as previously described [<xref ref-type="bibr" rid="b11-genes-02-01050">11</xref>]. We used CGview [<xref ref-type="bibr" rid="b31-genes-02-01050">31</xref>] to visualize whole-genome alignments and used MEGA5 for phylogenetic analyses. Literature references for previously sequenced bacterial genomes used in this study are listed cited in [<xref ref-type="bibr" rid="b10-genes-02-01050">10</xref>,<xref ref-type="bibr" rid="b11-genes-02-01050">11</xref>]. <italic>De novo</italic> assembly of Illumina sequence reads was performed using Velvet 1.1.03 [<xref ref-type="bibr" rid="b32-genes-02-01050">32</xref>]. We discarded any sequence reads that contained one or more ‘N’ prior to assembly. We used the following values for the hash-length parameter: 25 for NCPPB1131, 41 for NCPPB1132 and 49 for NCPPB4493. The coverage cut-off parameter was set to 4 in all three assemblies. For NCPPB1132 and NCPPB4393 read-pairs, we used Velvet's scaffolding step. We did not perform scaffolding on the NCPPB1131 data as the reads were not paired (<italic>i.e</italic>., we used single-end sequencing). The RAST server [<xref ref-type="bibr" rid="b33-genes-02-01050">33</xref>] was used for automated annotation of draft assemblies.</p>
<p>Note that in the genome-wide alignments (<xref ref-type="fig" rid="f3-genes-02-01050">Figures 3</xref> and <xref ref-type="fig" rid="f4-genes-02-01050">4</xref>), the pattern of coverage by aligned raw reads does not exactly coincide with the coverage by aligned contigs/scaffolds. This inconsistency is inevitable since the two alignment methods use different criteria for assigning a match. The BWA alignment tool tolerates mismatches so long as the edit distance does not exceed two between the raw read and the reference genome sequence. On the other hand, BLAST uses an E-value threshold (1e-10 in this case) as the criterion for whether to accept a match. Furthermore, the process of assembly leads to the correction of errors by consensus of multiple sequence reads.</p></sec>
<sec sec-type="conclusions">
<label>4.</label>
<title>Conclusions</title>
<p>The ability to survive on banana plants has evolved more than once within the genus <italic>Xanthomonas</italic>, with strains isolated from banana falling within both major phylogenetic lineages: Group 1 (NCPPB1131 and NCPPB1132) and Group 2 (<italic>X. campestris</italic> pv. <italic>musacearum</italic>). Clearly their strategies are different. <italic>Xanthomonas campestris</italic> pv. <italic>musacearum</italic> encodes an apparently intact Hrp T3SS and a suite of effectors that it presumably uses to overcome the host's defences. On the other hand, the Group 1 strains, related to <italic>X. sacchari</italic> and <italic>X. albilineans</italic>, lack the T3SS and must use some other strategy to avoid triggering host defences. In the case of <italic>X. albilineans</italic>, the strategy appears to be one of stealth, where the pathogen restricts its colonization to the dead xylem. As a result, it has undergone substantial genome reduction, shedding genes unnecessary for this restricted niche. Very little information is available about the endophytic lifestyles of <italic>X. sacchari</italic> and other related members of <italic>Xanthomonas</italic> Group 1 and there is no evidence that they are limited to the xylem. Certainly the lack of genome reduction would be consistent with having to survive in more diverse conditions. The example strain that we sequenced here, NCPPB4393, apparently spent at least part of its life cycle associated with insects. We hope that the availability of complete sequence data for this group of organisms is the first step towards understanding their interactions with plants and identifying potential virulence factors.</p></sec>
<sec sec-type="supplementary-material">
<title>Supplementary Material</title>
<supplementary-material id="SD1" content-type="local-data">
<media mimetype="application" mime-subtype="pdf" xlink:href="genes-02-01050.pdf"/></supplementary-material></sec></body>
<back>
<sec sec-type="display-objects">
<title>Figures and Tables</title>
<fig id="f1-genes-02-01050" position="float">
<label>Figure 1.</label>
<caption>
<p>Molecular phylogenetic anaylsis of <italic>atpD</italic> gene of newly sequenced xanthomonads by Maximum Likelihood method. The evolutionary history was inferred by using the Maximum Likelihood method based on the Tamura-Nei model [<xref ref-type="bibr" rid="b13-genes-02-01050">13</xref>]. The bootstrap consensus tree inferred from 500 replicates is taken to represent the evolutionary history of the taxa analyzed. Branches corresponding to partitions reproduced in less than 50% bootstrap replicates are collapsed. The percentage of replicate trees in which the associated taxa clustered together in the bootstrap test (500 replicates) are shown next to the branches. Initial tree(s) for the heuristic search were obtained automatically as follows. When the number of common sites was &lt;100 or less than one fourth of the total number of sites, the maximum parsimony method was used; otherwise BIONJ method with MCL distance matrix was used. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The analysis involved 15 nucleotide sequences. Codon positions included were 1st + 2nd + 3rd + Noncoding. All positions containing gaps and missing data were eliminated. There were a total of 1,373 positions in the final dataset. Evolutionary analyses were conducted in MEGA5 [<xref ref-type="bibr" rid="b14-genes-02-01050">14</xref>]. The newly sequenced bacterial strains are indicated with black circles. For each nucleotide sequence, RefSeq accession numbers and coordinates are given in parentheses. The newly sequenced strains are indicated by black circles (●).</p></caption>
<graphic xlink:href="genes-02-01050f1.gif"/></fig>
<fig id="f2-genes-02-01050" position="float">
<label>Figure 2.</label>
<caption>
<p>Phylogenetic positions of the three newly sequenced strains within <italic>Xanthomonas</italic> Group 1. The figure shows the evolutionary history of the <italic>gyrB</italic> gene as inferred by using the Maximum Likelihood method based on the Tamura-Nei model [<xref ref-type="bibr" rid="b13-genes-02-01050">13</xref>]. The tree with the highest log likelihood (−7358.2201) is shown. The percentage of trees in which the associated taxa clustered together is shown next to the branches. Initial tree(s) for the heuristic search were obtained automatically as follows. When the number of common sites was &lt;100 or less than one fourth of the total number of sites, the maximum parsimony method was used; otherwise BIONJ method with MCL distance matrix was used. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The analysis involved 219 nucleotide sequences, taken from the studies by Young and colleagues [<xref ref-type="bibr" rid="b5-genes-02-01050">5</xref>] and Parkinson and colleagues [<xref ref-type="bibr" rid="b6-genes-02-01050">6</xref>] as well as from the three newly sequenced strains. GenBank accession numbers are indicated for the sequences. However, for clarity, only the sub-tree corresponding to Group 1 is shown. The full length of each GenBank sequence entry was used for all of the Young [<xref ref-type="bibr" rid="b5-genes-02-01050">5</xref>] and Parkinson [<xref ref-type="bibr" rid="b6-genes-02-01050">6</xref>] sequences. For the sequences taken from our data, the coordinates of the subsequence are given in the parentheses, following the GenBank accession. All positions containing gaps and missing data were eliminated. There were a total of 517 positions in the final dataset. Evolutionary analyses were conducted in MEGA5 [<xref ref-type="bibr" rid="b14-genes-02-01050">14</xref>]. The newly sequenced strains are indicated by black circles (●).</p></caption>
<graphic xlink:href="genes-02-01050f2.gif"/></fig>
<fig id="f3-genes-02-01050" position="float">
<label>Figure 3.</label>
<caption>
<p>Alignment of the three <italic>Xanthomonas</italic> Group 1 new genome sequences against the chromosome of <italic>X. albilineans</italic>. The blue and red inner track represents annotated genes. The next three black tracks represent depth of coverage by Illumina sequence reads for NCPPB1131, NCPPB4393 and NCPPB1131 respectively. The four colored outer rings indicate sequence similarity to the genome assemblies of NCPPB1131 (olive), NCPPB4393 (yellow) and NCPPB1131 (grey) respectively.</p></caption>
<graphic xlink:href="genes-02-01050f3.gif"/></fig>
<fig id="f4-genes-02-01050" position="float">
<label>Figure 4.</label>
<caption>
<p>Alignment of the four <italic>Xanthomonas</italic> Group 1 genome sequences against the chromosome of <italic>Xanthomonas campestris</italic> pv. <italic>vesicatoria</italic>. The blue and red inner track represents annotated genes. The next three black tracks represent depth of coverage by Illumina sequence reads for NCPPB1131, NCPPB4393 and NCPPB1131 respectively. The four colored outer rings indicate sequence similarity to the genome assemblies of NCPPB1131 (olive), NCPPB4393 (yellow), NCPPB1131 (grey) and <italic>X. albilineans</italic> (orange) respectively.</p></caption>
<graphic xlink:href="genes-02-01050f4.gif"/></fig>
<table-wrap id="t1-genes-02-01050" position="float">
<label>Table 1.</label>
<caption>
<p>Bacterial strains sequenced in this study.</p></caption>
<table frame="box" rules="all">
<thead>
<tr>
<th align="left" valign="top"><bold>Strain</bold></th>
<th align="left" valign="top"><bold>Host plant</bold></th>
<th align="left" valign="top"><bold>Country</bold></th>
<th align="left" valign="top"><bold>Year</bold></th></tr></thead>
<tbody>
<tr>
<td align="left" valign="top">NCPPB1131</td>
<td align="left" valign="top"><italic>Musa paradisiaca</italic></td>
<td align="left" valign="top">American (Eastern) Samoa</td>
<td align="left" valign="top">1961</td></tr>
<tr>
<td align="left" valign="top">NCPPB1132</td>
<td align="left" valign="top"><italic>Musa canksii</italic> var. <italic>samoensis</italic></td>
<td align="left" valign="top">Western Samoa</td>
<td align="left" valign="top">1961</td></tr>
<tr>
<td align="left" valign="middle">NCPPB4393</td>
<td align="left" valign="middle"><italic>Musa</italic> species Isolated from insect on diseased plant</td>
<td align="left" valign="middle">Tanzania</td>
<td align="left" valign="middle">2007</td></tr></tbody></table></table-wrap>
<table-wrap id="t2" position="float">
<caption>
<p>Supplementary files.</p></caption>
<table frame="box" rules="all">
<thead>
<tr>
<th align="center" valign="top"><bold>File Name</bold></th>
<th align="center" valign="top"><bold>File Type</bold></th>
<th align="center" valign="top"><bold>Description</bold></th></tr></thead>
<tbody>
<tr>
<td align="left" valign="middle">GyraseA_alignment.pdf</td>
<td align="left" valign="middle">PDF (Adobe Acrobat)</td>
<td align="left" valign="middle">Multiple sequence alignment of partial GyrA proteins from <italic>Xanthomonas albilineans</italic> and other gamma-Proteobacteria</td></tr>
<tr>
<td align="left" valign="middle">phylogenetic_trees.pdf</td>
<td align="left" valign="middle">PDF (Adobe Acrobat)</td>
<td align="left" valign="middle">Phylogenetic analysis of fully sequenced members of the genera <italic>Xanthomonas, Xylella</italic> and <italic>Steonotrophomonas.</italic> Maximum likelihood trees are based on nucleotide sequences of the house-keeping genes <italic>dnaK</italic>, <italic>groEL</italic> and <italic>recA</italic></td></tr>
<tr>
<td align="left" valign="middle">NCPPB1131-sequences- not-in-X_albilineans.html</td>
<td align="left" valign="middle">HTML (web browser)</td>
<td align="left" valign="middle">A list of regions in the genome of NCPPB1131 that share no detectable nucleotide sequence similarity with <italic>X. albilineans</italic></td></tr>
<tr>
<td align="left" valign="middle">NCPPB1132-sequences-not-in-X_albilineans.html</td>
<td align="left" valign="middle">HTML (web browser)</td>
<td align="left" valign="middle">A list of regions in the genome of NCPPB1132 that share no detectable nucleotide sequence similarity with <italic>X. albilineans</italic></td></tr>
<tr>
<td align="left" valign="middle">NCPPB4393-sequences-not-in-X_albilineans.html</td>
<td align="left" valign="middle">HTML (web browser)</td>
<td align="left" valign="middle">A list of regions in the genome of NCPPB4393 that share no detectable nucleotide sequence similarity with <italic>X. albilineans</italic></td></tr>
<tr>
<td align="left" valign="middle">NCPPB1131-RAST.xls</td>
<td align="left" valign="middle">Microsoft Excel spreadsheet</td>
<td align="left" valign="middle">Automated annotation results for NCPPB1131 draft genome sequence generated by the RAST server</td></tr>
<tr>
<td align="left" valign="middle">NCPPB1132-RAST.xls</td>
<td align="left" valign="middle">Microsoft Excel spreadsheet</td>
<td align="left" valign="middle">Automated annotation results for NCPPB1132 draft genome sequence generated by the RAST server</td></tr>
<tr>
<td align="left" valign="middle">NCPPB4393-RAST.xls</td>
<td align="left" valign="middle">Microsoft Excel spreadsheet</td>
<td align="left" valign="middle">Automated annotation results for NCPPB4393 draft genome sequence generated by the RAST server</td></tr>
<tr>
<td align="left" valign="middle">NCPPB1131_genes_comparison.pdf</td>
<td align="left" valign="middle">PDF (Adobe Acrobat)</td>
<td align="left" valign="middle">A list of predicted genes in NCPPB1131 that are absent from NCPPB1132 and/or NCPPB4393 as judged by BWA alignments of raw Illumina sequence reads against the NCPPB1131 genome. Genomic coordinates are given on GenBank accession numbers</td></tr>
<tr>
<td align="left" valign="middle">NCPPB1132_genes_comparison.pdf</td>
<td align="left" valign="middle">PDF (Adobe Acrobat)</td>
<td align="left" valign="middle">A list of predicted genes in NCPPB1132 that are absent from NCPPB1131 and/or NCPPB4393 as judged by BWA alignments of raw Illumina sequence reads against the NCPPB1132 genome. Genomic coordinates are given on GenBank accession numbers</td></tr>
<tr>
<td align="left" valign="middle">NCPPB4393_genes_comparison.pdf</td>
<td align="left" valign="middle">PDF (Adobe Acrobat)</td>
<td align="left" valign="middle">A list of predicted genes in NCPPB4393 that are absent from NCPPB1131 and/or NCPPB1132 as judged by BWA alignments of raw Illumina sequence reads against the NCPPB4393 genome. Genomic coordinates are given on GenBank accession numbers</td></tr>
<tr>
<td align="left" valign="middle">atpD_aligned_fasta.txt</td>
<td align="left" valign="middle">Gapped FastA / plain text</td>
<td align="left" valign="middle">Multiple sequence alignment of <italic>atpD</italic> used to generate the phylogenetic tree in <xref ref-type="fig" rid="f1-genes-02-01050">Figure 1</xref> in the main text</td></tr>
<tr>
<td align="left" valign="middle">gyrB_aligned_fasta.txt</td>
<td align="left" valign="middle">Gapped FastA / plain text</td>
<td align="left" valign="middle">Multiple sequence alignment of <italic>gyrB</italic> used to generate the phylogenetic tree in <xref ref-type="fig" rid="f1-genes-02-01050">Figure 1</xref> in the main text</td></tr></tbody></table></table-wrap></sec>
<ack>
<p>This study was supported by the National Agriculture Research Organisation, Uganda. The authors wish to thank Karen Moore and Alex Moorhouse for their invaluable technical assistance with sequencing. We thank Mgenzi Byabachwezi for isolation of strain NCPPB4393 and Neil Parkinson for some helpful comments on an earlier draft of the manuscript.</p></ack>
<ref-list>
<title>References</title>
<ref id="b1-genes-02-01050"><label>1.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hauben</surname><given-names>L.</given-names></name><name><surname>Vauterin</surname><given-names>L.</given-names></name><name><surname>Swings</surname><given-names>J.</given-names></name><name><surname>Moore</surname><given-names>E.R.</given-names></name></person-group><article-title>Comparison of 16S ribosomal DNA sequences of all <italic>Xanthomonas</italic> species</article-title><source>Int. J. Syst. Bacteriol.</source><year>1997</year><volume>47</volume><fpage>328</fpage><lpage>335</lpage><pub-id pub-id-type="doi">10.1099/00207713-47-2-328</pub-id><pub-id pub-id-type="pmid">9103617</pub-id></citation></ref>
<ref id="b2-genes-02-01050"><label>2.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gonçalves</surname><given-names>E.R.</given-names></name><name><surname>Rosato</surname><given-names>Y.B.</given-names></name></person-group><article-title>Phylogenetic analysis of <italic>Xanthomonas</italic> species based upon 16S-23S rDNA intergenic spacer sequences</article-title><source>Int. J. Syst. Evol. Microbiol.</source><year>2002</year><volume>52</volume><fpage>355</fpage><lpage>361</lpage><pub-id pub-id-type="pmid">11931143</pub-id></citation></ref>
<ref id="b3-genes-02-01050"><label>3.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rademaker</surname><given-names>J.L.</given-names></name><name><surname>Louws</surname><given-names>F.J.</given-names></name><name><surname>Schultz</surname><given-names>M.H.</given-names></name><name><surname>Rossbach</surname><given-names>U.</given-names></name><name><surname>Vauterin</surname><given-names>L.</given-names></name><name><surname>Swings</surname><given-names>J.</given-names></name><name><surname>de Bruijn</surname><given-names>F.J.</given-names></name></person-group><article-title>A comprehensive species to strain taxonomic framework for <italic>Xanthomonas</italic></article-title><source>Phytopathology</source><year>2005</year><volume>95</volume><fpage>1098</fpage><lpage>1111</lpage><pub-id pub-id-type="doi">10.1094/PHYTO-95-1098</pub-id><pub-id pub-id-type="pmid">18943308</pub-id></citation></ref>
<ref id="b4-genes-02-01050"><label>4.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Parkinson</surname><given-names>N.</given-names></name><name><surname>Aritua</surname><given-names>V.</given-names></name><name><surname>Heeney</surname><given-names>J.</given-names></name><name><surname>Cowie</surname><given-names>C.</given-names></name><name><surname>Bew</surname><given-names>J.</given-names></name><name><surname>Stead</surname><given-names>D.</given-names></name></person-group><article-title>Phylogenetic analysis of <italic>Xanthomonas</italic> species by comparison of partial gyrase B gene sequences</article-title><source>Int. J. Syst. Evol. Microbiol.</source><year>2007</year><volume>57</volume><fpage>2881</fpage><lpage>2887</lpage><pub-id pub-id-type="doi">10.1099/ijs.0.65220-0</pub-id><pub-id pub-id-type="pmid">18048743</pub-id></citation></ref>
<ref id="b5-genes-02-01050"><label>5.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Young</surname><given-names>J.M.</given-names></name><name><surname>Park</surname><given-names>D.C.</given-names></name><name><surname>Shearman</surname><given-names>H.M.</given-names></name><name><surname>Fargier</surname><given-names>E.</given-names></name></person-group><article-title>A multilocus sequence analysis of the genus <italic>Xanthomonas</italic></article-title><source>Syst. Appl. Microbiol.</source><year>2008</year><volume>31</volume><fpage>366</fpage><lpage>377</lpage><pub-id pub-id-type="doi">10.1016/j.syapm.2008.06.004</pub-id><pub-id pub-id-type="pmid">18783906</pub-id></citation></ref>
<ref id="b6-genes-02-01050"><label>6.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Parkinson</surname><given-names>N.</given-names></name><name><surname>Cowie</surname><given-names>C.</given-names></name><name><surname>Heeney</surname><given-names>J.</given-names></name><name><surname>Stead</surname><given-names>D.</given-names></name></person-group><article-title>Phylogenetic structure of <italic>Xanthomonas</italic> determined by comparison of <italic>gyrB</italic> sequences</article-title><source>Int. J. Syst. Evol. Microbiol.</source><year>2009</year><volume>59</volume><fpage>264</fpage><lpage>274</lpage><pub-id pub-id-type="doi">10.1099/ijs.0.65825-0</pub-id><pub-id pub-id-type="pmid">19196764</pub-id></citation></ref>
<ref id="b7-genes-02-01050"><label>7.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ryan</surname><given-names>R.P.</given-names></name><name><surname>Vorhölter</surname><given-names>F.J.</given-names></name><name><surname>Potnis</surname><given-names>N.</given-names></name><name><surname>Jones</surname><given-names>J.B.</given-names></name><name><surname>van Sluys</surname><given-names>M.A.</given-names></name><name><surname>Bogdanove</surname><given-names>A.J.</given-names></name><name><surname>Dow</surname><given-names>J.M.</given-names></name></person-group><article-title>Pathogenomics of <italic>Xanthomonas</italic>: Understanding bacterium-plant interactions</article-title><source>Nat. Rev. Microbiol.</source><year>2011</year><volume>9</volume><fpage>344</fpage><lpage>355</lpage><pub-id pub-id-type="doi">10.1038/nrmicro2558</pub-id><pub-id pub-id-type="pmid">21478901</pub-id></citation></ref>
<ref id="b8-genes-02-01050"><label>8.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pieretti</surname><given-names>I.</given-names></name><name><surname>Royer</surname><given-names>M.</given-names></name><name><surname>Barbe</surname><given-names>V.</given-names></name><name><surname>Carrere</surname><given-names>S.</given-names></name><name><surname>Koebnik</surname><given-names>R.</given-names></name><name><surname>Cociancich</surname><given-names>S.</given-names></name><name><surname>Couloux</surname><given-names>A.</given-names></name><name><surname>Darrasse</surname><given-names>A.</given-names></name><name><surname>Gouzy</surname><given-names>J.</given-names></name><name><surname>Jacques</surname><given-names>M.A.</given-names></name><etal/></person-group><article-title>The complete genome sequence of <italic>Xanthomonas albilineans</italic> provides new insights into the reductive genome evolution of the xylem-limited <italic>Xanthomonadaceae</italic></article-title><source>BMC Genomics</source><year>2009</year><volume>10</volume><fpage>616</fpage><pub-id pub-id-type="doi">10.1186/1471-2164-10-616</pub-id><pub-id pub-id-type="pmid">20017926</pub-id></citation></ref>
<ref id="b9-genes-02-01050"><label>9.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Marguerettaz</surname><given-names>M.</given-names></name><name><surname>Pieretti</surname><given-names>I.</given-names></name><name><surname>Gayral</surname><given-names>P.</given-names></name><name><surname>Puig</surname><given-names>J.</given-names></name><name><surname>Brin</surname><given-names>C.</given-names></name><name><surname>Cociancich</surname><given-names>S.</given-names></name><name><surname>Poussier</surname><given-names>S.</given-names></name><name><surname>Rott</surname><given-names>P.</given-names></name><name><surname>Royer</surname><given-names>M.</given-names></name></person-group><article-title>Genomic and evolutionary features of the SPI-1 type III secretion system that is present in <italic>Xanthomonas albilineans</italic> but is not essential for xylem colonization and symptom development of sugarcane leaf scald</article-title><source>Mol. Plant Microbe Interact.</source><year>2011</year><volume>24</volume><fpage>246</fpage><lpage>259</lpage><pub-id pub-id-type="doi">10.1094/MPMI-08-10-0188</pub-id><pub-id pub-id-type="pmid">20955079</pub-id></citation></ref>
<ref id="b10-genes-02-01050"><label>10.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rott</surname><given-names>P.</given-names></name><name><surname>Fleites</surname><given-names>L.</given-names></name><name><surname>Marlow</surname><given-names>G.</given-names></name><name><surname>Royer</surname><given-names>M.</given-names></name><name><surname>Gabriel</surname><given-names>D.W.</given-names></name></person-group><article-title>Identification of new candidate pathogenicity factors in the xylem-invading pathogen <italic>Xanthomonas albilineans</italic> by transposon mutagenesis</article-title><source>Mol. Plant Microbe Interact.</source><year>2011</year><volume>24</volume><fpage>594</fpage><lpage>605</lpage><pub-id pub-id-type="doi">10.1094/MPMI-07-10-0156</pub-id><pub-id pub-id-type="pmid">21190440</pub-id></citation></ref>
<ref id="b11-genes-02-01050"><label>11.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Studholme</surname><given-names>D.J.</given-names></name><name><surname>Kemen</surname><given-names>E.</given-names></name><name><surname>MacLean</surname><given-names>D.</given-names></name><name><surname>Schornack</surname><given-names>S.</given-names></name><name><surname>Aritua</surname><given-names>V.</given-names></name><name><surname>Thwaites</surname><given-names>R.</given-names></name><name><surname>Grant</surname><given-names>M.</given-names></name><name><surname>Smith</surname><given-names>J.</given-names></name><name><surname>Jones</surname><given-names>J.D.</given-names></name></person-group><article-title>Genome-wide sequencing data reveals virulence factors implicated in banana <italic>Xanthomonas</italic> wilt</article-title><source>FEMS Microbiol. Lett.</source><year>2010</year><volume>310</volume><fpage>182</fpage><lpage>192</lpage><pub-id pub-id-type="doi">10.1111/j.1574-6968.2010.02065.x</pub-id><pub-id pub-id-type="pmid">20695894</pub-id></citation></ref>
<ref id="b12-genes-02-01050"><label>12.</label><note><p>Byachwezi, M. Belgian Development Agency, Dar Es Salaam, Tanzania. Personal communication, 2011.</p></note></ref>
<ref id="b13-genes-02-01050"><label>13.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tamura</surname><given-names>K.</given-names></name><name><surname>Nei</surname><given-names>M.</given-names></name></person-group><article-title>Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees</article-title><source>Mol. Biol. Evol.</source><year>1993</year><volume>10</volume><fpage>512</fpage><lpage>526</lpage><pub-id pub-id-type="pmid">8336541</pub-id></citation></ref>
<ref id="b14-genes-02-01050"><label>14.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tamura</surname><given-names>K.</given-names></name><name><surname>Peterson</surname><given-names>D.</given-names></name><name><surname>Peterson</surname><given-names>N.</given-names></name><name><surname>Stecher</surname><given-names>G.</given-names></name><name><surname>Nei</surname><given-names>M.</given-names></name><name><surname>Kumar</surname><given-names>S.</given-names></name></person-group><article-title>MEGA5: Molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods</article-title><source>Mol. Biol. Evol.</source><year>2011</year><volume>28</volume><fpage>2731</fpage><lpage>2739</lpage><pub-id pub-id-type="doi">10.1093/molbev/msr121</pub-id><pub-id pub-id-type="pmid">21546353</pub-id></citation></ref>
<ref id="b15-genes-02-01050"><label>15.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lu</surname><given-names>H.</given-names></name><name><surname>Patil</surname><given-names>P.</given-names></name><name><surname>van Sluys</surname><given-names>M.A.</given-names></name><name><surname>White</surname><given-names>F.F.</given-names></name><name><surname>Ryan</surname><given-names>R.P.</given-names></name><name><surname>Dow</surname><given-names>J.M.</given-names></name><name><surname>Rabinowicz</surname><given-names>P.</given-names></name><name><surname>Salzberg</surname><given-names>S.L.</given-names></name><name><surname>Leach</surname><given-names>J.E.</given-names></name><name><surname>Sonti</surname><given-names>R.</given-names></name><etal/></person-group><article-title>Acquisition and evolution of plant pathogenesis-associated gene clusters and candidate determinants of tissue-specificity in <italic>Xanthomonas.</italic></article-title><source>PLoS ONE</source><year>2008</year><volume>3</volume><fpage>e3828</fpage><pub-id pub-id-type="doi">10.1371/journal.pone.0003828</pub-id><pub-id pub-id-type="pmid">19043590</pub-id></citation></ref>
<ref id="b16-genes-02-01050"><label>16.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sharma</surname><given-names>V.</given-names></name><name><surname>Patil</surname><given-names>P.B.</given-names></name></person-group><article-title>Resolving the phylogenetic and taxonomic relationship of <italic>Xanthomonas</italic> and <italic>Stenotrophomonas</italic> strains using complete <italic>rpoB</italic> gene sequence</article-title><source>PLoS Curr.</source><year>2011</year><volume>3</volume><fpage>RRN1239</fpage><pub-id pub-id-type="doi">10.1371/currents.RRN1239</pub-id><pub-id pub-id-type="pmid">21686310</pub-id></citation></ref>
<ref id="b17-genes-02-01050"><label>17.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vauterin</surname><given-names>L.</given-names></name><name><surname>Hoste</surname><given-names>B.</given-names></name><name><surname>Kersters</surname><given-names>K.</given-names></name><name><surname>Swings</surname><given-names>J.</given-names></name></person-group><article-title>Reclassification of <italic>Xanthomonas</italic></article-title><source>Int. J. Syst. Bacteriol.</source><year>1995</year><volume>45</volume><fpage>472</fpage><lpage>489</lpage><pub-id pub-id-type="doi">10.1099/00207713-45-3-472</pub-id></citation></ref>
<ref id="b18-genes-02-01050"><label>18.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gartemann</surname><given-names>K.H.</given-names></name><name><surname>Abt</surname><given-names>B.</given-names></name><name><surname>Bekel</surname><given-names>T.</given-names></name><name><surname>Burger</surname><given-names>A.</given-names></name><name><surname>Engemann</surname><given-names>J.</given-names></name><name><surname>Flügel</surname><given-names>M.</given-names></name><name><surname>Gaigalat</surname><given-names>L.</given-names></name><name><surname>Goesmann</surname><given-names>A.</given-names></name><name><surname>Gräfen</surname><given-names>I.</given-names></name><name><surname>Kalinowski</surname><given-names>J.</given-names></name><etal/></person-group><article-title>The genome sequence of the tomato-pathogenic actinomycete <italic>Clavibacter michiganensis</italic> subsp. <italic>michiganensis</italic> NCPPB382 reveals a large island involved in pathogenicity</article-title><source>J. Bacteriol.</source><year>2008</year><volume>190</volume><fpage>2138</fpage><lpage>2149</lpage><pub-id pub-id-type="doi">10.1128/JB.01595-07</pub-id><pub-id pub-id-type="pmid">18192381</pub-id></citation></ref>
<ref id="b19-genes-02-01050"><label>19.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Purcell</surname><given-names>A.H.</given-names></name><name><surname>Hopkins</surname><given-names>D.L.</given-names></name></person-group><article-title>Fastidious xylem-limited bacterial plant pathogens</article-title><source>Annu. Rev. Phytopathol.</source><year>1996</year><volume>34</volume><fpage>131</fpage><lpage>151</lpage><pub-id pub-id-type="doi">10.1146/annurev.phyto.34.1.131</pub-id><pub-id pub-id-type="pmid">15012538</pub-id></citation></ref>
<ref id="b20-genes-02-01050"><label>20.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Birch</surname><given-names>R.G.</given-names></name><name><surname>Patil</surname><given-names>S.S.</given-names></name></person-group><article-title>Preliminary characterization of an antibiotic produced by <italic>Xanthomonas albilineans</italic> which inhibits DNA synthesis in <italic>Escherichia coli</italic></article-title><source>J. Gen. Microbiol.</source><year>1985</year><volume>131</volume><fpage>1069</fpage><lpage>1075</lpage><pub-id pub-id-type="pmid">2410547</pub-id></citation></ref>
<ref id="b21-genes-02-01050"><label>21.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hashimi</surname><given-names>S.M.</given-names></name><name><surname>Huang</surname><given-names>G.</given-names></name><name><surname>Maxwell</surname><given-names>A.</given-names></name><name><surname>Birch</surname><given-names>R.G.</given-names></name></person-group><article-title>DNA gyrase from the albicidin producer <italic>Xanthomonas albilineans</italic> has multiple-antibiotic-resistance and unusual enzymatic properties</article-title><source>Antimicrob. Agents Chemother.</source><year>2008</year><volume>52</volume><fpage>1382</fpage><lpage>1390</lpage><pub-id pub-id-type="doi">10.1128/AAC.01551-07</pub-id><pub-id pub-id-type="pmid">18268084</pub-id></citation></ref>
<ref id="b22-genes-02-01050"><label>22.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Monteiro-Vitorello</surname><given-names>C.B.</given-names></name><name><surname>Camargo</surname><given-names>L.E.</given-names></name><name><surname>van Sluys</surname><given-names>M.A.</given-names></name><name><surname>Kitajima</surname><given-names>J.P.</given-names></name><name><surname>Truffi</surname><given-names>D.</given-names></name><name><surname>do Amaral</surname><given-names>A.M.</given-names></name><name><surname>Harakava</surname><given-names>R.</given-names></name><name><surname>de Oliveira</surname><given-names>J.C.</given-names></name><name><surname>Wood</surname><given-names>D.</given-names></name><name><surname>de Oliveira</surname><given-names>M.C.</given-names></name><etal/></person-group><article-title>The genome sequence of the gram-positive sugarcane pathogen <italic>Leifsonia xyli</italic> subsp. <italic>xyli</italic></article-title><source>Mol. Plant Microbe Interact.</source><year>2004</year><volume>17</volume><fpage>827</fpage><lpage>836</lpage><pub-id pub-id-type="doi">10.1094/MPMI.2004.17.8.827</pub-id><pub-id pub-id-type="pmid">15305603</pub-id></citation></ref>
<ref id="b23-genes-02-01050"><label>23.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Parker</surname><given-names>J.E.</given-names></name><name><surname>Barber</surname><given-names>C.E.</given-names></name><name><surname>Fan</surname><given-names>M.J.</given-names></name><name><surname>Daniels</surname><given-names>M.J.</given-names></name></person-group><article-title>Interaction of <italic>Xanthomonas campestris</italic> with Arabidopsis thaliana: Characterization of a gene from <italic>X. c</italic>. pv. <italic>raphani</italic> that confers avirulence to most <italic>A. thaliana</italic> accessions</article-title><source>Mol. Plant Microbe Interact.</source><year>1993</year><volume>6</volume><fpage>216</fpage><lpage>224</lpage><pub-id pub-id-type="doi">10.1094/MPMI-6-216</pub-id><pub-id pub-id-type="pmid">8471795</pub-id></citation></ref>
<ref id="b24-genes-02-01050"><label>24.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Corbett</surname><given-names>M.</given-names></name><name><surname>Virtue</surname><given-names>S.</given-names></name><name><surname>Bell</surname><given-names>K.</given-names></name><name><surname>Birch</surname><given-names>P.</given-names></name><name><surname>Burr</surname><given-names>T.</given-names></name><name><surname>Hyman</surname><given-names>L.</given-names></name><name><surname>Lilley</surname><given-names>K.</given-names></name><name><surname>Poock</surname><given-names>S.</given-names></name><name><surname>Toth</surname><given-names>I.</given-names></name><name><surname>Salmond</surname><given-names>G.</given-names></name></person-group><article-title>Identification of a new quorum-sensing-controlled virulence factor in <italic>Erwinia carotovora</italic> subsp. <italic>atroseptica</italic> secreted via the type II targeting pathway</article-title><source>Mol. Plant Microbe Interact.</source><year>2005</year><volume>18</volume><fpage>334</fpage><lpage>342</lpage><pub-id pub-id-type="doi">10.1094/MPMI-18-0334</pub-id><pub-id pub-id-type="pmid">15828685</pub-id></citation></ref>
<ref id="b25-genes-02-01050"><label>25.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kazemi-Pour</surname><given-names>N.</given-names></name><name><surname>Condemine</surname><given-names>G.</given-names></name><name><surname>Hugouvieux-Cotte-Pattat</surname><given-names>N.</given-names></name></person-group><article-title>The secretome of the plant pathogenic bacterium <italic>Erwinia chrysanthemi</italic></article-title><source>Proteomics</source><year>2004</year><volume>4</volume><fpage>3177</fpage><lpage>3186</lpage><pub-id pub-id-type="doi">10.1002/pmic.200300814</pub-id><pub-id pub-id-type="pmid">15378709</pub-id></citation></ref>
<ref id="b26-genes-02-01050"><label>26.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jha</surname><given-names>G.</given-names></name><name><surname>Rajeshwari</surname><given-names>R.</given-names></name><name><surname>Sonti</surname><given-names>R.V.</given-names></name></person-group><article-title>Bacterial type two secretion system secreted proteins: Double-edged swords for plant pathogens</article-title><source>Mol. Plant Microbe Interact.</source><year>2005</year><volume>18</volume><fpage>891</fpage><lpage>898</lpage><pub-id pub-id-type="doi">10.1094/MPMI-18-0891</pub-id><pub-id pub-id-type="pmid">16167759</pub-id></citation></ref>
<ref id="b27-genes-02-01050"><label>27.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Katoh</surname><given-names>K.</given-names></name><name><surname>Asimenos</surname><given-names>G.</given-names></name><name><surname>Toh</surname><given-names>H.</given-names></name></person-group><article-title>Multiple alignment of DNA sequences with MAFFT</article-title><source>Methods Mol. Biol.</source><year>2009</year><volume>537</volume><fpage>39</fpage><lpage>64</lpage><pub-id pub-id-type="pmid">19378139</pub-id></citation></ref>
<ref id="b28-genes-02-01050"><label>28.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname><given-names>H.</given-names></name><name><surname>Durbin</surname><given-names>R.</given-names></name></person-group><article-title>Fast and accurate short read alignment with Burrows-Wheeler transform</article-title><source>Bioinformatics</source><year>2009</year><volume>25</volume><fpage>1754</fpage><lpage>1760</lpage><pub-id pub-id-type="doi">10.1093/bioinformatics/btp324</pub-id><pub-id pub-id-type="pmid">19451168</pub-id></citation></ref>
<ref id="b29-genes-02-01050"><label>29.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Altschul</surname><given-names>S.F.</given-names></name><name><surname>Gish</surname><given-names>W.</given-names></name><name><surname>Miller</surname><given-names>W.</given-names></name><name><surname>Myers</surname><given-names>E.W.</given-names></name><name><surname>Lipman</surname><given-names>D.J.</given-names></name></person-group><article-title>Basic local alignment search tool</article-title><source>J. Mol. Biol.</source><year>1990</year><volume>215</volume><fpage>403</fpage><lpage>410</lpage><pub-id pub-id-type="pmid">2231712</pub-id></citation></ref>
<ref id="b30-genes-02-01050"><label>30.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Delcher</surname><given-names>A.L.</given-names></name><name><surname>Phillippy</surname><given-names>A.</given-names></name><name><surname>Carlton</surname><given-names>J.</given-names></name><name><surname>Salzberg</surname><given-names>S.L.</given-names></name></person-group><article-title>Fast algorithms for large-scale genome alignment and comparison</article-title><source>Nucleic Acids Res.</source><year>2002</year><volume>30</volume><fpage>2478</fpage><lpage>2483</lpage><pub-id pub-id-type="doi">10.1093/nar/30.11.2478</pub-id><pub-id pub-id-type="pmid">12034836</pub-id></citation></ref>
<ref id="b31-genes-02-01050"><label>31.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stothard</surname><given-names>P.</given-names></name><name><surname>Wishart</surname><given-names>D.S.</given-names></name></person-group><article-title>Circular genome visualization and exploration using CGView</article-title><source>Bioinformatics</source><year>2005</year><volume>21</volume><fpage>537</fpage><lpage>539</lpage><pub-id pub-id-type="doi">10.1093/bioinformatics/bti054</pub-id><pub-id pub-id-type="pmid">15479716</pub-id></citation></ref>
<ref id="b32-genes-02-01050"><label>32.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zerbino</surname><given-names>D.R.</given-names></name><name><surname>Birney</surname><given-names>E.</given-names></name></person-group><article-title>Velvet: Algorithms for <italic>de novo</italic> short read assembly using de Bruijn graphs</article-title><source>Genome Res.</source><year>2008</year><volume>18</volume><fpage>821</fpage><lpage>829</lpage><pub-id pub-id-type="doi">10.1101/gr.074492.107</pub-id><pub-id pub-id-type="pmid">18349386</pub-id></citation></ref>
<ref id="b33-genes-02-01050"><label>33.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Aziz</surname><given-names>R.K.</given-names></name><name><surname>Bartels</surname><given-names>D.</given-names></name><name><surname>Best</surname><given-names>A.A.</given-names></name><name><surname>DeJongh</surname><given-names>M.</given-names></name><name><surname>Disz</surname><given-names>T.</given-names></name><name><surname>Edwards</surname><given-names>R.A.</given-names></name><name><surname>Formsma</surname><given-names>K.</given-names></name><name><surname>Gerdes</surname><given-names>S.</given-names></name><name><surname>Glass</surname><given-names>E.M.</given-names></name><name><surname>Kubal</surname><given-names>M.</given-names></name><etal/></person-group><article-title>The RAST Server: Rapid annotations using subsystems technology</article-title><source>BMC Genomics</source><year>2008</year><volume>9</volume><fpage>75</fpage><pub-id pub-id-type="doi">10.1186/1471-2164-9-75</pub-id><pub-id pub-id-type="pmid">18261238</pub-id></citation></ref></ref-list></back></article>
