<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xml:lang="en" article-type="review-article">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">genes</journal-id>
      <journal-title>Genes</journal-title>
      <abbrev-journal-title abbrev-type="publisher">Genes</abbrev-journal-title>
      <abbrev-journal-title abbrev-type="pubmed">Genes</abbrev-journal-title>
      <issn pub-type="epub">2073-4425</issn>
      <publisher>
        <publisher-name>MDPI</publisher-name>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.3390/genes3040634</article-id>
      <article-id pub-id-type="publisher-id">genes-03-00634</article-id>
      <article-categories>
        <subj-group>
          <subject>Review</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>Factors Behind Junk DNA in Bacteria</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <name>
            <surname>Gil</surname>
            <given-names>Rosario</given-names>
          </name>
          <xref rid="af1-genes-03-00634" ref-type="aff">1</xref>
          <xref rid="c1-genes-03-00634" ref-type="corresp">*</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Latorre</surname>
            <given-names>Amparo</given-names>
          </name>
          <xref rid="af1-genes-03-00634" ref-type="aff">1</xref>
          <xref rid="af2-genes-03-00634" ref-type="aff">2</xref>
        </contrib>
      </contrib-group>
      <aff id="af1-genes-03-00634"><label>1 </label>Institut Cavanilles de Biodiversitat i Biologia Evolutiva, Universitat de València, Apartado Postal 22085, 46071 València, Spain; Departament de Genètica, Universitat de València, Dr. Moliner, 50, 46100 Burjassot (València), Spain</aff>
      <aff id="af2-genes-03-00634"><label>2 </label>Área de Genómica y Salud, Centro Superior de Investigación en Salud Pública (CSISP), Avenida de Cataluña 21, 46020 Valencia, Spain; E-Mail: <email>amparo.latorre@uv.es</email></aff>
      <author-notes>
        <corresp id="c1-genes-03-00634"><label>*</label> Author to whom correspondence should be addressed; E-Mail: <email>rosario.gil@uv.es</email>; Tel.: +34-96-35-43824; Fax: +34-96-35-43670.</corresp>
      </author-notes>
      <pub-date pub-type="epub">
        <day>12</day>
        <month>10</month>
        <year>2012</year>
      </pub-date>
      <pub-date pub-type="collection"><month>12</month>
        <year>2012</year>
      </pub-date>
      <volume>3</volume>
      <issue>4</issue>
      <fpage>634</fpage>
      <lpage>650</lpage>
      <history>
        <date date-type="received">
          <day>25</day>
          <month>07</month>
          <year>2012</year>
        </date>
        <date date-type="rev-recd">
          <day>11</day>
          <month>09</month>
          <year>2012</year>
        </date>
        <date date-type="accepted">
          <day>25</day>
          <month>09</month>
          <year>2012</year>
        </date>
      </history>
      <permissions>
        <copyright-statement>© 2012 by the authors; licensee MDPI, Basel, Switzerland.</copyright-statement>
        <copyright-year>2012</copyright-year>
        <license xmlns:xlink="http://www.w3.org/1999/xlink" license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/3.0/">
          <p>This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).</p>
        </license>
      </permissions>
      <abstract>
        <p>Although bacterial genomes have been traditionally viewed as being very compact, with relatively low amounts of repetitive and non-coding DNA, this view has dramatically changed in recent years. The increase of available complete bacterial genomes has revealed that many species present abundant repetitive DNA (<italic>i.e.</italic>, insertion sequences, prophages or paralogous genes) and that many of these sequences are not functional but can have evolutionary consequences as concerns the adaptation to specialized host-related ecological niches. Comparative genomics analyses with close relatives that live in non-specialized environments reveal the nature and fate of this bacterial junk DNA. In addition, the number of insertion sequences and pseudogenes, as well as the size of the intergenic regions, can be used as markers of the evolutionary stage of a genome.</p>
      </abstract>
      <kwd-group>
        <kwd>junk DNA</kwd>
        <kwd>pseudogenes</kwd>
        <kwd>intergenic regions (IGR)</kwd>
        <kwd>insertion sequences (IS)</kwd>
        <kwd>genome degradation</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec sec-type="intro">
      <title>1. Introduction</title>
      <sec>
        <title>1.1. Genome Size and Junk DNA</title>
        <p>The availability of an increasing number of complete genome sequences has allowed researchers to extract general rules about their shape and dynamics, as well as to analyze the molecular and evolutionary forces involved. Eukaryotes have a wide range of genome sizes, from the 6.5 Mb of the pneumonia-causing fungi <italic>Pneumocystis carinii</italic> f. sp. <italic>muris</italic> to the 133 Gb of the marbled lungfish <italic>Protopterus aethiopicus</italic> [<xref ref-type="bibr" rid="B1-genes-03-00634">1</xref>]. However, it has been demonstrated that theincrease of genome size is not correlated with an increase in complexity or gene number, as bigger eukaryotic genomes tend to increase the amount of non-coding DNA (ncDNA) and contain large amounts of transposable elements [<xref ref-type="bibr" rid="B2-genes-03-00634">2</xref>]. The high percentage of ncDNA present in higher eukaryotes genomes was already noticed in the 1970s, when it was considered as “junk DNA” [<xref ref-type="bibr" rid="B3-genes-03-00634">3</xref>]. However, it has become obvious that many of these non-coding sequences are fully functional. Therefore, it would be more appropriate to refer to junk DNA only when we are talking about DNA that has “little obvious function” and “contributes little or nothing to the fitness of the organism” [<xref ref-type="bibr" rid="B4-genes-03-00634">4</xref>]. This would include inter- or intragenic regions that have no known function, pseudogenes, repeated sequences and selfish DNA such as transposons and viral elements.</p>
        <p>On the other side, prokaryotic genomes are in general smaller and less variable than eukaryotic ones, ranging from less than 200 kb to more than 11 Mb (<uri>http://www.genomesonline.org</uri>). The size distribution in the different bacterial taxonomic groups shows that species with large and small genomes coexist in the different lineages and important size variation can even be observed among strains of the same species. <italic>Escherichia coli</italic> genomes can range from 4.6 to 5.7 Mb; <italic>Procholoroccus marinus</italic>, from 1.6 to 2.7 Mb; <italic>Pseudomonas fluorescens</italic>, from 6.3 to 7.1 Mb, or <italic>Buchnera aphidicola,</italic> from 0.4 to 0.7 Mb. All these examples illustrate that genome size is highly variable in prokaryotes and that, contrary to eukaryotes, it may change drastically even within a short divergence time. Variations in genome size are mostly related to the bacteria lifestyle, although different mechanisms seem to be acting in free-living and host-dependent bacteria [<xref ref-type="bibr" rid="B5-genes-03-00634">5</xref>]. Regarding gene content, and also contrary to eukaryotes, bacteria and archaea genomes tend to be very compact, with genes occupying around 90% of the genome for most of the species [<xref ref-type="bibr" rid="B6-genes-03-00634">6</xref>]. Intergenic regions (IGRs) are usually short but variable in size, with mean values as small as 3 bp in <italic>Pelagibacter ubique</italic>, 85 bp in <italic>E. coli</italic>, or 151 bp in <italic>Yersinia pestis</italic> [<xref ref-type="bibr" rid="B7-genes-03-00634">7</xref>]. Therefore, in general, genome size shows a strong positive correlation with gene number, although exceptions to this compactness are accumulating in recent years. Many sequenced bacterial genomes present a lower gene density than average due to the accumulation of different types of sequences that, similarly to eukaryotic genomes, can be considered “junk DNA”. Some organisms possess a large quantity of pseudogenes, as it is the case of <italic>Mycobacterium leprae</italic> [<xref ref-type="bibr" rid="B8-genes-03-00634">8</xref>], <italic>Serratia symbiotica</italic> SAp [<xref ref-type="bibr" rid="B9-genes-03-00634">9</xref>] or <italic>Sodalis glossinidius</italic> [<xref ref-type="bibr" rid="B10-genes-03-00634">10</xref>,<xref ref-type="bibr" rid="B11-genes-03-00634">11</xref>], while others, such as <italic>S. symbiotica</italic> SCs [<xref ref-type="bibr" rid="B12-genes-03-00634">12</xref>] or <italic>Rickettsia prowazekii</italic> [<xref ref-type="bibr" rid="B13-genes-03-00634">13</xref>], show long IGRs. There are also genomes where genetic parasites, such as transposable elements and bacteriophages, are highly present, as it is the case of SOPE [<xref ref-type="bibr" rid="B14-genes-03-00634">14</xref>]. In this review we will not take into consideration bacteriophages as junk DNA because it has been shown that genes of a phage origin have been co-opted by different bacteria for their own profit. Thus, bacteriophages are known to carry key virulence factors for pathogenic bacteria [<xref ref-type="bibr" rid="B15-genes-03-00634">15</xref>] and, although their roles in symbiotic bacteria are less understood, Oliver <italic>et al.</italic> (2009) [<xref ref-type="bibr" rid="B16-genes-03-00634">16</xref>] suggested that a phage-encoded toxin is responsible for the protecting role of <italic>Hamiltonella defensa</italic>, a facultative symbiont of the pea aphid <italic>Acyrtosiphon pisum,</italic> against the attack of parasitoids. Furthermore, though several studies are trying to manage this problem [<xref ref-type="bibr" rid="B17-genes-03-00634">17</xref>,<xref ref-type="bibr" rid="B18-genes-03-00634">18</xref>,<xref ref-type="bibr" rid="B19-genes-03-00634">19</xref>], the present standards of annotation for genomes available in the databases make difficult to assess when a putative functional gene has a phage origin or is just a non-functional remnant of an ancestral phage gene [<xref ref-type="bibr" rid="B20-genes-03-00634">20</xref>].</p>
      </sec>
      <sec>
        <title>1.2. Bacterial Endosymbionts as a Model</title>
        <p>Our group has been involved for more than a decade in the study of the genomes of endosymbionts, bacteria that live inside specialized eukaryotic cells. Many bacteria live in obligate association with eukaryotes (see [<xref ref-type="bibr" rid="B21-genes-03-00634">21</xref>] and [<xref ref-type="bibr" rid="B22-genes-03-00634">22</xref>] for complete reviews on the subject). Based on the effect of the bacterium on its host, these relationships are classified as mutualistic, commensal or parasitic but, from the bacterial point of view, the basic requirements to successfully maintain this kind of association are the same. They need to overcome the physical, cellular and molecular barriers presented by the host to achieve survival, proliferation and the infection of a new host. Therefore, the evolutionary processes suffered by both pathogens and mutualists are very similar. The rich and stable environment provided by the host cells leads to dramatic changes in the bacterial genome composition due to a relaxation of the selective forces on genes that become non-essential in the new environment, alongside an increase in the effects of genetic drift caused by bottlenecking over bacterial generations. These changes include a gradual reduction in genome size as well as variation in the coding capacity and gene density, together with changes in the number of genes and pseudogenes, and in the presence of insertion sequences (IS). Although endosymbiotic bacteria that keep a long term relationship with their hosts have extremely reduced and (usually) compact genomes, without traces of phage remnants, sequences acquired through horizontal gene transfer (HGT) and ISs and with limited amounts of pseudogenes, newly acquired endosymbionts still retain many of these features in their genomes, probably reflecting an intermediate step before the massive genome reduction starts. To better understand the molecular and evolutionary bases of endosymbiosis, the genomes of bacteria in different stages of the symbiotic integration process have been analyzed. These genomes and their comparison with those from their free-living relatives can be also useful in understanding the fate of bacterial junk-DNA. Or, on the other hand, the analysis of junk DNA can be used as a marker of the integration process of endosymbiotic bacteria.</p>
      </sec>
    </sec>
    <sec>
      <title>2. The Impact of Pseudogenes in Bacterial Genomes</title>
      <p>Pseudogenes are inactivated copies of known genes, which present an erosion or disruption either of their reading frames or their regulatory regions. They are usually detected by comparative analysis with their functional orthologs in close relative species. The alignment of such orthologous genes allows the detection of mutations, insertions/deletions (indels), frameshifts, premature stop codons and insertion of transposable elements, all of which cause the appearing of a truncated protein, or the inactivation or loss of essential functional domains. However, it is not always easy to assess the functional status of each annotated coding region within a genome for several reasons. First of all, orthology cannot be detected above a threshold of divergence between sequences. Second, not all frameshifts, indels, stop mutations and rearrangements will inactivate a gene and there is a wide length variation of homologous proteins within families. An additional problem is that there is no clear definition of what has to be considered a pseudogene and there are inconsistencies in the methods used in different studies, so that it is not always possible to compare the pseudogene content from different genomes. In some studies, any shortened open reading frame (ORF) is annotated as a pseudogene [<xref ref-type="bibr" rid="B23-genes-03-00634">23</xref>], while in others considerably shortened homologs are annotated as genes if they keep complete domains specifying some function [<xref ref-type="bibr" rid="B24-genes-03-00634">24</xref>]. Lerat and Ochman (2005) [<xref ref-type="bibr" rid="B25-genes-03-00634">25</xref>] considered a pseudogene only when more of 20% of the length of the primary sequence of the encoded protein was lost, while some other studies excluded from this category impairments in homology matches within a “cutoff” region at either end, considering that slightly shorter alignments can reflect functional protein [<xref ref-type="bibr" rid="B26-genes-03-00634">26</xref>]. The most widely used method to identify non-functional genes involves the comparison of nucleotide substitution rates at synonymous (<italic>K</italic><sub>s</sub>) and non-synonymous sites (<italic>K</italic><sub>a</sub>), using the <italic>K</italic><sub>a</sub>/<italic>K</italic><sub>s</sub> test [<xref ref-type="bibr" rid="B27-genes-03-00634">27</xref>]. Regions without functional constraints, such as pseudogenes, are expected to have <italic>K</italic><sub>a</sub>/<italic>K</italic><sub>s</sub> ratios not significantly different from one. However, comparative analyses can only detect pseudogenes in genes that have orthologs in other available genomes, and only if the inactivation is due to truncations or disruptions of the original ORF, which represents only a fraction of the potentially inactivated genes. Genes that have been inactivated by missense mutations or changes in regulatory regions will also remain undetected by these methods. </p>
      <p>At the beginning of the Genomics era, pseudogenes were thought to be unusual in bacteria. This idea has completely changed now that it is possible to perform comparative genomics on different strains of the same species, on closely related species with different lifestyles or on those living in different environments. In fact, when the <italic>E. coli</italic> MG1665 genome was first reported, only one pseudogene was annotated among its 4288 coding regions, but recent studies revealed that this genome contains about 100 genes that retain less than 20% of the annotated sequence in other <italic>E. coli</italic> strains, some of which (at least) are likely to be non-functional [<xref ref-type="bibr" rid="B28-genes-03-00634">28</xref>]. Nevertheless, pseudogenes were early noticed in significant amounts in some bacterial pathogens such as <italic>R. prowazekii</italic> [<xref ref-type="bibr" rid="B13-genes-03-00634">13</xref>] or <italic>M. leprae</italic> [<xref ref-type="bibr" rid="B8-genes-03-00634">8</xref>], the latter being one of the most extreme cases known to date, with 1614 protein-coding genes and 1133 annotated pseudogenes, 40% of its 3.2-Mb genome. Since then, several comprehensive studies have been performed trying to extract general rules that explain the dynamics of pseudogenes within bacterial genomes. </p>
      <p>Liu <italic>et al.</italic> (2004) [<xref ref-type="bibr" rid="B26-genes-03-00634">26</xref>] analyzed 64 prokaryote genomes and defined a conservative set of conditions to detect their proportion of pseudogenes. In this analysis, they do not include hypothetical or putative proteins, because a large proportion of them could be over-annotated. However, in order to maximize the efficiency of pseudogene finding, they searched for remnants of ancient genes in the regions that had been considered as IGRs in the original annotations. Using this approach, they found that pseudogenes are pervasive in prokaryotes, accounting for 1 to 5% of most analyzed genomes. They didn’t find a clear correlation between percentage of pseudogenes and lifestyle: taking aside the extreme case of <italic>M. leprae</italic>, the pseudogene fraction of the analyzed archaea, non-pathogenic and pathogenic bacteria were fairly similar (3.6, 3.9 and 3.3% respectively). However, and as mentioned in the introduction, it has been shown later on that some bacteria that have recently adopted an obligate host-dependent lifestyle (either mutualistic or parasitic) present higher amounts of pseudogenes, probably related with a rapid change to the new environments in which some previously encoded functions are no longer needed. The rapid adaptation to new environments also explains the situation of <italic>M. leprae</italic> and its differences with its close relative <italic>M. tuberculosis</italic> [<xref ref-type="bibr" rid="B29-genes-03-00634">29</xref>] in that, with 3959 protein-coding genes, has only six identified pseudogenes. A comparative analysis of both genomes suggested that the pseudogenes in <italic>M. leprae</italic> have degenerated by gene-by-gene inactivation mostly after the divergence of these two clades [<xref ref-type="bibr" rid="B30-genes-03-00634">30</xref>]. Additionally, genomes from host-dependent symbionts that are suffering a genome reductive syndrome are known to contain many pseudogenes in regions that were previously annotated as IGRs. Thus, when the genome of <italic>R. prowazekii</italic>, the a-proteobacteria that causes typhus was sequenced [<xref ref-type="bibr" rid="B13-genes-03-00634">13</xref>], it was found that a considerable portion of its 1.1-Mb genome (22.9%) is covered by ncDNA and pseudogenes. A comparative analysis of this genome with the genome of three representative species of the genus <italic>Rickettsia</italic> showed that most of the IGRs were, in fact, remnants of ancestral genes with a high degree of degradation [<xref ref-type="bibr" rid="B24-genes-03-00634">24</xref>,<xref ref-type="bibr" rid="B31-genes-03-00634">31</xref>]. Even in smaller genomes, such as those from four different strains of <italic>B. aphidicola</italic> (the bacterial endosymbiont of aphids), longer IGRs in one strain contain the remnants of lost genes in other lineages [<xref ref-type="bibr" rid="B32-genes-03-00634">32</xref>]. </p>
      <p>As it would be expected, the pseudogenes identified by Liu <italic>et al.</italic> (2004) [<xref ref-type="bibr" rid="B26-genes-03-00634">26</xref>] cluster mainly in specific families related with environmental responses (such as specific nutrient transporters or processing, and related with antigenic variation). Other disabled genes correspond to hypothetical and unknown proteins, belong to transposable elements and bacteriophages, or are remnants of horizontally-transferred genes. Later on, Lerat and Ochman (2005) [<xref ref-type="bibr" rid="B25-genes-03-00634">25</xref>] determined the pseudogene content of 11 available complete genomes from four bacterial genera (<italic>Staphylococcus, Streptococcus, Yersinia</italic> and <italic>Vibrio</italic>), each of which include at least one human pathogen, which were supposed to accumulate pseudogenes compared with their free-living relatives. In addition to the gene families identified in the previous study [<xref ref-type="bibr" rid="B26-genes-03-00634">26</xref>], in obligate host-associated bacteria several broadly distributed genes involved in nucleotide processing, repair or replication appear as pseudogenes. The loss of genes needed for DNA repair and recombination are also among the first losses detected in bacteria that have recently acquired an obligate endosymbiotic lifestyle [<xref ref-type="bibr" rid="B33-genes-03-00634">33</xref>], thus contributing to the accumulation of pseudogenes in these species [<xref ref-type="bibr" rid="B34-genes-03-00634">34</xref>]. </p>
    </sec>
    <sec>
      <title>3. IS Elements Shaping Bacterial Genomes</title>
      <p>Mobile genetic elements, such as phages, plasmids and transposable elements are widespread in prokaryotes, where they can represent a significant proportion of their DNA and play important roles in shaping their genomes. The simplest and most abundant transposable elements in bacteria are ISs. Although several classification schemes have been proposed, the one defined by Mahillon and Chandler (1998) [<xref ref-type="bibr" rid="B35-genes-03-00634">35</xref>] has become the most widely used. It is the format used by ISfinder (<uri>www-is.biotoul.fr</uri>) [<xref ref-type="bibr" rid="B36-genes-03-00634">36</xref>], a repository of ISs isolated from bacteria and archaea that also provides extensive background information, as well as an updated and comprehensive classification in IS families and subfamilies, and a proposal for a coherent nomenclature. These small elements are very compact and consist on a short DNA sequence, usually between 0.6 and 2.5 kb in length, able to translocate within and among replicons. A typical IS element only codes for proteins involved in its transposition, flanked by short inverted repeats (IR) of around 10 to 40 bp. Usually, their transposition generates a small duplication (2 to 14 bp) of the target DNA flanking the insertion point. Based on similarities in their transposases and IRs, and the length of their target site sequence, ISs can be grouped into 20 major families. IS elements are important factors involved in genetic variability, since they can promote genomic rearrangements, and generate mutations by their insertion within genes or regulatory sequences. This capacity to generate genome damage is probably one of the reasons why transposition is usually strongly regulated and maintained at a low level in free-living bacteria [<xref ref-type="bibr" rid="B37-genes-03-00634">37</xref>]. In fact, most bacterial genomes contain only a few copies of a limited number of IS types [<xref ref-type="bibr" rid="B38-genes-03-00634">38</xref>]. However, in bacteria that have recently adopted an intracellular lifestyle, there is a reduction in the selective pressure to keep the ISs regulated, and a massive expansion of those elements can take place (see below). </p>
      <p>As it happens with the pseudogenes, the quality of IS annotation in sequenced genomes is highly heterogeneous due to the different annotation methods and nomenclature used by different researchers, and the diverse interests that drive the functional analysis of a given genome. In fact, it is not uncommon that annotation focuses only on the potential protein-coding sequences included in the elements, but ignores their IR boundaries and the directed repeats generated by their insertions, as well as the vestiges of ancestral ISs [<xref ref-type="bibr" rid="B36-genes-03-00634">36</xref>]. Nevertheless, several global surveys of available bacterial genomes have been performed in recent years, allowing conclusions to be obtained regarding the distribution, dynamics and evolution of these elements.</p>
      <p>Many different hypothesis have been proposed to explain the abundance and variability of IS elements among prokaryote genomes. Wagner <italic>et al.</italic> (2007) [<xref ref-type="bibr" rid="B38-genes-03-00634">38</xref>] used IScan, a free open source package, to examine the more than 2000 IS elements present in 438 completely sequenced bacterial genomes in the most comprehensive analysis to date. They found a high homogeneity pattern of IS families across vast taxonomic scales, consistent with previous and more limited works [<xref ref-type="bibr" rid="B39-genes-03-00634">39</xref>,<xref ref-type="bibr" rid="B40-genes-03-00634">40</xref>,<xref ref-type="bibr" rid="B41-genes-03-00634">41</xref>]. Such high-sequence homogeneity could be explained by the rapid spreading of ISs within a genome, in addition to other genetic mechanisms such as gene conversion. In the evolutionary scenario proposed by Wagner <italic>et al.</italic>, after an IS enters a genome, its copy number expands rapidly through transposition. Consequently, there is a low degree of diversity among the different copies of an IS present in a given genome. Eventually, due to the deleterious effect of their accumulation, IS elements tend to disappear and may become extinct from the lineage. However, later on, it may be reintroduced through HGT. The impact of HGT in the spreading of these sequences is revealed by the presence of closely related IS elements of the same family in non-closely-related genomes. However, HGT appears to be necessary but not sufficient for the presence of ISs, since their abundance within a genome does not depend on the level at which genomes are invaded by the elements [<xref ref-type="bibr" rid="B42-genes-03-00634">42</xref>].</p>
      <p>Touchon and Rocha (2007) [<xref ref-type="bibr" rid="B42-genes-03-00634">42</xref>] performed a comprehensive reannotation and analysis of the putative functional ISs present in 262 prokaryote genomes in order to test for IS family specificity, the influence of host genome size, pathogenicity, or human association in the IS abundance or density. They found that IS distribution in prokaryotic genomes strongly correlates only with genome size, probably due to a decrease on the density of highly deleterious insertion sites with genome size. They hypothesized that IS abundance is mostly determined by selection and, therefore, the effective population sizes of the microorganism would strongly influence IS abundance. Thus, the high increase in the amount of ISs that has been detected in some bacterial pathogens, even though they have smaller genomes, would be the consequence of a recent reduction of effective population sizes, not of their parasitic lifestyle. However, it must be taken into account that it is their host-dependent lifestyle that determines that many genes become redundant or superfluous, which also diminishes the effectiveness of natural selection, because there are more sites that can be substrate of IS transposition without affecting the fitness of the microorganism.</p>
      <p>On a more recent study, Newton and Bordenstein (2011) [<xref ref-type="bibr" rid="B43-genes-03-00634">43</xref>] analyzed 384 bacterial genomes on the light of their phylogenetic relationships, genome sizes and ecology, in order to test whether there is a correlation between any of these factors and the mobile elements density (which in this study included phages, plasmids and transposable elements). They found that the density of mobile DNA elements only correlates with bacterial ecology. ISs account for the majority of mobile DNA elements in nearly half of the analyzed genomes, and there is a significant variation on the amount of them depending on the bacterial lifestyles (free-living, facultative intracellular and obligate intracellular bacteria), and also between horizontally and vertically transmitted obligate intracellular bacteria. The increase in the amount of IS elements in bacteria that have recently evolved as specialized pathogens or acquired an intracellular lifestyle had already been noticed in previous studies [<xref ref-type="bibr" rid="B44-genes-03-00634">44</xref>]. Since these bacteria sequestered inside eukaryotic cells can not exchange material with other bacteria through HGT, the massive presence of ISs must be due to an increase in the replicative transposition of elements that were resident at the onset of the obligate symbiosis [<xref ref-type="bibr" rid="B45-genes-03-00634">45</xref>], when many genes that had become non-essential can accumulate ISs without a detrimental effect. In addition, the high abundance of IS elements is also an important source of chromosomal rearrangements suffered by these genomes at this point [<xref ref-type="bibr" rid="B46-genes-03-00634">46</xref>]. Some studies emphasize that the persistence of IS elements in bacterial genomes can not be only explained by the balance of their propagation as selfish elements and the control of the host genome to avoid deleterious effects, but also due to their ability to promote adaptive evolution of the host genomes by generating beneficial mutations that increase the fitness of the host [<xref ref-type="bibr" rid="B47-genes-03-00634">47</xref>]. However, such benefits do not seem to be acting on extremely reduced genomes with a large evolutionary history with their hosts, since IS elements have been completely lost in their genomes. Their absence in the latter stages of the endosymbiosis can mostly explain the extreme chromosomal stasis observed in the genomes of the different <italic>B. aphidicola</italic> strains that have been sequenced [<xref ref-type="bibr" rid="B48-genes-03-00634">48</xref>].</p>
    </sec>
    <sec>
      <title>4. Junk DNA as a Marker of the Symbiotic Integration Process</title>
      <p>In recent years, new information has been obtained by the study of bacterial endosymbionts in different stages of their relationship with their respective hosts. Several cases are especially useful to illustrate the dynamics of junk DNA in the genomes of bacteria that acquire a specialized lifestyle and will be analyzed in detail in this section and summarized in <xref ref-type="table" rid="genes-03-00634-t001">Table 1</xref>. </p>
      <sec>
        <title>4.1. Serratia symbiotica, the Missing Link from Free-Living to Obligate Mutualism</title>
        <p><italic>S. symbiotica</italic> is a g-proteobacterium that appears in symbiotic association with different aphid species. In most cases, as for the pea aphid <italic>A. pisum,</italic> it appears as a facultative symbiont (strain SAp) [<xref ref-type="bibr" rid="B9-genes-03-00634">9</xref>]. However, in the case of the cedar aphid <italic>Cinara cedri</italic>, <italic>S. symbiotica</italic> SCc has established a permanent and obligate consortium with <italic>B. aphidicola</italic> [<xref ref-type="bibr" rid="B12-genes-03-00634">12</xref>], the primary endosymbiont of aphids. Therefore, S<italic>. symbiotica</italic> SCc is present in all cedar aphid populations and both bacteria maintain a metabolic complementation for the provision of some essential nutrients, such as tryptophan, as revealed by functional analyses of their genomes. It is worth noticing that <italic>B. aphidicola</italic> BCc from <italic>C. cedri</italic> possess the smallest genome among all the sequenced <italic>Buchnera</italic> strains [<xref ref-type="bibr" rid="B52-genes-03-00634">52</xref>].</p>
      <table-wrap id="genes-03-00634-t001" position="float">
        <object-id pub-id-type="pii">genes-03-00634-t001_Table 1</object-id>
        <label>Table 1</label>
        <caption>
          <p>Relevant genomic features of selected bacteria with different lifestyles</p>
        </caption>
        <table rules="all" style="border: solid thin">
          <thead>
            <tr>
              <th align="center" valign="middle">Species</th>
              <th align="center" valign="middle">Lifestyle</th>
              <th align="center" valign="middle">Genome Size (kb)</th>
              <th align="center" valign="middle">CDS</th>
              <th align="center" valign="middle">Pseudogenes</th>
              <th align="center" valign="middle">IGR Mean Size (bp)</th>
              <th align="center" valign="middle">% Coding Density </th>
              <th align="center" valign="middle">Presence of ISs (%)</th>
              <th align="center" valign="middle">Data Source</th>
            </tr>
          </thead>
          <tbody>
            <tr>
              <td align="left" valign="middle"><italic>M. leprae</italic> TN</td>
              <td align="left" valign="middle">human parasite</td>
              <td align="center" valign="middle">3,268</td>
              <td align="center" valign="middle">1614</td>
              <td align="center" valign="middle">1293</td>
              <td align="center" valign="middle">ND</td>
              <td align="center" valign="middle">ND</td>
              <td align="center" valign="middle">Yes </td>
              <td align="center" valign="middle">[<xref ref-type="bibr" rid="B30-genes-03-00634">30</xref>,<xref ref-type="bibr" rid="B49-genes-03-00634">49</xref>]</td>
            </tr>
            <tr>
              <td align="left" valign="middle"><italic>M. tuberculosis</italic> H37Rv</td>
              <td align="left" valign="middle">human parasite</td>
              <td align="center" valign="middle">4,411</td>
              <td align="center" valign="middle">4006</td>
              <td align="center" valign="middle">6*</td>
              <td align="center" valign="middle">ND</td>
              <td align="center" valign="middle">ND</td>
              <td align="center" valign="middle">1.5</td>
              <td align="center" valign="middle">[<xref ref-type="bibr" rid="B29-genes-03-00634">29</xref>,<xref ref-type="bibr" rid="B44-genes-03-00634">44</xref>,<xref ref-type="bibr" rid="B50-genes-03-00634">50</xref>]</td>
            </tr>
            <tr>
              <td align="left" valign="middle"><italic>S. symbiotica</italic> SAp</td>
              <td align="left" valign="middle">pea aphid facultative symbiont</td>
              <td align="center" valign="middle">2,789</td>
              <td align="center" valign="middle">2098</td>
              <td align="center" valign="middle">550</td>
              <td align="center" valign="middle">204.3</td>
              <td align="center" valign="middle">60.9</td>
              <td align="center" valign="middle">Yes </td>
              <td align="center" valign="middle">[<xref ref-type="bibr" rid="B9-genes-03-00634">9</xref>,<xref ref-type="bibr" rid="B12-genes-03-00634">12</xref>]</td>
            </tr>
            <tr>
              <td align="left" valign="middle"><italic>S. symbiotica</italic> SCc</td>
              <td align="left" valign="middle">cedar aphid facultative symbiont</td>
              <td align="center" valign="middle">1,763</td>
              <td align="center" valign="middle">672</td>
              <td align="center" valign="middle">58</td>
              <td align="center" valign="middle">1672.01</td>
              <td align="center" valign="middle">38.7</td>
              <td align="center" valign="middle">No</td>
              <td align="center" valign="middle">[<xref ref-type="bibr" rid="B12-genes-03-00634">12</xref>]</td>
            </tr>
            <tr>
              <td align="left" valign="middle">
                <italic>S. proteamaculans</italic>
              </td>
              <td align="left" valign="middle">free-living</td>
              <td align="center" valign="middle">5,496</td>
              <td align="center" valign="middle">4942</td>
              <td align="center" valign="middle">12</td>
              <td align="center" valign="middle">165.67</td>
              <td align="center" valign="middle">87.1</td>
              <td align="center" valign="middle">Yes </td>
              <td align="center" valign="middle">[<xref ref-type="bibr" rid="B12-genes-03-00634">12</xref>]</td>
            </tr>
            <tr>
              <td align="left" valign="middle"><italic>B. aphidicola</italic> BAp</td>
              <td align="left" valign="middle">pea aphid obligate endosymbiont</td>
              <td align="center" valign="middle">656</td>
              <td align="center" valign="middle">574</td>
              <td align="center" valign="middle">1</td>
              <td align="center" valign="middle">126.9</td>
              <td align="center" valign="middle">86.7</td>
              <td align="center" valign="middle">No</td>
              <td align="center" valign="middle">[<xref ref-type="bibr" rid="B12-genes-03-00634">12</xref>,<xref ref-type="bibr" rid="B51-genes-03-00634">51</xref>]</td>
            </tr>
            <tr>
              <td align="left" valign="middle"><italic>B. aphidicola</italic> BCc</td>
              <td align="left" valign="middle">cedar aphid obligate endosymbiont</td>
              <td align="center" valign="middle">422</td>
              <td align="center" valign="middle">362</td>
              <td align="center" valign="middle">3</td>
              <td align="center" valign="middle">135.8</td>
              <td align="center" valign="middle">90.0</td>
              <td align="center" valign="middle">No</td>
              <td align="center" valign="middle">[<xref ref-type="bibr" rid="B52-genes-03-00634">52</xref>]</td>
            </tr>
            <tr>
              <td align="left" valign="middle"><italic>S. glossinidius</italic> str. "morsitans"</td>
              <td align="left" valign="middle">tse-tse fly facultative symbiont</td>
              <td align="center" valign="middle">4,293</td>
              <td align="center" valign="middle">2516</td>
              <td align="center" valign="middle">1501</td>
              <td align="center" valign="middle">ND</td>
              <td align="center" valign="middle">50.9</td>
              <td align="center" valign="middle">2.72</td>
              <td align="center" valign="middle">[<xref ref-type="bibr" rid="B10-genes-03-00634">10</xref>,<xref ref-type="bibr" rid="B11-genes-03-00634">11</xref>]</td>
            </tr>
            <tr>
              <td align="left" valign="middle"><italic>W. pipientis w</italic>Mel</td>
              <td align="left" valign="middle"><italic>D. melanogaster</italic> reproductive parasite</td>
              <td align="center" valign="middle">1,268</td>
              <td align="center" valign="middle">1270</td>
              <td align="center" valign="middle">94</td>
              <td align="center" valign="middle">ND</td>
              <td align="center" valign="middle">80</td>
              <td align="center" valign="middle">7.7</td>
              <td align="center" valign="middle">[<xref ref-type="bibr" rid="B44-genes-03-00634">44</xref>,<xref ref-type="bibr" rid="B53-genes-03-00634">53</xref>]</td>
            </tr>
            <tr>
              <td align="left" valign="middle"><italic>W. pipientis w</italic>Ri</td>
              <td align="left" valign="middle"><italic>D. simulans</italic> reproductive parasite</td>
              <td align="center" valign="middle">1,445</td>
              <td align="center" valign="middle">1150</td>
              <td align="center" valign="middle">114</td>
              <td align="center" valign="middle">ND</td>
              <td align="center" valign="middle">ND</td>
              <td align="center" valign="middle">10</td>
              <td align="center" valign="middle">[<xref ref-type="bibr" rid="B54-genes-03-00634">54</xref>]</td>
            </tr>
            <tr>
              <td align="left" valign="middle"><italic>W. pipientis w</italic>Pip</td>
              <td align="left" valign="middle"><italic>C. pipiens</italic> reproductive parasite</td>
              <td align="center" valign="middle">1,482</td>
              <td align="center" valign="middle">1386</td>
              <td align="center" valign="middle">97</td>
              <td align="center" valign="middle">ND</td>
              <td align="center" valign="middle">82</td>
              <td align="center" valign="middle">Yes</td>
              <td align="center" valign="middle">[<xref ref-type="bibr" rid="B55-genes-03-00634">55</xref>]</td>
            </tr>
            <tr>
              <td align="left" valign="middle"><italic>W. pipientis w</italic>Bm</td>
              <td align="left" valign="middle"><italic>B. malayi</italic> obligate endosymbiont</td>
              <td align="center" valign="middle">1,080</td>
              <td align="center" valign="middle">806</td>
              <td align="center" valign="middle">98</td>
              <td align="center" valign="middle">ND</td>
              <td align="center" valign="middle">67</td>
              <td align="center" valign="middle">5.4**</td>
              <td align="center" valign="middle">[<xref ref-type="bibr" rid="B56-genes-03-00634">56</xref>]</td>
            </tr>
            <tr>
              <td align="left" valign="middle"><italic>R. prowazekii</italic> str. Madrid E</td>
              <td align="left" valign="middle">human parasite</td>
              <td align="center" valign="middle">1,112</td>
              <td align="center" valign="middle">835</td>
              <td align="center" valign="middle">12</td>
              <td align="center" valign="middle">ND</td>
              <td align="center" valign="middle">76</td>
              <td align="center" valign="middle">0.3</td>
              <td align="center" valign="middle">[<xref ref-type="bibr" rid="B13-genes-03-00634">13</xref>]</td>
            </tr>
          </tbody>
        </table>
		<table-wrap-foot>
		<fn>
        <p>* Excluding IS elements. ** Various repeats including ISs. ND: non-determined</p>
		</fn>
		</table-wrap-foot>
      </table-wrap>
        <p>The genome comparison of both obligate and facultative <italic>S. symbiotica</italic> strains and other free-living <italic>Serratia</italic> reveals the gradual changes in gene content at different stages in the transition from free-living to endosymbiosis. <italic>S. symbiotica</italic> SCc has a moderately reduced genome (36.8%), compared to <italic>S. symbiotica</italic> SAp, and a 67.7% reduction compared to free-living <italic>Serratia</italic> such as <italic>S. proteamaculans</italic>. However, the data indicating genome decay are more extreme when the coding capacity is compared: the obligate strain SCc presents only 672 protein-coding genes and 58 pseudogenes, whereas the facultative strain SAp has 2098 genes and 550 pseudogenes. Thus, the overall coding density of strain SCc is 38.7%, the lowest among insect endosymbionts described so far. As a consequence, the IGRs are very long in this strain, with an average length of 1,672 bp, the highest among all endosymbionts analyzed. No traces of homology have been found in these IGRs compared to coding regions of other sequenced bacterial genomes. This ncDNA must represent ancient pseudogenes that are being gradually eroded until their total (or almost total) disappearance, as it has happened in ancient obligate bacterial endosymbionts [<xref ref-type="bibr" rid="B57-genes-03-00634">57</xref>]. Altogether, these data indicate that this genome is in the last steps of genomic degradation but previous to that of long-term endosymbionts such as <italic>B. aphidicola</italic>. In fact, if we substitute the size of the IGRs in <italic>S. symbiotica</italic> SCc for the size of these regions in <italic>B. aphidicola</italic> BCc (135.8 bp on average), the chromosomal length would be 771,075 bp, in the range of other obligate endosymbionts. In addition, no IS sequences remain in this strain, whereas they have been described in SAp as well as in free-living <italic>Serratia</italic>. An analysis of the genome synteny between both symbionts revealed that an important number of rearrangements have occurred in both <italic>S. symbiotica</italic> lineages: This fact is in accordance with the possibility of the presence of active mobile elements in their ancestor at the onset of the symbiosis, which are currently unidentifiable in the <italic>S. symbiotica</italic> SCc genome [<xref ref-type="bibr" rid="B12-genes-03-00634">12</xref>]. </p>
      </sec>
      <sec>
        <title>4.2. Wolbachia pipiensis, from Reproductive Parasite to Intracellular Mutualist</title>
        <p>One of the cases most extensively studied correspond to different strains of <italic>W. pipiensis</italic>, an a-proteobacterium that has established parasitic and mutualistic symbiosis with different animal hosts [<xref ref-type="bibr" rid="B58-genes-03-00634">58</xref>]. Most <italic>W. pipientis</italic> strains are reproductive parasites of insects. The complete genome of the strains that parasite <italic>Drosophila melanogaster</italic> (<italic>w</italic>Mel) [<xref ref-type="bibr" rid="B53-genes-03-00634">53</xref>], <italic>D. simulans</italic> (<italic>w</italic>Ri) [<xref ref-type="bibr" rid="B54-genes-03-00634">54</xref>], and mosquitoes from the <italic>Culex pipiens</italic> group(<italic>w</italic>Pip) [<xref ref-type="bibr" rid="B55-genes-03-00634">55</xref>,<xref ref-type="bibr" rid="B59-genes-03-00634">59</xref>] are available. Additionally, <italic>W. pipiensis w</italic>Bm was identified as the obligate mutualistic symbiont of <italic>Brugia malayi</italic>, a human filarial parasitic nematode, and its genome was also sequenced [<xref ref-type="bibr" rid="B56-genes-03-00634">56</xref>]. All these bacteria have streamlined genomes compared with other free-living a-proteobacteria, as expected for host-associated microorganism, being more extreme in the case of the mutualistic strain <italic>w</italic>Bm. However, all of them also contain high levels of repetitive DNA and mobile elements. When the genome of the strain <italic>w</italic>Mel was determined [<xref ref-type="bibr" rid="B53-genes-03-00634">53</xref>], it was found that the mobile DNA identified didn’t have homologues in other a-proteobacteria but have been found in other <italic>Wolbachia</italic> strains. Therefore, the authors proposed that it was acquired short after the separation of the <italic>Wolbachia</italic> and <italic>Rickettsia</italic> lineages but before the radiation of the <italic>Wolbachia</italic> group. Another study revealed that most of the differences in genome size between <italic>w</italic>Mel with <italic>w</italic>Pip come from repetitive and mobile elements (not only ISs, but also prophages), and there is almost no overlapping between both genomes, which might be due to a recent invasion after the separation of both lineages or to different losses after such event [<xref ref-type="bibr" rid="B55-genes-03-00634">55</xref>]. Although many of the mobile elements seem to be defective, they must have played a key role in shaping the evolution of this genus, and are probably responsible for most of the genome rearrangements found among strains and compared with free-living relatives, similar to what has happened in <italic>S. symbiotica</italic> [<xref ref-type="bibr" rid="B12-genes-03-00634">12</xref>]. The comparison with <italic>W. pipiensis w</italic>Bm revealed that it also shares some repetitive elements with <italic>w</italic>Mel, although they are considerably less abundant in the mutualistic symbiont (5.4%) [<xref ref-type="bibr" rid="B56-genes-03-00634">56</xref>]. Additionally, the <italic>w</italic>Bm genome has an extremely low density of predicted functional genes, similar to what has been found in <italic>R. prowazekii</italic> [<xref ref-type="bibr" rid="B13-genes-03-00634">13</xref>] and <italic>M. leprae</italic> [<xref ref-type="bibr" rid="B8-genes-03-00634">8</xref>]. As in these other genomes, the main reason for such low-coding density is the presence of a considerable number of pseudogenes (which, even so, were underestimated at that time, since many putative ORFs correspond to fragmented former genes).</p>
      </sec>
      <sec>
        <title>4.3. The Sodalis-like Group of Symbionts, at the Transition Point from Facultative to Obligate Symbionts</title>
        <p>Phylogenetic analysis demonstrated the close relationship between <italic>S. glossinidius</italic>, secondary symbiont of the tse-tse fly and the primary endosymbiont of grain weevils of the genus <italic>Sitophilus</italic>, as it is the case of SOPE and SZPE, endosymbionts of rice and maize weevils, <italic>S. oryzae</italic> and <italic>S. zeamays</italic>, respectively. <italic>Sitophilus</italic> symbionts and <italic>S. glossinidius</italic> provide a good model for the study of the evolutionary transition from facultative to obligate-mutualism, because the divergence of these lineages from a common ancestor has produced two different symbiotic outcomes. The genome sequencing and reannotation of <italic>S. glossinidius</italic>, showed that about a third of it is composed of inactivated genes in different degrees of disintegration, and it also contains a limited amount of IS elements of five different types, representing 2,72% of the genome [<xref ref-type="bibr" rid="B10-genes-03-00634">10</xref>,<xref ref-type="bibr" rid="B11-genes-03-00634">11</xref>]. Among the 1501 identified pseudogenes, only 18 were originated by the insertion of an IS element. This big amount of pesudogenes confirms the very recent symbiotic association with their insect host [<xref ref-type="bibr" rid="B60-genes-03-00634">60</xref>,<xref ref-type="bibr" rid="B61-genes-03-00634">61</xref>], while it is clear that ISs have not been determinant in the degeneration of the genome. </p>
        <p>Although the genome sequence of SOPE is not yet available, preliminary studies showed that this genome has been massively invaded by IS elements, which have been estimated to represent more than 20% of the genome [<xref ref-type="bibr" rid="B14-genes-03-00634">14</xref>]. This is the most extreme case of IS abundance in any known bacterial genome. The presence of a Sodalis-like g-proteobacteria in cereal weevil lineages has been explained by a replacement of the ancestral endosymbiont <italic>Nardonella</italic> (present in the rest of the members of the family to which the rice weevil belongs) less than 25 million years ago [<xref ref-type="bibr" rid="B62-genes-03-00634">62</xref>,<xref ref-type="bibr" rid="B63-genes-03-00634">63</xref>], thus being a very young obligate mutualist. Therefore, it seems that after the establishment of the obligate symbiosis, the evolutionary path taken by SOPE differs from the observed in <italic>S. glossinidius</italic>, with a massive proliferation of at least four types of IS elements. Only two of these elements are shared with <italic>S. glossinidius,</italic> which as in the case of <italic>Wolbachia</italic>, indicates that they were present before the split of the two lineages, and the other two must have either been acquired by HGT in the ancestor of SOPE, or lost in <italic>S. glossinidius</italic>. The importance of the ISs in the process of pseudogenization can not be completely estimated until the full genome is available. Although it is known that many genes are interrupted by ISs, it is first necessary to determine if the interruption occurred on genes that were already pseudogenized in the common ancestor of <italic>Sodalis</italic> and SOPE, which seems to be the case. Preliminary studies performed in our group indicate that many genes inactivated by IS insertions encode hypothetical proteins or proteins of unknown function, as well as transposases and bacteriophage related proteins [<xref ref-type="bibr" rid="B14-genes-03-00634">14</xref>] and that a high proportion of them appear already as pseudogenes in <italic>Sodalis</italic> (unpublished results). In any case, it is clear that ISs must be involved in genomic recombination events leading to the loss of the region between two elements. Thus, similar to what is found in SOPE, ISs must be a key factor in the genome degradation that occurs in the first stages of integration to intracellular life even though the random pseudogenization of genes that have become non-essential started before their proliferation.</p>
      </sec>
    </sec>
    <sec>
      <title>5. Concluding Remarks: Dynamics of Junk-DNA during the Evolutionary Reduction Process</title>
      <p>Comparative studies have revealed that bacterial genomes are under selective pressures that have a deep impact on their shape and gene content, including a previously unexpected degree of diversity in gene repertoires within and between species. In addition to a set of genes that are shared by all members of a monophyletic group, there are a considerable number of genes and other functional features that can be highly specific depending on the environmental context. The evolution of these gene repertoires can be explained by processes of gene gains through HGT and duplications, and gene loses, through pseudogenization and gene excision [<xref ref-type="bibr" rid="B28-genes-03-00634">28</xref>]. Since it is well known that HGT has a great impact on prokaryotes [<xref ref-type="bibr" rid="B64-genes-03-00634">64</xref>], reductive evolutionary processes due to gene degradation and elimination must also occur very frequently in order to maintain compact genomes [<xref ref-type="bibr" rid="B31-genes-03-00634">31</xref>,<xref ref-type="bibr" rid="B65-genes-03-00634">65</xref>]. Pseudogenes must be quickly removed from the genomes, as it can be deduced from the small proportion of them those that are present simultaneously in different strains. When mutations occur in genes that are no longer needed as an adaptation to particular living conditions, the inactivated genes can remain in the genome for some time and they gradually erode in a random manner until they are completely removed [<xref ref-type="bibr" rid="B57-genes-03-00634">57</xref>,<xref ref-type="bibr" rid="B66-genes-03-00634">66</xref>], because within bacteria (as well as within several eukaryotes) there is a mutational bias toward deletions over insertions [<xref ref-type="bibr" rid="B31-genes-03-00634">31</xref>,<xref ref-type="bibr" rid="B67-genes-03-00634">67</xref>,<xref ref-type="bibr" rid="B68-genes-03-00634">68</xref>,<xref ref-type="bibr" rid="B69-genes-03-00634">69</xref>]. </p>
      <p>The dynamics of degradation and elimination of junk sequences has been studied in intracellular specialists by comparative genomics with their free-living relatives (<xref ref-type="fig" rid="genes-03-00634-f001">Figure 1</xref>). Free-living bacteria have larger genomes with a moderate content of repeated sequences and self-propagating DNA, such as transposons, bacteriophage, a moderate amount of pseudogenes (usually between 1% and 5%) and relatively short IGRs. In the transition from free-living to host-dependent lifestyles, junk DNA increases their content, leading some times to an expansion of genome size parallel to a decrease in gene-coding capacity. The accumulation of pseudogenes in different degrees of decay (sometimes leading to long IGRs in which no traces of former pseudogenes can be detected except based on genome synteny studies), as well as the accumulation and losses of IS elements in the genomes of these bacteria can be explained by genetic and population factors. At the beginning of the association, the new rich, protected and stable niche provided by the host makes unnecessary or redundant some gene functions. Due to a decreased efficiency of the purifying selection, they can rapidly accumulate slightly deleterious mutations on genes that have become non-essential. The inefficiency of the purifying selection also explains the increase in the amount of IS elements present in these genomes at the onset of symbiosis, which in some cases seems to be accompanied by an increase in the transposition rate and the concomitant production of pseudogenes created by IS insertions. At the same time, the random genetic drift increases due to a drastic reduction of the bacterial effective population size when they are transmitted from one host to another. The uncontrolled proliferation of IS elements can lead to the loss of large stretches of genomic DNA by unequal recombination when they appear in direct orientation. This was one of the predicted effects in the model for the reductive syndrome of endosymbiont genomes, which takes place in two stages: first, a massive reduction and a second stage of pseudogenization and progressive disappearing of non-functional sequences by gene-by-gene erosion and deletion, leading to the highly compact and streamlined genomes of symbionts with a long established obligatory intracellular relationship with their hosts. IS elements also suffer the accumulation of mutations that render them inactive for transposition, thus becoming real junk DNA. The process occurs in a random manner so that some genomes can accumulate dramatic amounts of ISs (as in the case of SOPE) or lose most of them at the end of the first stage. The possibility for rapid genome degradation by means of deletion events is unlikely in the latter genomes and therefore, as it is the case of <italic>S. symbiotica</italic> SCc, they enter the second step of gene-by-gene degeneration while still retaining great amounts of pseudogenes which, with evolutionary time, will appear as long IGRs. </p>
      <fig id="genes-03-00634-f001" position="float">
        <label>Figure 1</label>
        <caption>
          <p>Dynamics of gain and loss of junk DNA in bacteria that establishes a host-dependent lifestyle. Details are given in the text. Orange arrows: essential genes; pink arrows: non-essential genes; white arrows: pseudogenes; blue and green boxes ISs that are active (dark colour) or inactive (light colour); light purple boxes: IGRs. </p>
        </caption>
        <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="genes-03-00634-g001.tif"/>
      </fig>
    </sec>
  </body>
  <back>
    <ack>
      <title>Acknowledgments</title>
      <p>Financial support was provided by Grant BFU2009-12895-C02-01/BMC (Ministerio de Educación y Ciencia, Spain) to A. Latorre and Prometeo/2009/092 (Conselleria d’Educació, Generalitat Valenciana, Spain). </p>
    </ack>
    <ref-list>
      <title>References and Notes</title>
      <ref id="B1-genes-03-00634">
        <label>1.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Gregory</surname>
              <given-names>T.R.</given-names>
            </name>
            <name>
              <surname>Nicol</surname>
              <given-names>J.A.</given-names>
            </name>
            <name>
              <surname>Tamm</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Kullman</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Kullman</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Leitch</surname>
              <given-names>I.J.</given-names>
            </name>
            <name>
              <surname>Murray</surname>
              <given-names>B.G.</given-names>
            </name>
            <name>
              <surname>Kapraun</surname>
              <given-names>D.F.</given-names>
            </name>
            <name>
              <surname>Greilhuber</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Bennett</surname>
              <given-names>M.D.</given-names>
            </name>
          </person-group>
          <article-title>Eukaryotic genome size databases</article-title>
          <source>Nucleic Acids Res.</source>
          <year>2007</year>
          <volume>35</volume>
          <fpage>D332</fpage>
          <lpage>D338</lpage>
        <pub-id pub-id-type="doi">10.1093/nar/gkl828</pub-id><pub-id pub-id-type="pmid">17090588</pub-id></citation>
      </ref>
      <ref id="B2-genes-03-00634">
        <label>2.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Gregory</surname>
              <given-names>T.R.</given-names>
            </name>
          </person-group>
          <article-title>Synergy between sequence and size in large-scale genomics</article-title>
          <source>Nat. Rev. Genet.</source>
          <year>2005</year>
          <volume>6</volume>
          <fpage>699</fpage>
          <lpage>708</lpage>
          <pub-id pub-id-type="doi">10.1038/nrg1674</pub-id>
        </citation>
      </ref>
      <ref id="B3-genes-03-00634">
        <label>3.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Ohno</surname>
              <given-names>S.</given-names>
            </name>
          </person-group>
          <article-title>So much "Junk" DNA in our genome</article-title>
          <source>Brookhaven Symp. Biol.</source>
          <year>1972</year>
          <volume>23</volume>
          <fpage>366</fpage>
          <lpage>370</lpage>
        <pub-id pub-id-type="pmid">5065367</pub-id></citation>
      </ref>
      <ref id="B4-genes-03-00634">
        <label>4.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Orgel</surname>
              <given-names>L.E.</given-names>
            </name>
            <name>
              <surname>Crick</surname>
              <given-names>F.H.</given-names>
            </name>
          </person-group>
          <article-title>Selfish DNA: The ultimate parasite</article-title>
          <source>Nature</source>
          <year>1980</year>
          <volume>284</volume>
          <fpage>604</fpage>
          <lpage>607</lpage>
          <pub-id pub-id-type="doi">10.1038/284604a0</pub-id>
        </citation>
      </ref>
      <ref id="B5-genes-03-00634">
        <label>5.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Delaye</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Gil</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Pereto</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Latorre</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Moya</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <article-title>Life with a few genes: A survey on naturally evolved reduced genomes</article-title>
          <source>Open Evol. J.</source>
          <year>2010</year>
          <volume>4</volume>
          <fpage>12</fpage>
          <lpage>22</lpage>
          <pub-id pub-id-type="doi">10.2174/1874404401004010012</pub-id>
        </citation>
      </ref>
      <ref id="B6-genes-03-00634">
        <label>6.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Achaz</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Coissac</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Netter</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Rocha</surname>
              <given-names>E.P.</given-names>
            </name>
          </person-group>
          <article-title>Associations between inverted repeats and the structural evolution of bacterial genomes</article-title>
          <source>Genetics</source>
          <year>2003</year>
          <volume>164</volume>
          <fpage>1279</fpage>
          <lpage>1289</lpage>
        <pub-id pub-id-type="pmid">12930739</pub-id></citation>
      </ref>
      <ref id="B7-genes-03-00634">
        <label>7.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Giovannoni</surname>
              <given-names>S.J.</given-names>
            </name>
            <name>
              <surname>Tripp</surname>
              <given-names>H.J.</given-names>
            </name>
            <name>
              <surname>Givan</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Podar</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Vergin</surname>
              <given-names>K.L.</given-names>
            </name>
            <name>
              <surname>Baptista</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Bibbs</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Eads</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Richardson</surname>
              <given-names>T.H.</given-names>
            </name>
            <name>
              <surname>Noordewier</surname>
              <given-names>M.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Genome streamlining in a cosmopolitan oceanic bacterium</article-title>
          <source>Science</source>
          <year>2005</year>
          <volume>309</volume>
          <fpage>1242</fpage>
          <lpage>1245</lpage>
        <pub-id pub-id-type="doi">10.1126/science.1114057</pub-id><pub-id pub-id-type="pmid">16109880</pub-id></citation>
      </ref>
      <ref id="B8-genes-03-00634">
        <label>8.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Cole</surname>
              <given-names>S.T.</given-names>
            </name>
            <name>
              <surname>Eiglmeier</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Parkhill</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>James</surname>
              <given-names>K.D.</given-names>
            </name>
            <name>
              <surname>Thomson</surname>
              <given-names>N.R.</given-names>
            </name>
            <name>
              <surname>Wheeler</surname>
              <given-names>P.R.</given-names>
            </name>
            <name>
              <surname>Honore</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Garnier</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Churcher</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Harris</surname>
              <given-names>D.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Massive gene decay in the leprosy bacillus</article-title>
          <source>Nature</source>
          <year>2001</year>
          <volume>409</volume>
          <fpage>1007</fpage>
          <lpage>1011</lpage>
          <pub-id pub-id-type="doi">10.1038/35059006</pub-id>
        </citation>
      </ref>
      <ref id="B9-genes-03-00634">
        <label>9.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Burke</surname>
              <given-names>G.R.</given-names>
            </name>
            <name>
              <surname>Moran</surname>
              <given-names>N.A.</given-names>
            </name>
          </person-group>
          <article-title>Massive genomic decay in <italic>Serratia symbiotica</italic>, a recently evolved symbiont of aphids</article-title>
          <source>Genome Biol. Evol.</source>
          <year>2011</year>
          <volume>3</volume>
          <fpage>195</fpage>
          <lpage>208</lpage>
          <pub-id pub-id-type="doi">10.1093/gbe/evr002</pub-id>
        </citation>
      </ref>
      <ref id="B10-genes-03-00634">
        <label>10.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Toh</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Weiss</surname>
              <given-names>B.L.</given-names>
            </name>
            <name>
              <surname>Perkin</surname>
              <given-names>S.A.</given-names>
            </name>
            <name>
              <surname>Yamashita</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Oshima</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Hattori</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Aksoy</surname>
              <given-names>S.</given-names>
            </name>
          </person-group>
          <article-title>Massive genome erosion and functional adaptations provide insights into the symbiotic lifestyle of <italic>Sodalis glossinidius</italic> in the tsetse host</article-title>
          <source>Genome Res.</source>
          <year>2006</year>
          <volume>16</volume>
          <fpage>149</fpage>
          <lpage>156</lpage>
        <pub-id pub-id-type="pmid">16365377</pub-id></citation>
      </ref>
      <ref id="B11-genes-03-00634">
        <label>11.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Belda</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Moya</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Bentley</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Silva</surname>
              <given-names>F.J.</given-names>
            </name>
          </person-group>
          <article-title>Mobile genetic element proliferation and gene inactivation impact over the genome structure and metabolic capabilities of <italic>Sodalis glossinidius</italic>, the secondary endosymbiont of tsetse flies</article-title>
          <source>BMC Genomics</source>
          <year>2010</year>
          <volume>11</volume>
          <fpage>449</fpage>
          <pub-id pub-id-type="doi">10.1186/1471-2164-11-449</pub-id>
        </citation>
      </ref>
      <ref id="B12-genes-03-00634">
        <label>12.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Lamelas</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Gosalbes</surname>
              <given-names>M.J.</given-names>
            </name>
            <name>
              <surname>Manzano-Marin</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Pereto</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Moya</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Latorre</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <article-title><italic>Serratia symbiotica</italic> from the aphid <italic>Cinara cedri</italic>: A missing link from facultative to obligate insect endosymbiont</article-title>
          <source>PLoS Genet.</source>
          <year>2011</year>
          <volume>7</volume>
          <fpage>e1002357</fpage>
          <pub-id pub-id-type="doi">10.1371/journal.pgen.1002357</pub-id>
        </citation>
      </ref>
      <ref id="B13-genes-03-00634">
        <label>13.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Andersson</surname>
              <given-names>S.G.</given-names>
            </name>
            <name>
              <surname>Zomorodipour</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Andersson</surname>
              <given-names>J.O.</given-names>
            </name>
            <name>
              <surname>Sicheritz-Ponten</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Alsmark</surname>
              <given-names>U.C.</given-names>
            </name>
            <name>
              <surname>Podowski</surname>
              <given-names>R.M.</given-names>
            </name>
            <name>
              <surname>Naslund</surname>
              <given-names>A.K.</given-names>
            </name>
            <name>
              <surname>Eriksson</surname>
              <given-names>A.S.</given-names>
            </name>
            <name>
              <surname>Winkler</surname>
              <given-names>H.H.</given-names>
            </name>
            <name>
              <surname>Kurland</surname>
              <given-names>C.G.</given-names>
            </name>
          </person-group>
          <article-title>The genome sequence of <italic>Rickettsia prowazekii</italic> and the origin of mitochondria</article-title>
          <source>Nature</source>
          <year>1998</year>
          <volume>396</volume>
          <fpage>133</fpage>
          <lpage>140</lpage>
          <pub-id pub-id-type="doi">10.1038/24094</pub-id>
        </citation>
      </ref>
      <ref id="B14-genes-03-00634">
        <label>14.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Gil</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Belda</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Gosalbes</surname>
              <given-names>M.J.</given-names>
            </name>
            <name>
              <surname>Delaye</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Vallier</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Vincent-Monegat</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Heddi</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Silva</surname>
              <given-names>F.J.</given-names>
            </name>
            <name>
              <surname>Moya</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Latorre</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <article-title>Massive presence of insertion sequences in the genome of SOPE, the primary endosymbiont of the rice weevil <italic>Sitophilus oryzae</italic></article-title>
          <source>Int. Microbiol.</source>
          <year>2008</year>
          <volume>11</volume>
          <fpage>41</fpage>
          <lpage>48</lpage>
        <pub-id pub-id-type="pmid">18683631</pub-id></citation>
      </ref>
      <ref id="B15-genes-03-00634">
        <label>15.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Brussow</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Canchaya</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Hardt</surname>
              <given-names>W.D.</given-names>
            </name>
          </person-group>
          <article-title>Phages and the evolution of bacterial pathogens: From genomic rearrangements to lysogenic conversion</article-title>
          <source>Microbiol. Mol. Biol. Rev.</source>
          <year>2004</year>
          <volume>68</volume>
          <fpage>560</fpage>
          <lpage>602</lpage>
          <pub-id pub-id-type="doi">10.1128/MMBR.68.3.560-602.2004</pub-id>
        </citation>
      </ref>
      <ref id="B16-genes-03-00634">
        <label>16.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Oliver</surname>
              <given-names>K.M.</given-names>
            </name>
            <name>
              <surname>Degnan</surname>
              <given-names>P.H.</given-names>
            </name>
            <name>
              <surname>Hunter</surname>
              <given-names>M.S.</given-names>
            </name>
            <name>
              <surname>Moran</surname>
              <given-names>N.A.</given-names>
            </name>
          </person-group>
          <article-title>Bacteriophages encode factors required for protection in a symbiotic mutualism</article-title>
          <source>Science</source>
          <year>2009</year>
          <volume>325</volume>
          <fpage>992</fpage>
          <lpage>994</lpage>
          <pub-id pub-id-type="doi">10.1126/science.1174463</pub-id>
        </citation>
      </ref>
      <ref id="B17-genes-03-00634">
        <label>17.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Lima-Mendez</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Van Helden</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Toussaint</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Leplae</surname>
              <given-names>R.</given-names>
            </name>
          </person-group>
          <article-title>Prophinder: A computational tool for prophage prediction in prokaryotic genomes</article-title>
          <source>Bioinformatics</source>
          <year>2008</year>
          <volume>24</volume>
          <fpage>863</fpage>
          <lpage>865</lpage>
          <pub-id pub-id-type="doi">10.1093/bioinformatics/btn043</pub-id>
        </citation>
      </ref>
      <ref id="B18-genes-03-00634">
        <label>18.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Zhou</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Liang</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Lynch</surname>
              <given-names>K.H.</given-names>
            </name>
            <name>
              <surname>Dennis</surname>
              <given-names>J.J.</given-names>
            </name>
            <name>
              <surname>Wishart</surname>
              <given-names>D.S.</given-names>
            </name>
          </person-group>
          <article-title>PHAST: A fast phage search tool</article-title>
          <source>Nucleic Acids Res.</source>
          <year>2011</year>
          <volume>39</volume>
          <fpage>W347</fpage>
          <lpage>W352</lpage>
          <pub-id pub-id-type="doi">10.1093/nar/gkr485</pub-id>
        </citation>
      </ref>
      <ref id="B19-genes-03-00634">
        <label>19.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Akhter</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Aziz</surname>
              <given-names>R.K.</given-names>
            </name>
            <name>
              <surname>Edwards</surname>
              <given-names>R.A.</given-names>
            </name>
          </person-group>
          <article-title><italic>PhiSpy</italic>: A novel algorithm for finding prophages in bacterial genomes that combines similarity- and composition-based strategies</article-title>
          <source>Nucleic Acids Res.</source>
          <year>2012</year>
          <pub-id pub-id-type="doi">10.1093/nar/gks406</pub-id>
        </citation>
      </ref>
      <ref id="B20-genes-03-00634">
        <label>20.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Canchaya</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Proux</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Fournous</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Bruttin</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Brussow</surname>
              <given-names>H.</given-names>
            </name>
          </person-group>
          <article-title>Prophage genomics</article-title>
          <source>Microbiol. Mol. Biol. Rev.</source>
          <year>2003</year>
          <volume>67</volume>
          <fpage>238</fpage>
          <lpage>276</lpage>
          <pub-id pub-id-type="doi">10.1128/MMBR.67.2.238-276.2003</pub-id>
        </citation>
      </ref>
      <ref id="B21-genes-03-00634">
        <label>21.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Moya</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Pereto</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Gil</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Latorre</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <article-title>Learning how to live together: Genomic insights into prokaryote-animal symbioses</article-title>
          <source>Nat. Rev. Genet.</source>
          <year>2008</year>
          <volume>9</volume>
          <fpage>218</fpage>
          <lpage>229</lpage>
          <pub-id pub-id-type="doi">10.1038/nrg2319</pub-id>
        </citation>
      </ref>
      <ref id="B22-genes-03-00634">
        <label>22.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Moran</surname>
              <given-names>N.A.</given-names>
            </name>
            <name>
              <surname>McCutcheon</surname>
              <given-names>J.P.</given-names>
            </name>
            <name>
              <surname>Nakabachi</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <article-title>Genomics and evolution of heritable bacterial symbionts</article-title>
          <source>Annu. Rev. Genet.</source>
          <year>2008</year>
          <volume>42</volume>
          <fpage>165</fpage>
          <lpage>190</lpage>
          <pub-id pub-id-type="doi">10.1146/annurev.genet.41.110306.130119</pub-id>
        </citation>
      </ref>
      <ref id="B23-genes-03-00634">
        <label>23.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Chain</surname>
              <given-names>P.S.</given-names>
            </name>
            <name>
              <surname>Carniel</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Larimer</surname>
              <given-names>F.W.</given-names>
            </name>
            <name>
              <surname>Lamerdin</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Stoutland</surname>
              <given-names>P.O.</given-names>
            </name>
            <name>
              <surname>Regala</surname>
              <given-names>W.M.</given-names>
            </name>
            <name>
              <surname>Georgescu</surname>
              <given-names>A.M.</given-names>
            </name>
            <name>
              <surname>Vergez</surname>
              <given-names>L.M.</given-names>
            </name>
            <name>
              <surname>Land</surname>
              <given-names>M.L.</given-names>
            </name>
            <name>
              <surname>Motin</surname>
              <given-names>V.L.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Insights into the evolution of <italic>Yersinia pestis</italic> through whole-genome comparison with <italic>Yersinia. pseudotuberculosis</italic></article-title>
          <source>Proc. Natl. Acad. Sci. USA</source>
          <year>2004</year>
          <volume>101</volume>
          <fpage>13826</fpage>
          <lpage>13831</lpage>
        <pub-id pub-id-type="doi">10.1073/pnas.0404012101</pub-id><pub-id pub-id-type="pmid">15358858</pub-id></citation>
      </ref>
      <ref id="B24-genes-03-00634">
        <label>24.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Ogata</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Audic</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Renesto-Audiffren</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Fournier</surname>
              <given-names>P.E.</given-names>
            </name>
            <name>
              <surname>Barbe</surname>
              <given-names>V.</given-names>
            </name>
            <name>
              <surname>Samson</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Roux</surname>
              <given-names>V.</given-names>
            </name>
            <name>
              <surname>Cossart</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Weissenbach</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Claverie</surname>
              <given-names>J.M.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Mechanisms of evolution in <italic>Rickettsia conorii</italic> and <italic>R. prowazekii</italic></article-title>
          <source>Science</source>
          <year>2001</year>
          <volume>293</volume>
          <fpage>2093</fpage>
          <lpage>2098</lpage>
          <pub-id pub-id-type="doi">10.1126/science.1061471</pub-id>
        </citation>
      </ref>
      <ref id="B25-genes-03-00634">
        <label>25.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Lerat</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Ochman</surname>
              <given-names>H.</given-names>
            </name>
          </person-group>
          <article-title>Recognizing the pseudogenes in bacterial genomes</article-title>
          <source>Nucleic Acids Res.</source>
          <year>2005</year>
          <volume>33</volume>
          <fpage>3125</fpage>
          <lpage>3132</lpage>
          <pub-id pub-id-type="doi">10.1093/nar/gki631</pub-id>
        </citation>
      </ref>
      <ref id="B26-genes-03-00634">
        <label>26.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Liu</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Harrison</surname>
              <given-names>P.M.</given-names>
            </name>
            <name>
              <surname>Kunin</surname>
              <given-names>V.</given-names>
            </name>
            <name>
              <surname>Gerstein</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>Comprehensive analysis of pseudogenes in prokaryotes: Widespread gene decay and failure of putative horizontally transferred genes</article-title>
          <source>Genome Biol.</source>
          <year>2004</year>
          <volume>5</volume>
          <fpage>R64</fpage>
          <pub-id pub-id-type="doi">10.1186/gb-2004-5-9-r64</pub-id>
        </citation>
      </ref>
      <ref id="B27-genes-03-00634">
        <label>27.</label>
        <citation citation-type="book">
          <person-group person-group-type="author">
            <name>
              <surname>Nei</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Kumar</surname>
              <given-names>S.</given-names>
            </name>
          </person-group>
          <source>Molecular Evolution and Phylogenetics</source>
          <publisher-name>Oxford University Press</publisher-name>
          <publisher-loc>New York, NY, USA</publisher-loc>
          <year>2000</year>
        </citation>
      </ref>
      <ref id="B28-genes-03-00634">
        <label>28.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Abby</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Daubin</surname>
              <given-names>V.</given-names>
            </name>
          </person-group>
          <article-title>Comparative genomics and the evolution of prokaryotes</article-title>
          <source>Trends Microbiol.</source>
          <year>2007</year>
          <volume>15</volume>
          <fpage>135</fpage>
          <lpage>141</lpage>
          <pub-id pub-id-type="doi">10.1016/j.tim.2007.01.007</pub-id>
        </citation>
      </ref>
      <ref id="B29-genes-03-00634">
        <label>29.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Cole</surname>
              <given-names>S.T.</given-names>
            </name>
            <name>
              <surname>Brosch</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Parkhill</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Garnier</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Churcher</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Harris</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Gordon</surname>
              <given-names>S.V.</given-names>
            </name>
            <name>
              <surname>Eiglmeier</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Gas</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Barry</surname>
              <given-names>C.E.</given-names>
              <suffix>3rd</suffix>
            </name>
            <etal/>
          </person-group>
          <article-title>Deciphering the biology of <italic>Mycobacterium tuberculosis</italic> from the complete genome sequence</article-title>
          <source>Nature</source>
          <year>1998</year>
          <volume>393</volume>
          <fpage>537</fpage>
          <lpage>544</lpage>
          <pub-id pub-id-type="doi">10.1038/31159</pub-id>
        </citation>
      </ref>
      <ref id="B30-genes-03-00634">
        <label>30.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Gomez-Valero</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Rocha</surname>
              <given-names>E.P.</given-names>
            </name>
            <name>
              <surname>Latorre</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Silva</surname>
              <given-names>F.J.</given-names>
            </name>
          </person-group>
          <article-title>Reconstructing the ancestor of <italic>Mycobacterium leprae</italic>: The dynamics of gene loss and genome reduction</article-title>
          <source>Genome Res.</source>
          <year>2007</year>
          <volume>17</volume>
          <fpage>1178</fpage>
          <lpage>1185</lpage>
          <pub-id pub-id-type="doi">10.1101/gr.6360207</pub-id>
        </citation>
      </ref>
      <ref id="B31-genes-03-00634">
        <label>31.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Andersson</surname>
              <given-names>J.O.</given-names>
            </name>
            <name>
              <surname>Andersson</surname>
              <given-names>S.G.</given-names>
            </name>
          </person-group>
          <article-title>Pseudogenes, junk DNA, and the dynamics of <italic>Rickettsia</italic> genomes</article-title>
          <source>Mol. Biol. Evol.</source>
          <year>2001</year>
          <volume>18</volume>
          <fpage>829</fpage>
          <lpage>839</lpage>
          <pub-id pub-id-type="doi">10.1093/oxfordjournals.molbev.a003864</pub-id>
        </citation>
      </ref>
      <ref id="B32-genes-03-00634">
        <label>32.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Perez-Brocal</surname>
              <given-names>V.</given-names>
            </name>
            <name>
              <surname>Latorre</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Gil</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Moya</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <article-title>Comparative analysis of two genomic regions among four strains of <italic>Buchnera aphidicola</italic>, primary endosymbiont of aphids</article-title>
          <source>Gene</source>
          <year>2005</year>
          <volume>345</volume>
          <fpage>73</fpage>
          <lpage>80</lpage>
          <pub-id pub-id-type="doi">10.1016/j.gene.2004.11.021</pub-id>
        </citation>
      </ref>
      <ref id="B33-genes-03-00634">
        <label>33.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Dale</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Moran</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Ochman</surname>
              <given-names>H.</given-names>
            </name>
          </person-group>
          <article-title>Loss of DNA recombinational repair enzymes in the initial stages of genome degeneration</article-title>
          <source>Mol. Biol. Evol.</source>
          <year>2003</year>
          <volume>20</volume>
          <fpage>1188</fpage>
          <lpage>1194</lpage>
          <pub-id pub-id-type="doi">10.1093/molbev/msg138</pub-id>
        </citation>
      </ref>
      <ref id="B34-genes-03-00634">
        <label>34.</label>
        <citation citation-type="book">
          <person-group person-group-type="author">
            <name>
              <surname>Gil</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Latorre</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Moya</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <article-title>Evolution of prokaryote-animal symbiosis from a genomics perspective</article-title>
          <source>(Endo) symbiotic Methanogenic Archaea</source>
          <person-group person-group-type="editor">
            <name>
              <surname>Hackstein</surname>
              <given-names>J.H.P.</given-names>
            </name>
          </person-group>
          <publisher-name>Springer-Verlag</publisher-name>
          <publisher-loc>Berlin, Germany</publisher-loc>
          <year>2010</year>
          <fpage>207</fpage>
          <lpage>233</lpage>
        </citation>
      </ref>
      <ref id="B35-genes-03-00634">
        <label>35.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Mahillon</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Chandler</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>Insertion sequences</article-title>
          <source>Microbiol. Mol. Biol. Rev.</source>
          <year>1998</year>
          <volume>62</volume>
          <fpage>725</fpage>
          <lpage>774</lpage>
        <pub-id pub-id-type="pmid">9729608</pub-id></citation>
      </ref>
      <ref id="B36-genes-03-00634">
        <label>36.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Siguier</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Perochon</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Lestrade</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Mahillon</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Chandler</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>ISfinder: The reference centre for bacterial insertion sequences</article-title>
          <source>Nucleic Acids Res.</source>
          <year>2006</year>
          <volume>34</volume>
          <fpage>D32</fpage>
          <lpage>36</lpage>
          <pub-id pub-id-type="doi">10.1093/nar/gkj014</pub-id>
        </citation>
      </ref>
      <ref id="B37-genes-03-00634">
        <label>37.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Nagy</surname>
              <given-names>Z.</given-names>
            </name>
            <name>
              <surname>Chandler</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>Regulation of transposition in bacteria</article-title>
          <source>Res. Microbiol.</source>
          <year>2004</year>
          <volume>155</volume>
          <fpage>387</fpage>
          <lpage>398</lpage>
          <pub-id pub-id-type="doi">10.1016/j.resmic.2004.01.008</pub-id>
        </citation>
      </ref>
      <ref id="B38-genes-03-00634">
        <label>38.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Wagner</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Lewis</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Bichsel</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>A survey of bacterial insertion sequences using IScan</article-title>
          <source>Nucleic Acids Res.</source>
          <year>2007</year>
          <volume>35</volume>
          <fpage>5284</fpage>
          <lpage>5293</lpage>
          <pub-id pub-id-type="doi">10.1093/nar/gkm597</pub-id>
        </citation>
      </ref>
      <ref id="B39-genes-03-00634">
        <label>39.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Sawyer</surname>
              <given-names>S.A.</given-names>
            </name>
            <name>
              <surname>Dykhuizen</surname>
              <given-names>D.E.</given-names>
            </name>
            <name>
              <surname>DuBose</surname>
              <given-names>R.F.</given-names>
            </name>
            <name>
              <surname>Green</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Mutangadura-Mhlanga</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Wolczyk</surname>
              <given-names>D.F.</given-names>
            </name>
            <name>
              <surname>Hartl</surname>
              <given-names>D.L.</given-names>
            </name>
          </person-group>
          <article-title>Distribution and abundance of insertion sequences among natural isolates of <italic>Escherichia coli</italic></article-title>
          <source>Genetics</source>
          <year>1987</year>
          <volume>115</volume>
          <fpage>51</fpage>
          <lpage>63</lpage>
        <pub-id pub-id-type="pmid">3030884</pub-id></citation>
      </ref>
      <ref id="B40-genes-03-00634">
        <label>40.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Lawrence</surname>
              <given-names>J.G.</given-names>
            </name>
            <name>
              <surname>Ochman</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Hartl</surname>
              <given-names>D.L.</given-names>
            </name>
          </person-group>
          <article-title>The evolution of insertion sequences within enteric bacteria</article-title>
          <source>Genetics</source>
          <year>1992</year>
          <volume>131</volume>
          <fpage>9</fpage>
          <lpage>20</lpage>
        <pub-id pub-id-type="pmid">1317318</pub-id></citation>
      </ref>
      <ref id="B41-genes-03-00634">
        <label>41.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Wagner</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <article-title>Periodic extinctions of transposable elements in bacterial lineages: Evidence from intragenomic variation in multiple genomes</article-title>
          <source>Mol. Biol. Evol.</source>
          <year>2006</year>
          <volume>23</volume>
          <fpage>723</fpage>
          <lpage>733</lpage>
          <pub-id pub-id-type="doi">10.1093/molbev/msj085</pub-id>
        </citation>
      </ref>
      <ref id="B42-genes-03-00634">
        <label>42.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Touchon</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Rocha</surname>
              <given-names>E.P.</given-names>
            </name>
          </person-group>
          <article-title>Causes of insertion sequences abundance in prokaryotic genomes</article-title>
          <source>Mol. Biol. Evol.</source>
          <year>2007</year>
          <volume>24</volume>
          <fpage>969</fpage>
          <lpage>981</lpage>
          <pub-id pub-id-type="doi">10.1093/molbev/msm014</pub-id>
        </citation>
      </ref>
      <ref id="B43-genes-03-00634">
        <label>43.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Newton</surname>
              <given-names>I.L.</given-names>
            </name>
            <name>
              <surname>Bordenstein</surname>
              <given-names>S.R.</given-names>
            </name>
          </person-group>
          <article-title>Correlations between bacterial ecology and mobile DNA</article-title>
          <source>Curr. Microbiol. 62</source>
          <year>2010</year>
          <fpage>198</fpage>
          <lpage>208</lpage>
        </citation>
      </ref>
      <ref id="B44-genes-03-00634">
        <label>44.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Bordenstein</surname>
              <given-names>S.R.</given-names>
            </name>
            <name>
              <surname>Reznikoff</surname>
              <given-names>W.S.</given-names>
            </name>
          </person-group>
          <article-title>Mobile DNA in obligate intracellular bacteria</article-title>
          <source>Nat. Rev. Microbiol.</source>
          <year>2005</year>
          <volume>3</volume>
          <fpage>688</fpage>
          <lpage>699</lpage>
          <pub-id pub-id-type="doi">10.1038/nrmicro1233</pub-id>
        </citation>
      </ref>
      <ref id="B45-genes-03-00634">
        <label>45.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Dale</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Moran</surname>
              <given-names>N.A.</given-names>
            </name>
          </person-group>
          <article-title>Molecular interactions between bacterial symbionts and their hosts</article-title>
          <source>Cell.</source>
          <year>2006</year>
          <volume>126</volume>
          <fpage>453</fpage>
          <lpage>465</lpage>
          <pub-id pub-id-type="doi">10.1016/j.cell.2006.07.014</pub-id>
        </citation>
      </ref>
      <ref id="B46-genes-03-00634">
        <label>46.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Moran</surname>
              <given-names>N.A.</given-names>
            </name>
            <name>
              <surname>Plague</surname>
              <given-names>G.R.</given-names>
            </name>
          </person-group>
          <article-title>Genomic changes following host restriction in bacteria</article-title>
          <source>Curr. Opin. Genet. Dev.</source>
          <year>2004</year>
          <volume>14</volume>
          <fpage>627</fpage>
          <lpage>633</lpage>
          <pub-id pub-id-type="doi">10.1016/j.gde.2004.09.003</pub-id>
        </citation>
      </ref>
      <ref id="B47-genes-03-00634">
        <label>47.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Schneider</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Lenski</surname>
              <given-names>R.E.</given-names>
            </name>
          </person-group>
          <article-title>Dynamics of insertion sequence elements during experimental evolution of bacteria</article-title>
          <source>Res. Microbiol.</source>
          <year>2004</year>
          <volume>155</volume>
          <fpage>319</fpage>
          <lpage>327</lpage>
          <pub-id pub-id-type="doi">10.1016/j.resmic.2003.12.008</pub-id>
        </citation>
      </ref>
      <ref id="B48-genes-03-00634">
        <label>48.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Latorre</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Gil</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Silva</surname>
              <given-names>F.J.</given-names>
            </name>
            <name>
              <surname>Moya</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <article-title>Chromosomal stasis <italic>versus</italic> plasmid plasticity in aphid endosymbiont <italic>Buchnera aphidicola</italic></article-title>
          <source>Heredity</source>
          <year>2005</year>
          <volume>95</volume>
          <fpage>339</fpage>
          <lpage>347</lpage>
          <pub-id pub-id-type="doi">10.1038/sj.hdy.6800716</pub-id>
        </citation>
      </ref>
      <ref id="B49-genes-03-00634">
        <label>49.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Singh</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Cole</surname>
              <given-names>S.T.</given-names>
            </name>
          </person-group>
          <article-title><italic>Mycobacterium leprae</italic>: Genes, pseudogenes and genetic diversity</article-title>
          <source>Future Microbiol. 6</source>
          <year>2010</year>
          <fpage>57</fpage>
          <lpage>71</lpage>
        </citation>
      </ref>
      <ref id="B50-genes-03-00634">
        <label>50.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Camus</surname>
              <given-names>J.C.</given-names>
            </name>
            <name>
              <surname>Pryor</surname>
              <given-names>M.J.</given-names>
            </name>
            <name>
              <surname>Medigue</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Cole</surname>
              <given-names>S.T.</given-names>
            </name>
          </person-group>
          <article-title>Re-annotation of the genome sequence of <italic>Mycobacterium tuberculosis</italic> H37Rv</article-title>
          <source>Microbiol.</source>
          <year>2002</year>
          <volume>148</volume>
          <fpage>2967</fpage>
          <lpage>2973</lpage>
        </citation>
      </ref>
      <ref id="B51-genes-03-00634">
        <label>51.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Shigenobu</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Watanabe</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Hattori</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Sakaki</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Ishikawa</surname>
              <given-names>H.</given-names>
            </name>
          </person-group>
          <article-title>Genome sequence of the endocellular bacterial symbiont of aphids <italic>Buchnera</italic> sp. APS</article-title>
          <source>Nature</source>
          <year>2000</year>
          <volume>407</volume>
          <fpage>81</fpage>
          <lpage>86</lpage>
          <pub-id pub-id-type="doi">10.1038/35024074</pub-id>
        </citation>
      </ref>
      <ref id="B52-genes-03-00634">
        <label>52.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Perez-Brocal</surname>
              <given-names>V.</given-names>
            </name>
            <name>
              <surname>Gil</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Ramos</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Lamelas</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Postigo</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Michelena</surname>
              <given-names>J.M.</given-names>
            </name>
            <name>
              <surname>Silva</surname>
              <given-names>F.J.</given-names>
            </name>
            <name>
              <surname>Moya</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Latorre</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <article-title>A small microbial genome: The end of a long symbiotic relationship?</article-title>
          <source>Science</source>
          <year>2006</year>
          <volume>314</volume>
          <fpage>312</fpage>
          <lpage>313</lpage>
        <pub-id pub-id-type="doi">10.1126/science.1130441</pub-id><pub-id pub-id-type="pmid">17038625</pub-id></citation>
      </ref>
      <ref id="B53-genes-03-00634">
        <label>53.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Wu</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Sun</surname>
              <given-names>L.V.</given-names>
            </name>
            <name>
              <surname>Vamathevan</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Riegler</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Deboy</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Brownlie</surname>
              <given-names>J.C.</given-names>
            </name>
            <name>
              <surname>McGraw</surname>
              <given-names>E.A.</given-names>
            </name>
            <name>
              <surname>Martin</surname>
              <given-names>W.</given-names>
            </name>
            <name>
              <surname>Esser</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Ahmadinejad</surname>
              <given-names>N.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Phylogenomics of the reproductive parasite <italic>Wolbachia. pipientis w</italic>Mel: A streamlined genome overrun by mobile genetic elements</article-title>
          <source>PLoS Biol.</source>
          <year>2004</year>
          <volume>2</volume>
          <fpage>E69</fpage>
          <pub-id pub-id-type="doi">10.1371/journal.pbio.0020069</pub-id>
        </citation>
      </ref>
      <ref id="B54-genes-03-00634">
        <label>54.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Klasson</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Westberg</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Sapountzis</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Naslund</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Lutnaes</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Darby</surname>
              <given-names>A.C.</given-names>
            </name>
            <name>
              <surname>Veneti</surname>
              <given-names>Z.</given-names>
            </name>
            <name>
              <surname>Chen</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Braig</surname>
              <given-names>H.R.</given-names>
            </name>
            <name>
              <surname>Garrett</surname>
              <given-names>R.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>The mosaic genome structure of the <italic>Wolbachia. w</italic>Ri strain infecting <italic>Drosophila simulans</italic></article-title>
          <source>Proc. Natl. Acad. Sci. USA</source>
          <year>2009</year>
          <volume>106</volume>
          <fpage>5725</fpage>
          <lpage>5730</lpage>
        <pub-id pub-id-type="doi">10.1073/pnas.0810753106</pub-id><pub-id pub-id-type="pmid">19307581</pub-id></citation>
      </ref>
      <ref id="B55-genes-03-00634">
        <label>55.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Klasson</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Walker</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Sebaihia</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Sanders</surname>
              <given-names>M.J.</given-names>
            </name>
            <name>
              <surname>Quail</surname>
              <given-names>M.A.</given-names>
            </name>
            <name>
              <surname>Lord</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Sanders</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Earl</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>O'Neill</surname>
              <given-names>S.L.</given-names>
            </name>
            <name>
              <surname>Thomson</surname>
              <given-names>N.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Genome evolution of <italic>Wolbachia</italic> strain <italic>w</italic>Pip from the <italic>Culex pipiens</italic> group</article-title>
          <source>Mol. Biol. Evol.</source>
          <year>2008</year>
          <volume>25</volume>
          <fpage>1877</fpage>
          <lpage>1887</lpage>
          <pub-id pub-id-type="doi">10.1093/molbev/msn133</pub-id>
        </citation>
      </ref>
      <ref id="B56-genes-03-00634">
        <label>56.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Foster</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Ganatra</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Kamal</surname>
              <given-names>I.</given-names>
            </name>
            <name>
              <surname>Ware</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Makarova</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Ivanova</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Bhattacharyya</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Kapatral</surname>
              <given-names>V.</given-names>
            </name>
            <name>
              <surname>Kumar</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Posfai</surname>
              <given-names>J.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>The <italic>Wolbachia</italic> genome of <italic>Brugia malayi</italic>: Endosymbiont evolution within a human pathogenic nematode</article-title>
          <source>PLoS Biol.</source>
          <year>2005</year>
          <volume>3</volume>
          <fpage>e121</fpage>
          <pub-id pub-id-type="doi">10.1371/journal.pbio.0030121</pub-id>
        </citation>
      </ref>
      <ref id="B57-genes-03-00634">
        <label>57.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Silva</surname>
              <given-names>F.J.</given-names>
            </name>
            <name>
              <surname>Latorre</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Moya</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <article-title>Genome size reduction through multiple events of gene disintegration in <italic>Buchnera</italic> APS</article-title>
          <source>Trends Genet.</source>
          <year>2001</year>
          <volume>17</volume>
          <fpage>615</fpage>
          <lpage>618</lpage>
          <pub-id pub-id-type="doi">10.1016/S0168-9525(01)02483-0</pub-id>
        </citation>
      </ref>
      <ref id="B58-genes-03-00634">
        <label>58.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Lo</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Paraskevopoulos</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Bourtzis</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>O'Neill</surname>
              <given-names>S.L.</given-names>
            </name>
            <name>
              <surname>Werren</surname>
              <given-names>J.H.</given-names>
            </name>
            <name>
              <surname>Bordenstein</surname>
              <given-names>S.R.</given-names>
            </name>
            <name>
              <surname>Bandi</surname>
              <given-names>C.</given-names>
            </name>
          </person-group>
          <article-title>Taxonomic status of the intracellular bacterium <italic>Wolbachia pipientis</italic></article-title>
          <source>Int. J. Syst. Evol. Microbiol.</source>
          <year>2007</year>
          <volume>57</volume>
          <fpage>654</fpage>
          <lpage>657</lpage>
          <pub-id pub-id-type="doi">10.1099/ijs.0.64515-0</pub-id>
        </citation>
      </ref>
      <ref id="B59-genes-03-00634">
        <label>59.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Salzberg</surname>
              <given-names>S.L.</given-names>
            </name>
            <name>
              <surname>Puiu</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Sommer</surname>
              <given-names>D.D.</given-names>
            </name>
            <name>
              <surname>Nene</surname>
              <given-names>V.</given-names>
            </name>
            <name>
              <surname>Lee</surname>
              <given-names>N.H.</given-names>
            </name>
          </person-group>
          <article-title>Genome sequence of the <italic>Wolbachia</italic> endosymbiont of <italic>Culex quinquefasciatus</italic> JHB</article-title>
          <source>J. Bacteriol.</source>
          <year>2009</year>
          <volume>191</volume>
          <fpage>1725</fpage>
          <pub-id pub-id-type="doi">10.1128/JB.01731-08</pub-id>
        </citation>
      </ref>
      <ref id="B60-genes-03-00634">
        <label>60.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Heddi</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Charles</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Khatchadourian</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Bonnot</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Nardon</surname>
              <given-names>P.</given-names>
            </name>
          </person-group>
          <article-title>Molecular characterization of the principal symbiotic bacteria of the weevil <italic>Sitophilus oryzae</italic>: A peculiar G+C content of an endocytobiotic DNA</article-title>
          <source>J. Mol. Evol.</source>
          <year>1998</year>
          <volume>47</volume>
          <fpage>52</fpage>
          <lpage>61</lpage>
          <pub-id pub-id-type="doi">10.1007/PL00006362</pub-id>
        </citation>
      </ref>
      <ref id="B61-genes-03-00634">
        <label>61.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Dale</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Plague</surname>
              <given-names>G.R.</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Ochman</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Moran</surname>
              <given-names>N.A.</given-names>
            </name>
          </person-group>
          <article-title>Type III secretion systems and the evolution of mutualistic endosymbiosis</article-title>
          <source>Proc. Natl. Acad. Sci. USA</source>
          <year>2002</year>
          <volume>99</volume>
          <fpage>12397</fpage>
          <lpage>12402</lpage>
          <pub-id pub-id-type="doi">10.1073/pnas.182213299</pub-id>
        </citation>
      </ref>
      <ref id="B62-genes-03-00634">
        <label>62.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Lefevre</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Charles</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Vallier</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Delobel</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Farrell</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Heddi</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <article-title>Endosymbiont phylogenesis in the Dryophthoridae weevils: Evidence for bacterial replacement</article-title>
          <source>Mol. Biol. Evol.</source>
          <year>2004</year>
          <volume>21</volume>
          <fpage>965</fpage>
          <lpage>973</lpage>
          <pub-id pub-id-type="doi">10.1093/molbev/msh063</pub-id>
        </citation>
      </ref>
      <ref id="B63-genes-03-00634">
        <label>63.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Conord</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Despres</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Vallier</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Balmand</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Miquel</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Zundel</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Lemperiere</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Heddi</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <article-title>Long-term evolutionary stability of bacterial endosymbiosis in Curculionoidea: Additional evidence of symbiont replacement in the Dryophthoridae family</article-title>
          <source>Mol. Biol. Evol.</source>
          <year>2008</year>
          <fpage>859</fpage>
          <lpage>868</lpage>
        </citation>
      </ref>
      <ref id="B64-genes-03-00634">
        <label>64.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Koonin</surname>
              <given-names>E.V.</given-names>
            </name>
            <name>
              <surname>Makarova</surname>
              <given-names>K.S.</given-names>
            </name>
            <name>
              <surname>Aravind</surname>
              <given-names>L.</given-names>
            </name>
          </person-group>
          <article-title>Horizontal gene transfer in prokaryotes: Quantification and classification</article-title>
          <source>Annu. Rev. Microbiol.</source>
          <year>2001</year>
          <volume>55</volume>
          <fpage>709</fpage>
          <lpage>742</lpage>
          <pub-id pub-id-type="doi">10.1146/annurev.micro.55.1.709</pub-id>
        </citation>
      </ref>
      <ref id="B65-genes-03-00634">
        <label>65.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Lawrence</surname>
              <given-names>J.G.</given-names>
            </name>
            <name>
              <surname>Hendrix</surname>
              <given-names>R.W.</given-names>
            </name>
            <name>
              <surname>Casjens</surname>
              <given-names>S.</given-names>
            </name>
          </person-group>
          <article-title>Where are the pseudogenes in bacterial genomes?</article-title>
          <source>Trends Microbiol.</source>
          <year>2001</year>
          <volume>9</volume>
          <fpage>535</fpage>
          <lpage>540</lpage>
          <pub-id pub-id-type="doi">10.1016/S0966-842X(01)02198-9</pub-id>
        </citation>
      </ref>
      <ref id="B66-genes-03-00634">
        <label>66.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Ochman</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Davalos</surname>
              <given-names>L.M.</given-names>
            </name>
          </person-group>
          <article-title>The nature and dynamics of bacterial genomes</article-title>
          <source>Science</source>
          <year>2006</year>
          <volume>311</volume>
          <fpage>1730</fpage>
          <lpage>1733</lpage>
          <pub-id pub-id-type="doi">10.1126/science.1119966</pub-id>
        </citation>
      </ref>
      <ref id="B67-genes-03-00634">
        <label>67.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Mira</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Ochman</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Moran</surname>
              <given-names>N.A.</given-names>
            </name>
          </person-group>
          <article-title>Deletional bias and the evolution of bacterial genomes</article-title>
          <source>Trends Genet.</source>
          <year>2001</year>
          <volume>17</volume>
          <fpage>589</fpage>
          <lpage>596</lpage>
          <pub-id pub-id-type="doi">10.1016/S0168-9525(01)02447-7</pub-id>
        </citation>
      </ref>
      <ref id="B68-genes-03-00634">
        <label>68.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Andersson</surname>
              <given-names>J.O.</given-names>
            </name>
            <name>
              <surname>Andersson</surname>
              <given-names>S.G.</given-names>
            </name>
          </person-group>
          <article-title>Insights into the evolutionary process of genome degradation</article-title>
          <source>Curr. Opin. Genet. Dev.</source>
          <year>1999</year>
          <volume>9</volume>
          <fpage>664</fpage>
          <lpage>671</lpage>
          <pub-id pub-id-type="doi">10.1016/S0959-437X(99)00024-6</pub-id>
        </citation>
      </ref>
      <ref id="B69-genes-03-00634">
        <label>69.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Petrov</surname>
              <given-names>D.A.</given-names>
            </name>
            <name>
              <surname>Hartl</surname>
              <given-names>D.L.</given-names>
            </name>
          </person-group>
          <article-title>Pseudogene evolution and natural selection for a compact genome</article-title>
          <source>J. Hered.</source>
          <year>2000</year>
          <volume>91</volume>
          <fpage>221</fpage>
          <lpage>227</lpage>
          <pub-id pub-id-type="doi">10.1093/jhered/91.3.221</pub-id>
        </citation>
      </ref>
    </ref-list>
  </back>
</article>
