<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xml:lang="en" article-type="review-article">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">genes</journal-id>
      <journal-title>Genes</journal-title>
      <abbrev-journal-title abbrev-type="publisher">Genes</abbrev-journal-title>
      <abbrev-journal-title abbrev-type="pubmed">Genes</abbrev-journal-title>
      <issn pub-type="epub">2073-4425</issn>
      <publisher>
        <publisher-name>MDPI</publisher-name>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.3390/genes3030461</article-id>
      <article-id pub-id-type="publisher-id">genes-03-00461</article-id>
      <article-categories>
        <subj-group>
          <subject>Review</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>Beyond Junk-Variable Tandem Repeats as Facilitators of Rapid Evolution of Regulatory and Coding Sequences</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <name>
            <surname>Gemayel</surname>
            <given-names>Rita</given-names>
          </name>
          <xref rid="af1-genes-03-00461" ref-type="aff">1</xref>
          <xref rid="af2-genes-03-00461" ref-type="aff">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Cho</surname>
            <given-names>Janice</given-names>
          </name>
          <xref rid="af1-genes-03-00461" ref-type="aff">1</xref>
          <xref rid="af2-genes-03-00461" ref-type="aff">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Boeynaems</surname>
            <given-names>Steven</given-names>
          </name>
          <xref rid="af1-genes-03-00461" ref-type="aff">1</xref>
          <xref rid="af2-genes-03-00461" ref-type="aff">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Verstrepen</surname>
            <given-names>Kevin J.</given-names>
          </name>
          <xref rid="af1-genes-03-00461" ref-type="aff">1</xref>
          <xref rid="af2-genes-03-00461" ref-type="aff">2</xref>
          <xref rid="c1-genes-03-00461" ref-type="corresp">*</xref>
        </contrib>
      </contrib-group>
      <aff id="af1-genes-03-00461"><label>1 </label>Laboratory for Systems Biology, VIB, Gaston Geenslaan 1, B-3001 Heverlee, Belgium; <email>rita.gemayel@biw.vib-kuleuven.be</email> (R.G.); <email>jacho@students.pitzer.edu</email> (J.C.); <email>steven.boeynaems@student.kuleuven.be</email> (S.B.)</aff>
      <aff id="af2-genes-03-00461"><label>2 </label>Laboratory for Genetics and Genomics, Centre of Microbial and Plant Genetics (CMPG), KULeuven, Gaston Geenslaan 1, B-3001 Heverlee, Belgium</aff>
      <author-notes>
        <corresp id="c1-genes-03-00461"><label>*</label> Author to whom correspondence should be addressed; E-Mail: <email>Kevin.Verstrepen@biw.vib-kuleuven.be</email>; Tel./Fax: +32-0-16-75-1390.</corresp>
      </author-notes>
      <pub-date pub-type="epub">
        <day>26</day>
        <month>07</month>
        <year>2012</year>
      </pub-date>
      <pub-date pub-type="collection"><month>09</month>
        <year>2012</year>
      </pub-date>
      <volume>3</volume>
      <issue>3</issue>
      <fpage>461</fpage>
      <lpage>480</lpage>
      <history>
        <date date-type="received">
          <day>25</day>
          <month>06</month>
          <year>2012</year>
        </date>
        <date date-type="rev-recd">
          <day>19</day>
          <month>07</month>
          <year>2012</year>
        </date>
        <date date-type="accepted">
          <day>21</day>
          <month>07</month>
          <year>2012</year>
        </date>
      </history>
      <permissions>
        <copyright-statement>© 2012 by the authors; licensee MDPI, Basel, Switzerland.</copyright-statement>
        <copyright-year>2012</copyright-year>
        <license xmlns:xlink="http://www.w3.org/1999/xlink" license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/3.0/">
          <p>This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).</p>
        </license>
      </permissions>
      <abstract>
        <p>Copy Number Variations (CNVs) and Single Nucleotide Polymorphisms (SNPs) have been the major focus of most large-scale comparative genomics studies to date. Here, we discuss a third, largely ignored, type of genetic variation, namely changes in tandem repeat number. Historically, tandem repeats have been designated as non functional “junk” DNA, mostly as a result of their highly unstable nature. With the exception of tandem repeats involved in human neurodegenerative diseases, repeat variation was often believed to be neutral with no phenotypic consequences. Recent studies, however, have shown that as many as 10% to 20% of coding and regulatory sequences in eukaryotes contain an unstable repeat tract. Contrary to initial suggestions, tandem repeat variation can have useful phenotypic consequences. Examples include rapid variation in microbial cell surface, tuning of internal molecular clocks in flies and the dynamic morphological plasticity in mammals. As such, tandem repeats can be useful functional elements that facilitate evolvability and rapid adaptation.</p>
      </abstract>
      <kwd-group>
        <kwd>microsatellites</kwd>
        <kwd>repeats</kwd>
        <kwd>evolvability</kwd>
        <kwd>phenotype</kwd>
        <kwd>polyglutamine</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec sec-type="intro">
      <title>1. Introduction</title>
      <sec>
        <title>1.1. Tandem Repeats—Definitions and Characteristics</title>
        <p>The scientific community has previously dismissed repetitive regions as nonfunctional, “junk” DNA. However, recent evidence with the assistance of whole genome sequencing illuminates the significant role repeats might play in genomes. In the 1960s, scientists identified these repetitive elements as the explanation for the negative correlation between an organism’s phenotypic complexity and its genome size [<xref ref-type="bibr" rid="B1-genes-03-00461">1</xref>]. For instance, repeats constitute almost 46% of the entire human genome and prokaryotic genomes contain roughly 10% repetitive regions, a significant amount considering their small sizes [<xref ref-type="bibr" rid="B2-genes-03-00461">2</xref>]. </p>
        <p>Two categories of repetitive regions exist—interspersed repeats and tandem repeats (TRs). Interspersed repeats, the more predominant type of repeat, are remnants of transposons dispersed throughout the genome. Such elements are responsible for the diverse array of genome sizes amongst various species [<xref ref-type="bibr" rid="B3-genes-03-00461">3</xref>]. On the other hand, TRs are repetitive DNA, which exist directly adjacent, or in tandem, to one another (<xref ref-type="fig" rid="genes-03-00461-f001">Figure 1</xref>). TRs are often referred to as satellite DNA because they were first identified as sequences constituting the second or “satellite” band after subjection through density-gradient centrifugation [<xref ref-type="bibr" rid="B4-genes-03-00461">4</xref>].</p>
        <p>On the basis of unit length (unit = repeated sequence of DNA—see <xref ref-type="fig" rid="genes-03-00461-f001">Figure 1</xref>), TRs are further divided into two subcategories—microsatellites and minisatellites. Microsatellites, or simple sequence repeats (SSRs), are short TRs with unit length between one to ten nucleotides. Minisatellites are TRs with unit length larger than ten nucleotides. For the purposes of this review, we limit our discussion to tandem repeats.</p>
      </sec>
      <sec>
        <title>1.2. Instability of Repeats</title>
        <p>Tandem repeats are evolutionarily pertinent due to their instability; they mutate at rates between 10<sup>−3</sup> and 10<sup>−6</sup> per cellular generation (<italic>i.e.</italic>, 1 to 10 orders of magnitude greater than point mutations) [<xref ref-type="bibr" rid="B5-genes-03-00461">5</xref>]. Repeat polymorphisms usually occur from the addition or deletion of repeat units, rather than nucleotide substitutions. For instance, in a CTGA tract, most mutations occur by the addition or deletion of an entire CTGA unit as opposed to rare cases in which only a part of the repeat unit is altered (e.g., deletion of two nucleotides GA) (<xref ref-type="fig" rid="genes-03-00461-f001">Figure 1</xref>).</p>
        <p>Two major models have been proposed to explain TR expansions and contraction: strand-slippage replication and recombination. Strand-slippage replication (slipped-strand mispairing or DNA slippage) is a DNA replication error by which mispairing occurs between the template and nascent strands. As such, the template strand can loop out, causing contraction; the nascent strand can also loop out, leading to repeat expansion. Recombination events, such as unequal crossing over and gene conversion may additionally lead to contractions and expansions of TR sequences [<xref ref-type="bibr" rid="B5-genes-03-00461">5</xref>,<xref ref-type="bibr" rid="B6-genes-03-00461">6</xref>].</p>
        <p>Various studies suggest alternative explanations for mutation events. Some studies implicate that strand-slippage replication dominates micro- and minisatellite instability [<xref ref-type="bibr" rid="B7-genes-03-00461">7</xref>]; others indicate that strand-slippage replication generally associates with microsatellite instability whereas recombination effects dominate minisatellite instability [<xref ref-type="bibr" rid="B8-genes-03-00461">8</xref>]. However, the precise mechanism by which repeat mutations occur remains unclear.</p>
        <fig id="genes-03-00461-f001" position="float">
          <label>Figure 1</label>
          <caption>
            <p>Main definitions and characteristics of tandem repeats (TRs). TRs are unstable due to frequent changes in the number of repeat units. TRs with short unit length are also called microsatellites and those with long units are called minisatellites.</p>
          </caption>
          <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="genes-03-00461-g001.tif"/>
        </fig>
      </sec>
      <sec>
        <title>1.3. Factors Influencing Repeat Instability</title>
        <p>The frequency of repeat mutation depends on repeat tract: the longer and purer the repeat, the higher the mutation frequency whereas the shorter and less pure the repeat, the lower the mutation frequency. Even a few impurities drastically reduce mutability of repeat regions [<xref ref-type="bibr" rid="B9-genes-03-00461">9</xref>], indicating that impure repeats demonstrate far greater stability than pure repeats. One study showcases that even a single variant repeat in an otherwise pure poly GT tract increases its stability fivefold [<xref ref-type="bibr" rid="B10-genes-03-00461">10</xref>]. According to Legendre <italic>et al.</italic> 2007, the number of repeat units is the most important determinant of TR stability, above purity and repeat unit length. Base composition also affects repeat stability: polyC and polyG tracts are more unstable than polyA and polyT tracts [<xref ref-type="bibr" rid="B11-genes-03-00461">11</xref>].</p>
        <p>Cellular processes may additionally influence TR mutation rates. It has been shown that increased transcription rates can enhance repeat instability; in yeast, transcription of a poly GT tract destabilizes the repeat region four-to-ninefold with higher levels of transcription causing an additional destabilizing effect of two- to three-fold [<xref ref-type="bibr" rid="B12-genes-03-00461">12</xref>].</p>
        <p>External factors may also significantly influence repeat stability. Classic examples include CT repeat contraction upon fungal infection [<xref ref-type="bibr" rid="B13-genes-03-00461">13</xref>] and repeats in bacteria that undergo frequent adaptive, environmentally stimulated mutations [<xref ref-type="bibr" rid="B14-genes-03-00461">14</xref>]. Likewise, inhibition of Hsp90 (a protein generally involved in responding to severe environmental changes) increases CAG repeat contraction by tenfold, while not affecting the rate of point mutations, illustrating how environmental stresses may trigger TR mutations [<xref ref-type="bibr" rid="B15-genes-03-00461">15</xref>].</p>
      </sec>
      <sec>
        <title>1.4. In Search of Repeats</title>
        <p>Whole genome sequencing has enlightened researchers on the prevalence of repetitive regions in the genome. A comprehensive survey of the various software packages and algorithms for TR detection has been previously published [<xref ref-type="bibr" rid="B16-genes-03-00461">16</xref>]. Certain obstacles remain in our quest to understand more about TRs. One significant issue arises due to the tedious nature of TR identification. Despite the availability of numerous algorithms for identifying TRs (<italic>i.e.</italic>, Tandem Repeat Finder, Sputnik, TROLL) which define certain requirements for TRs based on the number of repeated units, purity of repeats, as well as length of repeat region, no definite guideline exists and algorithms vary from one to another. Therefore, it proves to be quite difficult in assigning a proper cutoff score and an overestimation of TR regions may ensue without the enforcement of stringent guidelines for TR identification.</p>
        <p>Certain measures may be taken to prevent such scenarios (e.g., the shuffle method). Multiple, randomly shuffled sequences of similar size and composition as the sequence of interest are selected and subjected to a TR-finding program. These sequences are searched for TRs using various cutoff scores. The average number of TRs identified in these scrambled sequences then serves as a value of background noise produced by chance occurrence of TRs (for more details, see [<xref ref-type="bibr" rid="B17-genes-03-00461">17</xref>]. A sequence mining tool such as SERV identifies TRs in various DNA sequences and predicts a “variability score” of each repeat based on its purity, unit size, and number of repeat units [<xref ref-type="bibr" rid="B9-genes-03-00461">9</xref>]. These procedures allow for proper identification of significant, variable TRs rather than insignificant “background noise”.</p>
      </sec>
      <sec>
        <title>1.5. Location of Repeats in Genomes: Coding and Regulatory Regions</title>
        <p>Though repeats are overabundant in gene deserts, whole genome sequencing capabilities provide concrete evidence for the prevalence of repeats in coding regions as well. This is exemplified by the human genome; 17% of genes contain repeats in open reading frames and such values are comparable in other species as well [<xref ref-type="bibr" rid="B17-genes-03-00461">17</xref>,<xref ref-type="bibr" rid="B18-genes-03-00461">18</xref>] Microsatellites (&lt;10 nt) are specifically enriched in regulatory genes that encode for transcription factors, DNA-RNA binding proteins, and chromatin modifiers [<xref ref-type="bibr" rid="B19-genes-03-00461">19</xref>]; minisatellites (&gt;10) are prevalent in genes encoding extracellular or cell-surface genes [<xref ref-type="bibr" rid="B5-genes-03-00461">5</xref>]. </p>
        <p>Unlike repeats in noncoding regions, the high frequency of tri- and hexanucleotide repeats (units of 3 or 6 nucleotides) found in coding regions suggests a selective pressure against frameshift mutations [<xref ref-type="bibr" rid="B20-genes-03-00461">20</xref>]. For instance, of the 1 in 20 human genes that were found to contain TR polymorphisms, only approximately 1% of those genes exhibited frameshift mutations [<xref ref-type="bibr" rid="B21-genes-03-00461">21</xref>]. Exceptions apply to prokaryotes in which frameshift mutations act as a beneficial mechanism (to be further discussed later). </p>
        <p>In coding regions, repeats exhibit biases for specific amino acids. Glutamine, arginine, glutamate, alanine, and serine residues largely dominate eukaryotic genomes [<xref ref-type="bibr" rid="B22-genes-03-00461">22</xref>] and [<xref ref-type="bibr" rid="B23-genes-03-00461">23</xref>] whereas serine, glycine, alanine, and proline residues dominate prokaryotic genomes [<xref ref-type="bibr" rid="B24-genes-03-00461">24</xref>]. Interestingly enough, hydrophobic residues such as isoleucine, methionine, and tryptophan residues are generally absent, indicating that preferences for certain residues are not random but stem from the functional role of repeats.</p>
        <p>In many instances, repeat mutations, especially in gene deserts, have neutral effects and remain undetected. Cases like these may have contributed to the previous notion that repeats were nonfunctional junk or selfish DNA [<xref ref-type="bibr" rid="B25-genes-03-00461">25</xref>,<xref ref-type="bibr" rid="B26-genes-03-00461">26</xref>,<xref ref-type="bibr" rid="B27-genes-03-00461">27</xref>]. However, repeat instability in coding regions, promoters, and other regulatory regions may eventually proceed to alter gene function. In this review, we highlight the studies that implicate variable TRs with changes in gene function, starting by TRs located within regulatory regions (promoters and other regulatory sequences) and ending with studies on intragenic (<italic>i.e.</italic>, coding) repeats.</p>
      </sec>
    </sec>
    <sec>
      <title>2. Repeats in Non-Coding Regions</title>
      <p>Phenotypic evolution has been mostly attributed to changes in gene expression and regulation due to variation in (non-coding) <italic>cis</italic>-regulatory elements [<xref ref-type="bibr" rid="B28-genes-03-00461">28</xref>,<xref ref-type="bibr" rid="B29-genes-03-00461">29</xref>]. Upon analysis of the human genome, Rockman and Wray, found a higher heterozygosity in <italic>cis</italic>-regulatory sites than in coding sequences in part, due to the high prevalence of TRs in regulatory regions, particularly the AC dinucleotide repeat [<xref ref-type="bibr" rid="B30-genes-03-00461">30</xref>]. The authors suggest that these repeats contribute significantly to variable gene expression in humans and are thus sources of rapidly evolving, heritable phenotypic variation. A similar analysis in the yeast <italic>Saccharomyces cerevisiae</italic> indicates that about 25% of all promoters contain TRs. Variation in these repeats is associated with variable gene expression within and between yeast species [<xref ref-type="bibr" rid="B31-genes-03-00461">31</xref>] .</p>
      <p>Additional studies uncover a link between variable TRs in promoters and changes in gene expression and/or function. This suggests that instead of being neutral sequences without a functional value, TRs may actually be significant contributors to functional variation in gene expression. In the following section, we report the studies that reveal a link between variable TRs and changes in gene expression. We focus particularly on studies which reveal the mechanism through which repeats in non-coding regions affect downstream gene expression. A summary of numerous correlation studies, in which gene expression variation has only been associated with variable promoter TRs, can be found in the following reference [<xref ref-type="bibr" rid="B17-genes-03-00461">17</xref>].</p>
    </sec>
    <sec>
      <title>3. Promoter Evolution Through Tandem Repeat Variation</title>
      <p>The high prevalence of TRs in regulatory regions, in many organisms, suggests that repeat variation might be a common, evolutionarily conserved, mechanism regulating gene expression. A large number of studies report correlations between TRs and changes in gene expression [<xref ref-type="bibr" rid="B17-genes-03-00461">17</xref>]. Other studies, in addition, provide a solid mechanistic explanation as to how repeat variation leads to changes in downstream gene expression. These studies will be discussed in greater detail in this section. </p>
      <sec>
        <title>3.1. Tandemly Repeated Transcription Factor Binding Sites</title>
        <p>Variable TRs in promoters can affect gene expression by altering the number of transcription factor binding sites. Frequent TR variability of repeat-containing transcription factor binding sites results in changes in the number of binding sites. An example of such a mechanism was first uncovered in pathogenic bacterial species. In <italic>Neisseria meningitidis</italic>, changes in the number of TAAA repeats located in the <italic>nadA</italic> promoter induces variable expression of the NadA adhesin protein. This highly frequent switching of phenotypic outcome, also referred to as phase variation, is mediated by the loss or gain of binding sites for the regulatory protein IHF [<xref ref-type="bibr" rid="B32-genes-03-00461">32</xref>]. A second, repeat-dependent, level of control on the same <italic>nadA</italic> promoter is discussed in the following paragraph. Rapid phenotypic switching mediated by intragenic (<italic>i.e.</italic>, coding) repeats will be discussed in a later section of this review. </p>
        <p>Changes in transcription factor binding sites as a result of unstable TRs have also been described in eukaryotes. A variable TCC repeat in the promoter region of the human epidermal growth factor receptor (a proto-oncogene) induces changes in gene expression by altering the number of binding sites for the transcriptional regulator Sp1 [<xref ref-type="bibr" rid="B33-genes-03-00461">33</xref>]. A change of one unit in a 61-bp repeat of the human reduced folate carrier gene promoter induces a significant change in gene expression. This repeat is polymorphic in the human population and binds the transcription factors Sp1 and AP2 <italic>in vitro</italic> [<xref ref-type="bibr" rid="B34-genes-03-00461">34</xref>]. Though these display a few examples of the role of TRs in transcription factor binding sites, a screen of polymorphic TRs located in a 10-kb region upstream of the transcription start sites in the human genome reveals that there are many more TRs that bind regulatory proteins and thus act as <italic>cis</italic>-regulatory elements [<xref ref-type="bibr" rid="B35-genes-03-00461">35</xref>].</p>
      </sec>
      <sec>
        <title>3.2. Variable Repeats Induce Variable Spacing Between Functional Promoter Elements</title>
        <p>Variable TRs in promoters not only affect the number of transcription factor binding sites, but can also induce changes in spacing between critical promoter elements (<xref ref-type="fig" rid="genes-03-00461-f002">Figure 2</xref>). In the <italic>nadA</italic> promoter of <italic>N. meningitidis</italic> (see above), changes in the number of TAAA repeats also induce changes in spacing between distal and proximal binding sites (flanking the TAAA repeat) for the NadR repressor protein [<xref ref-type="bibr" rid="B36-genes-03-00461">36</xref>]. Additionally, the transcript level of the <italic>N. meningitidis</italic> membrane protein PorA correlates with the number of guanidine residues within the -10 and -35 spacer region in the upstream promoter. A mere loss or gain of one nucleotide in this polyGuanidine TR leads to significant changes in PorA transcription [<xref ref-type="bibr" rid="B37-genes-03-00461">37</xref>]. Similarly, in <italic>Haemophilus influenzae</italic>, promoter TRs modulate the expression of genes involved in the formation of adhesive appendages or fimbriae. Variation in a TA dinucleotide repeat in the bidirectional promoter of two genes <italic>hifA</italic> and <italic>hifB</italic> alters the critical spacing between the -10 and -35 consensus sequence, thus affecting the bidirectional transcription initiation of both genes [<xref ref-type="bibr" rid="B38-genes-03-00461">38</xref>]. Several phase variable outer membrane proteins, whose expression is attributed to similar promoter TRs, have also been described in <italic>Bordetella pertussis</italic> [<xref ref-type="bibr" rid="B39-genes-03-00461">39</xref>], and <italic>Mycoplasma hyorhinis</italic> [<xref ref-type="bibr" rid="B40-genes-03-00461">40</xref>].</p>
        <fig id="genes-03-00461-f002" position="float">
          <label>Figure 2</label>
          <caption>
            <p>Phase variation mediated by variable TRs. Variable TRs control phase variation to allow pathogenic bacteria to rapidly interchange between phenotypes. (<bold>a</bold>) Variable tandem repeats in promoter regions control gene expression. A certain repeat number allows for RNA polymerase binding in promoter regions enabling proper gene expression (confers the ON phenotype) whereas TR truncation disables RNA polymerase binding leading to blocked expression (confers the OFF phenotype); (<bold>b</bold>) Variable TRs located in coding regions produce either a fully-functioning protein (confers the ON phenotype) or truncated, protein (confers the OFF phenotype).</p>
          </caption>
          <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="genes-03-00461-g002.tif"/>
        </fig>
        <p>Genome-wide sequencing suggests there are many more pathogenic bacteria, than the ones mentioned above, in which tandem repeat-mediated phase variation is operable. In fact, more than 30 <italic>N. meningitidis</italic> genes are expected to be phase variable as a result of unstable TRs [<xref ref-type="bibr" rid="B41-genes-03-00461">41</xref>]. </p>
        <p>It seems that this form of gene expression regulation (changes in spacing between functional promoter elements) is common among several pathogenic bacterial species. This rapid phenotypic switching, results in a heterogeneous bacterial population in which some individuals are better suited to environmental changes (e.g., attack by the immune system) and consequently have better chances of survival.</p>
      </sec>
      <sec>
        <title>3.3. Dynamic Chromatin Structure and Nucleosome Positioning</title>
        <p>DNA is intricately organized in the nucleus and packaged into structures called nucleosomes, in which DNA is wrapped around an octamer of four core histones, to form chromatin. Chromatin structure is highly dynamic and has large implications in the regulation of gene expression, especially since histone complexes affect accessibility of transcription factors to DNA. </p>
        <p>Several studies indicate a possible role for repetitive amino acid tracts in mediating effects on chromatin structure. A number of authors demonstrate the importance of polyA and polyT tracts as constituents of promoter regions that inhibit nucleosome formation [<xref ref-type="bibr" rid="B42-genes-03-00461">42</xref>,<xref ref-type="bibr" rid="B43-genes-03-00461">43</xref>,<xref ref-type="bibr" rid="B44-genes-03-00461">44</xref>]. Additionally, Vinces <italic>et al</italic>. illustrates TR enrichment in nucleosome-free regions of yeast promoter regions and supports this finding with preliminary observations in human promoters (<xref ref-type="fig" rid="genes-03-00461-f003">Figure 3</xref>). Mutants were made in repeat-containing promoters of several yeast genes and found to affect gene expression and nucleosome positioning. This study unveils that TRs behave as nucleosome-inhibiting sequences that allow for an open chromatin structure [<xref ref-type="bibr" rid="B31-genes-03-00461">31</xref>]. The authors also find that approximately 25% of yeast genes contain TRs, which has strong implications for the importance of TRs as crucial components in genomes of higher-order eukaryotes that regulate DNA structure and facilitate DNA melting. The following section further explores the role of TRs in the regulation of DNA structure.</p>
        <fig id="genes-03-00461-f003" position="float">
          <label>Figure 3</label>
          <caption>
            <p>TRs mediate dynamic chromatin structure. (<bold>a</bold>) TR enrichment in promoter regions is nucleosome-inhibiting and allows for an open chromatin structure; (<bold>b</bold>) Nucleosome-dense regions lack TRs in promoter regions of the genome.</p>
          </caption>
          <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="genes-03-00461-g003.tif"/>
        </fig>
      </sec>
      <sec>
        <title>3.4. Intrinsic Properties of Tandem Repeats Affect DNA Structure</title>
        <p>Though DNA is normally found as B-DNA (right-handed helical conformation), in rare cases, it may exist as Z-DNA (left-handed helical conformation). This conformation is drastically different from the normal B-DNA, that it might block access of transcription factors to DNA. Several studies suggest a role for TRs as elements that assist in Z-DNA formation. Alternating purines and pyrimidines, which favor the Z conformation, are found as TR sequences in the genome. Authors Naylor and Clark find evidence for a CA repeat lying upstream of the rat prolactin gene that enables Z-DNA formation. Additionally, this repeat inhibits gene transcription by potentially deterring transcriptional efficacy of RNA polymerase II [<xref ref-type="bibr" rid="B45-genes-03-00461">45</xref>]. Specific Z-DNA-binding proteins also influence Z-DNA gene expression as exemplified by the effect of double stranded RNA adenosine deaminase (ADAR1) proteins that bind promoter Z-DNA to activate gene expression [<xref ref-type="bibr" rid="B46-genes-03-00461">46</xref>].The structure of DNA greatly influences gene expression and these studies allude to the possible importance of TRs for DNA structure.</p>
      </sec>
    </sec>
    <sec>
      <title>4. Repeats in Introns and Untranslated Regions</title>
      <p>Functional roles for TRs present in introns and untranslated regions have been mostly uncovered by studies investigating the pathogenic mechanisms involved in neurodegenerative diseases. These studies provide insight into how variable intergenic TRs can act as modulators of gene expression. In this section, we focus on such studies and highlight the mechanism through which variable TRs in introns and untranslated regions can lead to changes in gene expression. </p>
      <sec>
        <title>4.1. Recruitment of RNA Binding Proteins</title>
        <p>Tandem repeats in introns and untranslated regions can influence gene expression by modulating the activity of RNA-binding proteins. The mechanistic role of TRs in recruiting RNA-binding proteins has been proposed to explain the overlapping clinical features of two genetically unrelated neurological diseases. Expansion of a CTG repeat in the 3' UTR of the <italic>DMPK</italic> gene causes myotonic dystrophy type 1 (DM1) and the expansion of a CCTG repeat in intron 1 of the <italic>ZNF9</italic> gene causes myotonic dystrophy type 2 (DM2). The expanded RNAs adopting a stable hairpin structure, bind to and alter the activity of RNA processing proteins CUG-BP1 and MNBL [<xref ref-type="bibr" rid="B47-genes-03-00461">47</xref>]. This leads to aberrant splicing of numerous transcripts (<italic>i.e.</italic>, downstream targets) involved in different cellular processes. </p>
      </sec>
      <sec>
        <title>4.2. Variable mRNA Splicing Patterns</title>
        <p>A study investigating the link between expression of the Bromodomain-Containing Gene (<italic>BRD2</italic>) and epilepsy in humans uncovered the role of variable intronic repeats in regulating gene expression through alternative mRNA splicing pattern. In intron 2 of the <italic>BRD2</italic> gene, a polymorphic GT repeat is strongly associated with epilepsy. Gradual shortening from 13 GT repeats to 1 GT repeat results in a higher proportion of (non-productive) alternatively spliced <italic>BRD2</italic> transcripts over normal transcripts. These alternative mRNAs produce non-functional BRD2 proteins [<xref ref-type="bibr" rid="B48-genes-03-00461">48</xref>]. Similarly, a change from 11 CA repeats to 13 repeats in an intron of the cystic fibrosis transmembrane regulator results in higher levels of a non-functional transcript (missing exon 9) [<xref ref-type="bibr" rid="B49-genes-03-00461">49</xref>]. These examples illustrate how intronic dinucleotide repeats alter splicing patterns, ultimately affecting translation and protein synthesis (<xref ref-type="fig" rid="genes-03-00461-f004">Figure 4</xref>). </p>
        <fig id="genes-03-00461-f004" position="float">
          <label>Figure 4</label>
          <caption>
            <p>Variable intronic TRs regulate mRNA splicing. (<bold>a</bold>) A variable TR in intron 2 in the pre-mRNA transcript enables proper splicing to form a fully-functioning, normal mRNA transcript; (<bold>b</bold>) Shortening of intron 2 repeat results in an alternatively spliced (non-productive) transcript missing exon 2, which does not produce any functional protein (from reference [<xref ref-type="bibr" rid="B48-genes-03-00461">48</xref>]). </p>
          </caption>
          <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="genes-03-00461-g004.tif"/>
        </fig>
      </sec>
      <sec>
        <title>4.3. How Can Variable Tandem Repeats in Regulatory Regions Promote Evolvability?</title>
        <p>The studies described in this section indicate that variable TRs in regulatory regions can strongly impact gene expression and/or function through various mechanisms. The high prevalence of these repeats in diverse genomes suggest that this type of gene expression regulation (<italic>i.e.</italic>, through variation in TR number) might be a common and conserved regulatory mechanism. The central question that remains is how can TR variability promote evolvability? And what are the effects on an organismal level? </p>
        <p>The studies on bacterial phase variation clearly demonstrate how repeat variation (in promoters and coding regions—see further) enable phenotypic evolution, ultimately impacting the population; rapid changes in TRs create phenotypic variants within a population, leading to fast adaptation (e.g., to the immune system) and increased survival. </p>
        <p>In eukaryotes, a recent study provides experimental evidence on the role of TRs as drivers of rapid evolution. Vinces <italic>et al.</italic> reports that yeast promoters containing TRs have evolved faster since their expression patterns are more divergent between and within yeast strains compared to promoters without repeats [<xref ref-type="bibr" rid="B31-genes-03-00461">31</xref>]. Promoter-containing repeats fused to a fluorescent marker to facilitate selection for higher gene expression provide experimental evidence for repeat-driven rapid evolution after a few rounds of selection: repeat-containing promoters yield variants with higher expression that are specifically linked to changes in promoter repeat number; repeat-less promoters do not show any changes in gene expression during the selection process [<xref ref-type="bibr" rid="B31-genes-03-00461">31</xref>]. This study firmly establishes that several promoter repeats generate expression variability at high rates, allowing for rapid evolution of gene expression.</p>
        <p>In complex organisms, such studies are difficult to undertake, though some reports indicate that this might be the case for particular genes. The promoter of the <italic>MMP3</italic> gene contains a variable polyT tract that is rapidly evolving in the primate lineage. In humans, variation in this repeat results in variable expression of the <italic>MMP3</italic> gene, associated with risk of heart disease: the 5T allele results in higher <italic>MMP3</italic> expression and is associated with myocardial infarction and aneurysm, whereas the 6T allele drives a lower <italic>MMP3</italic> expression and is associated with coronary artery disease [<xref ref-type="bibr" rid="B50-genes-03-00461">50</xref>]. In European populations, the 5T allele seems to be under positive selection in comparison to other parts of the world. This highly variable repeat seems to drive the rapid evolution of the <italic>MMP3</italic> gene expression and based on its associated phenotype in humans, it clearly exhibits a functional value.</p>
      </sec>
    </sec>
    <sec>
      <title>5. Repeats in Coding Regions</title>
      <p>Tandem repeats in coding regions are mostly known for their association with a number of human neurodegenerative diseases. Yet their abundance in functional regions of the genome and their enrichment in certain gene categories, suggest that repeats in coding sequences might play a beneficial role. In this section, we review studies revealing functionally beneficial roles of long (minisatellite) and short (microsatellite) coding repeats.</p>
      <sec>
        <title>5.1. Minisatellites—Functional Variability in Cell Surface Proteins</title>
        <p>Minisatellites are enriched in cell surface genes and conserved through evolution from yeast to humans [<xref ref-type="bibr" rid="B9-genes-03-00461">9</xref>]. Various studies indicate that polymorphisms in these minisatellites in fact confer functional variability. </p>
        <p>The <italic>S. cerevisiae FLO1</italic> gene encodes a cell-surface adhesin that mediates cell-cell and cell-surface adhesion. The gene contains a repeat tract consisting of tandemly repeated units of approximately 100 nt, which are variable across different yeast strains [<xref ref-type="bibr" rid="B5-genes-03-00461">5</xref>]. Phenotypic tests with isogenic strains differing only in their <italic>FLO1</italic> repeat length illustrate that the intensity of the adhesion directly correlates with the length of the repeat (<italic>i.e.</italic>, cells with longer <italic>FLO1</italic> repeats adhere better to each other) [<xref ref-type="bibr" rid="B5-genes-03-00461">5</xref>] (<xref ref-type="fig" rid="genes-03-00461-f005">Figure 5</xref>).</p>
        <fig id="genes-03-00461-f005" position="float">
          <label>Figure 5</label>
          <caption>
            <p>Variable coding TRs modulate cell to cell adhesion. The Flo1 protein binds cell wall polysaccharides (orange and blue hexagons) through its <italic>N</italic>-terminal domain (green) thus linking two yeast cells. Long <italic>FLO1</italic> repeats result in a longer Flo1 protein capable of stronger cell wall binding.</p>
          </caption>
          <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="genes-03-00461-g005.tif"/>
        </fig>
        <p><italic>Flor</italic> yeast strains form buoyant biofilms on the air/liquid interface, allowing them to evade oxygen-deprived environments. Fidalgo <italic>et al</italic>. report that the extension of a minisatellite in <italic>FLO11</italic>, another adhesin-coding gene, contributes to biofilm formation. The repeat tract consists of serine/threonine-rich units (~100 nt), prone to <italic>O</italic>-glycosylation. The addition of covalently linked sugar groups renders the adhesins, and subsequently the cell wall, hydrophobic. Thus, an increased number of repeats enhances buoyancy of yeast strains. Additionally, transformation of <italic>S. cerevisiae</italic> strains, that generally sediment, with <italic>flor FLO11</italic> repeat tract and promoter enable yeast cells to form buoyant biofilms [<xref ref-type="bibr" rid="B51-genes-03-00461">51</xref>].</p>
        <p>Similar observations have also been made in the human pathogen <italic>Candida albicans</italic>. The <italic>ALS</italic> gene family encodes adhesins similar in structure to the baker’s yeast flocculins that mediate biofilm formation. The length of the central repeat domain correlates with adhesion propensity (<italic>i.e.</italic>, longer repeats generate more adhesive cells) [<xref ref-type="bibr" rid="B52-genes-03-00461">52</xref>]. Recombination events between the different <italic>ALS</italic> genes can generate cell-surface variability required to evade the host’s immune response [<xref ref-type="bibr" rid="B53-genes-03-00461">53</xref>]. </p>
      </sec>
      <sec>
        <title>5.2. Microsatellites—Beyond Diseases: A Greater Purpose for Coding Repeats</title>
        <p>Trinucleotide repeats, in particular polyglutamine (polyQ) repeats, are involved in a number of human neurodegenerative diseases (e.g., Huntington’s disease, fragile X syndrome, spinocerebellar ataxia), with numerous studies illustrating the conferral of such diseases upon repeat expansions exceeding a particular threshold. Though TRs involved in human diseases will not be discussed extensively here, it is important to note that severity and time of onset of these neurodegenerative diseases depend on the repeat expansion: longer repeats correlate with earlier disease onset and more severe symptoms (for a review on the role of repeats in human diseases, refer to [<xref ref-type="bibr" rid="B47-genes-03-00461">47</xref>]).</p>
        <p>Considering the little we know, such devastating effects of TR mutations have often overshadowed our knowledge of TRs. However, the predominance of repetitive DNA sequences in functional parts of the genomes, despite such a high propensity for mutations and apparent low information content, hint at a greater role for TRs. In fact, though the genetic code is degenerate, amino acid repeats tend to be coded by pure DNA repeats, illustrating the conservation of repetitive regions on the DNA level as well as the protein level. These unstable repeats, might be tolerated in some coding sequences because they hold the potential for the acquisition of a beneficial novel/improved function (e.g., novel protein-protein interactions,). TRs may not merely be “junk DNA” as was previously assumed. We will highlight key studies that provide experimental evidence for this hypothesis in the following section.</p>
      </sec>
      <sec>
        <title>5.3. Repeats as Molecular Switches in Bacteria</title>
        <p>Intragenic TRs can also mediate phase variation (see also section on repeats in non-coding regions). In the bacterium, <italic>Neisseria gonorrhoeae</italic>, repeat variations in the P. II family of cell surface genes allow for evasion of the host immune system [<xref ref-type="bibr" rid="B54-genes-03-00461">54</xref>]. Changes in the CTCTT pentameric microsatellite repeat in the coding region of the membrane signal peptide of cell surface genes, can either lead to correct or incorrect translation of proteins [<xref ref-type="bibr" rid="B55-genes-03-00461">55</xref>]. This switching between an ON and OFF state of the gene generates phenotypic variability in the population, providing individuals able to evade the host’s immune response (<xref ref-type="fig" rid="genes-03-00461-f002">Figure 2</xref>).</p>
        <p>Similar occurrences are seen in the <italic>lic1</italic> gene of <italic>Haemophilus influenzae</italic> (enables the addition of phosphorylcholine moieties to cell membrane lipopolysaccharides (LPS)), which contains a variable intragenic CAAT repeat. Formation of a complete, functioning Lic1 protein or a truncated mutant depends on variation of an intragenic CAAT repeat. Differences in LPA structures promote different bacterial variants enabling survival of the bacterium in the host [<xref ref-type="bibr" rid="B56-genes-03-00461">56</xref>]. </p>
        <p>Phase variable genes are not merely limited to cell-surface genes; recent evidence suggests that regulatory genes, such as those encoding DNA-methyltransferases (<italic>mod</italic> genes), are also phase variable. The m<italic>odA</italic> gene, in <italic>Neisseria</italic> and <italic>Haemophilus</italic> species, contains either an AGTC or AGCC repeat whose mutations directly alter the rate of phase variation [<xref ref-type="bibr" rid="B57-genes-03-00461">57</xref>]. The <italic>modA</italic> gene contains two potential start codons depending on repeat number but only the one that yields the longest transcript gives rise to a functional protein. Variation in <italic>ModA</italic> repeat number thus has consequences on the expression of its downstream targets that include outer membrane proteins, iron transporters and heat-shock proteins [<xref ref-type="bibr" rid="B57-genes-03-00461">57</xref>]. For instance, the OFF state of the <italic>modA</italic> gene leads to biofilm thickness and better survival rates of <italic>N. gonorrhoeae</italic> cells in human cervical epithelial cells [<xref ref-type="bibr" rid="B58-genes-03-00461">58</xref>]. The ability to engage in phase variation may provide a means to increase population fitness and facilitate rapid adaptation to a hostile environment.</p>
        <p>Sequencing reveals the substantial presence of TRs in prokaryotic genomes, which suggests that rather than such an incidence being happenstance, some TRs are positively selected for rather than against [<xref ref-type="bibr" rid="B59-genes-03-00461">59</xref>]. Particularly in pathogenic bacteria, repeats in coding sequences are more favored at the <italic>N</italic>-terminus and depleted in the middle of the coding sequence [<xref ref-type="bibr" rid="B60-genes-03-00461">60</xref>]. This might increase the probability of frameshift mutations that facilitate phase variation. Thus, TRs may be useful elements that mediate phenotypic change enabling organisms to rapidly adapt to a changing environment. Ultimately, understanding phase variations in these prominent human pathogens may unveil new insights in combating significant health problems.</p>
      </sec>
      <sec>
        <title>5.4. Significance of Polyglutamine Repeats</title>
        <p>In humans, polyglutamine (polyQ) repeats have strong implications for neurodegenerative diseases [<xref ref-type="bibr" rid="B61-genes-03-00461">61</xref>]. In <italic>Drosophila melanogaster</italic>, polyQ repeats constitute one of the three most frequently found amino acid runs; the same can be said for the yeast genome [<xref ref-type="bibr" rid="B62-genes-03-00461">62</xref>]. Overall, polyQ tracts are abundant in the eukaryotic genome [<xref ref-type="bibr" rid="B24-genes-03-00461">24</xref>,<xref ref-type="bibr" rid="B62-genes-03-00461">62</xref>] and repeats encoding these residues are largely variable. </p>
        <p>Here we showcase several studies regarding the possible functional role of polyQ repeats, ranging from its involvement in morphological variation to protein regulation. </p>
        <sec>
          <title>5.4.1. Polyglutamine Repeats Regulate Circadian Clocks</title>
          <p>Organisms possess circadian clocks, or innate biological timing mechanisms that synchronize various cellular processes with the external environment. The typical circadian cycle, also referred to as a period, occurs over a span of 24 h. The circadian rhythm, cued by environmental factors (e.g., light intensity and temperature), is crucial to an organism’s survival and well-being; maximizing ideal circadian patterns may confer a fitness advantage.</p>
          <p>In the species <italic>Neurospora crassa</italic>, intragenic CAG repeats control expression of the transcription factor white collar-1 (WC-1), responsible for expression of an important regulator of the circadian rhythm [<xref ref-type="bibr" rid="B63-genes-03-00461">63</xref>]. The variability of the repeats amongst various strains of <italic>N. crassa</italic> leads to differences in circadian periods; TR frequencies exhibit a latitudinal cline: longer repeats (corresponding to shorter circadian rhythms) are commonly seen amongst strains collected from lower latitudes [<xref ref-type="bibr" rid="B64-genes-03-00461">64</xref>].</p>
          <p>Tandem repeat variation has also been linked to differences in bird migratory patterns [<xref ref-type="bibr" rid="B65-genes-03-00461">65</xref>]. The avian <italic>CLOCK</italic> gene, a regulator of circadian patterns, contains a variable polyQ repeat in its <italic>C</italic>-terminus. Statistical correlation studies of the <italic>CLOCK</italic> gene unveil that TR variation differs between two bird species, the migratory bluethroat and the non-migratory blue tit; TR is more unstable in the blue tits population [<xref ref-type="bibr" rid="B65-genes-03-00461">65</xref>]. These observations suggest that variable TRs in the CLOCK gene may provide a mechanism for tuning internal clock to environmental changes in the nonmigratory blue tits. </p>
          <p>Such studies additionally support the beneficial role of unstable repeats, especially in diversifying gene pools, which may allow for better adaptation of organisms.</p>
        </sec>
        <sec>
          <title>5.4.2. Polyglutamine Repeats and Morphological Variation</title>
          <p>Higher-order eukaryotes unveil another possible role for repeats. In these organisms, TRs are enriched in genes involved in determining body morphology, suggesting that unstable repeats underlie phenotypic diversity [<xref ref-type="bibr" rid="B9-genes-03-00461">9</xref>].</p>
          <p>A fascinating study attributed morphological variation, occurring over short evolutionary timescales, to variations in TRs. The great diversity of morphology among dog breeds cannot merely be adequately explained by point mutations; the paper by Fondon and Garner provides compelling evidence associating variable TRs with changes in dog skull morphology occurring over a span of approximately fifty years [<xref ref-type="bibr" rid="B66-genes-03-00461">66</xref>]. In comparing genomic data alongside morphological data, variable TRs exist amongst different dog breeds, with two genes in particular showing quite striking results—<italic>Alx-4</italic> and <italic>Runx-2</italic>. Though the <italic>Alx-4</italic> gene does not contain a pure polyQ repeat but rather a PQ tract, a 51-nt deletion of this region is found in dogs with an additional rear claw (<italic>i.e.</italic>, bilateral rear polydactyly, a signature feature of the Great Pyrenees breed). The <italic>Runx-2</italic> gene, a transcription factor controlling osteoblast development, contains two polymorphic repeats—a polyQ repeat and a polyalanine repeat. Here, the ratio of glutamine to alanine repeats correlates with the degree of dorsoventral nose bend and midface length in dog breeds [<xref ref-type="bibr" rid="B66-genes-03-00461">66</xref>]. A further study of the <italic>Runx-2</italic> repeat region by Sears <italic>et al</italic>. suggests that the ratio of the two repeats correlates to onset of transcriptional activation [<xref ref-type="bibr" rid="B67-genes-03-00461">67</xref>]. The role of repeats in transcriptional regulation will be highlighted in the following paragraphs.</p>
          <p>The morphological diversification of dog breeds over approximately 50 years, an extremely short time span from an evolutionary standpoint, is the poignant aspect of this study by Fondon and Garner. Even despite strong selection against genetic diversity as a result of domestication and inbreeding, TR involvement in controlling development and body morphology suggests that variable TR may play a critical role in the rapid evolution of dog skeletal morphology.</p>
        </sec>
        <sec>
          <title>5.4.3. Polyglutamine Repeats and Gene Transcription</title>
          <p>A further exploration of the <italic>Runx-2</italic> repeats measured expression of its target and showed that mRNA of the <italic>Runx-2</italic> target increased with longer <italic>Runx-2</italic> repeats [<xref ref-type="bibr" rid="B67-genes-03-00461">67</xref>]. Another study, utilized CAG repeat fusions to the GAL4 transcription factor to determine changes in gene expression [<xref ref-type="bibr" rid="B68-genes-03-00461">68</xref>]. Gerber <italic>et al</italic>. provide evidence that increasing repeat number <italic>in vitro</italic> and <italic>in vivo</italic> leads to increased rates of mRNA transcription [<xref ref-type="bibr" rid="B68-genes-03-00461">68</xref>]. Though these studies themselves do not delve into the mechanistic aspect of increased expression, the direct effect of repeat length on gene expression sheds light on possible mechanistic effects. The following section illustrates the involvement of TR domains as sites for protein-protein interaction. </p>
        </sec>
        <sec>
          <title>5.4.4. Polyglutamine Repeats in Protein Interactions</title>
          <p>The expansion of polyQ tracts in the transcription factor <italic>human TATA-binding protein</italic> (<italic>TBP</italic>) gene has been shown to induce the neurodegenerative disease spinocerebellar ataxia (SCA17) [<xref ref-type="bibr" rid="B61-genes-03-00461">61</xref>]. Previous studies have shed light on the role of repeat expansions in conferring onset of disease, though the exact mechanism by which the polyQ repeat of TBP affects transcription has largely confounded researchers. However, Friedman and colleagues demonstrate the mechanistic role of the polyQ repeats in the transcriptional regulation of TBP [<xref ref-type="bibr" rid="B69-genes-03-00461">69</xref>]. TBP dimerization has previously been shown to prevent unregulated gene expression and modulate its own degradation [<xref ref-type="bibr" rid="B70-genes-03-00461">70</xref>]. Friedman <italic>et al</italic>. demonstrate that polyQ expansion inversely relates to TBP dimerization—the longer the repeat, the lower the frequency of TBP dimers (<xref ref-type="fig" rid="genes-03-00461-f006">Figure 6</xref>). This indicates that the polyQ repeat tract may modulate dimerization of the protein. Additionally, TR mutations affect TBP interaction with other transcriptional factors. For example, TR expansion leads to downregulation of HSPB1 (HSP27), which encodes a small heat shock protein critical for neuronal differentiation and survival in humans. TR expansion also leads to enhanced interactions with transcription factors (e.g., the general transcription factor IIB (TFIIB)), not seen in the wild-type. This study is a clear example of a mechanistic role for variable polyQ repeats on gene function and its associated phenotypes. More studies of this type are needed to fully elucidate the role(s) of variable intragenic repeats on normal gene function and regulation. </p>
          <fig id="genes-03-00461-f006" position="float">
            <label>Figure 6</label>
            <caption>
              <p>Variable glutamine (Q) TRs modulate protein dimerization. (<bold>a</bold>) Normal repeat length (ex<italic>.</italic> 13Q) of protein-interacting regions of the transcription factor TATA-binding protein allow for proper dimerization, initiating normal gene expression. (<bold>b</bold>) Expansion of polyQ repeats (ex. 71Q) inhibits dimerization causing irregular gene expression [<xref ref-type="bibr" rid="B69-genes-03-00461">69</xref>]. </p>
            </caption>
            <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="genes-03-00461-g006.tif"/>
          </fig>
        </sec>
      </sec>
    </sec>
    <sec sec-type="conclusions">
      <title>6. Conclusions</title>
      <p>Large scale genomic studies mainly focus on Single Nucleotide Polymorphisms (SNPs) and Copy Number Variations (CNVs) as the major sources of phenotypic variation. Here, we describe studies demonstrating how tandem repeat variation can influence phenotypes and, can be considered as a third source of genetic variation.</p>
      <p>From the evidence gleaned so far, (with the exception of repeat expansions in human diseases) TR variations do not result in deleterious changes rather in small incremental phenotypic changes (e.g., shorter or longer circadian period, lower or higher gene expression). This quantitative phenotypic variation is commonly believed to depend on complex interactions between multiple genes (<italic>i.e.</italic>, quantitative trait loci). However, the examples described here, show that many TRs can confer quantitative changes in phenotypes through changes in one genetic locus (coding or regulatory region). In some other cases (e.g., phase variation in pathogenic bacteria), variable TRs do function as binary switches (ON/OFF switching of cell-surface proteins in bacteria).</p>
      <p>The formation of TRs in genomes may be a random process (resulting from polymerase slippage) [<xref ref-type="bibr" rid="B71-genes-03-00461">71</xref>], but once formed, some repeats might in fact be selected for. Several lines of evidence hint at such a possibility: Dinucleotide repeats, known to affect gene expression, are preferentially present proximally to transcription start sites in the human genome [<xref ref-type="bibr" rid="B72-genes-03-00461">72</xref>]. Some dinucleotide TRs in introns and UTRs are highly conserved (in unit sequence but not in number) in mammalian genomes [<xref ref-type="bibr" rid="B73-genes-03-00461">73</xref>]. These repeats may, therefore, play a conserved role, particularly since they are predominantly found in genes involved in embryonic and nervous system development [<xref ref-type="bibr" rid="B74-genes-03-00461">74</xref>]. The occurrence of trinucleotide repeats in six plant species was found at a 2-fold higher frequency in coding regions suggesting a selection for certain stretches of amino acids [<xref ref-type="bibr" rid="B75-genes-03-00461">75</xref>]. A high proportion of dinucleotide repeats has also been uncovered in the genome of multiple strains of the HIV-1 virus [<xref ref-type="bibr" rid="B76-genes-03-00461">76</xref>]. These examples suggest that some variable TRs are useful for genomes as they might provide a certain degree of evolutionary flexibility while other parts of the genome remain stable and unchanged [<xref ref-type="bibr" rid="B77-genes-03-00461">77</xref>]. The central question that returns is: Does selection favor repeats because they are variable, or is the variability merely a consequence of selection favoring some other aspect of repeat structure or function? A clear answer to that question is not possible from what we know so far, but some evidence seems to favor the possibility that variability is what makes repeats selectively advantageous. Most amino acid repeats are encoded by runs of the same DNA repeat (pure codons) which are more unstable than repeats of synonymous codons (<italic>i.e.</italic>, different DNA sequence but coding for the same amino acid) [<xref ref-type="bibr" rid="B5-genes-03-00461">5</xref>].</p>
      <p>The appearance of TRs in genes could lead to the acquisition of a novel function and their retention could signal an evolutionary relevant path. This seems to be the case for the huntingtin protein, where the acquisition of a regulatory activity controlling neural adhesion in complex mammalian nervous systems, coincides with the appearance of glutamine repeats. Evolutionary distant heterologues of huntingtin, lacking the repeat region, fail to substitute the activity of the repeat-containing mammalian protein [<xref ref-type="bibr" rid="B78-genes-03-00461">78</xref>].</p>
      <p>Taken together, tandem repeats are an underestimated, understudied source of genetic variation which, by means of their instabilities, may facilitate a rapid evolution of genes and their associated phenotypes.</p>
    </sec>
  </body>
  <back>
    <ack>
      <title>Acknowledgements</title>
      <p>Research in the lab of KJV is supported by Human Frontier Science Program HFSP RGY79/2007, ERC Young Investigator Grant 241426, the EMBO YIP program, VIB, K.U.Leuven, IWT, the FWO-Odysseus program and the AB InBev Baillet-Latour foundation. JC acknowledges a Fulbright fellowship.</p>
    </ack>
    <ref-list>
      <title>References</title>
      <ref id="B1-genes-03-00461">
        <label>1.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Hartl</surname>
              <given-names>D.L.</given-names>
            </name>
          </person-group>
          <article-title>Molecular melodies in high and low C</article-title>
          <source>Nat. Rev. Genet.</source>
          <year>2000</year>
          <volume>1</volume>
          <fpage>145</fpage>
          <lpage>149</lpage>
          <pub-id pub-id-type="doi">10.1038/35038580</pub-id>
        </citation>
      </ref>
      <ref id="B2-genes-03-00461">
        <label>2.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>van Belkum</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Scherer</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>van Alphen</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Verbrugh</surname>
              <given-names>H.</given-names>
            </name>
          </person-group>
          <article-title>Short-sequence DNA repeats in prokaryotic genomes</article-title>
          <source>Microbiol. Mol. Biol. Rev.</source>
          <year>1998</year>
          <volume>62</volume>
          <fpage>275</fpage>
          <lpage>293</lpage>
        <pub-id pub-id-type="pmid">9618442</pub-id></citation>
      </ref>
      <ref id="B3-genes-03-00461">
        <label>3.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Feschotte</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Keswani</surname>
              <given-names>U.</given-names>
            </name>
            <name>
              <surname>Ranganathan</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Guibotsy</surname>
              <given-names>M.L.</given-names>
            </name>
            <name>
              <surname>Levine</surname>
              <given-names>D.</given-names>
            </name>
          </person-group>
          <article-title>Exploring repetitive DNA landscapes using REPCLASS, a tool that automates the classification of transposable elements in eukaryotic genomes</article-title>
          <source>Genome Biol. Evol.</source>
          <year>2009</year>
          <volume>1</volume>
          <fpage>205</fpage>
          <lpage>220</lpage>
        <pub-id pub-id-type="pmid">20333191</pub-id></citation>
      </ref>
      <ref id="B4-genes-03-00461">
        <label>4.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Kit</surname>
              <given-names>S.</given-names>
            </name>
          </person-group>
          <article-title>Equilibrium sedimentation in density gradients of DNA preparations from animal tissues</article-title>
          <source>J. Mol. Biol.</source>
          <year>1961</year>
          <volume>3</volume>
          <fpage>711</fpage>
          <lpage>716</lpage>
          <pub-id pub-id-type="doi">10.1016/S0022-2836(61)80075-2</pub-id>
        </citation>
      </ref>
      <ref id="B5-genes-03-00461">
        <label>5.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Verstrepen</surname>
              <given-names>K.J.</given-names>
            </name>
            <name>
              <surname>Jansen</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Lewitter</surname>
              <given-names>F.</given-names>
            </name>
            <name>
              <surname>Fink</surname>
              <given-names>G.R.</given-names>
            </name>
          </person-group>
          <article-title>Intragenic tandem repeats generate functional variability</article-title>
          <source>Nat. Genet.</source>
          <year>2005</year>
          <volume>37</volume>
          <fpage>986</fpage>
          <lpage>990</lpage>
        <pub-id pub-id-type="doi">10.1038/ng1618</pub-id><pub-id pub-id-type="pmid">16086015</pub-id></citation>
      </ref>
      <ref id="B6-genes-03-00461">
        <label>6.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Paques</surname>
              <given-names>F.</given-names>
            </name>
            <name>
              <surname>Leung</surname>
              <given-names>W.Y.</given-names>
            </name>
            <name>
              <surname>Haber</surname>
              <given-names>J.E.</given-names>
            </name>
          </person-group>
          <article-title>Expansions and contractions in a tandem repeat induced by double-strand break repair</article-title>
          <source>Mol. Cell. Biol.</source>
          <year>1998</year>
          <volume>18</volume>
          <fpage>2045</fpage>
          <lpage>2054</lpage>
        <pub-id pub-id-type="pmid">9528777</pub-id></citation>
      </ref>
      <ref id="B7-genes-03-00461">
        <label>7.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Kokoska</surname>
              <given-names>R.J.</given-names>
            </name>
            <name>
              <surname>Stefanovic</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Tran</surname>
              <given-names>H.T.</given-names>
            </name>
            <name>
              <surname>Resnick</surname>
              <given-names>M.A.</given-names>
            </name>
            <name>
              <surname>Gordenin</surname>
              <given-names>D.A.</given-names>
            </name>
            <name>
              <surname>Petes</surname>
              <given-names>T.D.</given-names>
            </name>
          </person-group>
          <article-title>Destabilization of yeast micro- and minisatellite DNA sequences by mutations affecting a nuclease involved in Okazaki fragment processing (rad27) and DNA polymerase delta (pol3-t)</article-title>
          <source>Mol. Cell. Biol.</source>
          <year>1998</year>
          <volume>18</volume>
          <fpage>2779</fpage>
          <lpage>2788</lpage>
        <pub-id pub-id-type="pmid">9566897</pub-id></citation>
      </ref>
      <ref id="B8-genes-03-00461">
        <label>8.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Richard</surname>
              <given-names>G.F.</given-names>
            </name>
            <name>
              <surname>Paques</surname>
              <given-names>F.</given-names>
            </name>
          </person-group>
          <article-title>Mini- and microsatellite expansions: The recombination connection</article-title>
          <source>EMBO Rep.</source>
          <year>2000</year>
          <volume>1</volume>
          <fpage>122</fpage>
          <lpage>126</lpage>
          <pub-id pub-id-type="doi">10.1093/embo-reports/kvd031</pub-id>
        </citation>
      </ref>
      <ref id="B9-genes-03-00461">
        <label>9.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Legendre</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Pochet</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Pak</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Verstrepen</surname>
              <given-names>K.J.</given-names>
            </name>
          </person-group>
          <article-title>Sequence-based estimation of minisatellite and microsatellite repeat variability</article-title>
          <source>Genome Res.</source>
          <year>2007</year>
          <volume>17</volume>
          <fpage>1787</fpage>
          <lpage>1796</lpage>
        <pub-id pub-id-type="doi">10.1101/gr.6554007</pub-id><pub-id pub-id-type="pmid">17978285</pub-id></citation>
      </ref>
      <ref id="B10-genes-03-00461">
        <label>10.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Petes</surname>
              <given-names>T.D.</given-names>
            </name>
            <name>
              <surname>Greenwell</surname>
              <given-names>P.W.</given-names>
            </name>
            <name>
              <surname>Dominska</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>Stabilization of microsatellite sequences by variant repeats in the yeast Saccharomyces cerevisiae</article-title>
          <source>Genetics</source>
          <year>1997</year>
          <volume>146</volume>
          <fpage>491</fpage>
          <lpage>498</lpage>
        <pub-id pub-id-type="pmid">9178000</pub-id></citation>
      </ref>
      <ref id="B11-genes-03-00461">
        <label>11.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Gragg</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Harfe</surname>
              <given-names>B.D.</given-names>
            </name>
            <name>
              <surname>Jinks-Robertson</surname>
              <given-names>S.</given-names>
            </name>
          </person-group>
          <article-title>Base composition of mononucleotide runs affects DNA polymerase slippage and removal of frameshift intermediates by mismatch repair in Saccharomyces cerevisiae</article-title>
          <source>Mol. Cell. Biol.</source>
          <year>2002</year>
          <volume>22</volume>
          <fpage>8756</fpage>
          <lpage>8762</lpage>
        <pub-id pub-id-type="doi">10.1128/MCB.22.24.8756-8762.2002</pub-id><pub-id pub-id-type="pmid">12446792</pub-id></citation>
      </ref>
      <ref id="B12-genes-03-00461">
        <label>12.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Wierdl</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Greene</surname>
              <given-names>C.N.</given-names>
            </name>
            <name>
              <surname>Datta</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Jinks-Robertson</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Petes</surname>
              <given-names>T.D.</given-names>
            </name>
          </person-group>
          <article-title>Destabilization of simple repetitive DNA sequences by transcription in yeast</article-title>
          <source>Genetics</source>
          <year>1996</year>
          <volume>143</volume>
          <fpage>713</fpage>
          <lpage>721</lpage>
        <pub-id pub-id-type="pmid">8725221</pub-id></citation>
      </ref>
      <ref id="B13-genes-03-00461">
        <label>13.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Schmidt</surname>
              <given-names>A.L.</given-names>
            </name>
            <name>
              <surname>Mitter</surname>
              <given-names>V.</given-names>
            </name>
          </person-group>
          <article-title>Microsatellite mutation directed by an external stimulus</article-title>
          <source>Mutat. Res. Fundam. Mol. Mech. Mutagen.</source>
          <year>2004</year>
          <volume>568</volume>
          <fpage>233</fpage>
          <lpage>243</lpage>
          <pub-id pub-id-type="doi">10.1016/j.mrfmmm.2004.09.003</pub-id>
        </citation>
      </ref>
      <ref id="B14-genes-03-00461">
        <label>14.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Rosenberg</surname>
              <given-names>S.M.</given-names>
            </name>
          </person-group>
          <article-title>Evolving responsively: Adaptive mutation</article-title>
          <source>Nat. Rev. Genet.</source>
          <year>2001</year>
          <volume>2</volume>
          <fpage>504</fpage>
          <lpage>515</lpage>
          <pub-id pub-id-type="doi">10.1038/35080556</pub-id>
        </citation>
      </ref>
      <ref id="B15-genes-03-00461">
        <label>15.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Mittelman</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Sykoudis</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Hersh</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Lin</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Wilson</surname>
              <given-names>J.H.</given-names>
            </name>
          </person-group>
          <article-title>Hsp90 modulates CAG repeat instability in human cells</article-title>
          <source>Cell Stress Chaperones</source>
          <year>2010</year>
          <volume>15</volume>
          <fpage>753</fpage>
          <lpage>759</lpage>
          <pub-id pub-id-type="doi">10.1007/s12192-010-0191-0</pub-id>
        </citation>
      </ref>
      <ref id="B16-genes-03-00461">
        <label>16.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Merkel</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Gemmell</surname>
              <given-names>N.</given-names>
            </name>
          </person-group>
          <article-title>Detecting short tandem repeats from genome data: Opening the software black box</article-title>
          <source>Brief Bioinform.</source>
          <year>2008</year>
          <volume>9</volume>
          <fpage>355</fpage>
          <lpage>366</lpage>
          <pub-id pub-id-type="doi">10.1093/bib/bbn028</pub-id>
        </citation>
      </ref>
      <ref id="B17-genes-03-00461">
        <label>17.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Gemayel</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Vinces</surname>
              <given-names>M.D.</given-names>
            </name>
            <name>
              <surname>Legendre</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Verstrepen</surname>
              <given-names>K.J.</given-names>
            </name>
          </person-group>
          <article-title>Variable Tandem Repeats Accelerate Evolution of Coding and Regulatory Sequences</article-title>
          <source>Annu. Rev. Genet.</source>
          <year>2010</year>
          <volume>44</volume>
          <fpage>445</fpage>
          <lpage>477</lpage>
          <pub-id pub-id-type="doi">10.1146/annurev-genet-072610-155046</pub-id>
        </citation>
      </ref>
      <ref id="B18-genes-03-00461">
        <label>18.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Jansen</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Gemayel</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Verstrepen</surname>
              <given-names>K.J.</given-names>
            </name>
          </person-group>
          <article-title>Unstable microsatellite repeats facilitate rapid evolution of coding and regulatory sequences</article-title>
          <source>Genome Dyn.</source>
          <year>2012</year>
          <volume>7</volume>
          <fpage>108</fpage>
          <lpage>125</lpage>
          <pub-id pub-id-type="doi">10.1159/000337121</pub-id>
        </citation>
      </ref>
      <ref id="B19-genes-03-00461">
        <label>19.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Young</surname>
              <given-names>E.T.</given-names>
            </name>
            <name>
              <surname>Sloan</surname>
              <given-names>J.S.</given-names>
            </name>
            <name>
              <surname>van Riper</surname>
              <given-names>K.</given-names>
            </name>
          </person-group>
          <article-title>Trinucleotide repeats are clustered in regulatory genes in Saccharomyces cerevisiae</article-title>
          <source>Genetics</source>
          <year>2000</year>
          <volume>154</volume>
          <fpage>1053</fpage>
          <lpage>1068</lpage>
        <pub-id pub-id-type="pmid">10757753</pub-id></citation>
      </ref>
      <ref id="B20-genes-03-00461">
        <label>20.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Metzgar</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Bytof</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Wills</surname>
              <given-names>C.</given-names>
            </name>
          </person-group>
          <article-title>Selection against frameshift mutations limits microsatellite expansion in coding DNA</article-title>
          <source>Genome Res.</source>
          <year>2000</year>
          <volume>10</volume>
          <fpage>72</fpage>
          <lpage>80</lpage>
        <pub-id pub-id-type="pmid">10645952</pub-id></citation>
      </ref>
      <ref id="B21-genes-03-00461">
        <label>21.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>O'Dushlaine</surname>
              <given-names>C.T.</given-names>
            </name>
            <name>
              <surname>Edwards</surname>
              <given-names>R.J.</given-names>
            </name>
            <name>
              <surname>Park</surname>
              <given-names>S.D.</given-names>
            </name>
            <name>
              <surname>Shields</surname>
              <given-names>D.C.</given-names>
            </name>
          </person-group>
          <article-title>Tandem repeat copy-number variation in protein-coding regions of human genes</article-title>
          <source>Genome Biol.</source>
          <year>2005</year>
          <volume>6</volume>
          <fpage>12</fpage>
        </citation>
      </ref>
      <ref id="B22-genes-03-00461">
        <label>22.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Mularoni</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Ledda</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Toll-Riera</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Alba</surname>
              <given-names>M.M.</given-names>
            </name>
          </person-group>
          <article-title>Natural selection drives the accumulation of amino acid tandem repeats in human proteins</article-title>
          <source>Genome Res.</source>
          <year>2010</year>
          <volume>20</volume>
          <fpage>745</fpage>
          <lpage>754</lpage>
          <pub-id pub-id-type="doi">10.1101/gr.101261.109</pub-id>
        </citation>
      </ref>
      <ref id="B23-genes-03-00461">
        <label>23.</label>
        <citation citation-type="other">
          <person-group person-group-type="author">
            <name>
              <surname>Legendre</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Verstrepen</surname>
              <given-names>K.J.</given-names>
            </name>
          </person-group>
          <publisher-name>Harvard University</publisher-name>
          <publisher-loc>Cambridge, MA, USA</publisher-loc>
          <year>2007</year>
          <comment>Unpublished work</comment>
        </citation>
      </ref>
      <ref id="B24-genes-03-00461">
        <label>24.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Faux</surname>
              <given-names>N.G.</given-names>
            </name>
            <name>
              <surname>Bottomley</surname>
              <given-names>S.P.</given-names>
            </name>
            <name>
              <surname>Lesk</surname>
              <given-names>A.M.</given-names>
            </name>
            <name>
              <surname>Irving</surname>
              <given-names>J.A.</given-names>
            </name>
            <name>
              <surname>Morrison</surname>
              <given-names>J.R.</given-names>
            </name>
            <name>
              <surname>de la Banda</surname>
              <given-names>M.C.</given-names>
            </name>
            <name>
              <surname>Whisstock</surname>
              <given-names>J.C.</given-names>
            </name>
          </person-group>
          <article-title>Functional insights from the distribution and role of homopeptide repeat-containing proteins</article-title>
          <source>Genome Res.</source>
          <year>2005</year>
          <volume>15</volume>
          <fpage>537</fpage>
          <lpage>551</lpage>
          <pub-id pub-id-type="doi">10.1101/gr.3096505</pub-id>
        </citation>
      </ref>
      <ref id="B25-genes-03-00461">
        <label>25.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Doolittle</surname>
              <given-names>W.F.</given-names>
            </name>
            <name>
              <surname>Sapienza</surname>
              <given-names>C.</given-names>
            </name>
          </person-group>
          <article-title>Selfish genes, the phenotype paradigm and genome evolution</article-title>
          <source>Nature</source>
          <year>1980</year>
          <volume>284</volume>
          <fpage>601</fpage>
          <lpage>603</lpage>
          <pub-id pub-id-type="doi">10.1038/284601a0</pub-id>
        </citation>
      </ref>
      <ref id="B26-genes-03-00461">
        <label>26.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Ohno</surname>
              <given-names>S.</given-names>
            </name>
          </person-group>
          <article-title>So much "junk" DNA in our genome</article-title>
          <source>Brookhaven Symp. Biol.</source>
          <year>1972</year>
          <volume>23</volume>
          <fpage>366</fpage>
          <lpage>370</lpage>
        <pub-id pub-id-type="pmid">5065367</pub-id></citation>
      </ref>
      <ref id="B27-genes-03-00461">
        <label>27.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Orgel</surname>
              <given-names>L.E.</given-names>
            </name>
            <name>
              <surname>Crick</surname>
              <given-names>F.H.</given-names>
            </name>
          </person-group>
          <article-title>Selfish DNA: the ultimate parasite</article-title>
          <source>Nature</source>
          <year>1980</year>
          <volume>284</volume>
          <fpage>604</fpage>
          <lpage>607</lpage>
          <pub-id pub-id-type="doi">10.1038/284604a0</pub-id>
        </citation>
      </ref>
      <ref id="B28-genes-03-00461">
        <label>28.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>King</surname>
              <given-names>M.C.</given-names>
            </name>
            <name>
              <surname>Wilson</surname>
              <given-names>A.C.</given-names>
            </name>
          </person-group>
          <article-title>Evolution at two levels in humans and chimpanzees</article-title>
          <source>Science</source>
          <year>1975</year>
          <volume>188</volume>
          <fpage>107</fpage>
          <lpage>116</lpage>
        <pub-id pub-id-type="doi">10.1126/science.1090005</pub-id><pub-id pub-id-type="pmid">1090005</pub-id></citation>
      </ref>
      <ref id="B29-genes-03-00461">
        <label>29.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Wray</surname>
              <given-names>G.A.</given-names>
            </name>
          </person-group>
          <article-title>The evolutionary significance of cis-regulatory mutations</article-title>
          <source>Nat. Rev. Genet.</source>
          <year>2007</year>
          <volume>8</volume>
          <fpage>206</fpage>
          <lpage>216</lpage>
          <pub-id pub-id-type="doi">10.1038/nrg2063</pub-id>
        </citation>
      </ref>
      <ref id="B30-genes-03-00461">
        <label>30.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Rockman</surname>
              <given-names>M.V.</given-names>
            </name>
            <name>
              <surname>Wray</surname>
              <given-names>G.A.</given-names>
            </name>
          </person-group>
          <article-title>Abundant raw material for cis-regulatory evolution in humans</article-title>
          <source>Mol. Biol. Evol.</source>
          <year>2002</year>
          <volume>19</volume>
          <fpage>1991</fpage>
          <lpage>2004</lpage>
          <pub-id pub-id-type="doi">10.1093/oxfordjournals.molbev.a004023</pub-id>
        </citation>
      </ref>
      <ref id="B31-genes-03-00461">
        <label>31.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Vinces</surname>
              <given-names>M.D.</given-names>
            </name>
            <name>
              <surname>Legendre</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Caldara</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Hagihara</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Verstrepen</surname>
              <given-names>K.J.</given-names>
            </name>
          </person-group>
          <article-title>Unstable tandem repeats in promoters confer transcriptional evolvability</article-title>
          <source>Science</source>
          <year>2009</year>
          <volume>324</volume>
          <fpage>1213</fpage>
          <lpage>1216</lpage>
        <pub-id pub-id-type="doi">10.1126/science.1170097</pub-id><pub-id pub-id-type="pmid">19478187</pub-id></citation>
      </ref>
      <ref id="B32-genes-03-00461">
        <label>32.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Martin</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Makepeace</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Hill</surname>
              <given-names>S.A.</given-names>
            </name>
            <name>
              <surname>Hood</surname>
              <given-names>D.W.</given-names>
            </name>
            <name>
              <surname>Moxon</surname>
              <given-names>E.R.</given-names>
            </name>
          </person-group>
          <article-title>Microsatellite instability regulates transcription factor binding and gene expression</article-title>
          <source>Proc. Natl. Acad. Sci. USA</source>
          <year>2005</year>
          <volume>102</volume>
          <fpage>3800</fpage>
          <lpage>3804</lpage>
        <pub-id pub-id-type="doi">10.1073/pnas.0406805102</pub-id><pub-id pub-id-type="pmid">15728391</pub-id></citation>
      </ref>
      <ref id="B33-genes-03-00461">
        <label>33.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Johnson</surname>
              <given-names>A.C.</given-names>
            </name>
            <name>
              <surname>Jinno</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Merlino</surname>
              <given-names>G.T.</given-names>
            </name>
          </person-group>
          <article-title>Modulation of epidermal growth factor receptor proto-oncogene transcription by a promoter site sensitive to S1 nuclease</article-title>
          <source>Mol. Cell. Biol.</source>
          <year>1988</year>
          <volume>8</volume>
          <fpage>4174</fpage>
          <lpage>4184</lpage>
        <pub-id pub-id-type="pmid">2847030</pub-id></citation>
      </ref>
      <ref id="B34-genes-03-00461">
        <label>34.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Whetstine</surname>
              <given-names>J.R.</given-names>
            </name>
            <name>
              <surname>Witt</surname>
              <given-names>T.L.</given-names>
            </name>
            <name>
              <surname>Matherly</surname>
              <given-names>L.H.</given-names>
            </name>
          </person-group>
          <article-title>The human reduced folate carrier gene is regulated by the AP2 and Sp1 transcription factor families and a functional 61-base pair polymorphism</article-title>
          <source>J. Biol. Chem.</source>
          <year>2002</year>
          <volume>277</volume>
          <fpage>43873</fpage>
          <lpage>43880</lpage>
        <pub-id pub-id-type="doi">10.1074/jbc.M208296200</pub-id><pub-id pub-id-type="pmid">12228234</pub-id></citation>
      </ref>
      <ref id="B35-genes-03-00461">
        <label>35.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Iglesias</surname>
              <given-names>A.R.</given-names>
            </name>
            <name>
              <surname>Kindlund</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Tammi</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Wadelius</surname>
              <given-names>C.</given-names>
            </name>
          </person-group>
          <article-title>Some microsatellites may act as novel polymorphic cis-regulatory elements through transcription factor binding</article-title>
          <source>Gene</source>
          <year>2004</year>
          <volume>341</volume>
          <fpage>149</fpage>
          <lpage>165</lpage>
          <pub-id pub-id-type="doi">10.1016/j.gene.2004.06.035</pub-id>
        </citation>
      </ref>
      <ref id="B36-genes-03-00461">
        <label>36.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Metruccio</surname>
              <given-names>M.M.E.</given-names>
            </name>
            <name>
              <surname>Pigozzi</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Roncarati</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Scorza</surname>
              <given-names>F.B.</given-names>
            </name>
            <name>
              <surname>Norais</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Hill</surname>
              <given-names>S.A.</given-names>
            </name>
            <name>
              <surname>Scarlato</surname>
              <given-names>V.</given-names>
            </name>
            <name>
              <surname>Delany</surname>
              <given-names>I.</given-names>
            </name>
          </person-group>
          <article-title>A Novel Phase Variation Mechanism in the Meningococcus Driven by a Ligand-Responsive Repressor and Differential Spacing of Distal Promoter Elements</article-title>
          <source>PLoS Pathog.</source>
          <year>2009</year>
          <volume>5</volume>
          <fpage>e1000710</fpage>
        <pub-id pub-id-type="doi">10.1371/journal.ppat.1000710</pub-id><pub-id pub-id-type="pmid">20041170</pub-id></citation>
      </ref>
      <ref id="B37-genes-03-00461">
        <label>37.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>van der Ende</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Hopman</surname>
              <given-names>C.T.</given-names>
            </name>
            <name>
              <surname>Zaat</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Essink</surname>
              <given-names>B.B.</given-names>
            </name>
            <name>
              <surname>Berkhout</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Dankert</surname>
              <given-names>J.</given-names>
            </name>
          </person-group>
          <article-title>Variable expression of class 1 outer membrane protein in Neisseria meningitidis is caused by variation in the spacing between the -10 and -35 regions of the promoter</article-title>
          <source>J. Bacteriol.</source>
          <year>1995</year>
          <volume>177</volume>
          <fpage>2475</fpage>
          <lpage>2480</lpage>
        <pub-id pub-id-type="pmid">7730280</pub-id></citation>
      </ref>
      <ref id="B38-genes-03-00461">
        <label>38.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>van Ham</surname>
              <given-names>S.M.</given-names>
            </name>
            <name>
              <surname>van Alphen</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Mooi</surname>
              <given-names>F.R.</given-names>
            </name>
            <name>
              <surname>van Putten</surname>
              <given-names>J.P.</given-names>
            </name>
          </person-group>
          <article-title>Phase variation of H. influenzae fimbriae: Transcriptional control of two divergent genes through a variable combined promoter region</article-title>
          <source>Cell</source>
          <year>1993</year>
          <volume>73</volume>
          <fpage>1187</fpage>
          <lpage>1196</lpage>
          <pub-id pub-id-type="doi">10.1016/0092-8674(93)90647-9</pub-id>
        </citation>
      </ref>
      <ref id="B39-genes-03-00461">
        <label>39.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Willems</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Paul</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>van der Heide</surname>
              <given-names>H.G.</given-names>
            </name>
            <name>
              <surname>ter Avest</surname>
              <given-names>A.R.</given-names>
            </name>
            <name>
              <surname>Mooi</surname>
              <given-names>F.R.</given-names>
            </name>
          </person-group>
          <article-title>Fimbrial phase variation in Bordetella pertussis: A novel mechanism for transcriptional regulation</article-title>
          <source>EMBO J.</source>
          <year>1990</year>
          <volume>9</volume>
          <fpage>2803</fpage>
          <lpage>2809</lpage>
        <pub-id pub-id-type="pmid">1975238</pub-id></citation>
      </ref>
      <ref id="B40-genes-03-00461">
        <label>40.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Yogev</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Rosengarten</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Watson-McKown</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Wise</surname>
              <given-names>K.S.</given-names>
            </name>
          </person-group>
          <article-title>Molecular basis of Mycoplasma surface antigenic variation: A novel set of divergent genes undergo spontaneous mutation of periodic coding regions and 5' regulatory sequences</article-title>
          <source>EMBO J.</source>
          <year>1991</year>
          <volume>10</volume>
          <fpage>4069</fpage>
          <lpage>4079</lpage>
        <pub-id pub-id-type="pmid">1721868</pub-id></citation>
      </ref>
      <ref id="B41-genes-03-00461">
        <label>41.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Saunders</surname>
              <given-names>N.J.</given-names>
            </name>
            <name>
              <surname>Jeffries</surname>
              <given-names>A.C.</given-names>
            </name>
            <name>
              <surname>Peden</surname>
              <given-names>P.F.</given-names>
            </name>
            <name>
              <surname>Hood</surname>
              <given-names>D.W.</given-names>
            </name>
            <name>
              <surname>Tettelin</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Rappuoli</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Moxon</surname>
              <given-names>E.R.</given-names>
            </name>
          </person-group>
          <article-title>Repeat-associated phase variable genes in the complete genome sequence of Neisseria meningitidis strain MC58</article-title>
          <source>Mol. Microbiol.</source>
          <year>2000</year>
          <volume>37</volume>
          <fpage>207</fpage>
          <lpage>215</lpage>
          <pub-id pub-id-type="doi">10.1046/j.1365-2958.2000.02000.x</pub-id>
        </citation>
      </ref>
      <ref id="B42-genes-03-00461">
        <label>42.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Iyer</surname>
              <given-names>V.</given-names>
            </name>
            <name>
              <surname>Struhl</surname>
              <given-names>K.</given-names>
            </name>
          </person-group>
          <article-title>Poly(dA:dT), a ubiquitous promoter element that stimulates transcription via its intrinsic DNA structure</article-title>
          <source>EMBO J.</source>
          <year>1995</year>
          <volume>14</volume>
          <fpage>2570</fpage>
          <lpage>2579</lpage>
        <pub-id pub-id-type="pmid">7781610</pub-id></citation>
      </ref>
      <ref id="B43-genes-03-00461">
        <label>43.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Kaplan</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Moore</surname>
              <given-names>I.K.</given-names>
            </name>
            <name>
              <surname>Fondufe-Mittendorf</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Gossett</surname>
              <given-names>A.J.</given-names>
            </name>
            <name>
              <surname>Tillo</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Field</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>LeProust</surname>
              <given-names>E.M.</given-names>
            </name>
            <name>
              <surname>Hughes</surname>
              <given-names>T.R.</given-names>
            </name>
            <name>
              <surname>Lieb</surname>
              <given-names>J.D.</given-names>
            </name>
            <name>
              <surname>Widom</surname>
              <given-names>J.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>The DNA-encoded nucleosome organization of a eukaryotic genome</article-title>
          <source>Nature</source>
          <year>2009</year>
          <volume>458</volume>
          <fpage>362</fpage>
          <lpage>366</lpage>
        <pub-id pub-id-type="doi">10.1038/nature07667</pub-id><pub-id pub-id-type="pmid">19092803</pub-id></citation>
      </ref>
      <ref id="B44-genes-03-00461">
        <label>44.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Yuan</surname>
              <given-names>G.C.</given-names>
            </name>
            <name>
              <surname>Liu</surname>
              <given-names>Y.J.</given-names>
            </name>
            <name>
              <surname>Dion</surname>
              <given-names>M.F.</given-names>
            </name>
            <name>
              <surname>Slack</surname>
              <given-names>M.D.</given-names>
            </name>
            <name>
              <surname>Wu</surname>
              <given-names>L.F.</given-names>
            </name>
            <name>
              <surname>Altschuler</surname>
              <given-names>S.J.</given-names>
            </name>
            <name>
              <surname>Rando</surname>
              <given-names>O.J.</given-names>
            </name>
          </person-group>
          <article-title>Genome-scale identification of nucleosome positions in <italic>S. cerevisiae</italic></article-title>
          <source>Science</source>
          <year>2005</year>
          <volume>309</volume>
          <fpage>626</fpage>
          <lpage>630</lpage>
        <pub-id pub-id-type="doi">10.1126/science.1112178</pub-id><pub-id pub-id-type="pmid">15961632</pub-id></citation>
      </ref>
      <ref id="B45-genes-03-00461">
        <label>45.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Naylor</surname>
              <given-names>L.H.</given-names>
            </name>
            <name>
              <surname>Clark</surname>
              <given-names>E.M.</given-names>
            </name>
          </person-group>
          <article-title>d(CA)n sequences upstream of the rat prolactin gene form Z-DNA and inhibit gene transcription</article-title>
          <source>Nucleic Acids Res.</source>
          <year>1990</year>
          <volume>18</volume>
          <fpage>1595</fpage>
          <lpage>1601</lpage>
          <pub-id pub-id-type="doi">10.1093/nar/18.6.1595</pub-id>
        </citation>
      </ref>
      <ref id="B46-genes-03-00461">
        <label>46.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Oh</surname>
              <given-names>D.B.</given-names>
            </name>
            <name>
              <surname>Kim</surname>
              <given-names>Y.G.</given-names>
            </name>
            <name>
              <surname>Rich</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <article-title>Z-DNA-binding proteins can act as potent effectors of gene expression <italic>in vivo</italic></article-title>
          <source>Proc. Natl. Acad. Sci. USA</source>
          <year>2002</year>
          <volume>99</volume>
          <fpage>16666</fpage>
          <lpage>16671</lpage>
        <pub-id pub-id-type="doi">10.1073/pnas.262672699</pub-id><pub-id pub-id-type="pmid">12486233</pub-id></citation>
      </ref>
      <ref id="B47-genes-03-00461">
        <label>47.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Gatchel</surname>
              <given-names>J.R.</given-names>
            </name>
            <name>
              <surname>Zoghbi</surname>
              <given-names>H.Y.</given-names>
            </name>
          </person-group>
          <article-title>Diseases of unstable repeat expansion: Mechanisms and common principles</article-title>
          <source>Nat. Rev. Genet.</source>
          <year>2005</year>
          <volume>6</volume>
          <fpage>743</fpage>
          <lpage>755</lpage>
          <pub-id pub-id-type="doi">10.1038/nrg1691</pub-id>
        </citation>
      </ref>
      <ref id="B48-genes-03-00461">
        <label>48.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Shang</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Cui</surname>
              <given-names>Q.</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>X.</given-names>
            </name>
            <name>
              <surname>Beseler</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Greenberg</surname>
              <given-names>D.A.</given-names>
            </name>
            <name>
              <surname>Wolgemuth</surname>
              <given-names>D.J.</given-names>
            </name>
          </person-group>
          <article-title>The Bromodomain-Containing Gene BRD2 Is Regulated at Transcription, Splicing, and Translation Levels</article-title>
          <source>J. Cell. Biochem.</source>
          <year>2011</year>
          <volume>112</volume>
          <fpage>2784</fpage>
          <lpage>2793</lpage>
        <pub-id pub-id-type="doi">10.1002/jcb.23192</pub-id><pub-id pub-id-type="pmid">21608014</pub-id></citation>
      </ref>
      <ref id="B49-genes-03-00461">
        <label>49.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Pagani</surname>
              <given-names>F.</given-names>
            </name>
            <name>
              <surname>Buratti</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Stuani</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Romano</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Zuccato</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Niksic</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Giglio</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Faraguna</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Baralle</surname>
              <given-names>F.E.</given-names>
            </name>
          </person-group>
          <article-title>Splicing factors induce cystic fibrosis transmembrane regulator exon 9 skipping through a nonevolutionary conserved intronic element</article-title>
          <source>J. Biol. Chem.</source>
          <year>2000</year>
          <volume>275</volume>
          <fpage>21041</fpage>
          <lpage>21047</lpage>
        <pub-id pub-id-type="doi">10.1074/jbc.M910165199</pub-id><pub-id pub-id-type="pmid">10766763</pub-id></citation>
      </ref>
      <ref id="B50-genes-03-00461">
        <label>50.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Rockman</surname>
              <given-names>M.V.</given-names>
            </name>
            <name>
              <surname>Hahn</surname>
              <given-names>M.W.</given-names>
            </name>
            <name>
              <surname>Soranzo</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Loisel</surname>
              <given-names>D.A.</given-names>
            </name>
            <name>
              <surname>Goldstein</surname>
              <given-names>D.B.</given-names>
            </name>
            <name>
              <surname>Wray</surname>
              <given-names>G.A.</given-names>
            </name>
          </person-group>
          <article-title>Positive selection on MMP3 regulation has shaped heart disease risk</article-title>
          <source>Curr. Biol.</source>
          <year>2004</year>
          <volume>14</volume>
          <fpage>1531</fpage>
          <lpage>1539</lpage>
        <pub-id pub-id-type="doi">10.1016/j.cub.2004.08.051</pub-id><pub-id pub-id-type="pmid">15341739</pub-id></citation>
      </ref>
      <ref id="B51-genes-03-00461">
        <label>51.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Fidalgo</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Barrales</surname>
              <given-names>R.R.</given-names>
            </name>
            <name>
              <surname>Ibeas</surname>
              <given-names>J.I.</given-names>
            </name>
            <name>
              <surname>Jimenez</surname>
              <given-names>J.</given-names>
            </name>
          </person-group>
          <article-title>Adaptive evolution by mutations in the FLO11 gene</article-title>
          <source>Proc. Natl. Acad. Sci. USA</source>
          <year>2006</year>
          <volume>103</volume>
          <fpage>11228</fpage>
          <lpage>11233</lpage>
        <pub-id pub-id-type="doi">10.1073/pnas.0601713103</pub-id><pub-id pub-id-type="pmid">16844788</pub-id></citation>
      </ref>
      <ref id="B52-genes-03-00461">
        <label>52.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Hoyer</surname>
              <given-names>L.L.</given-names>
            </name>
          </person-group>
          <article-title>The ALS gene family of Candida albicans</article-title>
          <source>Trends Microbiol.</source>
          <year>2001</year>
          <volume>9</volume>
          <fpage>176</fpage>
          <lpage>180</lpage>
          <pub-id pub-id-type="doi">10.1016/S0966-842X(01)01984-9</pub-id>
        </citation>
      </ref>
      <ref id="B53-genes-03-00461">
        <label>53.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Verstrepen</surname>
              <given-names>K.J.</given-names>
            </name>
            <name>
              <surname>Reynolds</surname>
              <given-names>T.B.</given-names>
            </name>
            <name>
              <surname>Fink</surname>
              <given-names>G.R.</given-names>
            </name>
          </person-group>
          <article-title>Origins of variation in the fungal cell surface</article-title>
          <source>Nat. Rev. Microbiol.</source>
          <year>2004</year>
          <volume>2</volume>
          <fpage>533</fpage>
          <lpage>540</lpage>
          <pub-id pub-id-type="doi">10.1038/nrmicro927</pub-id>
        </citation>
      </ref>
      <ref id="B54-genes-03-00461">
        <label>54.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Kita</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Katsui</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Emoto</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Sawaki</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Oku</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Nishikawa</surname>
              <given-names>F.</given-names>
            </name>
            <name>
              <surname>Hamuro</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Kashiba</surname>
              <given-names>S.</given-names>
            </name>
          </person-group>
          <article-title>Virulence of transparent and opaque colony types of Neisseria gonorrhoeae for the genital tract of mice</article-title>
          <source>J. Med. Microbiol.</source>
          <year>1991</year>
          <volume>34</volume>
          <fpage>355</fpage>
          <lpage>362</lpage>
          <pub-id pub-id-type="doi">10.1099/00222615-34-6-355</pub-id>
        </citation>
      </ref>
      <ref id="B55-genes-03-00461">
        <label>55.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Stern</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Brown</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Nickel</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Meyer</surname>
              <given-names>T.F.</given-names>
            </name>
          </person-group>
          <article-title>Opacity genes in Neisseria gonorrhoeae: Control of phase and antigenic variation</article-title>
          <source>Cell</source>
          <year>1986</year>
          <volume>47</volume>
          <fpage>61</fpage>
          <lpage>71</lpage>
          <pub-id pub-id-type="doi">10.1016/0092-8674(86)90366-1</pub-id>
        </citation>
      </ref>
      <ref id="B56-genes-03-00461">
        <label>56.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Weiser</surname>
              <given-names>J.N.</given-names>
            </name>
            <name>
              <surname>Love</surname>
              <given-names>J.M.</given-names>
            </name>
            <name>
              <surname>Moxon</surname>
              <given-names>E.R.</given-names>
            </name>
          </person-group>
          <article-title>The molecular mechanism of phase variation of H. influenzae lipopolysaccharide</article-title>
          <source>Cell</source>
          <year>1989</year>
          <volume>59</volume>
          <fpage>657</fpage>
          <lpage>665</lpage>
          <pub-id pub-id-type="doi">10.1016/0092-8674(89)90011-1</pub-id>
        </citation>
      </ref>
      <ref id="B57-genes-03-00461">
        <label>57.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>de Bolle</surname>
              <given-names>X.</given-names>
            </name>
            <name>
              <surname>Bayliss</surname>
              <given-names>C.D.</given-names>
            </name>
            <name>
              <surname>Field</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>van de Ven</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Saunders</surname>
              <given-names>N.J.</given-names>
            </name>
            <name>
              <surname>Hood</surname>
              <given-names>D.W.</given-names>
            </name>
            <name>
              <surname>Moxon</surname>
              <given-names>E.R.</given-names>
            </name>
          </person-group>
          <article-title>The length of a tetranucleotide repeat tract in Haemophilus influenzae determines the phase variation rate of a gene with homology to type III DNA methyltransferases</article-title>
          <source>Mol. Microbiol.</source>
          <year>2000</year>
          <volume>35</volume>
          <fpage>211</fpage>
          <lpage>222</lpage>
          <pub-id pub-id-type="doi">10.1046/j.1365-2958.2000.01701.x</pub-id>
        </citation>
      </ref>
      <ref id="B58-genes-03-00461">
        <label>58.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Srikhanta</surname>
              <given-names>Y.N.</given-names>
            </name>
            <name>
              <surname>Dowideit</surname>
              <given-names>S.J.</given-names>
            </name>
            <name>
              <surname>Edwards</surname>
              <given-names>J.L.</given-names>
            </name>
            <name>
              <surname>Falsetta</surname>
              <given-names>M.L.</given-names>
            </name>
            <name>
              <surname>Wu</surname>
              <given-names>H.J.</given-names>
            </name>
            <name>
              <surname>Harrison</surname>
              <given-names>O.B.</given-names>
            </name>
            <name>
              <surname>Fox</surname>
              <given-names>K.L.</given-names>
            </name>
            <name>
              <surname>Seib</surname>
              <given-names>K.L.</given-names>
            </name>
            <name>
              <surname>Maguire</surname>
              <given-names>T.L.</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>A.H.J.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Phasevarions Mediate Random Switching of Gene Expression in Pathogenic Neisseria</article-title>
          <source>PLoS Pathog.</source>
          <year>2009</year>
          <volume>5</volume>
          <fpage>e1000400</fpage>
        <pub-id pub-id-type="doi">10.1371/journal.ppat.1000400</pub-id><pub-id pub-id-type="pmid">19390608</pub-id></citation>
      </ref>
      <ref id="B59-genes-03-00461">
        <label>59.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Mrazek</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Guo</surname>
              <given-names>X.X.</given-names>
            </name>
            <name>
              <surname>Shah</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <article-title>Simple sequence repeats in prokaryotic genomes</article-title>
          <source>Proc. Natl. Acad. Sci. USA</source>
          <year>2007</year>
          <volume>104</volume>
          <fpage>8472</fpage>
          <lpage>8477</lpage>
        <pub-id pub-id-type="doi">10.1073/pnas.0702412104</pub-id><pub-id pub-id-type="pmid">17485665</pub-id></citation>
      </ref>
      <ref id="B60-genes-03-00461">
        <label>60.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Lin</surname>
              <given-names>W.H.</given-names>
            </name>
            <name>
              <surname>Kussell</surname>
              <given-names>E.</given-names>
            </name>
          </person-group>
          <article-title>Evolutionary pressures on simple sequence repeats in prokaryotic coding regions</article-title>
          <source>Nucleic Acids Res.</source>
          <year>2012</year>
          <volume>40</volume>
          <fpage>2399</fpage>
          <lpage>2413</lpage>
          <pub-id pub-id-type="doi">10.1093/nar/gkr1078</pub-id>
        </citation>
      </ref>
      <ref id="B61-genes-03-00461">
        <label>61.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Orr</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Zoghbi</surname>
              <given-names>H.</given-names>
            </name>
          </person-group>
          <article-title>Trinucleotide repeat disorders</article-title>
          <source>Annu. Rev. Neurosci.</source>
          <year>2007</year>
          <volume>30</volume>
          <fpage>575</fpage>
          <lpage>621</lpage>
          <pub-id pub-id-type="doi">10.1146/annurev.neuro.29.051605.113042</pub-id>
        </citation>
      </ref>
      <ref id="B62-genes-03-00461">
        <label>62.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Karlin</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Burge</surname>
              <given-names>C.</given-names>
            </name>
          </person-group>
          <article-title>Trinucleotide repeats and long homopeptides in genes and proteins associated with nervous system disease and development</article-title>
          <source>Proc. Natl. Acad. Sci. USA</source>
          <year>1996</year>
          <volume>93</volume>
          <fpage>1560</fpage>
          <lpage>1565</lpage>
          <pub-id pub-id-type="doi">10.1073/pnas.93.4.1560</pub-id>
        </citation>
      </ref>
      <ref id="B63-genes-03-00461">
        <label>63.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Froehlich</surname>
              <given-names>A.C.</given-names>
            </name>
            <name>
              <surname>Liu</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Loros</surname>
              <given-names>J.J.</given-names>
            </name>
            <name>
              <surname>Dunlap</surname>
              <given-names>J.C.</given-names>
            </name>
          </person-group>
          <article-title>White collar-1, a circadian blue light photoreceptor, binding to the frequency promoter</article-title>
          <source>Science</source>
          <year>2002</year>
          <volume>297</volume>
          <fpage>815</fpage>
          <lpage>819</lpage>
          <pub-id pub-id-type="doi">10.1126/science.1073681</pub-id>
        </citation>
      </ref>
      <ref id="B64-genes-03-00461">
        <label>64.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Michael</surname>
              <given-names>T.P.</given-names>
            </name>
            <name>
              <surname>Park</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Kim</surname>
              <given-names>T.S.</given-names>
            </name>
            <name>
              <surname>Booth</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Byer</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Sun</surname>
              <given-names>Q.</given-names>
            </name>
            <name>
              <surname>Chory</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Lee</surname>
              <given-names>K.</given-names>
            </name>
          </person-group>
          <article-title>Simple Sequence Repeats Provide a Substrate for Phenotypic Variation in the Neurospora crassa Circadian Clock</article-title>
          <source>PLoS One</source>
          <year>2007</year>
          <volume>2</volume>
          <fpage>10</fpage>
        </citation>
      </ref>
      <ref id="B65-genes-03-00461">
        <label>65.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Johnsen</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Fidler</surname>
              <given-names>A.E.</given-names>
            </name>
            <name>
              <surname>Kuhn</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Carter</surname>
              <given-names>K.L.</given-names>
            </name>
            <name>
              <surname>Hoffmann</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Barr</surname>
              <given-names>I.R.</given-names>
            </name>
            <name>
              <surname>Biard</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Charmantier</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Eens</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Korsten</surname>
              <given-names>P.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Avian Clock gene polymorphism: evidence for a latitudinal cline in allele frequencies</article-title>
          <source>Mol. Ecol.</source>
          <year>2007</year>
          <volume>16</volume>
          <fpage>4867</fpage>
          <lpage>4880</lpage>
          <pub-id pub-id-type="doi">10.1111/j.1365-294X.2007.03552.x</pub-id>
        </citation>
      </ref>
      <ref id="B66-genes-03-00461">
        <label>66.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Fondon</surname>
              <given-names>J.W.</given-names>
            </name>
            <name>
              <surname>Garner</surname>
              <given-names>H.R.</given-names>
            </name>
          </person-group>
          <article-title>Molecular origins of rapid and continuous morphological evolution</article-title>
          <source>Proc. Natl. Acad. Sci. USA</source>
          <year>2004</year>
          <volume>101</volume>
          <fpage>18058</fpage>
          <lpage>18063</lpage>
        <pub-id pub-id-type="doi">10.1073/pnas.0408118101</pub-id><pub-id pub-id-type="pmid">15596718</pub-id></citation>
      </ref>
      <ref id="B67-genes-03-00461">
        <label>67.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Sears</surname>
              <given-names>K.E.</given-names>
            </name>
            <name>
              <surname>Goswami</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Flynn</surname>
              <given-names>J.J.</given-names>
            </name>
            <name>
              <surname>Niswander</surname>
              <given-names>L.A.</given-names>
            </name>
          </person-group>
          <article-title>The correlated evolution of Runx2 tandem repeats, transcriptional activity, and facial length in Carnivora</article-title>
          <source>Evol. Dev.</source>
          <year>2007</year>
          <volume>9</volume>
          <fpage>555</fpage>
          <lpage>565</lpage>
          <pub-id pub-id-type="doi">10.1111/j.1525-142X.2007.00196.x</pub-id>
        </citation>
      </ref>
      <ref id="B68-genes-03-00461">
        <label>68.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Gerber</surname>
              <given-names>H.P.</given-names>
            </name>
            <name>
              <surname>Seipel</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Georgiev</surname>
              <given-names>O.</given-names>
            </name>
            <name>
              <surname>Hofferer</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Hug</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Rusconi</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Schaffner</surname>
              <given-names>W.</given-names>
            </name>
          </person-group>
          <article-title>Transcriptional activation modulated by homopolymeric glutamine and proline stretches</article-title>
          <source>Science</source>
          <year>1994</year>
          <volume>263</volume>
          <fpage>808</fpage>
          <lpage>811</lpage>
        <pub-id pub-id-type="doi">10.1126/science.8303297</pub-id><pub-id pub-id-type="pmid">8303297</pub-id></citation>
      </ref>
      <ref id="B69-genes-03-00461">
        <label>69.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Friedman</surname>
              <given-names>M.J.</given-names>
            </name>
            <name>
              <surname>Shah</surname>
              <given-names>A.G.</given-names>
            </name>
            <name>
              <surname>Fang</surname>
              <given-names>Z.-H.</given-names>
            </name>
            <name>
              <surname>Ward</surname>
              <given-names>E.G.</given-names>
            </name>
            <name>
              <surname>Warren</surname>
              <given-names>S.T.</given-names>
            </name>
            <name>
              <surname>Li</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Li</surname>
              <given-names>X.-J.</given-names>
            </name>
          </person-group>
          <article-title>Polyglutamine domain modulates the TBP-TFIIB interaction: Implications for its normal function and neurodegeneration</article-title>
          <source>Nat. Neurosci.</source>
          <year>2007</year>
          <volume>10</volume>
          <fpage>1519</fpage>
          <lpage>1528</lpage>
          <pub-id pub-id-type="doi">10.1038/nn2011</pub-id>
        </citation>
      </ref>
      <ref id="B70-genes-03-00461">
        <label>70.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Jackson-Fisher</surname>
              <given-names>A.J.</given-names>
            </name>
            <name>
              <surname>Chitikila</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Mitra</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Pugh</surname>
              <given-names>B.F.</given-names>
            </name>
          </person-group>
          <article-title>A role for TBP dimerization in preventing unregulated gene expression</article-title>
          <source>Mol. Cell</source>
          <year>1999</year>
          <volume>3</volume>
          <fpage>717</fpage>
          <lpage>727</lpage>
          <pub-id pub-id-type="doi">10.1016/S1097-2765(01)80004-6</pub-id>
        </citation>
      </ref>
      <ref id="B71-genes-03-00461">
        <label>71.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Bachtrog</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Weiss</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Zangerl</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Brem</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Schlotterer</surname>
              <given-names>C.</given-names>
            </name>
          </person-group>
          <article-title>Distribution of dinucleotide microsatellites in the Drosophila melanogaster genome</article-title>
          <source>Mol. Biol. Evol.</source>
          <year>1999</year>
          <volume>16</volume>
          <fpage>602</fpage>
          <lpage>610</lpage>
          <pub-id pub-id-type="doi">10.1093/oxfordjournals.molbev.a026142</pub-id>
        </citation>
      </ref>
      <ref id="B72-genes-03-00461">
        <label>72.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Schroth</surname>
              <given-names>G.P.</given-names>
            </name>
            <name>
              <surname>Chou</surname>
              <given-names>P.J.</given-names>
            </name>
            <name>
              <surname>Ho</surname>
              <given-names>P.S.</given-names>
            </name>
          </person-group>
          <article-title>Mapping Z-DNA in the human genome. Computer-aided mapping reveals a nonrandom distribution of potential Z-DNA-forming sequences in human genes</article-title>
          <source>J. Biol. Chem.</source>
          <year>1992</year>
          <volume>267</volume>
          <fpage>11846</fpage>
          <lpage>11855</lpage>
        <pub-id pub-id-type="pmid">1601856</pub-id></citation>
      </ref>
      <ref id="B73-genes-03-00461">
        <label>73.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Riley</surname>
              <given-names>D.E.</given-names>
            </name>
            <name>
              <surname>Krieger</surname>
              <given-names>J.N.</given-names>
            </name>
          </person-group>
          <article-title>UTR dinucleotide simple sequence repeat evolution exhibits recurring patterns including regulatory sequence motif replacements</article-title>
          <source>Gene</source>
          <year>2009</year>
          <volume>429</volume>
          <fpage>80</fpage>
          <lpage>86</lpage>
          <pub-id pub-id-type="doi">10.1016/j.gene.2008.09.030</pub-id>
        </citation>
      </ref>
      <ref id="B74-genes-03-00461">
        <label>74.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Riley</surname>
              <given-names>D.E.</given-names>
            </name>
            <name>
              <surname>Krieger</surname>
              <given-names>J.N.</given-names>
            </name>
          </person-group>
          <article-title>Embryonic nervous system genes predominate in searches for dinucleotide simple sequence repeats flanked by conserved sequences</article-title>
          <source>Gene</source>
          <year>2009</year>
          <volume>429</volume>
          <fpage>74</fpage>
          <lpage>79</lpage>
          <pub-id pub-id-type="doi">10.1016/j.gene.2008.09.025</pub-id>
        </citation>
      </ref>
      <ref id="B75-genes-03-00461">
        <label>75.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Morgante</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Hanafey</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Powell</surname>
              <given-names>W.</given-names>
            </name>
          </person-group>
          <article-title>Microsatellites are preferentially associated with nonrepetitive DNA in plant genomes</article-title>
          <source>Nat. Genet.</source>
          <year>2002</year>
          <volume>30</volume>
          <fpage>194</fpage>
          <lpage>200</lpage>
          <pub-id pub-id-type="doi">10.1038/ng822</pub-id>
        </citation>
      </ref>
      <ref id="B76-genes-03-00461">
        <label>76.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Chen</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Tan</surname>
              <given-names>Z.</given-names>
            </name>
            <name>
              <surname>Jiang</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Li</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Chen</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Shen</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Yu</surname>
              <given-names>R.</given-names>
            </name>
          </person-group>
          <article-title>Similar distribution of simple sequence repeats in diverse completed Human Immunodeficiency Virus Type 1 genomes</article-title>
          <source>FEBS Lett.</source>
          <year>2009</year>
          <volume>583</volume>
          <fpage>2959</fpage>
          <lpage>2963</lpage>
          <pub-id pub-id-type="doi">10.1016/j.febslet.2009.08.004</pub-id>
        </citation>
      </ref>
      <ref id="B77-genes-03-00461">
        <label>77.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>King</surname>
              <given-names>D.G.</given-names>
            </name>
            <name>
              <surname>Soller</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Kashi</surname>
              <given-names>Y.</given-names>
            </name>
          </person-group>
          <article-title>Evolutionary tuning knobs</article-title>
          <source>Endeavour</source>
          <year>1997</year>
          <volume>21</volume>
          <fpage>36</fpage>
          <lpage>40</lpage>
          <pub-id pub-id-type="doi">10.1016/S0160-9327(97)01005-3</pub-id>
        </citation>
      </ref>
      <ref id="B78-genes-03-00461">
        <label>78.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Lo Sardo</surname>
              <given-names>V.</given-names>
            </name>
            <name>
              <surname>Zuccato</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Gaudenzi</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Vitali</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Ramos</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Tartari</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Myre</surname>
              <given-names>M.A.</given-names>
            </name>
            <name>
              <surname>Walker</surname>
              <given-names>J.A.</given-names>
            </name>
            <name>
              <surname>Pistocchi</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Conti</surname>
              <given-names>L.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>An evolutionary recent neuroepithelial cell adhesion function of huntingtin implicates ADAM10-Ncadherin</article-title>
          <source>Nat. Neurosci.</source>
          <year>2012</year>
          <volume>15</volume>
          <fpage>713</fpage>
          <lpage>721</lpage>
          <pub-id pub-id-type="doi">10.1038/nn.3080</pub-id>
        </citation>
      </ref>
    </ref-list>
  </back>
</article>
