<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xml:lang="en" article-type="review-article">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">genes</journal-id>
      <journal-title>Genes</journal-title>
      <abbrev-journal-title abbrev-type="publisher">Genes</abbrev-journal-title>
      <abbrev-journal-title abbrev-type="pubmed">Genes</abbrev-journal-title>
      <issn pub-type="epub">2073-4425</issn>
      <publisher>
        <publisher-name>MDPI</publisher-name>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.3390/genes3030344</article-id>
      <article-id pub-id-type="publisher-id">genes-03-00344</article-id>
      <article-categories>
        <subj-group>
          <subject>Review</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>The Human Transcriptome: An Unfinished Story </article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <name>
            <surname>Pertea</surname>
            <given-names>Mihaela</given-names>
          </name>
        </contrib>
      </contrib-group>
        <aff id="c1-genes-03-00344">McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA; E-Mail: <email>mpertea@jhu.edu</email>; Tel.: +1-443-287-0972</aff>
      <pub-date pub-type="epub">
        <day>29</day>
        <month>06</month>
        <year>2012</year>
      </pub-date>
      <pub-date pub-type="collection"><month>09</month>
        <year>2012</year>
      </pub-date>
      <volume>3</volume>
      <issue>3</issue>
      <fpage>344</fpage>
      <lpage>360</lpage>
      <history>
        <date date-type="received">
          <day>15</day>
          <month>05</month>
          <year>2012</year>
        </date>
        <date date-type="rev-recd">
          <day>14</day>
          <month>06</month>
          <year>2012</year>
        </date>
        <date date-type="accepted">
          <day>25</day>
          <month>06</month>
          <year>2012</year>
        </date>
      </history>
      <permissions>
        <copyright-statement>© 2012 by the authors; licensee MDPI, Basel, Switzerland.</copyright-statement>
        <copyright-year>2012</copyright-year>
        <license xmlns:xlink="http://www.w3.org/1999/xlink" license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/3.0/">
          <p>This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).</p>
        </license>
      </permissions>
      <abstract>
        <p>Despite recent technological advances, the study of the human transcriptome is still in its early stages. Here we provide an overview of the complex human transcriptomic landscape, present the bioinformatics challenges posed by the vast quantities of transcriptomic data, and discuss some of the studies that have tried to determine how much of the human genome is transcribed. Recent evidence has suggested that more than 90% of the human genome is transcribed into RNA. However, this view has been strongly contested by groups of scientists who argued that many of the observed transcripts are simply the result of transcriptional noise. In this review, we conclude that the full extent of transcription remains an open question that will not be fully addressed until we decipher the complete range and biological diversity of the transcribed genomic sequences.</p>
      </abstract>
      <kwd-group>
        <kwd>transcriptome</kwd>
        <kwd>pervasive transcription</kwd>
        <kwd>RNA-seq</kwd>
        <kwd>mRNA</kwd>
        <kwd>ncRNA</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec>
      <title>1. Background and Introduction</title>
      <p>The transcriptome of a cell is the collection of all the RNA molecules, or transcripts, present in that cell. To generate the transcriptome, the DNA of an organism is first transcribed by RNA polymerase to create complementary RNA strands, which in turn are spliced to remove introns, producing mature transcripts that contain only exons. For many years, it was assumed that these RNA transcripts were primarily used as templates for translation to proteins. The vast majority of the remaining human genome, which is not protein coding, was thought to be non-functional and therefore considered “junk” DNA [<xref ref-type="bibr" rid="B1-genes-03-00344">1</xref>]. Soon after the publication of the human genome sequence in 2001 [<xref ref-type="bibr" rid="B2-genes-03-00344">2</xref>,<xref ref-type="bibr" rid="B3-genes-03-00344">3</xref>], a new view emerged, holding that only a small percentage of the human transcriptome is clearly translated into proteins [<xref ref-type="bibr" rid="B4-genes-03-00344">4</xref>,<xref ref-type="bibr" rid="B5-genes-03-00344">5</xref>,<xref ref-type="bibr" rid="B6-genes-03-00344">6</xref>], and most of the remaining transcripts have unknown purposes. In recent years, the number and variety of known RNA genes has grown dramatically, and in addition to protein-coding messenger RNAs (mRNAs), the catalog of transcribed elements now includes a myriad of non-coding RNAs (ncRNAs) that play multiple structural and regulatory roles in the molecular biology of the cell [<xref ref-type="bibr" rid="B7-genes-03-00344">7</xref>]. </p>
      <p>Ever since the discovery of the genetic code, scientists have labored to decipher the complete human transcriptome. It was only with the emergence of automated DNA sequencing in the 1980s that real progress was made in this direction [<xref ref-type="bibr" rid="B8-genes-03-00344">8</xref>]. In the 1990s, scientists realized the value of using expressed sequence tag (EST) sequencing to rapidly identify expressed genes, or at least fragments of those genes, in many human tissues [<xref ref-type="bibr" rid="B9-genes-03-00344">9</xref>,<xref ref-type="bibr" rid="B10-genes-03-00344">10</xref>]. Although at the time EST sequencing was considered a very high-throughput technique, both costs and technical limitations prevented it from producing a complete transcript catalog. As a consequence, much of our knowledge of the protein-coding portion of the human transcriptome relied on different computational gene prediction methods [<xref ref-type="bibr" rid="B11-genes-03-00344">11</xref>,<xref ref-type="bibr" rid="B12-genes-03-00344">12</xref>]. </p>
      <p>Various other technologies were developed to complement the traditional EST approach. These include tag-based methods such as serial analysis of gene expression (SAGE) [<xref ref-type="bibr" rid="B13-genes-03-00344">13</xref>], cap analysis of gene expression (CAGE) [<xref ref-type="bibr" rid="B14-genes-03-00344">14</xref>], and massively parallel signature sequencing (MPSS) [<xref ref-type="bibr" rid="B15-genes-03-00344">15</xref>]. Unlike the EST approach, the tag methods uniquely identify each transcript to achieve gene-level expression quantification. However they are generally unable to distinguish specific isoforms. In addition, most of them are based on traditional Sanger sequencing technology, making them very expensive to apply on a large scale. </p>
      <p>Hybridization-based microarrays provided the first relatively inexpensive way to detect and quantify transcripts on a large scale [<xref ref-type="bibr" rid="B16-genes-03-00344">16</xref>,<xref ref-type="bibr" rid="B17-genes-03-00344">17</xref>,<xref ref-type="bibr" rid="B18-genes-03-00344">18</xref>]. These include transcription tiling arrays, which allow the mapping of transcribed regions to a very high resolution, from 5 to 50 base pairs (bp), depending on probe density [<xref ref-type="bibr" rid="B19-genes-03-00344">19</xref>,<xref ref-type="bibr" rid="B20-genes-03-00344">20</xref>]. They have several advantages over previous methods, including their high throughput and their ability, with some designs, to quantify distinct spliced isoforms [<xref ref-type="bibr" rid="B21-genes-03-00344">21</xref>]. However, because of differences in hybridization strength, cross-hybridization, and other experimental variables, microarrays provide a noisy output signal. In addition, they can only measure genes for which the sequence and the precise exon-intron boundaries are known, making them unable to identify novel genes or novel splicing events [<xref ref-type="bibr" rid="B22-genes-03-00344">22</xref>,<xref ref-type="bibr" rid="B23-genes-03-00344">23</xref>]. </p>
      <p>Recently, RNA-seq methods technologies provide unprecedented opportunities for characterizing the set of RNA transcripts produced in a cell [<xref ref-type="bibr" rid="B24-genes-03-00344">24</xref>,<xref ref-type="bibr" rid="B25-genes-03-00344">25</xref>,<xref ref-type="bibr" rid="B26-genes-03-00344">26</xref>,<xref ref-type="bibr" rid="B27-genes-03-00344">27</xref>,<xref ref-type="bibr" rid="B28-genes-03-00344">28</xref>]. Called a “revolutionary tool for transcriptomics”, RNA-seq is the first sequencing-based method that allows the entire transcriptome to be surveyed in a very high-throughput and quantitative manner [<xref ref-type="bibr" rid="B29-genes-03-00344">29</xref>]. Unlike hybridization-based methods, it is not limited to the detection of known transcripts, and it can measure a much larger range of expression levels. Among its other advantages, RNA-seq data has relatively low background noise; it achieves base-pair resolution, allowing precise identification of exon and intron boundaries; and it can detect single nucleotide polymorphisms (SNPs) and other variants within transcripts. Although RNA-seq has already dramatically changed the landscape of genetic studies, it is clear that many years remain before we will have a complete catalogue of human genes and their expressed isoforms. </p>
    </sec>
    <sec>
      <title>2. The Diversity of the Transcriptome</title>
      <sec>
        <title>2.1. Various Classes of ncRNAs</title>
        <p>Over the past decade, many studies have revealed an unexpected level of diversity in the human transcriptome, which in turn has required scientists to expand their definition of a gene. The traditional definition of a gene—a DNA sequence that is transcribed to produce a functional product—has been expanded to include not only to the ~22,000 protein-coding genes present in the human genome [<xref ref-type="bibr" rid="B11-genes-03-00344">11</xref>], but also a myriad of non-protein coding sequences. These set of transcribed non-protein coding DNA sequences show complex patterns of expression and regulation [<xref ref-type="bibr" rid="B30-genes-03-00344">30</xref>], and they are no longer restricted to the well known ribosomal and transfer RNAs (rRNAs and tRNAs, respectively). Furthermore, when we introduce these new and growing functional RNAs into our gene counts, the number of genes in the human genome increases from ~22,000 (which includes only protein-coding genes) to the 2001 estimates of about 30,000–40,000 genes [<xref ref-type="bibr" rid="B31-genes-03-00344">31</xref>]. </p>
        <p>The discoveries of endogenous small interfering RNA (siRNA) [<xref ref-type="bibr" rid="B32-genes-03-00344">32</xref>] and microRNA (miRNA) [<xref ref-type="bibr" rid="B33-genes-03-00344">33</xref>] genes represented dramatic breakthroughs in our understanding of the transcriptome. These two classes of small ncRNAs play a central role in RNA interference by binding to specific mRNA molecules to either increase or decrease their activity. Various other classes of ncRNAs have a now-broadly recognized functional role. These include regulatory RNAs such as PIWI-interacting RNAs (piRNAs), promoter-associated RNAs (PARs), transcription initiation RNAs (tiRNAs), X-inactivation RNAs (xiRNAs), and many others [<xref ref-type="bibr" rid="B34-genes-03-00344">34</xref>,<xref ref-type="bibr" rid="B35-genes-03-00344">35</xref>]. Among them, the long non-coding RNAs (lncRNAs), defined as ncRNAs longer than 200 bp, are probably the least well-understood transcripts. Although few of them have been experimentally studied, a view is emerging that these are key regulators of epigenetic gene regulation in mammalian cells [<xref ref-type="bibr" rid="B36-genes-03-00344">36</xref>]. </p>
        <p>Large intergenic RNAs (lincRNAs) are a subclass of lncRNAs that do not overlap protein-coding regions. Cabili <italic>et al</italic>. [<xref ref-type="bibr" rid="B37-genes-03-00344">37</xref>] catalogued more than 8,000 lincRNAs (58% of which were novel) using an integrative approach that unifies existing annotation sources with transcripts assembled from RNA-seq data collected from 24 tissues and cell types. Several global properties of lincRNAs were evidenced by this study: </p>
        <list list-type="simple">
          <list-item>
            <p>- they are expressed in a highly tissue-specific manner compared to protein-coding genes,</p>
          </list-item>
          <list-item>
            <p>- they are typically co-expressed with their neighboring genes, and</p>
          </list-item>
          <list-item>
            <p>- they only show moderate conservation in other species.</p>
          </list-item>
        </list>
        <p>The functional classification of lincRNAs is far from complete, even though Cabili <italic>et al</italic>. assigned putative functions to many predicted lincRNAs based on the functions of protein-coding genes with similar expression patterns. </p>
      </sec>
      <sec>
        <title>2.2. Alternative Splicing</title>
        <p>Even when considering only protein-coding RNAs, the scientific community still does not have a complete picture of the transcriptome. Not only is there uncertainty about the exact number of human protein-coding genes, but recent evidence has emerged to show that different humans have slightly different individual gene sets [<xref ref-type="bibr" rid="B38-genes-03-00344">38</xref>,<xref ref-type="bibr" rid="B39-genes-03-00344">39</xref>,<xref ref-type="bibr" rid="B40-genes-03-00344">40</xref>]. The number of mature mRNA transcripts is even less certain, and varies across tissues and different stages during cell differentiation [<xref ref-type="bibr" rid="B41-genes-03-00344">41</xref>,<xref ref-type="bibr" rid="B42-genes-03-00344">42</xref>]. Further complicating matters, we now know that more than 90% of multi-exon protein-coding genes undergo alternative splicing [<xref ref-type="bibr" rid="B43-genes-03-00344">43</xref>,<xref ref-type="bibr" rid="B44-genes-03-00344">44</xref>], which is considered to play a major role in increasing cellular and functional diversity in the transcriptomes of higher eukaryotes [<xref ref-type="bibr" rid="B45-genes-03-00344">45</xref>]. However, we do not yet know the function of the vast majority of alternatively spliced human transcripts, and it is now clear that alternative splicing does not simply act to generate variant protein sequences [<xref ref-type="bibr" rid="B46-genes-03-00344">46</xref>].</p>
        <p>Alternative splicing also affects ncRNA genes, about 30% of which produce at least one alternatively spliced transcript [<xref ref-type="bibr" rid="B47-genes-03-00344">47</xref>]. Cabili <italic>et al</italic>. found that lincRNAs, although shorter and with fewer exons than mRNAs, are also alternatively spliced with an average of 2.3 isoforms per locus [<xref ref-type="bibr" rid="B37-genes-03-00344">37</xref>]. New transcripts are continuously being discovered [<xref ref-type="bibr" rid="B19-genes-03-00344">19</xref>,<xref ref-type="bibr" rid="B41-genes-03-00344">41</xref>,<xref ref-type="bibr" rid="B48-genes-03-00344">48</xref>,<xref ref-type="bibr" rid="B49-genes-03-00344">49</xref>], strengthening the observation that we are far from determining all transcript isoforms. </p>
      </sec>
      <sec>
        <title>2.3. Estimating the Annotated Human Transcript Count</title>
        <p>In an attempt to identify how many human transcripts are currently annotated, I combined all human gene annotations from Ensembl (release 64) [<xref ref-type="bibr" rid="B50-genes-03-00344">50</xref>], NCBI’s RefSeq database [<xref ref-type="bibr" rid="B51-genes-03-00344">51</xref>], and the UCSC Genome Browser [<xref ref-type="bibr" rid="B52-genes-03-00344">52</xref>] with the lincRNAs catalogued by Cabili <italic>et al</italic>. [<xref ref-type="bibr" rid="B37-genes-03-00344">37</xref>]. After eliminating redundant transcripts (<italic>i.e.</italic>, transcripts with identical annotation as an already included transcript from one of the databases), I divided the remaining ones into three categories: mRNAs if they were annotated as protein-coding transcripts, long ncRNAs if they were annotated as non-coding and were at least 200 bp long, and small ncRNAs otherwise. </p>
        <p>As also observed by others [<xref ref-type="bibr" rid="B53-genes-03-00344">53</xref>], I found a highly complex architecture in the human transcriptome, in which some base pairs could be part of many overlapping transcripts in any of the three categories, and emanating from both strands of the genome. Loci containing all three categories of transcripts were not frequent (see <xref ref-type="fig" rid="genes-03-00344-f001">Figure 1</xref>a). Not surprisingly, the annotations include more mRNA than ncRNA transcripts, possibly due to a bias towards annotating protein-coding transcripts, although loci with at least one ncRNA are more numerous than loci containing one or more mRNAs (see <xref ref-type="table" rid="genes-03-00344-t001">Table 1</xref>). Overall, annotated transcripts today cover 4.62% or 3.85% of the human genome, depending on whether or not we include pseudogenes. Expression of pseudogenes is controversial, with some reports suggesting that they might be transcribed and could play a significant part in gene regulation [<xref ref-type="bibr" rid="B54-genes-03-00344">54</xref>,<xref ref-type="bibr" rid="B55-genes-03-00344">55</xref>]. They cover about 30% of the total base pairs included in all ncRNA transcripts. <xref ref-type="fig" rid="genes-03-00344-f001">Figure 1</xref>b shows the base pair coverage of the human transcriptome (including pseudogenes) by the three categories of transcripts. I found that 62% of the base pairs in the transcriptome are part of mRNAs, supporting the fact that ncRNAs tend to be smaller in length than mRNAs. </p>
        <fig id="genes-03-00344-f001" position="float">
          <label>Figure 1</label>
          <caption>
            <p>Composition of the human transcriptome. (<bold>a</bold>) Venn diagram of the number of loci containing mRNA transcripts (green), long ncRNAs (blue), and small ncRNAs (red); (<bold>b</bold>) Base pair coverage of the transcriptome by the three categories of transcripts.</p>
          </caption>
          <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="genes-03-00344-g001.tif"/>
        </fig>
        <table-wrap id="genes-03-00344-t001" position="float">
          <object-id pub-id-type="pii">genes-03-00344-t001_Table 1</object-id>
          <label>Table 1</label>
          <caption>
            <p>Number of known annotated transcripts and human gene loci collected from Ensembl, NCBI’s RefSeq, UCSC Genome Browser, and Cabili <italic>et al</italic>.’s lincRNA catalog. A single locus typically contains multiple transcripts, particularly for mRNAs.</p>
          </caption>
          <table>
            <thead>
              <tr>
                <th align="left" valign="middle">Annotation</th>
                <th align="center" valign="middle">mRNA</th>
                <th align="center" valign="middle">Long ncRNA</th>
                <th align="center" valign="middle">Small ncRNA</th>
              </tr>
            </thead>
            <tbody>
              <tr>
                <td align="left" valign="middle">Transcripts</td>
                <td align="center" valign="middle">111,451</td>
                <td align="center" valign="middle">89,981</td>
                <td align="center" valign="middle">11,366</td>
              </tr>
              <tr>
                <td align="left" valign="middle">Loci</td>
                <td align="center" valign="middle">20,944</td>
                <td align="center" valign="middle">40,765</td>
                <td align="center" valign="middle">11,195</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
      </sec>
      <sec>
        <title>2.4. RNA Editing</title>
        <p>RNA editing is another cellular process that contributes to the complex landscape of mammalian transcriptomes. In the RNA editing process, single nucleotide changes occur after DNA has been transcribed into RNA. The resulting RNA transcripts may produce altered proteins, or they may disrupt translation more severely [<xref ref-type="bibr" rid="B56-genes-03-00344">56</xref>]. Two RNA editing mechanisms are known in humans, causing two types of substitutions: adenosine to inosine, and cytosine to uracil. The A-to-I editing, also called A-to-G, is a process mediated by a family of adenosine deaminases (ADARs) that act on RNA and replace certain adenosines (A) with inosines, which then act as guanosines (G) during translation [<xref ref-type="bibr" rid="B57-genes-03-00344">57</xref>,<xref ref-type="bibr" rid="B58-genes-03-00344">58</xref>]. Similarly, the C-to-U switches are mediated by APOBEC1 [<xref ref-type="bibr" rid="B59-genes-03-00344">59</xref>,<xref ref-type="bibr" rid="B60-genes-03-00344">60</xref>,<xref ref-type="bibr" rid="B61-genes-03-00344">61</xref>]. </p>
        <p>Until recently considered a rare event, RNA editing is now believed to affect both coding and non-coding sequences of thousands of genes, including ncRNAs [<xref ref-type="bibr" rid="B56-genes-03-00344">56</xref>,<xref ref-type="bibr" rid="B62-genes-03-00344">62</xref>,<xref ref-type="bibr" rid="B63-genes-03-00344">63</xref>]. A 2011 study by Li <italic>et al</italic>. [<xref ref-type="bibr" rid="B64-genes-03-00344">64</xref>] looked at RNA-seq and DNA sequence data from 27 individuals and reported that RNA-DNA differences (RDDs) are not limited to the two previous types of substitutions described above. In their study, Li <italic>et al</italic>., observed all 12 possible RNA-DNA substitutions at more than 10,000 exonic sites, most of them present in multiple individuals and in different cell types. Their result suggests that previously unknown RNA editing mechanisms may be active in humans. However, this result has been strongly contested by several other groups, who argued that the vast majority of the observed RDDs were technical artifacts, mostly due to read mapping errors or systematic sequencing errors [<xref ref-type="bibr" rid="B65-genes-03-00344">65</xref>,<xref ref-type="bibr" rid="B66-genes-03-00344">66</xref>,<xref ref-type="bibr" rid="B67-genes-03-00344">67</xref>,<xref ref-type="bibr" rid="B68-genes-03-00344">68</xref>]. Nevertheless, RNA editing has an important role in molecular biology, and recent studies show that it may produce even more transcriptome diversity than alternative splicing [<xref ref-type="bibr" rid="B69-genes-03-00344">69</xref>]. </p>
      </sec>
    </sec>
    <sec>
      <title>3. Reconstructing the Transcriptome</title>
      <p>As discussed above, high-throughput RNA sequencing surpasses all previous technologies in its ability to profile the extent and complexity of eukaryotic transcriptomes. The latest generation of sequencing machines can generate up to 600 gigabases (Gb) in a single run, equivalent to 200-fold coverage of the human genome. The 600 Gb is produced in the form of 6 billion short reads, each approximately 100 bp in length (using the Illumina HiSeq sequencer), and assembling these reads into chromosomes is a very complex, highly specialized task. Therefore one of the main challenges posed by RNA-seq is a computational one. Here I will briefly mention some of the most common bioinformatics systems for transcriptome assembly, and the challenges faced by these systems. For a more comprehensive review of next-generation transcriptome assembly methods, the interested reader can consult several recent reviews [<xref ref-type="bibr" rid="B70-genes-03-00344">70</xref>,<xref ref-type="bibr" rid="B71-genes-03-00344">71</xref>,<xref ref-type="bibr" rid="B72-genes-03-00344">72</xref>].</p>
      <p>Although many programs have been developed for whole-genome assembly (e.g., [<xref ref-type="bibr" rid="B73-genes-03-00344">73</xref>,<xref ref-type="bibr" rid="B74-genes-03-00344">74</xref>,<xref ref-type="bibr" rid="B75-genes-03-00344">75</xref>]), these methods cannot be directly applied to transcriptome assembly due to specific characteristics of RNA-seq data sets. Genome assembly programs assume that the DNA sequence’s depth of coverage is relatively uniform across the genome. This is not true for transcripts, which have highly variable sequence coverage depending on their expression levels. Sequence depth is used to indicate repeats by genome assemblers, which are designed to take this into account. Another confounding fact for genome assemblers is that alternative transcripts from the same locus typically share exons that are difficult to assemble unambiguously. Specific features of RNA-seq data (e.g., strand-specific sequencing or partially covered gene transcripts from low-abundance genes [<xref ref-type="bibr" rid="B48-genes-03-00344">48</xref>]) can also confound a whole-genome assembly algorithm. Therefore new methods have had to be developed to address the particular characteristics of transcriptome assembly.</p>
      <p>There are two main approaches for assembly of a transcriptome: a genome-guided approach when a reference genome is available; or <italic>de novo</italic> assembly, which does not need a genome reference and can theoretically reconstruct transcripts that are transcribed even from parts missing from that genome’s assembly. <italic>De novo</italic> transcriptome assembly is far more challenging in higher eukaryotes due to the large number of genes, the great variation in their expression levels, and especially because of the large number of alternatively spliced transcript variants. For this reason, <italic>de novo</italic> methods are primarily used for organisms that lack a sequenced reference genome.</p>
      <p>Read mapping is one of the main technical challenges of genome-guided approaches. Alignment of short reads to the reference genome is a challenge in itself, but with RNA-seq data these reads may be sequenced from exons and exon-exon junction regions. Methods such as Bowtie [<xref ref-type="bibr" rid="B76-genes-03-00344">76</xref>] and BWA [<xref ref-type="bibr" rid="B77-genes-03-00344">77</xref>] can be used for the alignment of reads to either a reference genome or directly to the transcriptome, but this strategy will miss novel exons and novel splicing events. Spliced aligners were developed to overcome these limitations. Some of them (e.g., TopHat [<xref ref-type="bibr" rid="B78-genes-03-00344">78</xref>], SpliceMap [<xref ref-type="bibr" rid="B79-genes-03-00344">79</xref>], MapSplice [<xref ref-type="bibr" rid="B80-genes-03-00344">80</xref>]) use an ‘exon-first’ approach where reads are first mapped to the genome, and then the unmapped reads are split into shorter segments and aligned independently. Other spliced aligners, such as GSNAP [<xref ref-type="bibr" rid="B81-genes-03-00344">81</xref>] or BLAT [<xref ref-type="bibr" rid="B82-genes-03-00344">82</xref>], use a ‘seed-and-extend’ strategy in which the reads are first divided into small segments (seeds) that are individually aligned to the genome, and then candidate regions are locally aligned to obtain the final spliced alignment of the read. There are different advantages to these strategies, but in general ‘exon-first’ aligners are usually faster, while ‘seed-and-extend’ ones may be slightly more sensitive by reducing the bias towards unspliced alignments in the exon-first approach.</p>
      <p>After mapping all reads to the reference genome, transcriptome assemblers cluster the overlapping reads at each locus and build a connectivity graph representing all possible isoforms. Different transcriptome assembly programs, such as Cufflinks [<xref ref-type="bibr" rid="B41-genes-03-00344">41</xref>], Scripture [<xref ref-type="bibr" rid="B83-genes-03-00344">83</xref>], IsoInfer [<xref ref-type="bibr" rid="B84-genes-03-00344">84</xref>], and IsoLasso [<xref ref-type="bibr" rid="B85-genes-03-00344">85</xref>], use different criteria to parse the connectivity graph. Cufflinks uses a parsimony principle to generate the minimal number of transcripts that will explain all reads in the graph. If there are multiple ways to assemble a minimal number of transcripts, Cufflinks uses the read coverage across each path to decide which combination is most likely to originate from the same RNA transcript. Scripture reconstructs all possible isoforms by enumerating all possible paths in the connectivity graph that have statistically significant read coverage. While Cufflinks and Scripture estimate the abundance of transcripts after they are assembled, IsoInfer and IsoLasso assemble transcripts at the same time that they estimate their expression levels. They take two different approaches: IsoInfer uses a heuristic approach to reduce the huge search space of all valid isoforms, while IsoLasso uses a multivariate regression method that also minimizes the number of predicted transcripts. </p>
      <p><italic>De novo</italic> transcriptome assembly methods, generally based on de Brujin graphs, are less efficient and less sensitive than genome-guided methods for the human genome. Despite that, running a <italic>de novo</italic> assembler in addition to a genome-guided method may produce a more comprehensive transcriptome. Because <italic>de novo</italic> assemblers do not need a reference genome, they can identify genes that are missing from the reference genome, such as trans-spliced transcripts and similar transcripts originating from chromosomal rearrangements. Trinity [<xref ref-type="bibr" rid="B86-genes-03-00344">86</xref>], Oases [<xref ref-type="bibr" rid="B87-genes-03-00344">87</xref>], SOAPdenovo [<xref ref-type="bibr" rid="B88-genes-03-00344">88</xref>], and Trans-ABySS [<xref ref-type="bibr" rid="B89-genes-03-00344">89</xref>] are some of the programs used for <italic>de novo</italic> transcriptome assembly. A recent comparative study [<xref ref-type="bibr" rid="B90-genes-03-00344">90</xref>] evaluated the performance of different <italic>de novo</italic> transcriptome assembly programs and found that Trinity performed well across various conditions, but took the longest running time; Oases consumed the most memory; SOAPdenovo required the shortest runtime but performed poorly at reconstructing full-length transcripts; and Trans-ABySS showed a good balance between resource usage and quality of assemblies. Although it would undoubtedly prove useful, there is no automated software pipeline to carry out a combined assembly strategy to bring together the high sensitivity of genome-guided assemblers with the ability of <italic>de novo</italic> methods to detect novel and trans-spliced transcripts.</p>
    </sec>
    <sec>
      <title>4. The Size of the Transcriptome</title>
      <p>Less than 2% of the human genome codes for proteins [<xref ref-type="bibr" rid="B91-genes-03-00344">91</xref>]. As described above, if we add to this fraction the DNA sequences that correspond to annotated ncRNAs, we are still left with less than 5% of the human genome covered by known transcripts. Other reports have found that only ~5–10% of the genome is stably transcribed in cell lines [<xref ref-type="bibr" rid="B19-genes-03-00344">19</xref>,<xref ref-type="bibr" rid="B20-genes-03-00344">20</xref>,<xref ref-type="bibr" rid="B92-genes-03-00344">92</xref>]. My own independent analysis (<xref ref-type="fig" rid="genes-03-00344-f002">Figure 2</xref>) shows that it is rare to see more than 5% of the total base pairs in the genome covered by assembled transcripts in normal human tissue. While these studies don’t capture the expression of the transcriptome at all stages in the cell development, they suggest that only a small portion of the human genome is transcribed. And yet a mounting number of studies suggest that the vast majority of the genome is transcribed at some time or other. Beginning in the early 2000s, full length cDNAs from various mouse tissues at different developmental stages, and genome-wide tiling arrays in different human tissues and cell lines revealed that much more of the mammalian genomes is transcribed than what is annotated in public databases [<xref ref-type="bibr" rid="B5-genes-03-00344">5</xref>,<xref ref-type="bibr" rid="B19-genes-03-00344">19</xref>,<xref ref-type="bibr" rid="B20-genes-03-00344">20</xref>,<xref ref-type="bibr" rid="B49-genes-03-00344">49</xref>,<xref ref-type="bibr" rid="B93-genes-03-00344">93</xref>,<xref ref-type="bibr" rid="B94-genes-03-00344">94</xref>,<xref ref-type="bibr" rid="B95-genes-03-00344">95</xref>]. These studies culminated with the publication in 2007 of the results from the pilot phase of the ENCODE Project [<xref ref-type="bibr" rid="B96-genes-03-00344">96</xref>], which estimated that as much as 93% of the human genome is transcribed in at least one cell type. Does this broad pattern of transcription mean simply that the cell creates a great deal of transcriptional noise by RNA polymerase binding accidentally (or randomly) to many sites in the genome? Or does this result challenge the long-standing view that most of the human genome is not biologically active? Scientists have conflicting opinions on the answer to this question. </p>
      <p>A recent study published by van Bakel <italic>et al</italic>. [<xref ref-type="bibr" rid="B97-genes-03-00344">97</xref>] claims that most ‘dark matter’ transcripts—defined as ncRNAs of unknown function - are associated with known genes. In this paper, van Bakel <italic>et al</italic>. argue that there is a high false-positive rate associated with the tiling array technology that was the basis of most analyses that suggested the pervasiveness of transcription. When compared to RNA-seq data, tiling arrays produce a larger proportion of low-abundance transcripts originating from intergenic and intronic regions, although tiling arrays and RNA-seq data generally agree on the location of the greatest transcript “mass.” The low coverage of intronic transcripts suggests that they might in fact represent random sampling from partially processed or unprocessed RNAs. Supporting this idea is also the observation that the transcription mass in intergenic regions increases at much lower rates than in intronic regions as the number of reads is increased. Van Bakel <italic>et al</italic>., also identified several thousand small transcripts that map outside known genes, however most of them could be explained as accidental by-products of enhancer activity. Overall, the authors conclude that most of the genome is not appreciably transcribed, and the majority of intergenic and intronic transcripts observed in previous studies may be attributed to biological and/or technical background noise.</p>
      <fig id="genes-03-00344-f002" position="float">
        <label>Figure 2</label>
        <caption>
          <p>The size of the transcriptome, computed as the fraction of the total number of base pairs in the human genome covered by the assembled transcripts, for 16 normal human tissues included in the Illumina Body Map [<xref ref-type="bibr" rid="B98-genes-03-00344">98</xref>]. Each RNA-seq data set was mapped to the genome with TopHat [<xref ref-type="bibr" rid="B78-genes-03-00344">78</xref>] and assembled with Cufflinks [<xref ref-type="bibr" rid="B41-genes-03-00344">41</xref>]. Note that except for adrenal tissue, in which transcripts cover 5.3% of the human genome, all other reconstructed transcriptomes are smaller in size than the currently annotated transcriptome.</p>
        </caption>
        <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="genes-03-00344-g002.tif"/>
      </fig>
      <p>Clark <italic>et al</italic>. [<xref ref-type="bibr" rid="B99-genes-03-00344">99</xref>] acknowledge that indeed most dark matter transcripts are associated with known genes, but they strongly disagree with van Bakel <italic>et al</italic>.’s conclusion that the genome is not as pervasively transcribed as previously reported. In their study, Clark <italic>et al</italic>., argue that we cannot dismiss the observations from multiple independent techniques, including RT-PCR, RACE, and Northern blot analyses, which together validated more than 90% of the identified transcripts [<xref ref-type="bibr" rid="B100-genes-03-00344">100</xref>,<xref ref-type="bibr" rid="B101-genes-03-00344">101</xref>]. They also argue that van Bakel <italic>et al</italic>.’s RNA-seq data suffers from insufficient sequencing depth and poor assembly, and is biased towards polyadenylated RNA, which selectively omits significant amounts of RNA as has been shown earlier [<xref ref-type="bibr" rid="B102-genes-03-00344">102</xref>]. Overall, similarly to other studies [<xref ref-type="bibr" rid="B103-genes-03-00344">103</xref>,<xref ref-type="bibr" rid="B104-genes-03-00344">104</xref>], Clark <italic>et al</italic>. find that the detection accuracy of tiling arrays is not significantly lower than that of RNA-seq, and they conclude that a significant fraction of dark matter RNA comes from very long, intergenic transcribed regions. </p>
      <p>In a subsequent paper [<xref ref-type="bibr" rid="B105-genes-03-00344">105</xref>], van Bakel <italic>et al</italic>. agree with the fact that most of the genome appears to be transcribed. But given the various sources of extraneous reads, both biological and laboratory-derived, they expect that given sufficient sequencing depth the whole genome may be covered with transcripts. A recent study that sequenced total RNA from human brain and liver supports van Bakel <italic>et al</italic>.’s suggestion that unannotated transcripts within introns represent unspliced introns rather than unique independent transcriptional units [<xref ref-type="bibr" rid="B106-genes-03-00344">106</xref>]. And yet another study found that sequenced reads observed in conventional RNA sequencing data sets, previously dismissed as noise, are in fact indicative of unassembled rare transcripts [<xref ref-type="bibr" rid="B107-genes-03-00344">107</xref>]. Therefore the debate about the pervasiveness of transcription continues, but as van Bakel <italic>et al</italic>., and others [<xref ref-type="bibr" rid="B30-genes-03-00344">30</xref>,<xref ref-type="bibr" rid="B108-genes-03-00344">108</xref>] point out, it is time to stop arguing over the content of the transcriptome, and focus on finding evidence for dark matter functions. </p>
    </sec>
    <sec sec-type="discussion">
      <title>5. Discussion and Conclusions</title>
      <p>The unprecedented depth of sequence coverage achieved by RNA-seq has revealed how much of the human transcriptome is still uncharacterized. Many novel transcripts are still being discovered, stimulating the debate as to the extent to which the genome is transcribed. Non-coding RNAs represent the majority of the human transcripts, and there is no doubt that many of them, initially considered to be transcriptional artifacts, are in fact functional. They play important roles in transcriptional and post-transcriptional gene regulation via both <italic>cis-</italic> and <italic>trans-</italic>acting mechanisms, chromatin modification, control of transcription factor binding, regulation of alternative splicing. These functions have important consequences for development and for diseases, including cancer [<xref ref-type="bibr" rid="B30-genes-03-00344">30</xref>,<xref ref-type="bibr" rid="B36-genes-03-00344">36</xref>,<xref ref-type="bibr" rid="B109-genes-03-00344">109</xref>,<xref ref-type="bibr" rid="B110-genes-03-00344">110</xref>].</p>
      <p>Despite current intense research efforts, many of the novel transcripts identified thus far have an unknown function. Most of them have been found only in specific cell types, tissues, or developmental stages [<xref ref-type="bibr" rid="B37-genes-03-00344">37</xref>,<xref ref-type="bibr" rid="B100-genes-03-00344">100</xref>,<xref ref-type="bibr" rid="B111-genes-03-00344">111</xref>]. They lack functional ORFs, have lower expression levels, and are only modestly conserved, although conservation is only a week indicator of functionality [<xref ref-type="bibr" rid="B96-genes-03-00344">96</xref>,<xref ref-type="bibr" rid="B112-genes-03-00344">112</xref>,<xref ref-type="bibr" rid="B113-genes-03-00344">113</xref>]. Occasionally, entirely novel protein-coding genes with strong mRNA expression have been identified [<xref ref-type="bibr" rid="B114-genes-03-00344">114</xref>], but most unannotated transcripts that are protein-coding are alternatively spliced isoforms of known mRNAs [<xref ref-type="bibr" rid="B41-genes-03-00344">41</xref>]. However, as of today the vast majority of alternatively spliced transcripts lack described functions, and the role of alternative splicing itself in gene evolution remains largely unexplored [<xref ref-type="bibr" rid="B46-genes-03-00344">46</xref>]. </p>
      <p>Is low RNA polymerase fidelity the principal cause of the widespread transcription observed in the human genome? We do not have a definite answer to this question [<xref ref-type="bibr" rid="B115-genes-03-00344">115</xref>]. A focus on deciphering the biological functions of transcribed genomic sequences might provide us with a clearer picture. Over the last decade, the estimated proportion of the human genome that might be functional has been constantly adjusted upwards, and today it lies between 10% and 15% [<xref ref-type="bibr" rid="B116-genes-03-00344">116</xref>]. This estimate is still much lower than the ~93% estimate for the transcribed fraction of the genome [<xref ref-type="bibr" rid="B96-genes-03-00344">96</xref>]. In a 2009 review, Ponting <italic>et al.</italic>, argue that a large, but as yet unknown, number of noncoding RNAs cannot be explained solely as the product of transcriptional noise [<xref ref-type="bibr" rid="B30-genes-03-00344">30</xref>]. If ncRNAs were simply transcriptional noise, than their expression levels would not show the wide diversity that is often observed among different tissues. In addition, their nucleotide substitution rates would be very similar to neutrally evolving sequences. Instead, several evolutionary studies suggest that many ncRNAs exhibit signatures of functionality that are more usually associated with protein-coding genes [<xref ref-type="bibr" rid="B47-genes-03-00344">47</xref>,<xref ref-type="bibr" rid="B117-genes-03-00344">117</xref>], or that their low sequence conservation is due to the fact that they are frequently acted upon by positive selection [<xref ref-type="bibr" rid="B118-genes-03-00344">118</xref>,<xref ref-type="bibr" rid="B119-genes-03-00344">119</xref>]. Nevertheless, some percentage of the transcripts observed are very likely the result either of transcriptional noise [<xref ref-type="bibr" rid="B120-genes-03-00344">120</xref>] or of genomic DNA contamination [<xref ref-type="bibr" rid="B121-genes-03-00344">121</xref>]. Even if not functional themselves, these unannotated transcripts might reflect transcriptional processes that facilitate the expression of other genes. Until we can functionally validate these transcripts or gain a better understanding of the range of transcriptional mechanisms involved, the question of how much of the human genome is transcribed will remain an open question. </p>
    </sec>
  </body>
  <back>
    <ack>
      <title>Acknowledgements</title>
      <p>Thanks to Steven Salzberg for helpful comments and suggestions. This work was supported by NIH grants R01 HG006677 and R01 HG006102.</p>
    </ack>
    <ref-list>
      <title>References</title>
      <ref id="B1-genes-03-00344">
        <label>1.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Ohno</surname>
              <given-names>S.</given-names>
            </name>
          </person-group>
          <article-title>So much “junk” DNA in our genome</article-title>
          <source>Brookhaven Symp. Biol.</source>
          <year>1972</year>
          <volume>23</volume>
          <fpage>366</fpage>
          <lpage>370</lpage>
        <pub-id pub-id-type="pmid">5065367</pub-id></citation>
      </ref>
      <ref id="B2-genes-03-00344">
        <label>2.</label>
        <citation citation-type="journal">
          <collab>The International Human Genome Sequencing Consortium</collab>
          <article-title>Initial sequencing and analysis of the human genome</article-title>
          <source>Nature</source>
          <year>2001</year>
          <volume>409</volume>
          <fpage>860</fpage>
          <lpage>921</lpage>
          <pub-id pub-id-type="doi">10.1038/35057062</pub-id>
        </citation>
      </ref>
      <ref id="B3-genes-03-00344">
        <label>3.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Venter</surname>
              <given-names>J.C.</given-names>
            </name>
            <name>
              <surname>Adams</surname>
              <given-names>M.D.</given-names>
            </name>
            <name>
              <surname>Myers</surname>
              <given-names>E.W.</given-names>
            </name>
            <name>
              <surname>Li</surname>
              <given-names>P.W.</given-names>
            </name>
            <name>
              <surname>Mural</surname>
              <given-names>R.J.</given-names>
            </name>
            <name>
              <surname>Sutton</surname>
              <given-names>G.G.</given-names>
            </name>
            <name>
              <surname>Smith</surname>
              <given-names>H.O.</given-names>
            </name>
            <name>
              <surname>Yandell</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Evans</surname>
              <given-names>C.A.</given-names>
            </name>
            <name>
              <surname>Holt</surname>
              <given-names>R.A.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>The sequence of the human genome</article-title>
          <source>Science</source>
          <year>2001</year>
          <volume>291</volume>
          <fpage>1304</fpage>
          <lpage>1351</lpage>
        <pub-id pub-id-type="doi">10.1126/science.1058040</pub-id><pub-id pub-id-type="pmid">11181995</pub-id></citation>
      </ref>
      <ref id="B4-genes-03-00344">
        <label>4.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Chen</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Sun</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Lee</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Zhou</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Rowley</surname>
              <given-names>J.D.</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>S.M.</given-names>
            </name>
          </person-group>
          <article-title>Identifying novel transcripts and novel genes in the human genome by using novel SAGE tags</article-title>
          <source>Proc. Natl. Acad. Sci. USA</source>
          <year>2002</year>
          <volume>99</volume>
          <fpage>12257</fpage>
          <lpage>12262</lpage>
        <pub-id pub-id-type="doi">10.1073/pnas.192436499</pub-id><pub-id pub-id-type="pmid">12213963</pub-id></citation>
      </ref>
      <ref id="B5-genes-03-00344">
        <label>5.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Kapranov</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Cawley</surname>
              <given-names>S.E.</given-names>
            </name>
            <name>
              <surname>Drenkow</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Bekiranov</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Strausberg</surname>
              <given-names>R.L.</given-names>
            </name>
            <name>
              <surname>Fodor</surname>
              <given-names>S.P.</given-names>
            </name>
            <name>
              <surname>Gingeras</surname>
              <given-names>T.R.</given-names>
            </name>
          </person-group>
          <article-title>Large-scale transcriptional activity in chromosomes 21 and 22</article-title>
          <source>Science</source>
          <year>2002</year>
          <volume>296</volume>
          <fpage>916</fpage>
          <lpage>919</lpage>
          <pub-id pub-id-type="doi">10.1126/science.1068597</pub-id>
        </citation>
      </ref>
      <ref id="B6-genes-03-00344">
        <label>6.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Saha</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Sparks</surname>
              <given-names>A.B.</given-names>
            </name>
            <name>
              <surname>Rago</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Akmaev</surname>
              <given-names>V.</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>C.J.</given-names>
            </name>
            <name>
              <surname>Vogelstein</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Kinzler</surname>
              <given-names>K.W.</given-names>
            </name>
            <name>
              <surname>Velculescu</surname>
              <given-names>V.E.</given-names>
            </name>
          </person-group>
          <article-title>Using the transcriptome to annotate the genome</article-title>
          <source>Nat. Biotechnol.</source>
          <year>2002</year>
          <volume>20</volume>
          <fpage>508</fpage>
          <lpage>512</lpage>
          <pub-id pub-id-type="doi">10.1038/nbt0502-508</pub-id>
        </citation>
      </ref>
      <ref id="B7-genes-03-00344">
        <label>7.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Mattick</surname>
              <given-names>J.S.</given-names>
            </name>
          </person-group>
          <article-title>The central role of RNA in human development and cognition</article-title>
          <source>FEBS Lett.</source>
          <year>2011</year>
          <volume>585</volume>
          <fpage>1600</fpage>
          <lpage>1616</lpage>
          <pub-id pub-id-type="doi">10.1016/j.febslet.2011.05.001</pub-id>
        </citation>
      </ref>
      <ref id="B8-genes-03-00344">
        <label>8.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Griffin</surname>
              <given-names>H.G.</given-names>
            </name>
            <name>
              <surname>Griffin</surname>
              <given-names>A.M.</given-names>
            </name>
          </person-group>
          <article-title>DNA sequencing. Recent innovations and future trends</article-title>
          <source>Appl. Biochem. Biotechnol.</source>
          <year>1993</year>
          <volume>38</volume>
          <fpage>147</fpage>
          <lpage>159</lpage>
          <pub-id pub-id-type="doi">10.1007/BF02916418</pub-id>
        </citation>
      </ref>
      <ref id="B9-genes-03-00344">
        <label>9.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Adams</surname>
              <given-names>M.D.</given-names>
            </name>
            <name>
              <surname>Kerlavage</surname>
              <given-names>A.R.</given-names>
            </name>
            <name>
              <surname>Fields</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Venter</surname>
              <given-names>J.C.</given-names>
            </name>
          </person-group>
          <article-title>3,400 new expressed sequence tags identify diversity of transcripts in human brain</article-title>
          <source>Nat. Genet.</source>
          <year>1993</year>
          <volume>4</volume>
          <fpage>256</fpage>
          <lpage>267</lpage>
          <pub-id pub-id-type="doi">10.1038/ng0793-256</pub-id>
        </citation>
      </ref>
      <ref id="B10-genes-03-00344">
        <label>10.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Adams</surname>
              <given-names>M.D.</given-names>
            </name>
            <name>
              <surname>Kerlavage</surname>
              <given-names>A.R.</given-names>
            </name>
            <name>
              <surname>Fleischmann</surname>
              <given-names>R.D.</given-names>
            </name>
            <name>
              <surname>Fuldner</surname>
              <given-names>R.A.</given-names>
            </name>
            <name>
              <surname>Bult</surname>
              <given-names>C.J.</given-names>
            </name>
            <name>
              <surname>Lee</surname>
              <given-names>N.H.</given-names>
            </name>
            <name>
              <surname>Kirkness</surname>
              <given-names>E.F.</given-names>
            </name>
            <name>
              <surname>Weinstock</surname>
              <given-names>K.G.</given-names>
            </name>
            <name>
              <surname>Gocayne</surname>
              <given-names>J.D.</given-names>
            </name>
            <name>
              <surname>White</surname>
              <given-names>O.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Initial assessment of human gene diversity and expression patterns based upon 83 million nucleotides of cDNA sequence</article-title>
          <source>Nature</source>
          <year>1995</year>
          <volume>377</volume>
          <fpage>3</fpage>
          <lpage>174</lpage>
        <pub-id pub-id-type="pmid">7566098</pub-id></citation>
      </ref>
      <ref id="B11-genes-03-00344">
        <label>11.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Pertea</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Salzberg</surname>
              <given-names>S.L.</given-names>
            </name>
          </person-group>
          <article-title>Between a chicken and a grape: Estimating the number of human genes</article-title>
          <source>Genome Biol.</source>
          <year>2010</year>
          <volume>11</volume>
          <fpage>206</fpage>
          <pub-id pub-id-type="doi">10.1186/gb-2010-11-5-206</pub-id>
        </citation>
      </ref>
      <ref id="B12-genes-03-00344">
        <label>12.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Strausberg</surname>
              <given-names>R.L.</given-names>
            </name>
            <name>
              <surname>Riggins</surname>
              <given-names>G.J.</given-names>
            </name>
          </person-group>
          <article-title>Navigating the human transcriptome</article-title>
          <source>Proc. Natl. Acad. Sci. USA</source>
          <year>2001</year>
          <volume>98</volume>
          <fpage>11837</fpage>
          <lpage>11838</lpage>
          <pub-id pub-id-type="doi">10.1073/pnas.221463598</pub-id>
        </citation>
      </ref>
      <ref id="B13-genes-03-00344">
        <label>13.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Velculescu</surname>
              <given-names>V.E.</given-names>
            </name>
            <name>
              <surname>Zhang</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Vogelstein</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Kinzler</surname>
              <given-names>K.W.</given-names>
            </name>
          </person-group>
          <article-title>Serial analysis of gene expression</article-title>
          <source>Science</source>
          <year>1995</year>
          <volume>270</volume>
          <fpage>484</fpage>
          <lpage>487</lpage>
        <pub-id pub-id-type="doi">10.1126/science.270.5235.484</pub-id><pub-id pub-id-type="pmid">7570003</pub-id></citation>
      </ref>
      <ref id="B14-genes-03-00344">
        <label>14.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Shiraki</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Kondo</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Katayama</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Waki</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Kasukawa</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Kawaji</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Kodzius</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Watahiki</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Nakamura</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Arakawa</surname>
              <given-names>T.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Cap analysis gene expression for high-throughput analysis of transcriptional starting point and identification of promoter usage</article-title>
          <source>Proc. Natl. Acad. Sci. USA</source>
          <year>2003</year>
          <volume>100</volume>
          <fpage>15776</fpage>
          <lpage>15781</lpage>
        <pub-id pub-id-type="doi">10.1073/pnas.2136655100</pub-id><pub-id pub-id-type="pmid">14663149</pub-id></citation>
      </ref>
      <ref id="B15-genes-03-00344">
        <label>15.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Brenner</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Johnson</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Bridgham</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Golda</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Lloyd</surname>
              <given-names>D.H.</given-names>
            </name>
            <name>
              <surname>Johnson</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Luo</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>McCurdy</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Foy</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Ewan</surname>
              <given-names>M.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Gene expression analysis by massively parallel signature sequencing (MPSS) on microbead arrays</article-title>
          <source>Nat. Biotechnol.</source>
          <year>2000</year>
          <volume>18</volume>
          <fpage>630</fpage>
          <lpage>634</lpage>
        <pub-id pub-id-type="doi">10.1038/76469</pub-id><pub-id pub-id-type="pmid">10835600</pub-id></citation>
      </ref>
      <ref id="B16-genes-03-00344">
        <label>16.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Clark</surname>
              <given-names>T.A.</given-names>
            </name>
            <name>
              <surname>Sugnet</surname>
              <given-names>C.W.</given-names>
            </name>
            <name>
              <surname>Ares</surname>
              <given-names>M.</given-names>
              <suffix>Jr.</suffix>
            </name>
          </person-group>
          <article-title>Genomewide analysis of mRNA processing in yeast using splicing-specific microarrays</article-title>
          <source>Science</source>
          <year>2002</year>
          <volume>296</volume>
          <fpage>907</fpage>
          <lpage>910</lpage>
        <pub-id pub-id-type="doi">10.1126/science.1069415</pub-id><pub-id pub-id-type="pmid">11988574</pub-id></citation>
      </ref>
      <ref id="B17-genes-03-00344">
        <label>17.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Schena</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Shalon</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Davis</surname>
              <given-names>R.W.</given-names>
            </name>
            <name>
              <surname>Brown</surname>
              <given-names>P.O.</given-names>
            </name>
          </person-group>
          <article-title>Quantitative monitoring of gene expression patterns with a complementary DNA microarray</article-title>
          <source>Science</source>
          <year>1995</year>
          <volume>270</volume>
          <fpage>467</fpage>
          <lpage>470</lpage>
        <pub-id pub-id-type="doi">10.1126/science.270.5235.467</pub-id><pub-id pub-id-type="pmid">7569999</pub-id></citation>
      </ref>
      <ref id="B18-genes-03-00344">
        <label>18.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Lashkari</surname>
              <given-names>D.A.</given-names>
            </name>
            <name>
              <surname>DeRisi</surname>
              <given-names>J.L.</given-names>
            </name>
            <name>
              <surname>McCusker</surname>
              <given-names>J.H.</given-names>
            </name>
            <name>
              <surname>Namath</surname>
              <given-names>A.F.</given-names>
            </name>
            <name>
              <surname>Gentile</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Hwang</surname>
              <given-names>S.Y.</given-names>
            </name>
            <name>
              <surname>Brown</surname>
              <given-names>P.O.</given-names>
            </name>
            <name>
              <surname>Davis</surname>
              <given-names>R.W.</given-names>
            </name>
          </person-group>
          <article-title>Yeast microarrays for genome wide parallel genetic and gene expression analysis</article-title>
          <source>Proc. Natl. Acad. Sci. USA</source>
          <year>1997</year>
          <volume>94</volume>
          <fpage>13057</fpage>
          <lpage>13062</lpage>
        <pub-id pub-id-type="doi">10.1073/pnas.94.24.13057</pub-id><pub-id pub-id-type="pmid">9371799</pub-id></citation>
      </ref>
      <ref id="B19-genes-03-00344">
        <label>19.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Bertone</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Stolc</surname>
              <given-names>V.</given-names>
            </name>
            <name>
              <surname>Royce</surname>
              <given-names>T.E.</given-names>
            </name>
            <name>
              <surname>Rozowsky</surname>
              <given-names>J.S.</given-names>
            </name>
            <name>
              <surname>Urban</surname>
              <given-names>A.E.</given-names>
            </name>
            <name>
              <surname>Zhu</surname>
              <given-names>X.</given-names>
            </name>
            <name>
              <surname>Rinn</surname>
              <given-names>J.L.</given-names>
            </name>
            <name>
              <surname>Tongprasit</surname>
              <given-names>W.</given-names>
            </name>
            <name>
              <surname>Samanta</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Weissman</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Gerstein</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Snyder</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>Global identification of human transcribed sequences with genome tiling arrays</article-title>
          <source>Science</source>
          <year>2004</year>
          <volume>306</volume>
          <fpage>2242</fpage>
          <lpage>2246</lpage>
        <pub-id pub-id-type="doi">10.1126/science.1103388</pub-id><pub-id pub-id-type="pmid">15539566</pub-id></citation>
      </ref>
      <ref id="B20-genes-03-00344">
        <label>20.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Cheng</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Kapranov</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Drenkow</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Dike</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Brubaker</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Patel</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Long</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Stern</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Tammana</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Helt</surname>
              <given-names>G.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Transcriptional maps of 10 human chromosomes at 5-nucleotide resolution</article-title>
          <source>Science</source>
          <year>2005</year>
          <volume>308</volume>
          <fpage>1149</fpage>
          <lpage>1154</lpage>
        <pub-id pub-id-type="doi">10.1126/science.1108625</pub-id><pub-id pub-id-type="pmid">15790807</pub-id></citation>
      </ref>
      <ref id="B21-genes-03-00344">
        <label>21.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Castle</surname>
              <given-names>J.C.</given-names>
            </name>
            <name>
              <surname>Zhang</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Shah</surname>
              <given-names>J.K.</given-names>
            </name>
            <name>
              <surname>Kulkarni</surname>
              <given-names>A.V.</given-names>
            </name>
            <name>
              <surname>Kalsotra</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Cooper</surname>
              <given-names>T.A.</given-names>
            </name>
            <name>
              <surname>Johnson</surname>
              <given-names>J.M.</given-names>
            </name>
          </person-group>
          <article-title>Expression of 24,426 human alternative splicing events and predicted cis regulation in 48 tissues and cell lines</article-title>
          <source>Nat. Genet.</source>
          <year>2008</year>
          <volume>40</volume>
          <fpage>1416</fpage>
          <lpage>1425</lpage>
        <pub-id pub-id-type="doi">10.1038/ng.264</pub-id><pub-id pub-id-type="pmid">18978788</pub-id></citation>
      </ref>
      <ref id="B22-genes-03-00344">
        <label>22.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Okoniewski</surname>
              <given-names>M.J.</given-names>
            </name>
            <name>
              <surname>Miller</surname>
              <given-names>C.J.</given-names>
            </name>
          </person-group>
          <article-title>Hybridization interactions between probesets in short oligo microarrays lead to spurious correlations</article-title>
          <source>BMC Bioinformatics</source>
          <year>2006</year>
          <volume>7</volume>
          <fpage>276</fpage>
          <pub-id pub-id-type="doi">10.1186/1471-2105-7-276</pub-id>
        </citation>
      </ref>
      <ref id="B23-genes-03-00344">
        <label>23.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Pan</surname>
              <given-names>Q.</given-names>
            </name>
            <name>
              <surname>Shai</surname>
              <given-names>O.</given-names>
            </name>
            <name>
              <surname>Misquitta</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Zhang</surname>
              <given-names>W.</given-names>
            </name>
            <name>
              <surname>Saltzman</surname>
              <given-names>A.L.</given-names>
            </name>
            <name>
              <surname>Mohammad</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Babak</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Siu</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Hughes</surname>
              <given-names>T.R.</given-names>
            </name>
            <name>
              <surname>Morris</surname>
              <given-names>Q.D.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Revealing global regulatory features of mammalian alternative splicing using a quantitative microarray platform</article-title>
          <source>Mol. Cell</source>
          <year>2004</year>
          <volume>16</volume>
          <fpage>929</fpage>
          <lpage>941</lpage>
          <pub-id pub-id-type="doi">10.1016/j.molcel.2004.12.004</pub-id>
        </citation>
      </ref>
      <ref id="B24-genes-03-00344">
        <label>24.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Lister</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>O’Malley</surname>
              <given-names>R.C.</given-names>
            </name>
            <name>
              <surname>Tonti-Filippini</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Gregory</surname>
              <given-names>B.D.</given-names>
            </name>
            <name>
              <surname>Berry</surname>
              <given-names>C.C.</given-names>
            </name>
            <name>
              <surname>Millar</surname>
              <given-names>A.H.</given-names>
            </name>
            <name>
              <surname>Ecker</surname>
              <given-names>J.R.</given-names>
            </name>
          </person-group>
          <article-title>Highly integrated single-base resolution maps of the epigenome in Arabidopsis</article-title>
          <source>Cell</source>
          <year>2008</year>
          <volume>133</volume>
          <fpage>523</fpage>
          <lpage>536</lpage>
          <pub-id pub-id-type="doi">10.1016/j.cell.2008.03.029</pub-id>
        </citation>
      </ref>
      <ref id="B25-genes-03-00344">
        <label>25.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Mortazavi</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Williams</surname>
              <given-names>B.A.</given-names>
            </name>
            <name>
              <surname>McCue</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Schaeffer</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Wold</surname>
              <given-names>B.</given-names>
            </name>
          </person-group>
          <article-title>Mapping and quantifying mammalian transcriptomes by RNA-Seq</article-title>
          <source>Nat. Methods</source>
          <year>2008</year>
          <volume>5</volume>
          <fpage>621</fpage>
          <lpage>628</lpage>
        <pub-id pub-id-type="doi">10.1038/nmeth.1226</pub-id><pub-id pub-id-type="pmid">18516045</pub-id></citation>
      </ref>
      <ref id="B26-genes-03-00344">
        <label>26.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Nagalakshmi</surname>
              <given-names>U.</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>Z.</given-names>
            </name>
            <name>
              <surname>Waern</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Shou</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Raha</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Gerstein</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Snyder</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>The transcriptional landscape of the yeast genome defined by RNA sequencing</article-title>
          <source>Science</source>
          <year>2008</year>
          <volume>320</volume>
          <fpage>1344</fpage>
          <lpage>1349</lpage>
        <pub-id pub-id-type="doi">10.1126/science.1158441</pub-id><pub-id pub-id-type="pmid">18451266</pub-id></citation>
      </ref>
      <ref id="B27-genes-03-00344">
        <label>27.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Salzberg</surname>
              <given-names>S.L.</given-names>
            </name>
          </person-group>
          <article-title>Recent advances in RNA sequence analysis</article-title>
          <source>F1000 Biol. Rep.</source>
          <year>2010</year>
          <volume>2</volume>
          <fpage>64</fpage>
        <pub-id pub-id-type="pmid">21173855</pub-id></citation>
      </ref>
      <ref id="B28-genes-03-00344">
        <label>28.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Cloonan</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Forrest</surname>
              <given-names>A.R.</given-names>
            </name>
            <name>
              <surname>Kolle</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Gardiner</surname>
              <given-names>B.B.</given-names>
            </name>
            <name>
              <surname>Faulkner</surname>
              <given-names>G.J.</given-names>
            </name>
            <name>
              <surname>Brown</surname>
              <given-names>M.K.</given-names>
            </name>
            <name>
              <surname>Taylor</surname>
              <given-names>D.F.</given-names>
            </name>
            <name>
              <surname>Steptoe</surname>
              <given-names>A.L.</given-names>
            </name>
            <name>
              <surname>Wani</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Bethel</surname>
              <given-names>G.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Stem cell transcriptome profiling via massive-scale mRNA sequencing</article-title>
          <source>Nat. Methods</source>
          <year>2008</year>
          <volume>5</volume>
          <fpage>613</fpage>
          <lpage>619</lpage>
        <pub-id pub-id-type="doi">10.1038/nmeth.1223</pub-id><pub-id pub-id-type="pmid">18516046</pub-id></citation>
      </ref>
      <ref id="B29-genes-03-00344">
        <label>29.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Wang</surname>
              <given-names>Z.</given-names>
            </name>
            <name>
              <surname>Gerstein</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Snyder</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>RNA-Seq: A revolutionary tool for transcriptomics</article-title>
          <source>Nat. Rev. Genet.</source>
          <year>2009</year>
          <volume>10</volume>
          <fpage>57</fpage>
          <lpage>63</lpage>
          <pub-id pub-id-type="doi">10.1038/nrg2484</pub-id>
        </citation>
      </ref>
      <ref id="B30-genes-03-00344">
        <label>30.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Ponting</surname>
              <given-names>C.P.</given-names>
            </name>
            <name>
              <surname>Oliver</surname>
              <given-names>P.L.</given-names>
            </name>
            <name>
              <surname>Reik</surname>
              <given-names>W.</given-names>
            </name>
          </person-group>
          <article-title>Evolution and functions of long noncoding RNAs</article-title>
          <source>Cell</source>
          <year>2009</year>
          <volume>136</volume>
          <fpage>629</fpage>
          <lpage>641</lpage>
        <pub-id pub-id-type="doi">10.1016/j.cell.2009.02.006</pub-id><pub-id pub-id-type="pmid">19239885</pub-id></citation>
      </ref>
      <ref id="B31-genes-03-00344">
        <label>31.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Dinger</surname>
              <given-names>M.E.</given-names>
            </name>
          </person-group>
          <article-title>lncRNAs: Finding the forest among the trees?</article-title>
          <source>Mol. Ther.</source>
          <year>2011</year>
          <volume>19</volume>
          <fpage>2109</fpage>
          <lpage>2111</lpage>
          <pub-id pub-id-type="doi">10.1038/mt.2011.251</pub-id>
        </citation>
      </ref>
      <ref id="B32-genes-03-00344">
        <label>32.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Fire</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Xu</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Montgomery</surname>
              <given-names>M.K.</given-names>
            </name>
            <name>
              <surname>Kostas</surname>
              <given-names>S.A.</given-names>
            </name>
            <name>
              <surname>Driver</surname>
              <given-names>S.E.</given-names>
            </name>
            <name>
              <surname>Mello</surname>
              <given-names>C.C.</given-names>
            </name>
          </person-group>
          <article-title>Potent and specific genetic interference by double-stranded RNA in Caenorhabditis elegans</article-title>
          <source>Nature</source>
          <year>1998</year>
          <volume>391</volume>
          <fpage>806</fpage>
          <lpage>811</lpage>
        <pub-id pub-id-type="doi">10.1038/35888</pub-id><pub-id pub-id-type="pmid">9486653</pub-id></citation>
      </ref>
      <ref id="B33-genes-03-00344">
        <label>33.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Lee</surname>
              <given-names>R.C.</given-names>
            </name>
            <name>
              <surname>Feinbaum</surname>
              <given-names>R.L.</given-names>
            </name>
            <name>
              <surname>Ambros</surname>
              <given-names>V.</given-names>
            </name>
          </person-group>
          <article-title>The C. elegans heterochronic gene lin-4 encodes small RNAs with antisense complementarity to lin-14</article-title>
          <source>Cell</source>
          <year>1993</year>
          <volume>75</volume>
          <fpage>843</fpage>
          <lpage>854</lpage>
          <pub-id pub-id-type="doi">10.1016/0092-8674(93)90529-Y</pub-id>
        </citation>
      </ref>
      <ref id="B34-genes-03-00344">
        <label>34.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Jacquier</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <article-title>The complex eukaryotic transcriptome: Unexpected pervasive transcription and novel small RNAs</article-title>
          <source>Nat. Rev. Genet.</source>
          <year>2009</year>
          <volume>10</volume>
          <fpage>833</fpage>
          <lpage>844</lpage>
          <pub-id pub-id-type="doi">10.1038/nrg2683</pub-id>
        </citation>
      </ref>
      <ref id="B35-genes-03-00344">
        <label>35.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Taft</surname>
              <given-names>R.J.</given-names>
            </name>
            <name>
              <surname>Pang</surname>
              <given-names>K.C.</given-names>
            </name>
            <name>
              <surname>Mercer</surname>
              <given-names>T.R.</given-names>
            </name>
            <name>
              <surname>Dinger</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Mattick</surname>
              <given-names>J.S.</given-names>
            </name>
          </person-group>
          <article-title>Non-coding RNAs: Regulators of disease</article-title>
          <source>J. Pathol.</source>
          <year>2010</year>
          <volume>220</volume>
          <fpage>126</fpage>
          <lpage>139</lpage>
          <pub-id pub-id-type="doi">10.1002/path.2638</pub-id>
        </citation>
      </ref>
      <ref id="B36-genes-03-00344">
        <label>36.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Derrien</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Guigo</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Johnson</surname>
              <given-names>R.</given-names>
            </name>
          </person-group>
          <article-title>The long non-coding RNAs: A New (P)layer in the “Dark Matter”</article-title>
          <source>Front Genet.</source>
          <year>2011</year>
          <volume>2</volume>
          <fpage>107</fpage>
        <pub-id pub-id-type="pmid">22303401</pub-id></citation>
      </ref>
      <ref id="B37-genes-03-00344">
        <label>37.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Cabili</surname>
              <given-names>M.N.</given-names>
            </name>
            <name>
              <surname>Trapnell</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Goff</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Koziol</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Tazon-Vega</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Regev</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Rinn</surname>
              <given-names>J.L.</given-names>
            </name>
          </person-group>
          <article-title>Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses</article-title>
          <source>Genes Dev.</source>
          <year>2011</year>
          <volume>25</volume>
          <fpage>1915</fpage>
          <lpage>1927</lpage>
        <pub-id pub-id-type="doi">10.1101/gad.17446611</pub-id><pub-id pub-id-type="pmid">21890647</pub-id></citation>
      </ref>
      <ref id="B38-genes-03-00344">
        <label>38.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Iafrate</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Feuk</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Rivera</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Listewnik</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Donahoe</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Qi</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Scherer</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Lee</surname>
              <given-names>C.</given-names>
            </name>
          </person-group>
          <article-title>Detection of large-scale variation in the human genome</article-title>
          <source>Nat Genet.</source>
          <year>2004</year>
          <volume>36</volume>
          <fpage>949</fpage>
          <lpage>951</lpage>
        <pub-id pub-id-type="doi">10.1038/ng1416</pub-id><pub-id pub-id-type="pmid">15286789</pub-id></citation>
      </ref>
      <ref id="B39-genes-03-00344">
        <label>39.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Sebat</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Lakshmi</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Troge</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Alexander</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Young</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Lundin</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Maner</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Massa</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Walker</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Chi</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Navin</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Lucito</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Healy</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Hicks</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Ye</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Reiner</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Gilliam</surname>
              <given-names>T.C.</given-names>
            </name>
            <name>
              <surname>Trask</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Patterson</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Zetterberg</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Wigler</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>Large-scale copy number polymorphism in the human genome</article-title>
          <source>Science</source>
          <year>2004</year>
          <volume>305</volume>
          <fpage>525</fpage>
          <lpage>528</lpage>
        <pub-id pub-id-type="doi">10.1126/science.1098918</pub-id><pub-id pub-id-type="pmid">15273396</pub-id></citation>
      </ref>
      <ref id="B40-genes-03-00344">
        <label>40.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Li</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Li</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Zheng</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Luo</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Zhu</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Li</surname>
              <given-names>Q.</given-names>
            </name>
            <name>
              <surname>Qian</surname>
              <given-names>W.</given-names>
            </name>
            <name>
              <surname>Ren</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Tian</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Li</surname>
              <given-names>J.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Building the sequence map of the human pan-genome</article-title>
          <source>Nat. Biotechnol.</source>
          <year>2009</year>
          <volume>28</volume>
          <fpage>57</fpage>
          <lpage>63</lpage>
        <pub-id pub-id-type="pmid">19997067</pub-id></citation>
      </ref>
      <ref id="B41-genes-03-00344">
        <label>41.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Trapnell</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Williams</surname>
              <given-names>B.A.</given-names>
            </name>
            <name>
              <surname>Pertea</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Mortazavi</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Kwan</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>van Baren</surname>
              <given-names>M.J.</given-names>
            </name>
            <name>
              <surname>Salzberg</surname>
              <given-names>S.L.</given-names>
            </name>
            <name>
              <surname>Wold</surname>
              <given-names>B.J.</given-names>
            </name>
            <name>
              <surname>Pachter</surname>
              <given-names>L.</given-names>
            </name>
          </person-group>
          <article-title>Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation</article-title>
          <source>Nat. Biotechnol.</source>
          <year>2010</year>
          <volume>28</volume>
          <fpage>511</fpage>
          <lpage>515</lpage>
        <pub-id pub-id-type="doi">10.1038/nbt.1621</pub-id><pub-id pub-id-type="pmid">20436464</pub-id></citation>
      </ref>
      <ref id="B42-genes-03-00344">
        <label>42.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Pan</surname>
              <given-names>Q.</given-names>
            </name>
            <name>
              <surname>Shai</surname>
              <given-names>O.</given-names>
            </name>
            <name>
              <surname>Lee</surname>
              <given-names>L.J.</given-names>
            </name>
            <name>
              <surname>Frey</surname>
              <given-names>B.J.</given-names>
            </name>
            <name>
              <surname>Blencowe</surname>
              <given-names>B.J.</given-names>
            </name>
          </person-group>
          <article-title>Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing</article-title>
          <source>Nat. Genet.</source>
          <year>2008</year>
          <volume>40</volume>
          <fpage>1413</fpage>
          <lpage>1415</lpage>
        <pub-id pub-id-type="doi">10.1038/ng.259</pub-id><pub-id pub-id-type="pmid">18978789</pub-id></citation>
      </ref>
      <ref id="B43-genes-03-00344">
        <label>43.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Kampa</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Cheng</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Kapranov</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Yamanaka</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Brubaker</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Cawley</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Drenkow</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Piccolboni</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Bekiranov</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Helt</surname>
              <given-names>G.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Novel RNAs identified from an in-depth analysis of the transcriptome of human chromosomes 21 and 22</article-title>
          <source>Genome Res.</source>
          <year>2004</year>
          <volume>14</volume>
          <fpage>331</fpage>
          <lpage>342</lpage>
          <pub-id pub-id-type="doi">10.1101/gr.2094104</pub-id>
        </citation>
      </ref>
      <ref id="B44-genes-03-00344">
        <label>44.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Wang</surname>
              <given-names>E.T.</given-names>
            </name>
            <name>
              <surname>Sandberg</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Luo</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Khrebtukova</surname>
              <given-names>I.</given-names>
            </name>
            <name>
              <surname>Zhang</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Mayr</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Kingsmore</surname>
              <given-names>S.F.</given-names>
            </name>
            <name>
              <surname>Schroth</surname>
              <given-names>G.P.</given-names>
            </name>
            <name>
              <surname>Burge</surname>
              <given-names>C.B.</given-names>
            </name>
          </person-group>
          <article-title>Alternative isoform regulation in human tissue transcriptomes</article-title>
          <source>Nature</source>
          <year>2008</year>
          <volume>456</volume>
          <fpage>470</fpage>
          <lpage>476</lpage>
        <pub-id pub-id-type="doi">10.1038/nature07509</pub-id><pub-id pub-id-type="pmid">18978772</pub-id></citation>
      </ref>
      <ref id="B45-genes-03-00344">
        <label>45.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Blencowe</surname>
              <given-names>B.J.</given-names>
            </name>
          </person-group>
          <article-title>Alternative splicing: New insights from global analyses</article-title>
          <source>Cell</source>
          <year>2006</year>
          <volume>126</volume>
          <fpage>37</fpage>
          <lpage>47</lpage>
          <pub-id pub-id-type="doi">10.1016/j.cell.2006.06.023</pub-id>
        </citation>
      </ref>
      <ref id="B46-genes-03-00344">
        <label>46.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Mudge</surname>
              <given-names>J.M.</given-names>
            </name>
            <name>
              <surname>Frankish</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Fernandez-Banet</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Alioto</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Derrien</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Howald</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Reymond</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Guigo</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Hubbard</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Harrow</surname>
              <given-names>J.</given-names>
            </name>
          </person-group>
          <article-title>The origins, evolution, and functional potential of alternative splicing in vertebrates</article-title>
          <source>Mol. Biol. Evol.</source>
          <year>2011</year>
          <volume>28</volume>
          <fpage>2949</fpage>
          <lpage>2959</lpage>
          <pub-id pub-id-type="doi">10.1093/molbev/msr127</pub-id>
        </citation>
      </ref>
      <ref id="B47-genes-03-00344">
        <label>47.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Ravasi</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Suzuki</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Pang</surname>
              <given-names>K.C.</given-names>
            </name>
            <name>
              <surname>Katayama</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Furuno</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Okunishi</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Fukuda</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Ru</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Frith</surname>
              <given-names>M.C.</given-names>
            </name>
            <name>
              <surname>Gongora</surname>
              <given-names>M.M.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Experimental validation of the regulated expression of large numbers of non-coding RNAs from the mouse genome</article-title>
          <source>Genome Res.</source>
          <year>2006</year>
          <volume>16</volume>
          <fpage>11</fpage>
          <lpage>19</lpage>
        <pub-id pub-id-type="pmid">16344565</pub-id></citation>
      </ref>
      <ref id="B48-genes-03-00344">
        <label>48.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Seok</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Xu</surname>
              <given-names>W.</given-names>
            </name>
            <name>
              <surname>Jiang</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Davis</surname>
              <given-names>R.W.</given-names>
            </name>
            <name>
              <surname>Xiao</surname>
              <given-names>W.</given-names>
            </name>
          </person-group>
          <article-title>Knowledge-based reconstruction of mRNA transcripts with short sequencing reads for transcriptome research</article-title>
          <source>PLoS One</source>
          <year>2012</year>
          <volume>7</volume>
          <fpage>e31440</fpage>
        <pub-id pub-id-type="doi">10.1371/journal.pone.0031440</pub-id><pub-id pub-id-type="pmid">22312447</pub-id></citation>
      </ref>
      <ref id="B49-genes-03-00344">
        <label>49.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Carninci</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Kasukawa</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Katayama</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Gough</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Frith</surname>
              <given-names>M.C.</given-names>
            </name>
            <name>
              <surname>Maeda</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Oyama</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Ravasi</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Lenhard</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Wells</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Kodzius</surname>
              <given-names>R.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>The transcriptional landscape of the mammalian genome</article-title>
          <source>Science</source>
          <year>2005</year>
          <volume>309</volume>
          <fpage>1559</fpage>
          <lpage>1563</lpage>
        <pub-id pub-id-type="doi">10.1126/science.1112014</pub-id><pub-id pub-id-type="pmid">16141072</pub-id></citation>
      </ref>
      <ref id="B50-genes-03-00344">
        <label>50.</label>
        <citation citation-type="web">
          <article-title>Ensembl Genome Browser</article-title>
          <comment>Available online:<ext-link xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="http://useast.ensembl.org/Homo_sapiens/Info/Index" ext-link-type="uri">http://useast.ensembl.org/Homo_sapiens/Info/Index</ext-link></comment>
          <access-date>(accessed on 5 September 2011)</access-date>
        </citation>
      </ref>
      <ref id="B51-genes-03-00344">
        <label>51.</label>
        <citation citation-type="web">
          <article-title>NCBI’s RefSeq Database</article-title>
          <comment>Available online:<ext-link xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="http://www.ncbi.nlm.nih.gov/RefSeq/" ext-link-type="uri">http://www.ncbi.nlm.nih.gov/RefSeq/</ext-link></comment>
          <access-date>(accessed on 5 September 2011)</access-date>
        </citation>
      </ref>
      <ref id="B52-genes-03-00344">
        <label>52.</label>
        <citation citation-type="web">
          <article-title>UCSC Genome Table Browser</article-title>
          <comment>Available online:<ext-link xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="http://genome.ucsc.edu/cgi-bin/hgTables" ext-link-type="uri">http://genome.ucsc.edu/cgi-bin/hgTables</ext-link></comment>
          <access-date>(accessed on 5 September 2011)</access-date>
        </citation>
      </ref>
      <ref id="B53-genes-03-00344">
        <label>53.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Kapranov</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Drenkow</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Cheng</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Long</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Helt</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Dike</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Gingeras</surname>
              <given-names>T.R.</given-names>
            </name>
          </person-group>
          <article-title>Examples of the complex architecture of the human transcriptome revealed by RACE and high-density tiling arrays</article-title>
          <source>Genome Res.</source>
          <year>2005</year>
          <volume>15</volume>
          <fpage>987</fpage>
          <lpage>997</lpage>
          <pub-id pub-id-type="doi">10.1101/gr.3455305</pub-id>
        </citation>
      </ref>
      <ref id="B54-genes-03-00344">
        <label>54.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Zheng</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Frankish</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Baertsch</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Kapranov</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Reymond</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Choo</surname>
              <given-names>S.W.</given-names>
            </name>
            <name>
              <surname>Lu</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Denoeud</surname>
              <given-names>F.</given-names>
            </name>
            <name>
              <surname>Antonarakis</surname>
              <given-names>S.E.</given-names>
            </name>
            <name>
              <surname>Snyder</surname>
              <given-names>M.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Pseudogenes in the ENCODE regions: Consensus annotation, analysis of transcription, and evolution</article-title>
          <source>Genome Res.</source>
          <year>2007</year>
          <volume>17</volume>
          <fpage>839</fpage>
          <lpage>851</lpage>
          <pub-id pub-id-type="doi">10.1101/gr.5586307</pub-id>
        </citation>
      </ref>
      <ref id="B55-genes-03-00344">
        <label>55.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Sasidharan</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Gerstein</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>Genomics: Protein fossils live on as RNA</article-title>
          <source>Nature</source>
          <year>2008</year>
          <volume>453</volume>
          <fpage>729</fpage>
          <lpage>731</lpage>
          <pub-id pub-id-type="doi">10.1038/453729a</pub-id>
        </citation>
      </ref>
      <ref id="B56-genes-03-00344">
        <label>56.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Sie</surname>
              <given-names>C.P.</given-names>
            </name>
            <name>
              <surname>Kuchka</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>RNA editing adds flavor to complexity</article-title>
          <source>Biochemistry (Mosc)</source>
          <year>2011</year>
          <volume>76</volume>
          <fpage>869</fpage>
          <lpage>881</lpage>
          <pub-id pub-id-type="doi">10.1134/S0006297911080025</pub-id>
        </citation>
      </ref>
      <ref id="B57-genes-03-00344">
        <label>57.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Bass</surname>
              <given-names>B.L.</given-names>
            </name>
            <name>
              <surname>Weintraub</surname>
              <given-names>H.</given-names>
            </name>
          </person-group>
          <article-title>An unwinding activity that covalently modifies its double-stranded RNA substrate</article-title>
          <source>Cell</source>
          <year>1988</year>
          <volume>55</volume>
          <fpage>1089</fpage>
          <lpage>1098</lpage>
          <pub-id pub-id-type="doi">10.1016/0092-8674(88)90253-X</pub-id>
        </citation>
      </ref>
      <ref id="B58-genes-03-00344">
        <label>58.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Wagner</surname>
              <given-names>R.W.</given-names>
            </name>
            <name>
              <surname>Smith</surname>
              <given-names>J.E.</given-names>
            </name>
            <name>
              <surname>Cooperman</surname>
              <given-names>B.S.</given-names>
            </name>
            <name>
              <surname>Nishikura</surname>
              <given-names>K.</given-names>
            </name>
          </person-group>
          <article-title>A double-stranded RNA unwinding activity introduces structural alterations by means of adenosine to inosine conversions in mammalian cells and Xenopus eggs</article-title>
          <source>Proc. Natl. Acad. Sci. USA</source>
          <year>1989</year>
          <volume>86</volume>
          <fpage>2647</fpage>
          <lpage>2651</lpage>
        <pub-id pub-id-type="doi">10.1073/pnas.86.8.2647</pub-id><pub-id pub-id-type="pmid">2704740</pub-id></citation>
      </ref>
      <ref id="B59-genes-03-00344">
        <label>59.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Powell</surname>
              <given-names>L.M.</given-names>
            </name>
            <name>
              <surname>Wallis</surname>
              <given-names>S.C.</given-names>
            </name>
            <name>
              <surname>Pease</surname>
              <given-names>R.J.</given-names>
            </name>
            <name>
              <surname>Edwards</surname>
              <given-names>Y.H.</given-names>
            </name>
            <name>
              <surname>Knott</surname>
              <given-names>T.J.</given-names>
            </name>
            <name>
              <surname>Scott</surname>
              <given-names>J.</given-names>
            </name>
          </person-group>
          <article-title>A novel form of tissue-specific RNA processing produces apolipoprotein-B48 in intestine</article-title>
          <source>Cell</source>
          <year>1987</year>
          <volume>50</volume>
          <fpage>831</fpage>
          <lpage>840</lpage>
          <pub-id pub-id-type="doi">10.1016/0092-8674(87)90510-1</pub-id>
        </citation>
      </ref>
      <ref id="B60-genes-03-00344">
        <label>60.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Chen</surname>
              <given-names>S.H.</given-names>
            </name>
            <name>
              <surname>Habib</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Yang</surname>
              <given-names>C.Y.</given-names>
            </name>
            <name>
              <surname>Gu</surname>
              <given-names>Z.W.</given-names>
            </name>
            <name>
              <surname>Lee</surname>
              <given-names>B.R.</given-names>
            </name>
            <name>
              <surname>Weng</surname>
              <given-names>S.A.</given-names>
            </name>
            <name>
              <surname>Silberman</surname>
              <given-names>S.R.</given-names>
            </name>
            <name>
              <surname>Cai</surname>
              <given-names>S.J.</given-names>
            </name>
            <name>
              <surname>Deslypere</surname>
              <given-names>J.P.</given-names>
            </name>
            <name>
              <surname>Rosseneu</surname>
              <given-names>M.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Apolipoprotein B-48 is the product of a messenger RNA with an organ-specific in-frame stop codon</article-title>
          <source>Science</source>
          <year>1987</year>
          <volume>238</volume>
          <fpage>363</fpage>
          <lpage>366</lpage>
        <pub-id pub-id-type="doi">10.1126/science.3659919</pub-id><pub-id pub-id-type="pmid">3659919</pub-id></citation>
      </ref>
      <ref id="B61-genes-03-00344">
        <label>61.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Teng</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Burant</surname>
              <given-names>C.F.</given-names>
            </name>
            <name>
              <surname>Davidson</surname>
              <given-names>N.O.</given-names>
            </name>
          </person-group>
          <article-title>Molecular cloning of an apolipoprotein B messenger RNA editing protein</article-title>
          <source>Science</source>
          <year>1993</year>
          <volume>260</volume>
          <fpage>1816</fpage>
          <lpage>1819</lpage>
        <pub-id pub-id-type="doi">10.1126/science.8511591</pub-id><pub-id pub-id-type="pmid">8511591</pub-id></citation>
      </ref>
      <ref id="B62-genes-03-00344">
        <label>62.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Athanasiadis</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Rich</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Maas</surname>
              <given-names>S.</given-names>
            </name>
          </person-group>
          <article-title>Widespread A-to-I RNA editing of Alu-containing mRNAs in the human transcriptome</article-title>
          <source>PLoS Biol.</source>
          <year>2004</year>
          <volume>2</volume>
          <fpage>e391</fpage>
          <pub-id pub-id-type="doi">10.1371/journal.pbio.0020391</pub-id>
        </citation>
      </ref>
      <ref id="B63-genes-03-00344">
        <label>63.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Levanon</surname>
              <given-names>E.Y.</given-names>
            </name>
            <name>
              <surname>Eisenberg</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Yelin</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Nemzer</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Hallegger</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Shemesh</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Fligelman</surname>
              <given-names>Z.Y.</given-names>
            </name>
            <name>
              <surname>Shoshan</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Pollock</surname>
              <given-names>S.R.</given-names>
            </name>
            <name>
              <surname>Sztybel</surname>
              <given-names>D.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Systematic identification of abundant A-to-I editing sites in the human transcriptome</article-title>
          <source>Nat. Biotechnol.</source>
          <year>2004</year>
          <volume>22</volume>
          <fpage>1001</fpage>
          <lpage>1005</lpage>
        <pub-id pub-id-type="doi">10.1038/nbt996</pub-id><pub-id pub-id-type="pmid">15258596</pub-id></citation>
      </ref>
      <ref id="B64-genes-03-00344">
        <label>64.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Li</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>I.X.</given-names>
            </name>
            <name>
              <surname>Li</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Bruzel</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Richards</surname>
              <given-names>A.L.</given-names>
            </name>
            <name>
              <surname>Toung</surname>
              <given-names>J.M.</given-names>
            </name>
            <name>
              <surname>Cheung</surname>
              <given-names>V.G.</given-names>
            </name>
          </person-group>
          <article-title>Widespread RNA and DNA sequence differences in the human transcriptome</article-title>
          <source>Science</source>
          <year>2011</year>
          <volume>333</volume>
          <fpage>53</fpage>
          <lpage>58</lpage>
        <pub-id pub-id-type="doi">10.1126/science.1207018</pub-id><pub-id pub-id-type="pmid">21596952</pub-id></citation>
      </ref>
      <ref id="B65-genes-03-00344">
        <label>65.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Kleinman</surname>
              <given-names>C.L.</given-names>
            </name>
            <name>
              <surname>Majewski</surname>
              <given-names>J.</given-names>
            </name>
          </person-group>
          <article-title>Comment on “Widespread RNA and DNA sequence differences in the human transcriptome”</article-title>
          <source>Science</source>
          <year>2012</year>
          <volume>335</volume>
          <fpage>1302</fpage>
          <pub-id pub-id-type="doi">10.1126/science.1209658</pub-id>
        </citation>
      </ref>
      <ref id="B66-genes-03-00344">
        <label>66.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Lin</surname>
              <given-names>W.</given-names>
            </name>
            <name>
              <surname>Piskol</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Tan</surname>
              <given-names>M.H.</given-names>
            </name>
            <name>
              <surname>Li</surname>
              <given-names>J.B.</given-names>
            </name>
          </person-group>
          <article-title>Comment on “Widespread RNA and DNA sequence differences in the human transcriptome”</article-title>
          <source>Science</source>
          <year>2012</year>
          <volume>335</volume>
          <fpage>1302-e</fpage>
        </citation>
      </ref>
      <ref id="B67-genes-03-00344">
        <label>67.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Pickrell</surname>
              <given-names>J.K.</given-names>
            </name>
            <name>
              <surname>Gilad</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Pritchard</surname>
              <given-names>J.K.</given-names>
            </name>
          </person-group>
          <article-title>Comment on “Widespread RNA and DNA sequence differences in the human transcriptome”</article-title>
          <source>Science</source>
          <year>2012</year>
          <volume>335</volume>
          <pub-id pub-id-type="doi">10.1126/science.1210484</pub-id>
        </citation>
      </ref>
      <ref id="B68-genes-03-00344">
        <label>68.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Schrider</surname>
              <given-names>D.R.</given-names>
            </name>
            <name>
              <surname>Gout</surname>
              <given-names>J.F.</given-names>
            </name>
            <name>
              <surname>Hahn</surname>
              <given-names>M.W.</given-names>
            </name>
          </person-group>
          <article-title>Very few RNA and DNA sequence differences in the human transcriptome</article-title>
          <source>PLoS One</source>
          <year>2011</year>
          <volume>6</volume>
          <fpage>e25842</fpage>
        <pub-id pub-id-type="doi">10.1371/journal.pone.0025842</pub-id><pub-id pub-id-type="pmid">22022455</pub-id></citation>
      </ref>
      <ref id="B69-genes-03-00344">
        <label>69.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Barak</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Levanon</surname>
              <given-names>E.Y.</given-names>
            </name>
            <name>
              <surname>Eisenberg</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Paz</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Rechavi</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Church</surname>
              <given-names>G.M.</given-names>
            </name>
            <name>
              <surname>Mehr</surname>
              <given-names>R.</given-names>
            </name>
          </person-group>
          <article-title>Evidence for large diversity in the human transcriptome created by Alu RNA editing</article-title>
          <source>Nucleic Acids Res.</source>
          <year>2009</year>
          <volume>37</volume>
          <fpage>6905</fpage>
          <lpage>6915</lpage>
          <pub-id pub-id-type="doi">10.1093/nar/gkp729</pub-id>
        </citation>
      </ref>
      <ref id="B70-genes-03-00344">
        <label>70.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Martin</surname>
              <given-names>J.A.</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>Z.</given-names>
            </name>
          </person-group>
          <article-title>Next-generation transcriptome assembly</article-title>
          <source>Nat. Rev. Genet.</source>
          <year>2011</year>
          <volume>12</volume>
          <fpage>671</fpage>
          <lpage>682</lpage>
          <pub-id pub-id-type="doi">10.1038/nrg3068</pub-id>
        </citation>
      </ref>
      <ref id="B71-genes-03-00344">
        <label>71.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Costa</surname>
              <given-names>V.</given-names>
            </name>
            <name>
              <surname>Angelini</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>de Feis</surname>
              <given-names>I.</given-names>
            </name>
            <name>
              <surname>Ciccodicola</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <article-title>Uncovering the complexity of transcriptomes with RNA-Seq</article-title>
          <source>J. Biomed. Biotechnol.</source>
          <year>2010</year>
          <fpage>853916</fpage>
        </citation>
      </ref>
      <ref id="B72-genes-03-00344">
        <label>72.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Garber</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Grabherr</surname>
              <given-names>M.G.</given-names>
            </name>
            <name>
              <surname>Guttman</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Trapnell</surname>
              <given-names>C.</given-names>
            </name>
          </person-group>
          <article-title>Computational methods for transcriptome annotation and quantification using RNA-seq</article-title>
          <source>Nat. Methods</source>
          <year>2011</year>
          <volume>8</volume>
          <fpage>469</fpage>
          <lpage>477</lpage>
        <pub-id pub-id-type="doi">10.1038/nmeth.1613</pub-id><pub-id pub-id-type="pmid">21623353</pub-id></citation>
      </ref>
      <ref id="B73-genes-03-00344">
        <label>73.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Zerbino</surname>
              <given-names>D.R.</given-names>
            </name>
            <name>
              <surname>Birney</surname>
              <given-names>E.</given-names>
            </name>
          </person-group>
          <article-title>Velvet: Algorithms for de novo short read assembly using de Bruijn graphs</article-title>
          <source>Genome Res.</source>
          <year>2008</year>
          <volume>18</volume>
          <fpage>821</fpage>
          <lpage>829</lpage>
          <pub-id pub-id-type="doi">10.1101/gr.074492.107</pub-id>
        </citation>
      </ref>
      <ref id="B74-genes-03-00344">
        <label>74.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Simpson</surname>
              <given-names>J.T.</given-names>
            </name>
            <name>
              <surname>Wong</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Jackman</surname>
              <given-names>S.D.</given-names>
            </name>
            <name>
              <surname>Schein</surname>
              <given-names>J.E.</given-names>
            </name>
            <name>
              <surname>Jones</surname>
              <given-names>S.J.</given-names>
            </name>
            <name>
              <surname>Birol</surname>
              <given-names>I.</given-names>
            </name>
          </person-group>
          <article-title>ABySS: A parallel assembler for short read sequence data</article-title>
          <source>Genome Res.</source>
          <year>2009</year>
          <volume>19</volume>
          <fpage>1117</fpage>
          <lpage>1123</lpage>
          <pub-id pub-id-type="doi">10.1101/gr.089532.108</pub-id>
        </citation>
      </ref>
      <ref id="B75-genes-03-00344">
        <label>75.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Butler</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>MacCallum</surname>
              <given-names>I.</given-names>
            </name>
            <name>
              <surname>Kleber</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Shlyakhter</surname>
              <given-names>I.A.</given-names>
            </name>
            <name>
              <surname>Belmonte</surname>
              <given-names>M.K.</given-names>
            </name>
            <name>
              <surname>Lander</surname>
              <given-names>E.S.</given-names>
            </name>
            <name>
              <surname>Nusbaum</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Jaffe</surname>
              <given-names>D.B.</given-names>
            </name>
          </person-group>
          <article-title>ALLPATHS: De novo assembly of whole-genome shotgun microreads</article-title>
          <source>Genome Res.</source>
          <year>2008</year>
          <volume>18</volume>
          <fpage>810</fpage>
          <lpage>820</lpage>
          <pub-id pub-id-type="doi">10.1101/gr.7337908</pub-id>
        </citation>
      </ref>
      <ref id="B76-genes-03-00344">
        <label>76.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Langmead</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Trapnell</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Pop</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Salzberg</surname>
              <given-names>S.L.</given-names>
            </name>
          </person-group>
          <article-title>Ultrafast and memory-efficient alignment of short DNA sequences to the human genome</article-title>
          <source>Genome Biol.</source>
          <year>2009</year>
          <volume>10</volume>
          <fpage>R25</fpage>
          <pub-id pub-id-type="doi">10.1186/gb-2009-10-3-r25</pub-id>
        </citation>
      </ref>
      <ref id="B77-genes-03-00344">
        <label>77.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Li</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Durbin</surname>
              <given-names>R.</given-names>
            </name>
          </person-group>
          <article-title>Fast and accurate short read alignment with Burrows-Wheeler transform</article-title>
          <source>Bioinformatics</source>
          <year>2009</year>
          <volume>25</volume>
          <fpage>1754</fpage>
          <lpage>1760</lpage>
          <pub-id pub-id-type="doi">10.1093/bioinformatics/btp324</pub-id>
        </citation>
      </ref>
      <ref id="B78-genes-03-00344">
        <label>78.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Trapnell</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Pachter</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Salzberg</surname>
              <given-names>S.L.</given-names>
            </name>
          </person-group>
          <article-title>TopHat: Discovering splice junctions with RNA-Seq</article-title>
          <source>Bioinformatics</source>
          <year>2009</year>
          <volume>25</volume>
          <fpage>1105</fpage>
          <lpage>1111</lpage>
          <pub-id pub-id-type="doi">10.1093/bioinformatics/btp120</pub-id>
        </citation>
      </ref>
      <ref id="B79-genes-03-00344">
        <label>79.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Au</surname>
              <given-names>K.F.</given-names>
            </name>
            <name>
              <surname>Jiang</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Lin</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Xing</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Wong</surname>
              <given-names>W.H.</given-names>
            </name>
          </person-group>
          <article-title>Detection of splice junctions from paired-end RNA-seq data by SpliceMap</article-title>
          <source>Nucleic Acids Res.</source>
          <year>2010</year>
          <volume>38</volume>
          <fpage>4570</fpage>
          <lpage>4578</lpage>
          <pub-id pub-id-type="doi">10.1093/nar/gkq211</pub-id>
        </citation>
      </ref>
      <ref id="B80-genes-03-00344">
        <label>80.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Wang</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Singh</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Zeng</surname>
              <given-names>Z.</given-names>
            </name>
            <name>
              <surname>Coleman</surname>
              <given-names>S.J.</given-names>
            </name>
            <name>
              <surname>Huang</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Savich</surname>
              <given-names>G.L.</given-names>
            </name>
            <name>
              <surname>He</surname>
              <given-names>X.</given-names>
            </name>
            <name>
              <surname>Mieczkowski</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Grimm</surname>
              <given-names>S.A.</given-names>
            </name>
            <name>
              <surname>Perou</surname>
              <given-names>C.M.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>MapSplice: Accurate mapping of RNA-seq reads for splice junction discovery</article-title>
          <source>Nucleic Acids Res.</source>
          <year>2010</year>
          <volume>38</volume>
          <fpage>e178</fpage>
        <pub-id pub-id-type="doi">10.1093/nar/gkq622</pub-id><pub-id pub-id-type="pmid">20802226</pub-id></citation>
      </ref>
      <ref id="B81-genes-03-00344">
        <label>81.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Wu</surname>
              <given-names>T.D.</given-names>
            </name>
            <name>
              <surname>Nacu</surname>
              <given-names>S.</given-names>
            </name>
          </person-group>
          <article-title>Fast and SNP-tolerant detection of complex variants and splicing in short reads</article-title>
          <source>Bioinformatics</source>
          <year>2010</year>
          <volume>26</volume>
          <fpage>873</fpage>
          <lpage>881</lpage>
          <pub-id pub-id-type="doi">10.1093/bioinformatics/btq057</pub-id>
        </citation>
      </ref>
      <ref id="B82-genes-03-00344">
        <label>82.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Kent</surname>
              <given-names>W.J.</given-names>
            </name>
          </person-group>
          <article-title>BLAT--the BLAST-like alignment tool</article-title>
          <source>Genome Res.</source>
          <year>2002</year>
          <volume>12</volume>
          <fpage>656</fpage>
          <lpage>664</lpage>
        <pub-id pub-id-type="pmid">11932250</pub-id></citation>
      </ref>
      <ref id="B83-genes-03-00344">
        <label>83.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Guttman</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Garber</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Levin</surname>
              <given-names>J.Z.</given-names>
            </name>
            <name>
              <surname>Donaghey</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Robinson</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Adiconis</surname>
              <given-names>X.</given-names>
            </name>
            <name>
              <surname>Fan</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Koziol</surname>
              <given-names>M.J.</given-names>
            </name>
            <name>
              <surname>Gnirke</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Nusbaum</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Rinn</surname>
              <given-names>J.L.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Ab initio reconstruction of cell type-specific transcriptomes in mouse reveals the conserved multi-exonic structure of lincRNAs</article-title>
          <source>Nat. Biotechnol.</source>
          <year>2010</year>
          <volume>28</volume>
          <fpage>503</fpage>
          <lpage>510</lpage>
        <pub-id pub-id-type="doi">10.1038/nbt.1633</pub-id><pub-id pub-id-type="pmid">20436462</pub-id></citation>
      </ref>
      <ref id="B84-genes-03-00344">
        <label>84.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Feng</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Li</surname>
              <given-names>W.</given-names>
            </name>
            <name>
              <surname>Jiang</surname>
              <given-names>T.</given-names>
            </name>
          </person-group>
          <article-title>Inference of isoforms from short sequence reads</article-title>
          <source>J. Comput. Biol.</source>
          <year>2011</year>
          <volume>18</volume>
          <fpage>305</fpage>
          <lpage>321</lpage>
          <pub-id pub-id-type="doi">10.1089/cmb.2010.0243</pub-id>
        </citation>
      </ref>
      <ref id="B85-genes-03-00344">
        <label>85.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Li</surname>
              <given-names>W.</given-names>
            </name>
            <name>
              <surname>Feng</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Jiang</surname>
              <given-names>T.</given-names>
            </name>
          </person-group>
          <article-title>IsoLasso: A LASSO regression approach to RNA-Seq based transcriptome assembly</article-title>
          <source>J. Comput. Biol.</source>
          <year>2011</year>
          <volume>18</volume>
          <fpage>1693</fpage>
          <lpage>1707</lpage>
          <pub-id pub-id-type="doi">10.1089/cmb.2011.0171</pub-id>
        </citation>
      </ref>
      <ref id="B86-genes-03-00344">
        <label>86.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Grabherr</surname>
              <given-names>M.G.</given-names>
            </name>
            <name>
              <surname>Haas</surname>
              <given-names>B.J.</given-names>
            </name>
            <name>
              <surname>Yassour</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Levin</surname>
              <given-names>J.Z.</given-names>
            </name>
            <name>
              <surname>Thompson</surname>
              <given-names>D.A.</given-names>
            </name>
            <name>
              <surname>Amit</surname>
              <given-names>I.</given-names>
            </name>
            <name>
              <surname>Adiconis</surname>
              <given-names>X.</given-names>
            </name>
            <name>
              <surname>Fan</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Raychowdhury</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Zeng</surname>
              <given-names>Q.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Full-length transcriptome assembly from RNA-Seq data without a reference genome</article-title>
          <source>Nat. Biotechnol.</source>
          <year>2011</year>
          <volume>29</volume>
          <fpage>644</fpage>
          <lpage>652</lpage>
        <pub-id pub-id-type="doi">10.1038/nbt.1883</pub-id><pub-id pub-id-type="pmid">21572440</pub-id></citation>
      </ref>
      <ref id="B87-genes-03-00344">
        <label>87.</label>
        <citation citation-type="web">
          <article-title>Oases: De novo transcriptome assembler for very short reads</article-title>
          <comment>Available online:<ext-link xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="http://www.ebi.ac.uk/~zerbino/oases/" ext-link-type="uri">http://www.ebi.ac.uk/~zerbino/oases/</ext-link></comment>
          <access-date>(accessed on 12 April 2012)</access-date>
        </citation>
      </ref>
      <ref id="B88-genes-03-00344">
        <label>88.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Li</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Yu</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Li</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Lam</surname>
              <given-names>T.W.</given-names>
            </name>
            <name>
              <surname>Yiu</surname>
              <given-names>S.M.</given-names>
            </name>
            <name>
              <surname>Kristiansen</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>J.</given-names>
            </name>
          </person-group>
          <article-title>SOAP2: An improved ultrafast tool for short read alignment</article-title>
          <source>Bioinformatics</source>
          <year>2009</year>
          <volume>25</volume>
          <fpage>1966</fpage>
          <lpage>1967</lpage>
        <pub-id pub-id-type="doi">10.1093/bioinformatics/btp336</pub-id><pub-id pub-id-type="pmid">19497933</pub-id></citation>
      </ref>
      <ref id="B89-genes-03-00344">
        <label>89.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Birol</surname>
              <given-names>I.</given-names>
            </name>
            <name>
              <surname>Jackman</surname>
              <given-names>S.D.</given-names>
            </name>
            <name>
              <surname>Nielsen</surname>
              <given-names>C.B.</given-names>
            </name>
            <name>
              <surname>Qian</surname>
              <given-names>J.Q.</given-names>
            </name>
            <name>
              <surname>Varhol</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Stazyk</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Morin</surname>
              <given-names>R.D.</given-names>
            </name>
            <name>
              <surname>Zhao</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Hirst</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Schein</surname>
              <given-names>J.E.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>De novo transcriptome assembly with ABySS</article-title>
          <source>Bioinformatics</source>
          <year>2009</year>
          <volume>25</volume>
          <fpage>2872</fpage>
          <lpage>2877</lpage>
        <pub-id pub-id-type="doi">10.1093/bioinformatics/btp367</pub-id><pub-id pub-id-type="pmid">19528083</pub-id></citation>
      </ref>
      <ref id="B90-genes-03-00344">
        <label>90.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Zhao</surname>
              <given-names>Q.Y.</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Kong</surname>
              <given-names>Y.M.</given-names>
            </name>
            <name>
              <surname>Luo</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Li</surname>
              <given-names>X.</given-names>
            </name>
            <name>
              <surname>Hao</surname>
              <given-names>P.</given-names>
            </name>
          </person-group>
          <article-title>Optimizing de novo transcriptome assembly from short-read RNA-Seq data: A comparative study</article-title>
          <source>BMC Bioinformatics</source>
          <year>2011</year>
          <volume>12</volume>
          <fpage>S2</fpage>
        <pub-id pub-id-type="pmid">22373013</pub-id></citation>
      </ref>
      <ref id="B91-genes-03-00344">
        <label>91.</label>
        <citation citation-type="journal">
          <collab>International Human Genome Sequencing Consortium</collab>
          <article-title>Finishing the euchromatic sequence of the human genome</article-title>
          <source>Nature</source>
          <year>2004</year>
          <volume>431</volume>
          <fpage>931</fpage>
          <lpage>945</lpage>
          <pub-id pub-id-type="doi">10.1038/nature03001</pub-id>
        </citation>
      </ref>
      <ref id="B92-genes-03-00344">
        <label>92.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Kapranov</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Cheng</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Dike</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Nix</surname>
              <given-names>D.A.</given-names>
            </name>
            <name>
              <surname>Duttagupta</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Willingham</surname>
              <given-names>A.T.</given-names>
            </name>
            <name>
              <surname>Stadler</surname>
              <given-names>P.F.</given-names>
            </name>
            <name>
              <surname>Hertel</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Hackermuller</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Hofacker</surname>
              <given-names>I.L.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>RNA maps reveal new RNA classes and a possible function for pervasive transcription</article-title>
          <source>Science</source>
          <year>2007</year>
          <volume>316</volume>
          <fpage>1484</fpage>
          <lpage>1488</lpage>
        <pub-id pub-id-type="doi">10.1126/science.1138341</pub-id><pub-id pub-id-type="pmid">17510325</pub-id></citation>
      </ref>
      <ref id="B93-genes-03-00344">
        <label>93.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Okazaki</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Furuno</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Kasukawa</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Adachi</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Bono</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Kondo</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Nikaido</surname>
              <given-names>I.</given-names>
            </name>
            <name>
              <surname>Osato</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Saito</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Suzuki</surname>
              <given-names>H.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs</article-title>
          <source>Nature</source>
          <year>2002</year>
          <volume>420</volume>
          <fpage>563</fpage>
          <lpage>573</lpage>
          <pub-id pub-id-type="doi">10.1038/nature01266</pub-id>
        </citation>
      </ref>
      <ref id="B94-genes-03-00344">
        <label>94.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Katayama</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Tomaru</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Kasukawa</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Waki</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Nakanishi</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Nakamura</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Nishida</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Yap</surname>
              <given-names>C.C.</given-names>
            </name>
            <name>
              <surname>Suzuki</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Kawai</surname>
              <given-names>J.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Antisense transcription in the mammalian transcriptome</article-title>
          <source>Science</source>
          <year>2005</year>
          <volume>309</volume>
          <fpage>1564</fpage>
          <lpage>1566</lpage>
        <pub-id pub-id-type="doi">10.1126/science.1112009</pub-id><pub-id pub-id-type="pmid">16141073</pub-id></citation>
      </ref>
      <ref id="B95-genes-03-00344">
        <label>95.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Rinn</surname>
              <given-names>J.L.</given-names>
            </name>
            <name>
              <surname>Euskirchen</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Bertone</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Martone</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Luscombe</surname>
              <given-names>N.M.</given-names>
            </name>
            <name>
              <surname>Hartman</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Harrison</surname>
              <given-names>P.M.</given-names>
            </name>
            <name>
              <surname>Nelson</surname>
              <given-names>F.K.</given-names>
            </name>
            <name>
              <surname>Miller</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Gerstein</surname>
              <given-names>M.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>The transcriptional activity of human Chromosome 22</article-title>
          <source>Genes Dev.</source>
          <year>2003</year>
          <volume>17</volume>
          <fpage>529</fpage>
          <lpage>540</lpage>
        <pub-id pub-id-type="doi">10.1101/gad.1055203</pub-id><pub-id pub-id-type="pmid">12600945</pub-id></citation>
      </ref>
      <ref id="B96-genes-03-00344">
        <label>96.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Birney</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Stamatoyannopoulos</surname>
              <given-names>J.A.</given-names>
            </name>
            <name>
              <surname>Dutta</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Guigo</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Gingeras</surname>
              <given-names>T.R.</given-names>
            </name>
            <name>
              <surname>Margulies</surname>
              <given-names>E.H.</given-names>
            </name>
            <name>
              <surname>Weng</surname>
              <given-names>Z.</given-names>
            </name>
            <name>
              <surname>Snyder</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Dermitzakis</surname>
              <given-names>E.T.</given-names>
            </name>
            <name>
              <surname>Thurman</surname>
              <given-names>R.E.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project</article-title>
          <source>Nature</source>
          <year>2007</year>
          <volume>447</volume>
          <fpage>799</fpage>
          <lpage>816</lpage>
        <pub-id pub-id-type="doi">10.1038/nature05874</pub-id><pub-id pub-id-type="pmid">17571346</pub-id></citation>
      </ref>
      <ref id="B97-genes-03-00344">
        <label>97.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Van Bakel</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Nislow</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Blencowe</surname>
              <given-names>B.J.</given-names>
            </name>
            <name>
              <surname>Hughes</surname>
              <given-names>T.R.</given-names>
            </name>
          </person-group>
          <article-title>Most “dark matter” transcripts are associated with known genes</article-title>
          <source>PLoS Biol.</source>
          <year>2010</year>
          <volume>8</volume>
          <fpage>e1000371</fpage>
          <pub-id pub-id-type="doi">10.1371/journal.pbio.1000371</pub-id>
        </citation>
      </ref>
      <ref id="B98-genes-03-00344">
        <label>98.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Asmann</surname>
              <given-names>Y.W.</given-names>
            </name>
            <name>
              <surname>Necela</surname>
              <given-names>B.M.</given-names>
            </name>
            <name>
              <surname>Kalari</surname>
              <given-names>K.R.</given-names>
            </name>
            <name>
              <surname>Hossain</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Baker</surname>
              <given-names>T.R.</given-names>
            </name>
            <name>
              <surname>Carr</surname>
              <given-names>J.M.</given-names>
            </name>
            <name>
              <surname>Davis</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Getz</surname>
              <given-names>J.E.</given-names>
            </name>
            <name>
              <surname>Hostetter</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Li</surname>
              <given-names>X.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Detection of redundant fusion transcripts as biomarkers or disease-specific therapeutic targets in breast cancer</article-title>
          <source>Cancer Res.</source>
          <year>2012</year>
          <volume>72</volume>
          <fpage>1921</fpage>
          <lpage>1928</lpage>
          <pub-id pub-id-type="doi">10.1158/0008-5472.CAN-11-3142</pub-id>
        </citation>
      </ref>
      <ref id="B99-genes-03-00344">
        <label>99.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Clark</surname>
              <given-names>M.B.</given-names>
            </name>
            <name>
              <surname>Amaral</surname>
              <given-names>P.P.</given-names>
            </name>
            <name>
              <surname>Schlesinger</surname>
              <given-names>F.J.</given-names>
            </name>
            <name>
              <surname>Dinger</surname>
              <given-names>M.E.</given-names>
            </name>
            <name>
              <surname>Taft</surname>
              <given-names>R.J.</given-names>
            </name>
            <name>
              <surname>Rinn</surname>
              <given-names>J.L.</given-names>
            </name>
            <name>
              <surname>Ponting</surname>
              <given-names>C.P.</given-names>
            </name>
            <name>
              <surname>Stadler</surname>
              <given-names>P.F.</given-names>
            </name>
            <name>
              <surname>Morris</surname>
              <given-names>K.V.</given-names>
            </name>
            <name>
              <surname>Morillon</surname>
              <given-names>A.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>The reality of pervasive transcription</article-title>
          <source>PLoS Biol.</source>
          <year>2011</year>
          <volume>9</volume>
          <fpage>e1000625</fpage>
          <pub-id pub-id-type="doi">10.1371/journal.pbio.1000625</pub-id>
        </citation>
      </ref>
      <ref id="B100-genes-03-00344">
        <label>100.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Amaral</surname>
              <given-names>P.P.</given-names>
            </name>
            <name>
              <surname>Mattick</surname>
              <given-names>J.S.</given-names>
            </name>
          </person-group>
          <article-title>Noncoding RNA in development</article-title>
          <source>Mamm. Genome</source>
          <year>2008</year>
          <volume>19</volume>
          <fpage>454</fpage>
          <lpage>492</lpage>
          <pub-id pub-id-type="doi">10.1007/s00335-008-9136-7</pub-id>
        </citation>
      </ref>
      <ref id="B101-genes-03-00344">
        <label>101.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Berretta</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Morillon</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <article-title>Pervasive transcription constitutes a new level of eukaryotic genome regulation</article-title>
          <source>EMBO Rep.</source>
          <year>2009</year>
          <volume>10</volume>
          <fpage>973</fpage>
          <lpage>982</lpage>
          <pub-id pub-id-type="doi">10.1038/embor.2009.181</pub-id>
        </citation>
      </ref>
      <ref id="B102-genes-03-00344">
        <label>102.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Kapranov</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>St Laurent</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Raz</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Ozsolak</surname>
              <given-names>F.</given-names>
            </name>
            <name>
              <surname>Reynolds</surname>
              <given-names>C.P.</given-names>
            </name>
            <name>
              <surname>Sorensen</surname>
              <given-names>P.H.</given-names>
            </name>
            <name>
              <surname>Reaman</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Milos</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Arceci</surname>
              <given-names>R.J.</given-names>
            </name>
            <name>
              <surname>Thompson</surname>
              <given-names>J.F.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>The majority of total nuclear-encoded non-ribosomal RNA in a human cell is ‘dark matter’ un-annotated RNA</article-title>
          <source>BMC Biol.</source>
          <year>2010</year>
          <volume>8</volume>
          <fpage>149</fpage>
          <pub-id pub-id-type="doi">10.1186/1741-7007-8-149</pub-id>
        </citation>
      </ref>
      <ref id="B103-genes-03-00344">
        <label>103.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Agarwal</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Koppstein</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Rozowsky</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Sboner</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Habegger</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Hillier</surname>
              <given-names>L.W.</given-names>
            </name>
            <name>
              <surname>Sasidharan</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Reinke</surname>
              <given-names>V.</given-names>
            </name>
            <name>
              <surname>Waterston</surname>
              <given-names>R.H.</given-names>
            </name>
            <name>
              <surname>Gerstein</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>Comparison and calibration of transcriptome data from RNA-Seq and tiling arrays</article-title>
          <source>BMC Genomics</source>
          <year>2010</year>
          <volume>11</volume>
          <fpage>383</fpage>
        <pub-id pub-id-type="doi">10.1186/1471-2164-11-383</pub-id><pub-id pub-id-type="pmid">20565764</pub-id></citation>
      </ref>
      <ref id="B104-genes-03-00344">
        <label>104.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Malone</surname>
              <given-names>J.H.</given-names>
            </name>
            <name>
              <surname>Oliver</surname>
              <given-names>B.</given-names>
            </name>
          </person-group>
          <article-title>Microarrays, deep sequencing and the true measure of the transcriptome</article-title>
          <source>BMC Biol.</source>
          <year>2011</year>
          <volume>9</volume>
          <fpage>34</fpage>
          <pub-id pub-id-type="doi">10.1186/1741-7007-9-34</pub-id>
        </citation>
      </ref>
      <ref id="B105-genes-03-00344">
        <label>105.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Van Bakel</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Nislow</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Blencowe</surname>
              <given-names>B.J.</given-names>
            </name>
            <name>
              <surname>Hughes</surname>
              <given-names>T.R.</given-names>
            </name>
          </person-group>
          <article-title>Response to “The reality of pervasive transcription”</article-title>
          <source>PLoS Biol.</source>
          <year>2011</year>
          <volume>9</volume>
          <fpage>e1001102</fpage>
          <pub-id pub-id-type="doi">10.1371/journal.pbio.1001102</pub-id>
        </citation>
      </ref>
      <ref id="B106-genes-03-00344">
        <label>106.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Ameur</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Zaghlool</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Halvardson</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Wetterbom</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Gyllensten</surname>
              <given-names>U.</given-names>
            </name>
            <name>
              <surname>Cavelier</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Feuk</surname>
              <given-names>L.</given-names>
            </name>
          </person-group>
          <article-title>Total RNA sequencing reveals nascent transcription and widespread co-transcriptional splicing in the human brain</article-title>
          <source>Nat. Struct. Mol. Biol.</source>
          <year>2011</year>
          <volume>18</volume>
          <fpage>1435</fpage>
          <lpage>1440</lpage>
          <pub-id pub-id-type="doi">10.1038/nsmb.2143</pub-id>
        </citation>
      </ref>
      <ref id="B107-genes-03-00344">
        <label>107.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Mercer</surname>
              <given-names>T.R.</given-names>
            </name>
            <name>
              <surname>Gerhardt</surname>
              <given-names>D.J.</given-names>
            </name>
            <name>
              <surname>Dinger</surname>
              <given-names>M.E.</given-names>
            </name>
            <name>
              <surname>Crawford</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Trapnell</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Jeddeloh</surname>
              <given-names>J.A.</given-names>
            </name>
            <name>
              <surname>Mattick</surname>
              <given-names>J.S.</given-names>
            </name>
            <name>
              <surname>Rinn</surname>
              <given-names>J.L.</given-names>
            </name>
          </person-group>
          <article-title>Targeted RNA sequencing reveals the deep complexity of the human transcriptome</article-title>
          <source>Nat. Biotechnol.</source>
          <year>2011</year>
          <volume>30</volume>
          <fpage>99</fpage>
          <lpage>104</lpage>
          <pub-id pub-id-type="doi">10.1038/nbt.2024</pub-id>
        </citation>
      </ref>
      <ref id="B108-genes-03-00344">
        <label>108.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Jarvis</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Robertson</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>The noncoding universe</article-title>
          <source>BMC Biol.</source>
          <year>2011</year>
          <volume>9</volume>
          <fpage>52</fpage>
          <pub-id pub-id-type="doi">10.1186/1741-7007-9-52</pub-id>
        </citation>
      </ref>
      <ref id="B109-genes-03-00344">
        <label>109.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Louro</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Smirnova</surname>
              <given-names>A.S.</given-names>
            </name>
            <name>
              <surname>Verjovski-Almeida</surname>
              <given-names>S.</given-names>
            </name>
          </person-group>
          <article-title>Long intronic noncoding RNA transcription: Expression noise or expression choice?</article-title>
          <source>Genomics</source>
          <year>2009</year>
          <volume>93</volume>
          <fpage>291</fpage>
          <lpage>298</lpage>
        <pub-id pub-id-type="doi">10.1016/j.ygeno.2008.11.009</pub-id><pub-id pub-id-type="pmid">19071207</pub-id></citation>
      </ref>
      <ref id="B110-genes-03-00344">
        <label>110.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Mercer</surname>
              <given-names>T.R.</given-names>
            </name>
            <name>
              <surname>Dinger</surname>
              <given-names>M.E.</given-names>
            </name>
            <name>
              <surname>Mattick</surname>
              <given-names>J.S.</given-names>
            </name>
          </person-group>
          <article-title>Long non-coding RNAs: Insights into functions</article-title>
          <source>Nat. Rev. Genet.</source>
          <year>2009</year>
          <volume>10</volume>
          <fpage>155</fpage>
          <lpage>159</lpage>
          <pub-id pub-id-type="doi">10.1038/nrg2521</pub-id>
        </citation>
      </ref>
      <ref id="B111-genes-03-00344">
        <label>111.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Dinger</surname>
              <given-names>M.E.</given-names>
            </name>
            <name>
              <surname>Amaral</surname>
              <given-names>P.P.</given-names>
            </name>
            <name>
              <surname>Mercer</surname>
              <given-names>T.R.</given-names>
            </name>
            <name>
              <surname>Pang</surname>
              <given-names>K.C.</given-names>
            </name>
            <name>
              <surname>Bruce</surname>
              <given-names>S.J.</given-names>
            </name>
            <name>
              <surname>Gardiner</surname>
              <given-names>B.B.</given-names>
            </name>
            <name>
              <surname>Askarian-Amiri</surname>
              <given-names>M.E.</given-names>
            </name>
            <name>
              <surname>Ru</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Solda</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Simons</surname>
              <given-names>C.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>S. Long noncoding RNAs in mouse embryonic stem cell pluripotency and differentiation</article-title>
          <source>Genome Res.</source>
          <year>2008</year>
          <volume>18</volume>
          <fpage>1433</fpage>
          <lpage>1445</lpage>
          <pub-id pub-id-type="doi">10.1101/gr.078378.108</pub-id>
        </citation>
      </ref>
      <ref id="B112-genes-03-00344">
        <label>112.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Ahituv</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Zhu</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Visel</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Holt</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Afzal</surname>
              <given-names>V.</given-names>
            </name>
            <name>
              <surname>Pennacchio</surname>
              <given-names>L.A.</given-names>
            </name>
            <name>
              <surname>Rubin</surname>
              <given-names>E.M.</given-names>
            </name>
          </person-group>
          <article-title>Deletion of ultraconserved elements yields viable mice</article-title>
          <source>PLoS Biol.</source>
          <year>2007</year>
          <volume>5</volume>
          <fpage>e234</fpage>
          <pub-id pub-id-type="doi">10.1371/journal.pbio.0050234</pub-id>
        </citation>
      </ref>
      <ref id="B113-genes-03-00344">
        <label>113.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Monroe</surname>
              <given-names>D.</given-names>
            </name>
          </person-group>
          <article-title>Genetics. Genomic clues to DNA treasure sometimes lead nowhere</article-title>
          <source>Science</source>
          <year>2009</year>
          <volume>325</volume>
          <fpage>142</fpage>
          <lpage>143</lpage>
          <pub-id pub-id-type="doi">10.1126/science.325_142</pub-id>
        </citation>
      </ref>
      <ref id="B114-genes-03-00344">
        <label>114.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Knowles</surname>
              <given-names>D.G.</given-names>
            </name>
            <name>
              <surname>McLysaght</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <article-title>Recent de novo origin of human protein-coding genes</article-title>
          <source>Genome Res.</source>
          <year>2009</year>
          <volume>19</volume>
          <fpage>1752</fpage>
          <lpage>1759</lpage>
          <pub-id pub-id-type="doi">10.1101/gr.095026.109</pub-id>
        </citation>
      </ref>
      <ref id="B115-genes-03-00344">
        <label>115.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Kaplan</surname>
              <given-names>C.D.</given-names>
            </name>
          </person-group>
          <article-title>The architecture of RNA polymerase fidelity</article-title>
          <source>BMC Biol.</source>
          <year>2010</year>
          <volume>8</volume>
          <fpage>85</fpage>
          <pub-id pub-id-type="doi">10.1186/1741-7007-8-85</pub-id>
        </citation>
      </ref>
      <ref id="B116-genes-03-00344">
        <label>116.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Ponting</surname>
              <given-names>C.P.</given-names>
            </name>
            <name>
              <surname>Hardison</surname>
              <given-names>R.</given-names>
            </name>
          </person-group>
          <article-title>What fraction of the human genome is functional?</article-title>
          <source>Genome Res.</source>
          <year>2011</year>
          <volume>21</volume>
          <fpage>1769</fpage>
          <lpage>1776</lpage>
          <pub-id pub-id-type="doi">10.1101/gr.116814.110</pub-id>
        </citation>
      </ref>
      <ref id="B117-genes-03-00344">
        <label>117.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Cawley</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Bekiranov</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Ng</surname>
              <given-names>H.H.</given-names>
            </name>
            <name>
              <surname>Kapranov</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Sekinger</surname>
              <given-names>E.A.</given-names>
            </name>
            <name>
              <surname>Kampa</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Piccolboni</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Sementchenko</surname>
              <given-names>V.</given-names>
            </name>
            <name>
              <surname>Cheng</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Williams</surname>
              <given-names>A.J.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Unbiased mapping of transcription factor binding sites along human chromosomes 21 and 22 points to widespread regulation of noncoding RNAs</article-title>
          <source>Cell</source>
          <year>2004</year>
          <volume>116</volume>
          <fpage>499</fpage>
          <lpage>509</lpage>
          <pub-id pub-id-type="doi">10.1016/S0092-8674(04)00127-8</pub-id>
        </citation>
      </ref>
      <ref id="B118-genes-03-00344">
        <label>118.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Wang</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Zhang</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Zheng</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Li</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Liu</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Li</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Samudrala</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Yu</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Wong</surname>
              <given-names>G.K. </given-names>
            </name>
          </person-group>
          <article-title>Mouse transcriptome: Neutral evolution of ‘non-coding’ complementary DNAs</article-title>
          <source>Nature</source>
          <year>2004</year>
          <volume>431</volume>
          <pub-id pub-id-type="doi">10.1038/nature03016</pub-id>
        </citation>
      </ref>
      <ref id="B119-genes-03-00344">
        <label>119.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Pang</surname>
              <given-names>K.C.</given-names>
            </name>
            <name>
              <surname>Frith</surname>
              <given-names>M.C.</given-names>
            </name>
            <name>
              <surname>Mattick</surname>
              <given-names>J.S.</given-names>
            </name>
          </person-group>
          <article-title>Rapid evolution of noncoding RNAs: Lack of conservation does not mean lack of function</article-title>
          <source>Trends Genet.</source>
          <year>2006</year>
          <volume>22</volume>
          <fpage>1</fpage>
          <lpage>5</lpage>
          <pub-id pub-id-type="doi">10.1016/j.tig.2005.10.003</pub-id>
        </citation>
      </ref>
      <ref id="B120-genes-03-00344">
        <label>120.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Ebisuya</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Yamamoto</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Nakajima</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Nishida</surname>
              <given-names>E.</given-names>
            </name>
          </person-group>
          <article-title>Ripples from neighbouring transcription</article-title>
          <source>Nat. Cell Biol.</source>
          <year>2008</year>
          <volume>10</volume>
          <fpage>1106</fpage>
          <lpage>1113</lpage>
          <pub-id pub-id-type="doi">10.1038/ncb1771</pub-id>
        </citation>
      </ref>
      <ref id="B121-genes-03-00344">
        <label>121.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Johnson</surname>
              <given-names>J.M.</given-names>
            </name>
            <name>
              <surname>Edwards</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Shoemaker</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Schadt</surname>
              <given-names>E.E.</given-names>
            </name>
          </person-group>
          <article-title>Dark matter in the genome: Evidence of widespread transcription detected by microarray tiling experiments</article-title>
          <source>Trends Genet.</source>
          <year>2005</year>
          <volume>21</volume>
          <fpage>93</fpage>
          <lpage>102</lpage>
          <pub-id pub-id-type="doi">10.1016/j.tig.2004.12.009</pub-id>
        </citation>
      </ref>
    </ref-list>
  </back>
</article>
