<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xml:lang="en" article-type="research-article">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">microarrays</journal-id>
      <journal-title>Microarrays</journal-title>
      <abbrev-journal-title abbrev-type="publisher">Microarrays</abbrev-journal-title>
      <abbrev-journal-title abbrev-type="pubmed">Microarrays</abbrev-journal-title>
      <issn pub-type="epub">2076-3905</issn>
      <publisher>
        <publisher-name>MDPI</publisher-name>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.3390/microarrays1020084</article-id>
      <article-id pub-id-type="publisher-id">microarrays-01-00084</article-id>
      <article-categories>
        <subj-group>
          <subject>Article</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>Quality Visualization of Microarray Datasets Using Circos</article-title>
      </title-group>
      
      <contrib-group>
        <contrib contrib-type="author">
          <name>
            <surname>Koch</surname>
            <given-names>Martin</given-names>
          </name>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Wiese</surname>
            <given-names>Michael</given-names>
          </name>
          <xref rid="c1-microarrays-01-00084" ref-type="corresp">*</xref>
        </contrib>
      </contrib-group>
      <aff id="af1-microarrays-01-00084">Pharmaceutical Institute, Rheinische Friedrich Wilhelms University Bonn, An der Immenburg 4, Bonn 53121, Germany; Email: <email>martin.koch@uni-bonn.de</email> </aff>
	  <author-notes>
        <corresp id="c1-microarrays-01-00084"><label>*</label> Author to whom correspondence should be addressed; Email: <email>mwiese@uni-bonn.de</email>; Tel.: +49-228-735-213; Fax: +49-228-737-929. </corresp>
      </author-notes>
      <pub-date pub-type="epub">
        <day>07</day>
        <month>08</month>
        <year>2012</year>
      </pub-date>
      <pub-date pub-type="collection"> <month>09</month>
        <year>2012</year>
      </pub-date>
      <volume>1</volume>
      <issue>2</issue>
      <fpage>84</fpage>
      <lpage>94</lpage>
      <history>
        <date date-type="received">
          <day>25</day>
          <month>06</month>
          <year>2012</year>
        </date>
        <date date-type="rev-recd">
          <day>25</day>
          <month>07</month>
          <year>2012</year>
        </date>
        <date date-type="accepted">
          <day>03</day>
          <month>08</month>
          <year>2012</year>
        </date>
      </history>
      <permissions>
        <copyright-statement>© 2012 by the authors; licensee MDPI, Basel, Switzerland.</copyright-statement>
        <copyright-year>2012</copyright-year>
        <license xmlns:xlink="http://www.w3.org/1999/xlink" license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/3.0/">
          <p>This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).</p>
        </license>
      </permissions>
      <abstract>
        <p>Quality control and normalization is considered the most important step in the analysis of microarray data. At present there are various methods available for quality assessments of microarray datasets. However there seems to be no standard visualization routine, which also depicts individual microarray quality. Here we present a convenient method for visualizing the results of standard quality control tests using Circos plots. In these plots various quality measurements are drawn in a circular fashion, thus allowing for visualization of the quality and all outliers of each distinct array within a microarray dataset. The proposed method is intended for use with the Affymetrix Human Genome platform (<italic>i.e.</italic>, GPL 96, GPL570 and GPL571). Circos quality measurement plots are a convenient way for the initial quality estimate of Affymetrix datasets that are stored in publicly available databases.</p>
      </abstract>
      <kwd-group>
        <kwd>circos</kwd>
        <kwd>yaqcaffy</kwd>
        <kwd>quality monitoring</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec sec-type="intro">
      <title>1. Introduction</title>
      <sec>
        <title>1.1. Microarray Raw Data and Quality Control</title>
        <p>Microarray technology has successfully made the transition from a specialized method to a common method that is widely adopted in biological research. However, marginal overlap in gene expression profiling studies [<xref ref-type="bibr" rid="B1-microarrays-01-00084">1</xref>] and discussions about analysis approaches for identifying differentially expressed genes have prompted a microarray quality initiative. The microarray quality control project (MAQC) aims to estimate the accuracy of the technology [<xref ref-type="bibr" rid="B2-microarrays-01-00084">2</xref>] and examines in a second phase analysis methods and obtained biological models [<xref ref-type="bibr" rid="B3-microarrays-01-00084">3</xref>]. In general, the aim of the MAQC-II study was to resolve the concerns about the reproducibility and the generalization capability of microarray analysis results, which may stem from: </p>
        <list>
          <list-item>
            <p>(i) a lack of information about the analysis protocol.</p>
          </list-item>
          <list-item>
            <p>(ii) choosing different normalization methods.</p>
          </list-item>
          <list-item>
            <p>(iii) the use of defective analytical methods.</p>
          </list-item>
        </list>
        <p>It was found that reproducibility seems to rely greatly on the availability and completeness of the documentation of the analysis process; however it is just as essential to start the analysis with high quality data. Therefore, prior to the analysis of the raw data, the quality of the dataset needs to be assessed. An important goal of quality assessment is detection of outliers. However, currently there is no common quality measure to estimate the soundness of a microarray dataset, although there are a variety of methods available for the analysis of microarray quality in the Bioconductor project [<xref ref-type="bibr" rid="B4-microarrays-01-00084">4</xref>]. A remedy to this situation is given by post-normalization quality assessment, which detects systematically wrong appliance of normalization. It was earlier suggested that a quality check should be performed before the actual analysis as well, thus preventing the attempt of normalizing erroneous data in the first place [<xref ref-type="bibr" rid="B5-microarrays-01-00084">5</xref>]. </p>
        <p>In general normalization methods are specifically designed either for dual channel microarray datasets [<xref ref-type="bibr" rid="B6-microarrays-01-00084">6</xref>,<xref ref-type="bibr" rid="B7-microarrays-01-00084">7</xref>,<xref ref-type="bibr" rid="B8-microarrays-01-00084">8</xref>,<xref ref-type="bibr" rid="B9-microarrays-01-00084">9</xref>,<xref ref-type="bibr" rid="B10-microarrays-01-00084">10</xref>] or single channel microarray datasets [<xref ref-type="bibr" rid="B11-microarrays-01-00084">11</xref>,<xref ref-type="bibr" rid="B12-microarrays-01-00084">12</xref>]. However, there are also manufacturer dependent normalization methods available for the different flavors of microarray data [<xref ref-type="bibr" rid="B12-microarrays-01-00084">12</xref>,<xref ref-type="bibr" rid="B13-microarrays-01-00084">13</xref>,<xref ref-type="bibr" rid="B14-microarrays-01-00084">14</xref>,<xref ref-type="bibr" rid="B15-microarrays-01-00084">15</xref>,<xref ref-type="bibr" rid="B16-microarrays-01-00084">16</xref>,<xref ref-type="bibr" rid="B17-microarrays-01-00084">17</xref>,<xref ref-type="bibr" rid="B18-microarrays-01-00084">18</xref>,<xref ref-type="bibr" rid="B19-microarrays-01-00084">19</xref>,<xref ref-type="bibr" rid="B20-microarrays-01-00084">20</xref>,<xref ref-type="bibr" rid="B21-microarrays-01-00084">21</xref>,<xref ref-type="bibr" rid="B22-microarrays-01-00084">22</xref>]. </p>
      </sec>
      <sec>
        <title>1.2. Aim of the Present Study</title>
        <p>Here we present a convenient visualization approach for assessment of individual microarray quality in large data sets using Circos. We have applied state of the art Bioconductor packages, to obtain quality control values. To condense the quality control analysis results, only outliers in the assessed quality measurements are highlighted, <italic>i.e</italic>., via red dots, denoting absence or deviations of probes in standard technical controls. Additionally RNA degradation assessment is presented in tiles using a gradient from blue to red for depicting potential outlier probes. Finally a principal component plot shows the actual array as a red dot and the other arrays in blue, thus facilitating the detection of potential outlier candidates. Presenting the analysis results in a circular format might be a suitable and convenient way for the initial quality estimate of datasets that are stored in publicly available databases. R code and a demo are publicly available at: <uri>https://github.com/buzzmak/circos-arrayQC</uri>. </p>
      </sec>
    </sec>
    <sec>
      <title>2. Experimental Section</title>
      <sec>
        <title>2.1. Publicly Available Microarray Studies in GEO</title>
        <p>Microarray raw data of publicly available studies was obtained from Gene Expression Omnibus (GEO, [<xref ref-type="bibr" rid="B23-microarrays-01-00084">23</xref>]). <xref ref-type="table" rid="microarrays-01-00084-t001">Table 1</xref> lists all studies, which were evaluated for array quality; note that all studies comprise Affymetrix Human Genome U133A arrays and became available recently, <italic>i.e</italic>., 2011, except for GSE9936 [<xref ref-type="bibr" rid="B24-microarrays-01-00084">24</xref>] which was submitted to GEO in 2008.</p>
        <table-wrap id="microarrays-01-00084-t001" position="anchor">
          <object-id pub-id-type="pii">microarrays-01-00084-t001_Table 1</object-id>
          <label>Table 1</label>
          <caption>
            <p>The following studies are publicly available as microarray raw data in the Gene Expression Omnibus (GEO) database.</p>
          </caption>
          <table>
            <thead>
              <tr align="center">
                <th valign="middle">GEO ID</th>
                <th valign="middle">Samples</th>
                <th valign="middle">Reference</th>
                <th valign="middle">Published on</th>
              </tr>
            </thead>
            <tbody>
              <tr align="center">
                <td valign="middle">GSE9801</td>
                <td valign="middle">6</td>
                <td valign="middle">[<xref ref-type="bibr" rid="B25-microarrays-01-00084">25</xref>]</td>
                <td valign="middle">7 November 2011</td>
              </tr>
              <tr align="center">
                <td valign="middle">GSE32700</td>
                <td valign="middle">46</td>
                <td valign="middle">[<xref ref-type="bibr" rid="B26-microarrays-01-00084">26</xref>]</td>
                <td valign="middle">7 October 2011</td>
              </tr>
              <tr align="center">
                <td valign="middle">GSE9936</td>
                <td valign="middle">105</td>
                <td valign="middle">[<xref ref-type="bibr" rid="B24-microarrays-01-00084">24</xref>]</td>
                <td valign="middle">7 February 2008</td>
              </tr>
            </tbody>
          </table>
		  </table-wrap>
      </sec>
      <sec>
        <title>2.2. Data Processing, Normalization and Principal Component Analysis</title>
        <p>Microarray data handling was performed in R (2.15.0), using the latest version of Bioconductor [<xref ref-type="bibr" rid="B4-microarrays-01-00084">4</xref>]. Microarray raw data was obtained using the <italic>GEOquery</italic> package [<xref ref-type="bibr" rid="B27-microarrays-01-00084">27</xref>] and batch processed using the <italic>affy</italic> package [<xref ref-type="bibr" rid="B28-microarrays-01-00084">28</xref>]. Additionally we used <italic>affy</italic> to calculate potential candidate arrays for RNA degradation. We performed quality control tests using the <italic>yaqcaffy</italic> package, which provides routines that are dedicated to Affymetrix arrays. We calculated all outliers in average background and average noise. Additionally we estimated outliers in both house-keeping probes (<italic>i.e.</italic>, β-Actin and GADPH), as well as outliers in the internal spike-in probe calls and poly-A controls. </p>
        <p>Finally all microarrays were processed using quantile normalization of the <italic>RMA</italic> package without background adjustment [<xref ref-type="bibr" rid="B18-microarrays-01-00084">18</xref>]. Subsequently principal component analysis was performed using the <italic>pcaMethods</italic> package [<xref ref-type="bibr" rid="B29-microarrays-01-00084">29</xref>]. The scores of the mean centred first principal component were obtained for visualization.</p>
      </sec>
      <sec>
        <title>2.3. Data Visualization Using Circos</title>
        <p>All quality control analysis results were summarized using R and then all dedicated Circos input files were generated by use of the proposed method (R code and a demo are publicly available at: <uri>http://github.com/buzzmak/circos-arrayQC</uri>). </p>
        <p>Inside an R command shell the proposed method can easily be executed, calling the following routine:</p>
        <p>
          <bold>
            <italic>&gt; writeCircos.files(data, celNames, workdir, fileName, pathToCircos)</italic>
          </bold>
        </p>
        <p>Where <bold><italic>data</italic></bold> presents an <italic>affyBatch</italic> object containing the raw data, <bold><italic>celNames</italic></bold> denotes all Affymetrix cel-file names used in the analysis, <italic><bold>workdir</bold></italic> is the path to the current directory, <italic><bold>filename</bold></italic> names the resulting Circos QC plot and <bold><italic>pathToCircos</italic></bold> leads to the home directory of Circos. </p>
        <p>Circos [<xref ref-type="bibr" rid="B30-microarrays-01-00084">30</xref>] version 0.6 and strawberry Perl (<uri>http://strawberryperl.com/</uri>) version 5.1.16 was used to generate all circular quality plots. Note that the Circos method needs to be called by the Perl interpreter, <italic>i.e</italic>.,</p>
        <p>
          <bold>
            <italic>&gt; perl bin/circos –conf C:\Path\to\where\circosFiles\are\located\circosQCconfig.txt</italic>
          </bold>
        </p>
      </sec>
    </sec>
    <sec sec-type="results">
      <title>3. Results and Discussion</title>
      <sec>
        <title>3.1. Quality Assessment and Normalization of Diverse Studies Available in GEO</title>
        <p>Since currently the majority of microarray datasets in GEO are based on Affymetrix arrays, we focused our investigation on Human Genome U133A arrays (GPL96 and GPL570) from this platform. Three representative example datasets varying mainly in the number of comprised arrays were selected to show the visualization capability of our approach (<xref ref-type="table" rid="microarrays-01-00084-t001">Table 1</xref>). The principal approach is introduced by using arrays from the study GSE9801. The principal component analysis results, which are made with the <italic>pcaMethods</italic> package [<xref ref-type="bibr" rid="B29-microarrays-01-00084">29</xref>], are depicted in <xref ref-type="fig" rid="microarrays-01-00084-f001">Figure 1</xref> for this dataset. Here the first principal component is plotted against the second principal component. Two clusters and one potential outlier array can be seen. In <xref ref-type="fig" rid="microarrays-01-00084-f002">Figure 2</xref>, a plot of potential RNA degradation is presented, generated with the <italic>affy</italic> package [<xref ref-type="bibr" rid="B28-microarrays-01-00084">28</xref>]. One array, which represents the topmost line, is probably an outlier. The <italic>yaqcaffy</italic> (<uri>http://www.bioconductor.org/packages/release/bioc/html/yaqcaffy.html</uri>) package in Bioconductor provides several quality assessment methods for Affymetrix arrays. In <xref ref-type="fig" rid="microarrays-01-00084-f003">Figure 3</xref> there is a quality analysis plot of study GSE9801 made with <italic>yaqcaffy</italic>. All quality control measurements are demonstrated and the suggested cut-off values are displayed as well. The box-plot showing the GADPH present calls reveals that array GSM247404 is probably an outlier. In addition, the same array is shown also in the Affymetrix specific spike-in and poly-A control plots and therefore most probably presents an outlier. Also there is array GSM247406, which has low values in the dap poly-A control.</p>
        <fig id="microarrays-01-00084-f001" position="anchor">
          <label>Figure 1</label>
          <caption>
            <p>Principal component analysis was performed using the <italic>pcaMethods</italic> package. The figure depicts the first <italic>versus</italic> the second principal component in a scatter plot. There are two clusters and one potential outlier to the right in dataset GSE9801. The first PC explains 67% and the second PC explains 19% of the variance of the data.</p>
          </caption>
          <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="microarrays-01-00084-g001.tif"/>
        </fig>
        <fig id="microarrays-01-00084-f002" position="anchor">
          <label>Figure 2</label>
          <caption>
            <p>The <italic>affy</italic> package enables the examination of potential RNA degradation probes, <italic>i.e.</italic>, eleven control probes which can reveal a potential fragmentation with high significance. Here we depict potential RNA degradation of arrays in dataset GSE9801. Note that the topmost blue line represents an outlier.</p>
          </caption>
          <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="microarrays-01-00084-g002.tif"/>
        </fig>
        <fig id="microarrays-01-00084-f003" position="anchor">
          <label>Figure 3</label>
          <caption>
            <p>Different quality measurements, which are available in the Bioconductor package yaqcaffy, shown for the example of dataset GSE9801. The first row contains two box-plots, which denote the average background and noise. The second row contains another two box-plots, showing the expression status of so called housekeeping genes. The third row contains box-plots of the Affymetrix specific spike-in and poly-A controls. Note that array GSM247404 is highlighted as potential outlier by black dots. Array GSM247406 is also flagged as outlier in the dap poly-A control, as seen at the bottom right.</p>
          </caption>
          <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="microarrays-01-00084-g003.tif"/>
        </fig>
      </sec>
      <sec>
        <title>3.2. Visualization by Use of Circos</title>
        <p>In <xref ref-type="fig" rid="microarrays-01-00084-f004">Figure 4</xref> we present an overview of the proposed quality measurement plot, using Circos. The plot combines all previously mentioned quality measurement methods and shows also quality measurements of individual arrays. Here we depict only two arrays, for explanatory purposes. <xref ref-type="fig" rid="microarrays-01-00084-f005">Figure 5</xref> depicts all arrays of a dataset in a circular view. Also we wished to condense quality information almost to the presence of outliers, in this way focusing only on erroneous outlier arrays. The first rim depicts the first principal component and highlights the actual array as a red dot, which is slightly bigger than the blue dots that represent the other arrays. Here arrays having similar principal component scores are clustered together, whereas arrays having different principal component scores are located apart.</p>
        <fig id="microarrays-01-00084-f004" position="anchor">
          <label>Figure 4</label>
          <caption>
            <p>Plot of the different quality measurements, which are shown in <xref ref-type="fig" rid="microarrays-01-00084-f001">Figure 1</xref>, <xref ref-type="fig" rid="microarrays-01-00084-f002">Figure 2</xref>, <xref ref-type="fig" rid="microarrays-01-00084-f003">Figure 3</xref> of the dataset GSE9801 combined with Circos. The outer rim depicts the first principal component, which shows the actual array in red. The second rim displays the result of RNA degradation assessment. Here the red tiles present high values and the blue tiles denote low values of degradation. In addition, white tiles show medium degradation. The next four rims present outliers based on different quality control levels, first average noise and background, second outliers in the level of housekeeping genes and lastly two rims, which are showing outliers in the control spike-in probes. The index in the figure denotes: 1 = first principal component. Note that the actual array is presented as a red dot for better visualisation. 2 = RNA degradation, 3 = background and noise, 4 = β-Actin and GADPH, 5 = internal spike-in probe calls, 6 = poly-A controls. Note that array GSM247404 presents outliers on each level, even when the scores of the first principal component clusters are apart. In contrast to that, the array GSM247405 depicts inconspicuous values for almost all quality measurements, even GADPH and β-Actin shown on the fourth level in blue appear to be of good quality.</p>
          </caption>
          <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="microarrays-01-00084-g004.tif"/>
        </fig>
        <p>In the next rim potential RNA degradation is depicted. A low overall quality is shown as red tile, while in the case when no degradation was measured the tiles are blue. Medium RNA degradation is depicted in white; these probes are in the range of the tolerance limits. For a better visual inspection of the whole dataset, all arrays are grouped together according to the RNA degradation measure. This way, arrays of similar RNA quality are typically positioned aside in the plot. In the third rim we find red dots if the average background or average noise measurements is off scale. The following three rims are dedicated to several affymetrix specific technical quality control probes and are introduced as follows. The fourth rim comprises the house-keeping genes β-actin and GADPH in blue and flagged in red in case the probes are off scale, <italic>i.e</italic>., induced more than threefold, these probes will be depicted in red. The fifth rim presents potential outliers of the internal probe calls, <italic>i.e.</italic>, three probes as depicted in <xref ref-type="fig" rid="microarrays-01-00084-f003">Figure 3</xref> on the bottom left part. The inner ring reveals outliers in poly-A control probes, <italic>i.e.</italic>, maximum of four probes, as depicted in the <xref ref-type="fig" rid="microarrays-01-00084-f003">Figure 3</xref> on the bottom right part. In <xref ref-type="fig" rid="microarrays-01-00084-f004">Figure 4</xref> we present only two arrays from dataset GSE9801, there is one array of low (GSM247404) and another array (GSM247405) of good quality. Array GSM247404 can easily be identified as a potential outlier on all measured quality scales. All technical spike-in probes in rim five are problematic and 50 percent of the other technical controls are outliers as well. The other array (GSM247405) in this dataset however is of good quality, since it shows blue tiles in the RNA degradation plot and no outliers in the technical controls.</p>
        <fig id="microarrays-01-00084-f005" position="anchor">
          <label>Figure 5</label>
          <caption>
            <p>Quality measurement plots of the two datasets listed in <xref ref-type="table" rid="microarrays-01-00084-t001">Table 1</xref>, visualized by Circos, investigating the visualization limit of the proposed method. The plot on the left shows the results of the quality assessment from 46 arrays. The plot on the right summarizes all quality measurements for more than 100 arrays. In this plot one can only trace the quality of the whole dataset; as assessing the quality of single arrays without magnifying is difficult. Therefore we suggest inspecting the same plot in scalable vector format (<italic>i.e</italic>., svg), as provided in <xref ref-type="fig" rid="microarrays-01-00084-f006">Figure S1</xref> of the supplement. </p>
          </caption>
          <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="microarrays-01-00084-g005.tif"/>
        </fig>
        
		<p>We wanted to assess the visual capability of our approach and therefore we generated Circos quality plots for two microarray studies as shown in <xref ref-type="fig" rid="microarrays-01-00084-f005">Figure 5</xref>. These studies are listed in <xref ref-type="table" rid="microarrays-01-00084-t001">Table 1</xref> in detail. The left plot in <xref ref-type="fig" rid="microarrays-01-00084-f005">Figure 5</xref> presents all arrays in study GSE37200 comprising 46 arrays and the right plot shows study GSE9936, which contains 105 arrays. In the dataset GSE32700, there are only three arrays of questionable quality. Two of them present outliers, since RNA degradation is detected and there are outliers in average background or noise levels. In addition to this, on the second innermost ring all three spike-in control probes are potential outliers. The majority of the arrays in the dataset are of good quality, there are only occasionally some outliers, mostly in the poly-A controls. </p>
        <p>The dataset GSE9936 has several RNA degradation candidate arrays, however only one of these arrays has also additional outliers on all other scales. The other RNA degradation candidates have mostly outliers in the average background intensities. The arrays, where RNA degradation is not problematic show off limit values in average background and noise levels instead. Additionally these arrays have off-scale measurements in both house-keeping control genes and also several absent probes and outliers in the technical spike-in and poly-A controls. In total we can count seven arrays, where quality measurements would imply outlier candidates. This underlines the fact that it might be important to access a plurality of quality measurement methods, to gain outlier arrays with greater confidence. In summary our method suits the majority of the dataset series based on GPL96 and GPL570, since most of these comprise less than 100 arrays. However, we find also that datasets containing more than 100 arrays are not reasonably well pictured using our method. Nonetheless, our method is applicable especially for use in web-resources, since Circos produces scalable vector graphic images in which the area of interest can be easily magnified.</p>
      </sec>
    </sec>
    <sec sec-type="discussion">
      <title>4. Discussion</title>
      <p>The NCBI resource GEO contains at present over 10,000 platforms, of which 1,895 comprise human genomic sequences. Among all samples in GEO (776,566) there are 108,119 samples (13.9%), which are based on the Affymetrix Human Genome platform (<italic>i.e.</italic>, GPL 96, GPL570 and GPL571). Over 13,000 samples are based on the Agilent 4x44k platform (0.94%) and there are 3550 samples based on the Illumina human-6 2.0 expression beadchip platform (0.45%). The proposed Circos method supports quality estimation for the majority of microarray-based experiments, which are based on the Affymetrix platform. However, there are quality-reporting methods for other platforms <italic>i.e.</italic>, beadArray [<xref ref-type="bibr" rid="B31-microarrays-01-00084">31</xref>] for Illumina data and a proprietary method for Agilent arrays. A standard visualization routine, which also depicts individual microarray quality, could allow for an initial quality estimation of a microarray dataset. In the proposed Circos quality plots we find a promising approach providing quick quality assessment, which could in future be generalized to other microarray platforms as well.</p>
    </sec>
    <sec>
      <title>5. Conclusion</title>
      <p>Quality control is the most important initial step in the analysis of microarray data. However, until now there has been no standard visualization routine to access individual microarray quality control values of large datasets. Here we present a convenient method for accessing the results of standard quality control test results using Circos. Currently the method works only for Microarray datasets, which are based on the Affymetrix Human Genome platform (<italic>i.e.</italic>, GPL 96, GPL570 and GPL571). In future, this method could be adopted for quality estimation of genomic high throughput data.</p>
    </sec>
<sec><title>Appendix</title>
<fig id="microarrays-01-00084-f006" position="anchor">
        <label>Figure S1</label>
        <caption>
          <p>Larger version of the quality measurement plots of dataset GSE9936 as visualized by Circos. </p>
        </caption>
        <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="microarrays-01-00084-g006.tif"/>
      </fig>
</sec>
  </body>
  <back>
	<ack>
      <title>Acknowledgments</title>
      <p>We wish to acknowledge the kind help of Martin Krzywinski and Laurent Gatto. Martin Krzywinski did also provide the improved layout of the plots and helped substantially to improve our understanding of the circos method.</p>
    </ack>
	<ref-list>
      <title>References</title>
      <ref id="B1-microarrays-01-00084">
        <label>1.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Bachtiary</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Boutros</surname>
              <given-names>P.C.</given-names>
            </name>
            <name>
              <surname>Pintilie</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Shi</surname>
              <given-names>W.</given-names>
            </name>
            <name>
              <surname>Bastianutto</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Li</surname>
              <given-names>J.-H.</given-names>
            </name>
            <name>
              <surname>Schwock</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Zhang</surname>
              <given-names>W.</given-names>
            </name>
            <name>
              <surname>Penn</surname>
              <given-names>L.Z.</given-names>
            </name>
            <name>
              <surname>Jurisica</surname>
              <given-names>I.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Gene expression profiling in cervical cancer: An exploration of intratumor heterogeneity</article-title>
          <source>Clin. Cancer Res.</source>
          <year>2006</year>
          <volume>12</volume>
          <fpage>5632</fpage>
          <lpage>5640</lpage>
        <pub-id pub-id-type="doi">10.1158/1078-0432.CCR-06-0357</pub-id><pub-id pub-id-type="pmid">17020965</pub-id></citation>
      </ref>
      <ref id="B2-microarrays-01-00084">
        <label>2.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Shi</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Reid</surname>
              <given-names>L.H.</given-names>
            </name>
            <name>
              <surname>Jones</surname>
              <given-names>W.D.</given-names>
            </name>
            <name>
              <surname>Shippy</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Warrington</surname>
              <given-names>J.A.</given-names>
            </name>
            <name>
              <surname>Baker</surname>
              <given-names>S.C.</given-names>
            </name>
            <name>
              <surname>Collins</surname>
              <given-names>P.J.</given-names>
            </name>
            <name>
              <surname>de Longueville</surname>
              <given-names>F.</given-names>
            </name>
            <name>
              <surname>Kawasaki</surname>
              <given-names>E.S.</given-names>
            </name>
            <name>
              <surname>Lee</surname>
              <given-names>K.Y.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>The Microarray Quality Control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurements</article-title>
          <source>Nat. Biotechnol.</source>
          <year>2006</year>
          <volume>24</volume>
          <fpage>1151</fpage>
          <lpage>1161</lpage>
          <pub-id pub-id-type="doi">10.1038/nbt1239</pub-id>
        </citation>
      </ref>
      <ref id="B3-microarrays-01-00084">
        <label>3.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Shi</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Campbell</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Jones</surname>
              <given-names>W.D.</given-names>
            </name>
            <name>
              <surname>Campagne</surname>
              <given-names>F.</given-names>
            </name>
            <name>
              <surname>Wen</surname>
              <given-names>Z.</given-names>
            </name>
            <name>
              <surname>Walker</surname>
              <given-names>S.J.</given-names>
            </name>
            <name>
              <surname>Su</surname>
              <given-names>Z.</given-names>
            </name>
            <name>
              <surname>Chu</surname>
              <given-names>T.M.</given-names>
            </name>
            <name>
              <surname>Goodsaid</surname>
              <given-names>F.M.</given-names>
            </name>
            <name>
              <surname>Pusztai</surname>
              <given-names>L.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>The Microarray Quality Control (MAQC)-II study of  common practices for the development and validation of microarray-based predictive models</article-title>
          <source>Nat. Biotechnol.</source>
          <year>2010</year>
          <volume>28</volume>
          <fpage>827</fpage>
          <lpage>838</lpage>
          <pub-id pub-id-type="doi">10.1038/nbt.1665</pub-id>
        </citation>
      </ref>
      <ref id="B4-microarrays-01-00084">
        <label>4.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Gentleman</surname>
              <given-names>R.C.</given-names>
            </name>
            <name>
              <surname>Carey</surname>
              <given-names>V.J.</given-names>
            </name>
            <name>
              <surname>Bates</surname>
              <given-names>D.M.</given-names>
            </name>
            <name>
              <surname>Bolstad</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Dettling</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Dudoit</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Ellis</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Gautier</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Ge</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Gentry</surname>
              <given-names>J.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Bioconductor: Open software development for computational biology and bioinformatics</article-title>
          <source>Genome Biol.</source>
          <year>2004</year>
          <pub-id pub-id-type="doi">10.1186/gb-2004-5-10-r80</pub-id>
        </citation>
      </ref>
      <ref id="B5-microarrays-01-00084">
        <label>5.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>McClure</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Wit</surname>
              <given-names>E.</given-names>
            </name>
          </person-group>
          <article-title>Post-normalization quality assessment visualization of microarray data</article-title>
          <source>Comp. Funct. Genomics</source>
          <year>2003</year>
          <volume>4</volume>
          <fpage>460</fpage>
          <lpage>467</lpage>
          <pub-id pub-id-type="doi">10.1002/cfg.317</pub-id>
        </citation>
      </ref>
      <ref id="B6-microarrays-01-00084">
        <label>6.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Yang</surname>
              <given-names>Y.H.</given-names>
            </name>
            <name>
              <surname>Dudoit</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Luu</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Lin</surname>
              <given-names>D.M.</given-names>
            </name>
            <name>
              <surname>Peng</surname>
              <given-names>V.</given-names>
            </name>
            <name>
              <surname>Ngai</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Speed</surname>
              <given-names>T.P.</given-names>
            </name>
          </person-group>
          <article-title>Normalization for cdna microarray data: A robust composite method addressing single and multiple slide systematic variation</article-title>
          <source>Nucleic Acids Res.</source>
          <year>2002</year>
          <pub-id pub-id-type="doi">10.1093/nar/30.4.e15</pub-id>
        </citation>
      </ref>
      <ref id="B7-microarrays-01-00084">
        <label>7.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Edwards</surname>
              <given-names>D.</given-names>
            </name>
          </person-group>
          <article-title>Non-linear normalization and background correction in one-channel cdna microarray studies</article-title>
          <source>Bioinformatics</source>
          <year>2003</year>
          <volume>19</volume>
          <fpage>825</fpage>
          <lpage>833</lpage>
          <pub-id pub-id-type="doi">10.1093/bioinformatics/btg083</pub-id>
        </citation>
      </ref>
      <ref id="B8-microarrays-01-00084">
        <label>8.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Xiong</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Zhang</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Martyniuk</surname>
              <given-names>C.J.</given-names>
            </name>
            <name>
              <surname>Trudeau</surname>
              <given-names>V.L.</given-names>
            </name>
            <name>
              <surname>Xia</surname>
              <given-names>X.</given-names>
            </name>
          </person-group>
          <article-title>Using generalized procrustes analysis (gpa) for normalization of cdna microarray data</article-title>
          <source>BMC Bioinformatics</source>
          <year>2008</year>
          <pub-id pub-id-type="doi">10.1186/ 1471-2105-9-25</pub-id>
        </citation>
      </ref>
      <ref id="B9-microarrays-01-00084">
        <label>9.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Wu</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Yan</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Liu</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Sun</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Xie</surname>
              <given-names>H.</given-names>
            </name>
          </person-group>
          <article-title>A new outlier removal approach for cdna microarray normalization</article-title>
          <source>BioTechniques</source>
		  <year>2009</year>
          <volume>47</volume>
          <fpage>691</fpage>
          <lpage>692</lpage>
		  <fpage>694</fpage>
          <lpage>700</lpage>
        </citation>
      </ref>
      <ref id="B10-microarrays-01-00084">
        <label>10.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Wu</surname>
              <given-names>Z.</given-names>
            </name>
            <name>
              <surname>Aryee</surname>
              <given-names>M.J.</given-names>
            </name>
          </person-group>
          <article-title>Subset quantile normalization using negative control features</article-title>
          <source>J. Comput. Biol.</source>
          <year>2010</year>
          <volume>17</volume>
          <fpage>1385</fpage>
          <lpage>1395</lpage>
          <pub-id pub-id-type="doi">10.1089/cmb.2010.0049</pub-id>
        </citation>
      </ref>
      <ref id="B11-microarrays-01-00084">
        <label>11.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Geller</surname>
              <given-names>S.C.</given-names>
            </name>
            <name>
              <surname>Gregg</surname>
              <given-names>J.P.</given-names>
            </name>
            <name>
              <surname>Hagerman</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Rocke</surname>
              <given-names>D.M.</given-names>
            </name>
          </person-group>
          <article-title>Transformation and normalization of oligonucleotide microarray data</article-title>
          <source>Bioinformatics</source>
          <year>2003</year>
          <volume>19</volume>
          <fpage>1817</fpage>
          <lpage>1823</lpage>
          <pub-id pub-id-type="doi">10.1093/bioinformatics/btg245</pub-id>
        </citation>
      </ref>
      <ref id="B12-microarrays-01-00084">
        <label>12.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Calza</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Valentini</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Pawitan</surname>
              <given-names>Y.</given-names>
            </name>
          </person-group>
          <article-title>Normalization of oligonucleotide arrays based on the  least-variant set of genes</article-title>
          <source>BMC Bioinforma.</source>
          <year>2008</year>
          <pub-id pub-id-type="doi">10.1186/1471-2105-9-140</pub-id>
        </citation>
      </ref>
      <ref id="B13-microarrays-01-00084">
        <label>13.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Carvalho</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Bengtsson</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Speed</surname>
              <given-names>T.P.</given-names>
            </name>
            <name>
              <surname>Irizarry</surname>
              <given-names>R.A.</given-names>
            </name>
          </person-group>
          <article-title>Exploration, normalization, and genotype calls of high-density oligonucleotide snp array data</article-title>
          <source>Biostatistics</source>
          <year>2007</year>
          <volume>8</volume>
          <fpage>485</fpage>
          <lpage>499</lpage>
        <pub-id pub-id-type="pmid">17189563</pub-id></citation>
      </ref>
      <ref id="B14-microarrays-01-00084">
        <label>14.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Rigaill</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Hupé</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Almeida</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Rosa</surname>
              <given-names>P.L.</given-names>
            </name>
            <name>
              <surname>Meyniel</surname>
              <given-names>J.-P.</given-names>
            </name>
            <name>
              <surname>Decraene</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Barillot</surname>
              <given-names>E.</given-names>
            </name>
          </person-group>
          <article-title>Italics: An algorithm for normalization and DNA copy number calling for affymetrix snp arrays</article-title>
          <source>Bioinformatics</source>
          <year>2008</year>
          <volume>24</volume>
          <fpage>768</fpage>
          <lpage>774</lpage>
          <pub-id pub-id-type="doi">10.1093/bioinformatics/btn048</pub-id>
        </citation>
      </ref>
      <ref id="B15-microarrays-01-00084">
        <label>15.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Zeller</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Henz</surname>
              <given-names>S.R.</given-names>
            </name>
            <name>
              <surname>Laubinger</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Weigel</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Rätsch</surname>
              <given-names>G.</given-names>
            </name>
          </person-group>
          <article-title>Transcript normalization and segmentation of tiling array data</article-title>
          <source>Pac. Symp. Biocomput.</source>
          <year>2008</year>
          <volume>538</volume>
          <fpage>527</fpage>
          <lpage>538</lpage>
        </citation>
      </ref>
      <ref id="B16-microarrays-01-00084">
        <label>16.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Autio</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Kilpinen</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Saarela</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Kallioniemi</surname>
              <given-names>O.</given-names>
            </name>
            <name>
              <surname>Hautaniemi</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Astola</surname>
              <given-names>J.</given-names>
            </name>
          </person-group>
          <article-title>Comparison of affymetrix data normalization methods using 6,926 experiments across five array generations</article-title>
          <source>BMC Bioinforma.</source>
          <year>2009</year>
          <pub-id pub-id-type="doi">10.1186/1471-2105-10-S1-S24</pub-id>
        </citation>
      </ref>
      <ref id="B17-microarrays-01-00084">
        <label>17.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Barbacioru</surname>
              <given-names>C.C.</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Canales</surname>
              <given-names>R.D.</given-names>
            </name>
            <name>
              <surname>Sun</surname>
              <given-names>Y.A.</given-names>
            </name>
            <name>
              <surname>Keys</surname>
              <given-names>D.N.</given-names>
            </name>
            <name>
              <surname>Chan</surname>
              <given-names>F.</given-names>
            </name>
            <name>
              <surname>Poulter</surname>
              <given-names>K.A.</given-names>
            </name>
            <name>
              <surname>Samaha</surname>
              <given-names>R.R.</given-names>
            </name>
          </person-group>
          <article-title>Effect of various normalization methods on applied biosystems expression array system data</article-title>
          <source>BMC Bioinforma.</source>
          <year>2006</year>
          <pub-id pub-id-type="doi">10.1186/1471-2105-7-533</pub-id>
        </citation>
      </ref>
      <ref id="B18-microarrays-01-00084">
        <label>18.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Irizarry</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Hobbs</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Collin</surname>
              <given-names>F.</given-names>
            </name>
            <name>
              <surname>Beazer-Barclay</surname>
              <given-names>Y.D.</given-names>
            </name>
            <name>
              <surname>Antonellis</surname>
              <given-names>K.J.</given-names>
            </name>
            <name>
              <surname>Scherf</surname>
              <given-names>U.</given-names>
            </name>
            <name>
              <surname>Speed</surname>
              <given-names>T.P.</given-names>
            </name>
          </person-group>
          <article-title>Exploration, normalization, and summaries of high density oligonucleotide array probe level data</article-title>
          <source>Biostatistics</source>
          <year>2003</year>
          <volume>4</volume>
          <fpage>249</fpage>
          <lpage>264</lpage>
          <pub-id pub-id-type="doi">10.1093/biostatistics/4.2.249</pub-id>
        </citation>
      </ref>
      <ref id="B19-microarrays-01-00084">
        <label>19.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Wu</surname>
              <given-names>W.</given-names>
            </name>
            <name>
              <surname>Dave</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Tseng</surname>
              <given-names>G.C.</given-names>
            </name>
            <name>
              <surname>Richards</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Xing</surname>
              <given-names>E.P.</given-names>
            </name>
            <name>
              <surname>Kaminski</surname>
              <given-names>N.</given-names>
            </name>
          </person-group>
          <article-title>Comparison of normalization methods for codelink bioarray data</article-title>
          <source>BMC Bioinforma.</source>
          <year>2005</year>
          <pub-id pub-id-type="doi">10.1186/1471-2105-6-309</pub-id>
        </citation>
      </ref>
      <ref id="B20-microarrays-01-00084">
        <label>20.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Du</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Kibbe</surname>
              <given-names>W.A.</given-names>
            </name>
            <name>
              <surname>Lin</surname>
              <given-names>S.M.</given-names>
            </name>
          </person-group>
          <article-title>Lumi: A pipeline for processing illumina microarray</article-title>
          <source>Bioinformatics</source>
          <year>2008</year>
          <volume>24</volume>
          <fpage>1547</fpage>
          <lpage>1548</lpage>
          <pub-id pub-id-type="doi">10.1093/bioinformatics/btn224</pub-id>
        </citation>
      </ref>
      <ref id="B21-microarrays-01-00084">
        <label>21.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Zahurak</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Parmigiani</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Yu</surname>
              <given-names>W.</given-names>
            </name>
            <name>
              <surname>Scharpf</surname>
              <given-names>R.B.</given-names>
            </name>
            <name>
              <surname>Berman</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Schaeffer</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Shabbeer</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Cope</surname>
              <given-names>L.</given-names>
            </name>
          </person-group>
          <article-title>Pre-processing agilent microarray data</article-title>
          <source>BMC Bioinforma.</source>
          <year>2007</year>
          <pub-id pub-id-type="doi">10.1186/1471-2105-8-142</pub-id>
        </citation>
      </ref>
      <ref id="B22-microarrays-01-00084">
        <label>22.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Kerr</surname>
              <given-names>K.F.</given-names>
            </name>
          </person-group>
          <article-title>Extended analysis of benchmark datasets for agilent two-color microarrays</article-title>
          <source>BMC Bioinforma.</source>
          <year>2007</year>
          <pub-id pub-id-type="doi">10.1186/1471-2105-8-371</pub-id>
        </citation>
      </ref>
      <ref id="B23-microarrays-01-00084">
        <label>23.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Barrett</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Suzek</surname>
              <given-names>T.O.</given-names>
            </name>
            <name>
              <surname>Troup</surname>
              <given-names>D.B.</given-names>
            </name>
            <name>
              <surname>Wilhite</surname>
              <given-names>S.E.</given-names>
            </name>
            <name>
              <surname>Ngau</surname>
              <given-names>W.C.</given-names>
            </name>
            <name>
              <surname>Ledoux</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Rudnev</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Lash</surname>
              <given-names>A.E.</given-names>
            </name>
            <name>
              <surname>Fujibuchi</surname>
              <given-names>W.</given-names>
            </name>
            <name>
              <surname>Edgar</surname>
              <given-names>R.</given-names>
            </name>
          </person-group>
          <article-title>NCBI GEO: Mining millions of expression profiles—Database and tools</article-title>
          <source>Nucleic Acids Res.</source>
          <year>2005</year>
          <volume>33</volume>
          <fpage>D562</fpage>
          <lpage>D566</lpage>
        <pub-id pub-id-type="pmid">15608262</pub-id></citation>
      </ref>
      <ref id="B24-microarrays-01-00084">
        <label>24.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Chang</surname>
              <given-names>E.C.</given-names>
            </name>
            <name>
              <surname>Charn</surname>
              <given-names>T.H.</given-names>
            </name>
            <name>
              <surname>Park</surname>
              <given-names>S.H.</given-names>
            </name>
            <name>
              <surname>Helferich</surname>
              <given-names>W.G.</given-names>
            </name>
            <name>
              <surname>Komm</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Katzenellenbogen</surname>
              <given-names>J.A.</given-names>
            </name>
            <name>
              <surname>Katzenellenbogen</surname>
              <given-names>B.S.</given-names>
            </name>
          </person-group>
          <article-title>Estrogen receptors alpha and beta as determinants of gene expression: Influence of ligand, dose, and chromatin binding</article-title>
          <source>Mol. Endocrinol.</source>
          <year>2008</year>
          <volume>22</volume>
          <fpage>1032</fpage>
          <lpage>1043</lpage>
          <pub-id pub-id-type="doi">10.1210/me.2007-0356</pub-id>
        </citation>
      </ref>
      <ref id="B25-microarrays-01-00084">
        <label>25.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Lutter</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Ugocsai</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Grandl</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Orso</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Theis</surname>
              <given-names>F.</given-names>
            </name>
            <name>
              <surname>Lang</surname>
              <given-names>E.W.</given-names>
            </name>
            <name>
              <surname>Schmitz</surname>
              <given-names>G.</given-names>
            </name>
          </person-group>
          <article-title>Analyzing m-csf dependent monocyte/macrophage differentiation: Expression modes and meta-modes derived from an independent component analysis</article-title>
          <source>BMC Bioinformatics</source>
          <year>2008</year>
          <pub-id pub-id-type="doi">10.1186/1471-2105-9-100</pub-id>
        </citation>
      </ref>
      <ref id="B26-microarrays-01-00084">
        <label>26.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Aoyagi</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Minashi</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Igaki</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Tachimori</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Nishimura</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Hokamura</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Ashida</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Daiko</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Ochiai</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Muto</surname>
              <given-names>M.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Artificially induced epithelial-mesenchymal transition in surgical subjects: Its implications in clinical and basic cancer research</article-title>
          <source>PLoS One</source>
          <year>2011</year>
          <volume>6</volume>
          <fpage>e18196</fpage>
        <pub-id pub-id-type="doi">10.1371/journal.pone.0018196</pub-id><pub-id pub-id-type="pmid">21533028</pub-id></citation>
      </ref>
      <ref id="B27-microarrays-01-00084">
        <label>27.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Sean</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Meltzer</surname>
              <given-names>P.S.</given-names>
            </name>
          </person-group>
          <article-title>GEOquery: A bridge between the Gene Expression Omnibus (GEO) and bioconductor</article-title>
          <source>Bioinformatics</source>
          <year>2007</year>
          <volume>23</volume>
          <fpage>1846</fpage>
          <lpage>1847</lpage>
          <pub-id pub-id-type="doi">10.1093/bioinformatics/btm254</pub-id>
        </citation>
      </ref>
      <ref id="B28-microarrays-01-00084">
        <label>28.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Gautier</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Cope</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Bolstad</surname>
              <given-names>B.M.</given-names>
            </name>
            <name>
              <surname>Irizarry</surname>
              <given-names>R.A.</given-names>
            </name>
          </person-group>
          <article-title>Affy—Analysis of affymetrix genechip data at the probe level</article-title>
          <source>Bioinformatics</source>
          <year>2004</year>
          <volume>20</volume>
          <fpage>307</fpage>
          <lpage>315</lpage>
          <pub-id pub-id-type="doi">10.1093/bioinformatics/btg405</pub-id>
        </citation>
      </ref>
      <ref id="B29-microarrays-01-00084">
        <label>29.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Stacklies</surname>
              <given-names>W.</given-names>
            </name>
            <name>
              <surname>Redestig</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Scholz</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Walther</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Selbig</surname>
              <given-names>J.</given-names>
            </name>
          </person-group>
          <article-title>Pcamethods—A bioconductor package providing pca methods for incomplete data</article-title>
          <source>Bioinformatics</source>
          <year>2007</year>
          <volume>23</volume>
          <fpage>1164</fpage>
          <lpage>1167</lpage>
        <pub-id pub-id-type="doi">10.1093/bioinformatics/btm069</pub-id><pub-id pub-id-type="pmid">17344241</pub-id></citation>
      </ref>
      <ref id="B30-microarrays-01-00084">
        <label>30.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Krzywinski</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Schein</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Birol</surname>
              <given-names>I.</given-names>
            </name>
            <name>
              <surname>Connors</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Gascoyne</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Horsman</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Jones</surname>
              <given-names>S.J.</given-names>
            </name>
            <name>
              <surname>Marra</surname>
              <given-names>M.A.</given-names>
            </name>
          </person-group>
          <article-title>Circos: An information aesthetic for comparative genomics</article-title>
          <source>Genome Res.</source>
          <year>2009</year>
          <volume>19</volume>
          <fpage>1639</fpage>
          <lpage>1645</lpage>
          <pub-id pub-id-type="doi">10.1101/gr.092759.109</pub-id>
        </citation>
      </ref>
      <ref id="B31-microarrays-01-00084">
        <label>31.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Ritchie</surname>
              <given-names>M.E.</given-names>
            </name>
            <name>
              <surname>Dunning</surname>
              <given-names>M.J.</given-names>
            </name>
            <name>
              <surname>Smith</surname>
              <given-names>M.L.</given-names>
            </name>
            <name>
              <surname>Shi</surname>
              <given-names>W.</given-names>
            </name>
            <name>
              <surname>Lynch</surname>
              <given-names>A.G.</given-names>
            </name>
          </person-group>
          <article-title>BeadArray expression analysis using bioconductor</article-title>
          <source>PLoS Comput. Biol.</source>
          <year>2011</year>
          <volume>7</volume>
          <fpage>e1002276</fpage>
          <pub-id pub-id-type="doi">10.1371/journal.pcbi.1002276</pub-id>
        </citation>
      </ref>
    </ref-list>
  </back>
</article>
