<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-5-12</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Software</dochead>
      <bibl>
         <title>
            <p>Array-A-Lizer: A serial DNA microarray quality analyzer</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Petri</snm>
               <fnm>Andreas</fnm>
               <insr iid="I1"/>
               <email>andp@novonordisk.com</email>
            </au>
            <au id="A2">
               <snm>Fleckner</snm>
               <fnm>Jan</fnm>
               <insr iid="I1"/>
               <email>jafl@novonordisk.com</email>
            </au>
            <au id="A3">
               <snm>Matthiessen</snm>
               <mnm>Wichmann</mnm>
               <fnm>Mads</fnm>
               <insr iid="I1"/>
               <email>bmc@madswichmann.dk</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Molecular Genetics, Novo Nordisk A/S, Bagsv&#230;rd, 2880, Denmark</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2004</pubdate>
         <volume>5</volume>
         <issue>1</issue>
         <fpage>12</fpage>
         <url>http://www.biomedcentral.com/1471-2105/5/12</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="doi">10.1186/1471-2105-5-12</pubid>
               <pubid idtype="pmpid">15018654</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>15</day>
               <month>10</month>
               <year>2003</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>09</day>
               <month>2</month>
               <year>2004</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>09</day>
               <month>2</month>
               <year>2004</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2004</year>
         <collab>Petri et al; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.</collab>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The proliferate nature of DNA microarray results have made it necessary to implement a uniform and quick quality control of experimental results to ensure the consistency of data across multiple experiments prior to actual data analysis.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Array-A-Lizer is a small and convenient stand-alone tool providing the necessary initial analysis of hybridization quality of an unlimited number of microarray experiments. The experiments are analyzed for even hybridization across the slide and between fluorescent dyes in two-color experiments in spotted DNA microarrays.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p>Array-A-Lizer allows the expedient determination of the quality of multiple DNA microarray experiments allowing for a rapid initial screening of results before progressing to further data analysis. Array-A-Lizer is directed towards speed and ease-of-use allowing both the expert and non-expert microarray researcher to rapidly assess the quality of multiple microarray hybridizations. Array-A-Lizer is available from the Internet as both source code and as a binary installation package.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The ongoing development of DNA microarray analysis equipment have diminished both the price and workload associated with microarray experiments leading to development of data at a tremendous rate. It is not unusual for a group of researchers to be able to produce and scan 50&#8211;100 microarray slides per week. The processing of such large amounts of experimental data, first requires verification of the overall quality of the experiments. Array-A-Lizer employs two tests to monitor the quality of the hybridization with respect to uniformity across the slide as well as relative intensity of the fluorescent dyes in two color experiments: 1) spectrum analysis of the signal across the microarray slide and 2) comparison of the two dyes that are used in two-color experiments (for instance Cy3 and Cy5).</p>
      </sec>
      <sec>
         <st>
            <p>Implementation</p>
         </st>
         <p>The Array-A-Lizer graphical user interface (GUI) is created in Borland Delphi and the statistical calculations are carried out in the R-project statistical scripting language <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Array-A-Lizer includes a microdistribution of the R-project and contains options for specifying the graphical output type as either bitmaps or postscript. Array-A-Lizer supports experiment files from GenePixPro and Spotfinder through an open architecture, which can be extended to include other file formats. Array-A-Lizer runs on the Microsoft Windows platform.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <p>Array-A-Lizer is an application for rapid quality control of large DNA microarray experiments. The program consists of a collection of scripts, that are contained and accessed through a GUI to ease their use (figure <figr fid="F1">1</figr>). The main advantage of the program is the rapid processing of an unlimited number of experiments. Array-A-Lizer generates reports with a graphical analysis of each experiment, providing the researcher with a rapid survey of the quality of experiments (figures <figr fid="F2">2</figr> and <figr fid="F3">3</figr>). Additionally, the program returns an overview of the results in the system browser with hyperlinks to each analysis report (figure <figr fid="F4">4</figr>).</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Graphical User Interface</p>
            </caption>
            <text>
               <p><b>Graphical User Interface. </b>An easily accessible graphical user interface is used to select experiments and analysis method.</p>
            </text>
            <graphic file="1471-2105-5-12-1"/>
         </fig>
         <fig id="F2">
            <title>
               <p>Figure 2</p>
            </title>
            <caption>
               <p>Diagnostic report</p>
            </caption>
            <text>
               <p><b>Diagnostic report. </b>A + B) The diagnostic method generates scatter plots of the dual color hybridization data, both as MvA plots and red/green-scatter plots. In these plots the green line represents M = 0 and red = green respectively, making it easy to assess the balance between the green and the red channel. C + D) The distribution of log<sub>2 </sub>transformed intensities from the individual channels are visualized by plotting the data as histograms. Finally, the report contains information on the files used to generate the diagnostic plots, the number of saturated spots, and the number of negative spots (background is higher than foreground).</p>
            </text>
            <graphic file="1471-2105-5-12-2"/>
         </fig>
         <fig id="F3">
            <title>
               <p>Figure 3</p>
            </title>
            <caption>
               <p>Spatial report</p>
            </caption>
            <text>
               <p><b>Spatial report. </b>Output from the spatial analysis method shows various representations of analyzed data according to the location of the spot on the array. A pseudo color representation is used to display log<sub>2 </sub>transformed foreground intensities and raw backgroundintensities. A plot showing the location of the negative values across the array is also generated. The cutoff value for the background data plot can be set in the GUI prior to analysis. A) Spatial plot from a hybridization showing clear fading at the right part of the slide. B) High background values results in increased frequency of negative spots. C) Inadequate post-hybridization washing often results in distinct background patterns. D) Bleeding of signal from the actual spot into the background area results in regions of high background values that are also visible in the foreground plot.</p>
            </text>
            <graphic file="1471-2105-5-12-3"/>
         </fig>
         <fig id="F4">
            <title>
               <p>Figure 4</p>
            </title>
            <caption>
               <p>Analysis reports</p>
            </caption>
            <text>
               <p><b>Analysis reports. </b>For easy accessibility, reports of the quality analyses including thumbnail pictures are presented in a hyperlinked document in the default Internet browser.</p>
            </text>
            <graphic file="1471-2105-5-12-4"/>
         </fig>
         <p>Array-A-Lizer facilitates the generation of several plots that detail the quality of the experiments. Two different analysis modes can be chosen, resulting in either a set of diagnostic plots or a spatial representation of the data.</p>
         <p>In comparison to existing analysis packages, Array-A-Lizer is both quick and easy to use. It is a stand-alone application that can be installed on any desktop computer running MS Windows. It is intended for easy visualization of microarray data allowing both the expert and non-expert microarray researcher to assess the quality of multiple microarray hybridizations.</p>
         <sec>
            <st>
               <p>Diagnostic report</p>
            </st>
            <p>In this mode, the experimental data are used to generate several diagnostic plots (figure <figr fid="F2">2</figr>) as well as statistics on the identified spots. The Array-A-Lizer diagnostic report includes both MvA plots (figure <figr fid="F2">2A</figr> left)<abbrgrp><abbr bid="B2">2</abbr></abbrgrp> and red/green-scatter plots (figure <figr fid="F2">2A</figr> right), both of which show spot intensities after local background subtraction.</p>
            <p>MvA plots display the log intensity ratio M = log<sub>2</sub>(R/G) versus the mean log intensity <graphic file="1471-2105-5-12-i1.gif"/>. This plot type is widely use to visualize array data because it directly displays the red to green ratios, which are often the quantities of interest in most experiments. Furthermore, MvA plots make it easy to identify intensity dependent biases in the data (i.e. curvature or 'banana shape'). In scatter plots, the intensities from the green channel are plotted against the red channel after log<sub>2 </sub>transformation. Genes displaying difference in signal intensities in the two channels are plotted off the diagonal and genes showing similar intensities are plotted close to the the diagonal.</p>
            <p>A common source of variation in microarray data acquisition is attributed by incorrectly balanced photomultiplier tube (PMT) settings during scanning. This results in overall differences in signal intensities obtained from either channel and a shift of the data from the x-axis (M = 0) or the diagonal (red = green) of the ideal MvA and scatter-plot respectively (figure <figr fid="F2">2B</figr>).</p>
            <p>Finally, the diagnostic analysis generates histograms of the log<sub>2 </sub>transformed data for comparison of the distribution of intensities between the two channels. The histograms display the signal intensities across the slide (figure <figr fid="F2">2C</figr>). Overamplified channels (PMT levels are set too high) will result in many saturated spots, which is revealed as an over representation of high intensity values (figure <figr fid="F2">2D</figr>).</p>
            <p>The diagnostic report includes information on which files were used for the analysis, the number of saturated spots, and the number of negative values, i.e. the number of spots where the background intensity was higher than the foreground intensity.</p>
         </sec>
         <sec>
            <st>
               <p>Spatial report</p>
            </st>
            <p>The spatial analysis results in a graphical representation of microarray data according to the location on the slide (figure <figr fid="F3">3</figr>). From each channel, three different plots are generated showing the log<sub>2 </sub>transformed foreground intensities, the background intensities, and a plot showing the location of negative values (background higher than foreground). This analysis method can be used to identify spatial effects on the hybridized arrays such as fading or illumination at the edges due to cover-slip effects (figure <figr fid="F3">3A</figr> and <figr fid="F3">3B</figr>) or scratches and artifacts resulting from inadequate washing of slides (figure <figr fid="F3">3C</figr> and <figr fid="F3">3D</figr>).</p>
            <p>The cut-off values on the background plot can be set from the GUI prior to starting the analysis. Keeping these limits fixed will allow easy detection of pronounced fluctuations in background intensities both between and within slides.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>With the reduced cost and labor of DNA microarray experiments, it is important that the inherent high through-put nature of the technology does not lower the quality of data and it is therefore vital that the control of experimental variability is consistently monitored, so the quality of subsequent data analysis is not severely weakened by the infusion of low quality data. Initial quality control is necessary and Array-A-Lizer delivers an easy-to-use application for rapid determination of experiment quality.</p>
      </sec>
      <sec>
         <st>
            <p>Availability and requirements</p>
         </st>
         <p>Project name: Array-A-Lizer</p>
         <p>Project home page: <url>http://www.bioinformatics.org/arrayalizer</url></p>
         <p>Operating system: MS Windows</p>
         <p>Programming language: Delphi and R</p>
         <p>License: GNU GPL</p>
         <p>Any restrictions to use by non-academics: None</p>
      </sec>
      <sec>
         <st>
            <p>List of Abbreviations</p>
         </st>
         <p>GNU: GNU's Not Unix</p>
         <p>GPL: General public license</p>
         <p>GUI: Graphical user interface</p>
         <p>PMT: Photomultiplier tube</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>AP and MWM conceived of the project and contributed equally to the work presented in this manuscript. JF supervised the project and provided the funding. All authors read and approved the manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>The R project in statistical computing</p>
            </title>
            <aug>
               <au>
                  <snm>Ripley</snm>
                  <fnm>BD</fnm>
               </au>
            </aug>
            <source>MSOR Connections. The newsletter of the LTSN Maths, Stats &amp; OR Network</source>
            <pubdate>2001</pubdate>
            <volume>1</volume>
            <issue>1</issue>
            <fpage>23</fpage>
            <lpage>25</lpage>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Normalization for cdna microarray data: a robust composite method addressing single and multiple slide systematic variation</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>HV</fnm>
               </au>
               <au>
                  <snm>Dudoit</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Luu</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Peng</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Ngai</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Speed</snm>
                  <fnm>TP</fnm>
               </au>
            </aug>
            <source>Nucl Acids Res</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <fpage>e15</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">11842121</pubid>
                  <pubid idtype="doi">10.1093/nar/30.4.e15</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
