<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-7-205</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Software</dochead>
      <bibl>
         <title>
            <p><it>SIMAGE</it>: <it>si</it>mulation of DNA-<it>m</it>icro<it>a</it>rray <it>g</it>ene <it>e</it>xpression data</p>
         </title>
         <aug>
            <au id="A1" ca="yes" ce="yes">
               <snm>Albers</snm>
               <mi>J</mi>
               <fnm>Casper</fnm>
               <insr iid="I1"/>
               <insr iid="I3"/>
               <email>c.j.albers@open.ac.uk</email>
            </au>
            <au id="A2">
               <snm>Jansen</snm>
               <mi>C</mi>
               <fnm>Ritsert</fnm>
               <insr iid="I1"/>
               <email>r.c.jansen@rug.nl</email>
            </au>
            <au id="A3">
               <snm>Kok</snm>
               <fnm>Jan</fnm>
               <insr iid="I2"/>
               <email>jan.kok@rug.nl</email>
            </au>
            <au id="A4">
               <snm>Kuipers</snm>
               <mi>P</mi>
               <fnm>Oscar</fnm>
               <insr iid="I2"/>
               <email>o.p.kuipers@rug.nl</email>
            </au>
            <au id="A5" ca="yes" ce="yes">
               <snm>van Hijum</snm>
               <mi>AFT</mi>
               <fnm>Sacha</fnm>
               <insr iid="I2"/>
               <email>s.a.f.t.van.hijum@rug.nl</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Groningen Bioinformatics Centre, University of Groningen, Groningen Biomolecular Sciences and Biotechnology Institute, PO Box 14, 9750 AA Haren, The Netherlands</p>
            </ins>
            <ins id="I2">
               <p>Department of Molecular Genetics, University of Groningen, Groningen Biomolecular Sciences and Biotechnology Institute, PO Box 14, 9750 AA Haren, The Netherlands</p>
            </ins>
            <ins id="I3">
               <p>Department of Statistics, The Open University, Walton Hall, Milton Keynes MK7 6AA, UK</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2006</pubdate>
         <volume>7</volume>
         <issue>1</issue>
         <fpage>205</fpage>
         <url>http://www.biomedcentral.com/1471-2105/7/205</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">16613602</pubid>
               <pubid idtype="doi">10.1186/1471-2105-7-205</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>28</day>
               <month>2</month>
               <year>2006</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>13</day>
               <month>4</month>
               <year>2006</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>13</day>
               <month>4</month>
               <year>2006</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2006</year>
         <collab>Albers et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Simulation of DNA-microarray data serves at least three purposes: (i) optimizing the design of an intended DNA microarray experiment, (ii) comparing existing pre-processing and processing methods for best analysis of a given DNA microarray experiment, (iii) educating students, lab-workers and other researchers by making them aware of the many factors influencing DNA microarray experiments.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Our model has multiple layers of factors influencing the experiment. The relative influence of such factors can differ significantly between labs, experiments within labs, etc. Therefore, we have added a module to roughly estimate their parameters from a given data set. This guarantees that our simulated data mimics real data as closely as possible.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>We introduce a model for the simulation of dual-dye cDNA-microarray data closely resembling real data and coin the model and its software implementation "<it>SIMAGE</it>" which stands for <ul>si</ul>mulation of <ul>m</ul>icro<ul>a</ul>rray <ul>g</ul>ene <ul>e</ul>xpression data. The software is freely accessible at: <url>http://bioinformatics.biol.rug.nl/websoftware/simage</url>.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="refman"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>No two laboratories produce the same expression data when performing seemingly identical DNA microarray experiments. This is simply due to the fact that experimental conditions and factors such as growth media composition, RNA sampling methodology and scanner calibration are never exactly identical <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Even within one and the same laboratory differences in the outcome of experiments executed by different laborants can be observed <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>. These and many other factors lead to sometimes unexpected gene expression variations that can occur at several levels. Figure <figr fid="F1">1</figr> shows a schematic and simplified overview of those levels in dual-color DNA microarray data (top) as well as the effect of some of these levels on the composition of the expression signals (bottom).</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Schematic overview of the <it>SIMAGE </it>model (A) and a few layers visualized (B and C)</p>
            </caption>
            <text>
               <p><b>Schematic overview of the <it>SIMAGE </it>model (A) and a few layers visualized (B and C). </b>The blue-marked boxes (A) indicate layers that are further visualized (B and C). A simulation of the entire 'non-biological' signal is shown in B and C. Top row, sum of the gradient effects and density effects; second row, spot pin effects; third row: Gaussian noise. The bottom row shows the sum of all the effects pictured in the top three rows. The signals are plotted three-dimensionally (left side view) and two-dimensionally (right side view).</p>
            </text>
            <graphic file="1471-2105-7-205-1"/>
         </fig>
         <p>It is obvious that knowledge about the properties that gene expression signals hold as shown in Figure <figr fid="F1">1</figr> is very important to any researcher. This knowledge can be used to design an optimal new DNA microarray experiment and to carry out the best analysis of that generated data set. To this end, real and simulated data has been used for a long time in data analysis workshops. And in case it is not (yet) obvious for a novice in the field, the visualization as shown in Figure <figr fid="F1">1</figr> can be of educational value. In all three cases &#8211; design, analysis and education &#8211; the availability of simulation software would be very helpful. It is of eminent importance that the simulated data resembles experimental data as much as possible <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> and, therefore, there is a need for software that can (roughly) estimate levels of variation.</p>
         <p>Various software packages for simulating dual-color DNA microarray data have been described in the literature (Table <tblr tid="T1">1</tblr>). Some of these packages aim at validating image analysis and spot quantification and simulate TIF-images of the visualized spots. Other software focuses on gene networks. In this paper, we present new software that fits in between these two categories: we simulate the gene expression data as it is tabulated after image analysis. In addition, we provide software that can be used to roughly estimate the levels of variation in real data. These computer programs, named <it>SIMAGE</it>, are freely available via a web-resource.</p>
         <tbl id="T1">
            <title>
               <p>Table 1</p>
            </title>
            <caption>
               <p>Comparison of the properties of different DNA-microarray simulation models described in literature. '+' and '-' indicate availability of the indicated feature in the specified model. Note that the modeling of features in the specific models is usually not the same.</p>
            </caption>
            <tblbdy cols="8">
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>Method</p>
                  </c>
                  <c ca="left">
                     <p><it>SIMAGE</it>, this study</p>
                  </c>
                  <c ca="left">
                     <p>Lalush <it>et al. </it>[23]</p>
                  </c>
                  <c ca="left">
                     <p>Balagurunathan <it>et al. </it>[24]</p>
                  </c>
                  <c ca="left">
                     <p>Lonnstedt <it>et al. </it>[25]</p>
                  </c>
                  <c ca="left">
                     <p>Wierling <it>et al. </it>[26]</p>
                  </c>
                  <c ca="left">
                     <p>GE2</p>
                  </c>
               </r>
               <r>
                  <c cspan="8">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>model implementation</p>
                  </c>
                  <c ca="left">
                     <p>tabulated gene expression data</p>
                  </c>
                  <c ca="left">
                     <p>+</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>+</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>modeling of gene networks</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>+</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>TIFF output</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>+</p>
                  </c>
                  <c ca="left">
                     <p>+</p>
                  </c>
                  <c ca="left">
                     <p>+</p>
                  </c>
                  <c ca="left">
                     <p>+</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>software available</p>
                  </c>
                  <c ca="left">
                     <p>+ [5]</p>
                  </c>
                  <c ca="left">
                     <p>+ <sup>1</sup></p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>+ [27]</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>adjustable model</p>
                  </c>
                  <c ca="left">
                     <p>+</p>
                  </c>
                  <c ca="left">
                     <p>+</p>
                  </c>
                  <c ca="left">
                     <p>+</p>
                  </c>
                  <c ca="left">
                     <p>+</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>+/-</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>model effects</p>
                  </c>
                  <c ca="left">
                     <p>background surface pattern</p>
                  </c>
                  <c ca="left">
                     <p>+</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>+</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>+</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>spot pin / grid effects</p>
                  </c>
                  <c ca="left">
                     <p>+</p>
                  </c>
                  <c ca="left">
                     <p>+</p>
                  </c>
                  <c ca="left">
                     <p>+</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>channel effects</p>
                  </c>
                  <c ca="left">
                     <p>+</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>+</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>non-linearity effects</p>
                  </c>
                  <c ca="left">
                     <p>+</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>+</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>+</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>missing data</p>
                  </c>
                  <c ca="left">
                     <p>+</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>+</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>'fishtailing'</p>
                  </c>
                  <c ca="left">
                     <p>+</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>+</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>+</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>spot shape / size</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>+</p>
                  </c>
                  <c ca="left">
                     <p>+</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p>1) C++ code available from author.</p>
            </tblfn>
         </tbl>
      </sec>
      <sec>
         <st>
            <p>Implementation</p>
         </st>
         <sec>
            <st>
               <p>Software overview</p>
            </st>
            <p>The <it>SIMAGE </it>web site <abbrgrp><abbr bid="B5">5</abbr></abbrgrp> consists of three parts: (i) a start page where the user may upload parameters for a simulation; (ii) a web-page where the user specifies a number of parameters, after which (iii) the results, namely simulated data (see below), are presented. Each run is assigned a unique session, which allows the user to inspect the results at any given time. <it>SIMAGE </it>provides the user with three types of text files containing: (i) the parameters used for the simulation. This file can, in turn, be provided to <it>SIMAGE </it>for future simulations; (ii) the (differential) gene expressions, which are obtained after applying only the gene-layer; and (iii) the observed signals (including their location on the slide and the corresponding genes) after applying the layers explained in the following paragraphs. The estimates from a provided dataset are determined by a self-explanatory web site which is linked from the <it>SIMAGE </it>web site.</p>
            <p><it>SIMAGE </it>was written in Pascal and compiled by FreePascal version 1.0.10 <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. Estimates for parameters, based on real data, are provided by a script developed in <it>R </it><abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. The software requires an Apache web-server and is integrated with a PHP web-interface.</p>
         </sec>
         <sec>
            <st>
               <p>The general model</p>
            </st>
            <p>Figure <figr fid="F1">1</figr> shows a schematic representation of the model used in this study. Throughout this manuscript, and in the software package, the base-2 logarithm is used, unless stated otherwise. In this model the (log) expression signals are composed of the following components: (i) gene expression, (ii) a raw background gradient signal, (iii) a channel effect, (iv) a spot pin effect, (v) a nonlinear effect, (vi) a quantization and saturation effect, and (vii) random error due to unknown and/or unmodeled factors. The model is explained in more detail in the following text.</p>
         </sec>
         <sec>
            <st>
               <p>Dimensions and notation</p>
            </st>
            <p>A DNA-microarray slide consists of a number of spots arranged in a two-dimensional matrix, which, in turn is divided into <it>n</it><sub><it>row </it></sub>&#215; <it>n</it><sub><it>col </it></sub>square (sub)grids (see supplementary Fig. S1). Each grid contains <m:math name="1471-2105-7-205-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>n</m:mi><m:mrow><m:mi>s</m:mi><m:mi>p</m:mi><m:mi>o</m:mi><m:mi>t</m:mi></m:mrow><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGUbGBdaqhaaWcbaGaem4CamNaemiCaaNaem4Ba8MaemiDaqhabaGaeGOmaidaaaaa@34E0@</m:annotation></m:semantics></m:math> spots (<it>n</it><sub><it>spot </it></sub>in both the horizontal and vertical direction), amounting to a total number of <it>n</it><sub><it>row </it></sub>&#215; <it>n</it><sub><it>col </it></sub>&#215; <m:math name="1471-2105-7-205-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>n</m:mi><m:mrow><m:mi>s</m:mi><m:mi>p</m:mi><m:mi>o</m:mi><m:mi>t</m:mi></m:mrow><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGUbGBdaqhaaWcbaGaem4CamNaemiCaaNaem4Ba8MaemiDaqhabaGaeGOmaidaaaaa@34E0@</m:annotation></m:semantics></m:math> spots per slide. In total, <it>n</it><sub><it>slide </it></sub>slides are simulated with each spot providing a measure of the green (Cy3) and the red (Cy5) signal. The simulated log expression at spot (<it>i, j</it>) of array <it>l </it>and channel <it>k </it>is denoted by <it>y</it><sub><it>ijkl</it></sub>.</p>
         </sec>
         <sec>
            <st>
               <p>Gene expressions</p>
            </st>
            <p>To distinguish between non-regulated and regulated genes, the log ratio is considered. Assuming absence of other systematic deviations, the log ratio of the Cy3 and C5 channel signals log(Cy3/Cy5) = log(Cy3) - log(Cy5), is zero for non-regulated genes (with deviances only due to randomness), positive for up-regulated genes, and negative for down-regulated genes. The part of <it>y</it><sub><it>jikl </it></sub>that is affected in this gene expression layer will be called <m:math name="1471-2105-7-205-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>y</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi><m:mi>k</m:mi><m:mi>l</m:mi></m:mrow><m:mo>*</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWG5bqEdaqhaaWcbaGaemyAaKMaemOAaOMaem4AaSMaemiBaWgabaGaeiOkaOcaaaaa@34A8@</m:annotation></m:semantics></m:math>. This <m:math name="1471-2105-7-205-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>y</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi><m:mi>k</m:mi><m:mi>l</m:mi></m:mrow><m:mo>*</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWG5bqEdaqhaaWcbaGaemyAaKMaemOAaOMaem4AaSMaemiBaWgabaGaeiOkaOcaaaaa@34A8@</m:annotation></m:semantics></m:math> will be the same on all spots where the same gene <it>g </it>is spotted (see also formula 3), hence we can use the notation <m:math name="1471-2105-7-205-i3" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>y</m:mi><m:mi>g</m:mi><m:mo>*</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWG5bqEdaqhaaWcbaGaem4zaCgabaGaeiOkaOcaaaaa@3087@</m:annotation></m:semantics></m:math>. We split <m:math name="1471-2105-7-205-i3" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>y</m:mi><m:mi>g</m:mi><m:mo>*</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWG5bqEdaqhaaWcbaGaem4zaCgabaGaeiOkaOcaaaaa@3087@</m:annotation></m:semantics></m:math> into two parts: <it>G</it><sub><it>g</it>;<it>k</it></sub>, which is concerned with the 'true' expression of gene <it>g </it>on channel <it>k </it>before any other layer is applied, and <it>D</it><sub><it>g</it>;<it>k</it></sub>, which is a (possible) deviation due to up- or down-regulation of gene <it>g</it>. The modeling of <m:math name="1471-2105-7-205-i3" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>y</m:mi><m:mi>g</m:mi><m:mo>*</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWG5bqEdaqhaaWcbaGaem4zaCgabaGaeiOkaOcaaaaa@3087@</m:annotation></m:semantics></m:math> is done via <m:math name="1471-2105-7-205-i4" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>y</m:mi><m:mrow><m:mi>g</m:mi><m:mo>;</m:mo><m:mi>k</m:mi></m:mrow><m:mo>*</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWG5bqEdaqhaaWcbaGaem4zaCMaei4oaSJaem4AaSgabaGaeiOkaOcaaaaa@32E4@</m:annotation></m:semantics></m:math> = <it>G</it><sub><it>g</it>;<it>k </it></sub>+ <it>D</it><sub><it>g</it>;<it>k</it></sub>. To start with <it>G</it><sub><it>g</it>;<it>k</it></sub>, we assume that this latent (i.e. unobservable) variable is distributed as <it>G</it><sub><it>g</it>;<it>k </it></sub>~ N(<it>&#956;</it>,<m:math name="1471-2105-7-205-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mi>G</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaqhaaWcbaGaem4raCeabaGaeGOmaidaaaaa@30AC@</m:annotation></m:semantics></m:math>). Here, <it>&#956; </it>is the average expression value, and <m:math name="1471-2105-7-205-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mi>G</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaqhaaWcbaGaem4raCeabaGaeGOmaidaaaaa@30AC@</m:annotation></m:semantics></m:math> can be interpreted as the variation in gene expression. Hence, the expressions of non-differentially expressed genes are distributed symmetrically around <it>&#956;</it>, according to a normal distribution. For a comment on the normality assumption of <it>G</it><sub><it>g</it>;<it>k</it></sub>, see the end of this paragraph. In most DNA-microarray data <it>G</it><sub><it>g</it>;1 </sub>and <it>G</it><sub><it>g</it>;2 </sub>are not statistically independent <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, which can be observed by considering a plot of Cy3 versus Cy5 signals. Therefore the covariance cov(<it>G</it><sub><it>g</it>;1</sub>,<it>G</it><sub><it>g</it>;2</sub>) = <it>&#961;</it><m:math name="1471-2105-7-205-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mi>G</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaqhaaWcbaGaem4raCeabaGaeGOmaidaaaaa@30AC@</m:annotation></m:semantics></m:math> is introduced, where <it>&#961; </it>is the correlation coefficient between the signals from the two channels.</p>
            <p>For each gene we model the probability of being up-, down- and non-regulated <it>&#960;</it><sub>+</sub>, <it>&#960;</it><sub>-</sub>, and <it>&#960;</it><sub>0</sub>= 1 - <it>&#960;</it><sub>+ </sub>- <it>&#960;</it><sub>-</sub>, respectively. For up-regulated genes the log ratio is increased by 2<it>&#956;</it><sub><it>D </it></sub>and for down-regulated genes the log ratio is decreased by 2<it>&#956;</it><sub><it>D</it></sub>. The effect of regulation is modeled via <it>D</it><sub><it>g</it>;<it>k</it></sub>:</p>
            <p>
               <m:math name="1471-2105-7-205-i6" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mrow>
                        <m:mrow>
                           <m:mo>(</m:mo>
                           <m:mrow>
                              <m:mtable>
                                 <m:mtr>
                                    <m:mtd>
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>D</m:mi>
                                             <m:mrow>
                                                <m:mi>g</m:mi>
                                                <m:mo>;</m:mo>
                                                <m:mn>1</m:mn>
                                             </m:mrow>
                                          </m:msub>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                                 <m:mtr>
                                    <m:mtd>
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>D</m:mi>
                                             <m:mrow>
                                                <m:mi>g</m:mi>
                                                <m:mo>;</m:mo>
                                                <m:mn>2</m:mn>
                                             </m:mrow>
                                          </m:msub>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                              </m:mtable>
                           </m:mrow>
                           <m:mo>)</m:mo>
                        </m:mrow>
                        <m:mo>=</m:mo>
                        <m:mrow>
                           <m:mo>(</m:mo>
                           <m:mrow>
                              <m:mtable>
                                 <m:mtr>
                                    <m:mtd>
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>k</m:mi>
                                             <m:mn>1</m:mn>
                                          </m:msub>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                                 <m:mtr>
                                    <m:mtd>
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>k</m:mi>
                                             <m:mn>2</m:mn>
                                          </m:msub>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                              </m:mtable>
                           </m:mrow>
                           <m:mo>)</m:mo>
                        </m:mrow>
                        <m:msub>
                           <m:mi>&#956;</m:mi>
                           <m:mi>D</m:mi>
                        </m:msub>
                        <m:mtext>,&#160;where&#160;</m:mtext>
                        <m:mrow>
                           <m:mo>(</m:mo>
                           <m:mrow>
                              <m:msub>
                                 <m:mi>k</m:mi>
                                 <m:mn>1</m:mn>
                              </m:msub>
                              <m:mo>,</m:mo>
                              <m:msub>
                                 <m:mi>k</m:mi>
                                 <m:mn>2</m:mn>
                              </m:msub>
                           </m:mrow>
                           <m:mo>)</m:mo>
                        </m:mrow>
                        <m:mo>=</m:mo>
                        <m:mrow>
                           <m:mo>{</m:mo>
                           <m:mrow>
                              <m:mtable columnalign="left">
                                 <m:mtr columnalign="left">
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:mrow>
                                             <m:mo>(</m:mo>
                                             <m:mrow>
                                                <m:mo>&#8722;</m:mo>
                                                <m:mn>1</m:mn>
                                                <m:mo>,</m:mo>
                                                <m:mn>1</m:mn>
                                             </m:mrow>
                                             <m:mo>)</m:mo>
                                          </m:mrow>
                                       </m:mrow>
                                    </m:mtd>
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:mtext>if&#160;gene&#160;</m:mtext>
                                          <m:mi>g</m:mi>
                                          <m:mtext>&#160;is&#160;downregulated;</m:mtext>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                                 <m:mtr columnalign="left">
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:mrow>
                                             <m:mo>(</m:mo>
                                             <m:mrow>
                                                <m:mtext>0,0</m:mtext>
                                             </m:mrow>
                                             <m:mo>)</m:mo>
                                          </m:mrow>
                                       </m:mrow>
                                    </m:mtd>
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:mtext>if&#160;gene&#160;</m:mtext>
                                          <m:mi>g</m:mi>
                                          <m:mtext>&#160;is&#160;nonregulated;</m:mtext>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                                 <m:mtr columnalign="left">
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:mrow>
                                             <m:mo>(</m:mo>
                                             <m:mrow>
                                                <m:mtext>1,</m:mtext>
                                                <m:mo>&#8722;</m:mo>
                                                <m:mtext>1</m:mtext>
                                             </m:mrow>
                                             <m:mo>)</m:mo>
                                          </m:mrow>
                                       </m:mrow>
                                    </m:mtd>
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:mtext>if&#160;gene&#160;</m:mtext>
                                          <m:mi>g</m:mi>
                                          <m:mtext>&#160;is&#160;upregulated</m:mtext>
                                          <m:mtext>.</m:mtext>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                              </m:mtable>
                           </m:mrow>
                        </m:mrow>
                        <m:mtext>&#160;&#160;&#160;&#160;&#160;</m:mtext>
                        <m:mrow>
                           <m:mo>(</m:mo>
                           <m:mn>1</m:mn>
                           <m:mo>)</m:mo>
                        </m:mrow>
                     </m:mrow>
                     <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqadaqaauaabeqaceaaaeaacqWGebardaWgaaWcbaGaem4zaCMaei4oaSJaeGymaedabeaaaOqaaiabdseaenaaBaaaleaacqWGNbWzcqGG7aWocqaIYaGmaeqaaaaaaOGaayjkaiaawMcaaiabg2da9maabmaabaqbaeqabiqaaaqaaiabdUgaRnaaBaaaleaacqaIXaqmaeqaaaGcbaGaem4AaS2aaSbaaSqaaiabikdaYaqabaaaaaGccaGLOaGaayzkaaacciGae8hVd02aaSbaaSqaaiabdseaebqabaGccqqGSaalcqqGGaaicqqG3bWDcqqGObaAcqqGLbqzcqqGYbGCcqqGLbqzcqqGGaaidaqadaqaaiabdUgaRnaaBaaaleaacqaIXaqmaeqaaOGaeiilaWIaem4AaS2aaSbaaSqaaiabikdaYaqabaaakiaawIcacaGLPaaacqGH9aqpdaGabaqaauaabaqadiaaaeaadaqadaqaaiabgkHiTiabigdaXiabcYcaSiabigdaXaGaayjkaiaawMcaaaqaaiabbMgaPjabbAgaMjabbccaGiabbEgaNjabbwgaLjabb6gaUjabbwgaLjabbccaGiabdEgaNjabbccaGiabbMgaPjabbohaZjabbccaGiabbsgaKjabb+gaVjabbEha3jabb6gaUjabbkhaYjabbwgaLjabbEgaNjabbwha1jabbYgaSjabbggaHjabbsha0jabbwgaLjabbsgaKjabbUda7aqaamaabmaabaGaeeimaaJaeeilaWIaeeimaadacaGLOaGaayzkaaaabaGaeeyAaKMaeeOzayMaeeiiaaIaee4zaCMaeeyzauMaeeOBa4MaeeyzauMaeeiiaaIaem4zaCMaeeiiaaIaeeyAaKMaee4CamNaeeiiaaIaeeOBa4Maee4Ba8MaeeOBa4MaeeOCaiNaeeyzauMaee4zaCMaeeyDauNaeeiBaWMaeeyyaeMaeeiDaqNaeeyzauMaeeizaqMaee4oaSdabaWaaeWaaeaacqqGXaqmcqqGSaalcqGHsislcqqGXaqmaiaawIcacaGLPaaaaeaacqqGPbqAcqqGMbGzcqqGGaaicqqGNbWzcqqGLbqzcqqGUbGBcqqGLbqzcqqGGaaicqWGNbWzcqqGGaaicqqGPbqAcqqGZbWCcqqGGaaicqqG1bqDcqqGWbaCcqqGYbGCcqqGLbqzcqqGNbWzcqqG1bqDcqqGSbaBcqqGHbqycqqG0baDcqqGLbqzcqqGKbazcqqGUaGlaaaacaGL7baacaWLjaGaaCzcamaabmaabaGaeGymaedacaGLOaGaayzkaaaaaa@C919@</m:annotation>
                  </m:semantics>
               </m:math>
            </p>
            <p>Although <it>&#956;</it><sub><it>D </it></sub>is a fixed number, because of the sum <m:math name="1471-2105-7-205-i4" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>y</m:mi><m:mrow><m:mi>g</m:mi><m:mo>;</m:mo><m:mi>k</m:mi></m:mrow><m:mo>*</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWG5bqEdaqhaaWcbaGaem4zaCMaei4oaSJaem4AaSgabaGaeiOkaOcaaaaa@32E4@</m:annotation></m:semantics></m:math> = <it>G</it><sub><it>g</it>;<it>k </it></sub>+ <it>D</it><sub><it>g</it>;<it>k </it></sub>that is 'measured' this layer behaves as if regulated genes get a random sized shift up or down. Note that <it>G</it><sub><it>g</it>;<it>k</it></sub>, as well as almost all other stochastic variables are modeled in <it>SIMAGE </it>as outcomes of normal distributions. Although this brings about some oversimplification, we consider this not really an issue or, in the words of Wit and McClure (2004) <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, 'the misspecification made by using a normal approximation is typically negligible'. In addition, the superposition of the various normally distributed layers does not imply that the generated expressions levels themselves are normally distributed. Furthermore, modeling of <m:math name="1471-2105-7-205-i3" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>y</m:mi><m:mi>g</m:mi><m:mo>*</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWG5bqEdaqhaaWcbaGaem4zaCgabaGaeiOkaOcaaaaa@3087@</m:annotation></m:semantics></m:math> as a combination of three (normal) densities (non-, up- and down-regulated genes) enables estimating the model parameters (see "gene expressions" below). A more extensive modeling of the gene expressions, although biologically somewhat more correct, will result in a significantly more complex estimation of parameters compared to the parameters described in this study.</p>
         </sec>
         <sec>
            <st>
               <p>Replication variation</p>
            </st>
            <p>Spotting (spot pin effects), hybridization (non-uniform distribution of the labeled probe over the slide surface), and quantization (due to slide scanner properties) introduce variation in <it>y</it><sub><it>ijkl </it></sub>in addition to the 'natural biological variation'. This natural variation is modeled as follows. When a simulated gene <it>g </it>is spotted on spot <it>ijkl</it>, random errors <it>&#949;</it><sub><it>ij</it>1<it>l </it></sub>and <it>&#949;</it><sub><it>ij</it>2<it>l </it></sub>are drawn from <it>N</it>(0,<m:math name="1471-2105-7-205-i7" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mi>&#949;</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaqhaaWcbaGae8xTdugabaGaeGOmaidaaaaa@3137@</m:annotation></m:semantics></m:math>) and are added to the two replicated measurements <m:math name="1471-2105-7-205-i8" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>y</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi><m:mn>1</m:mn><m:mi>l</m:mi></m:mrow><m:mo>*</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWG5bqEdaqhaaWcbaGaemyAaKMaemOAaOMaeGymaeJaemiBaWgabaGaeiOkaOcaaaaa@3439@</m:annotation></m:semantics></m:math> and <m:math name="1471-2105-7-205-i9" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>y</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi><m:mn>2</m:mn><m:mi>l</m:mi></m:mrow><m:mo>*</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWG5bqEdaqhaaWcbaGaemyAaKMaemOAaOMaeGOmaiJaemiBaWgabaGaeiOkaOcaaaaa@343B@</m:annotation></m:semantics></m:math>. Each gene will be replicated <it>n</it><sub><it>rep </it></sub>times.</p>
         </sec>
         <sec>
            <st>
               <p>Background surface variation</p>
            </st>
            <p>The signal distribution over the surface of the DNA-microarray glass-slide is affected by various factors, e.g. slide surface chemistry and hybridization effects, leading to an irregular distribution of the labeled probe on the slide <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. The most simplistic way to model such gradient signals is by using a tilted plane, such that the signals on one side of the slide are higher than on the other. A more sophisticated method is that of Balagurunathan and coworkers (2002) <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> where a quadratic function is implemented. Such a function, however, is limited in the sense that it has only one local extreme (and some extremes located on the slide edges).</p>
            <p>We use a method that allows for a number of local extremes to exist. A number <it>n</it><sub><it>bg </it></sub>of bivariate normal densities with random parameter settings identifying the location and size on the slide are computed and subsequently multiplied by a random amplitude <it>I </it>(between &#177;1). Each density is computed by:</p>
            <p>
               <m:math name="1471-2105-7-205-i10" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mrow>
                        <m:msub>
                           <m:mi>f</m:mi>
                           <m:mi>m</m:mi>
                        </m:msub>
                        <m:mrow>
                           <m:mo>(</m:mo>
                           <m:mrow>
                              <m:mi>i</m:mi>
                              <m:mo>,</m:mo>
                              <m:mi>j</m:mi>
                           </m:mrow>
                           <m:mo>)</m:mo>
                        </m:mrow>
                        <m:mo>=</m:mo>
                        <m:mfrac>
                           <m:mrow>
                              <m:msub>
                                 <m:mi>I</m:mi>
                                 <m:mi>m</m:mi>
                              </m:msub>
                           </m:mrow>
                           <m:mrow>
                              <m:mn>2</m:mn>
                              <m:mi>&#960;</m:mi>
                              <m:msubsup>
                                 <m:mi>&#963;</m:mi>
                                 <m:mrow>
                                    <m:mi>b</m:mi>
                                    <m:mi>g</m:mi>
                                    <m:mi>x</m:mi>
                                 </m:mrow>
                                 <m:mn>2</m:mn>
                              </m:msubsup>
                              <m:msubsup>
                                 <m:mi>&#963;</m:mi>
                                 <m:mrow>
                                    <m:mi>b</m:mi>
                                    <m:mi>g</m:mi>
                                    <m:mi>y</m:mi>
                                 </m:mrow>
                                 <m:mn>2</m:mn>
                              </m:msubsup>
                              <m:msqrt>
                                 <m:mrow>
                                    <m:mn>1</m:mn>
                                    <m:mo>&#8722;</m:mo>
                                    <m:msubsup>
                                       <m:mi>&#961;</m:mi>
                                       <m:mrow>
                                          <m:mi>b</m:mi>
                                          <m:mi>g</m:mi>
                                       </m:mrow>
                                       <m:mn>2</m:mn>
                                    </m:msubsup>
                                 </m:mrow>
                              </m:msqrt>
                           </m:mrow>
                        </m:mfrac>
                        <m:msup>
                           <m:mi>e</m:mi>
                           <m:mrow>
                              <m:mrow>
                                 <m:mo>[</m:mo>
                                 <m:mrow>
                                    <m:mo>&#8722;</m:mo>
                                    <m:mfrac>
                                       <m:mn>1</m:mn>
                                       <m:mrow>
                                          <m:mn>2</m:mn>
                                          <m:mrow>
                                             <m:mo>(</m:mo>
                                             <m:mrow>
                                                <m:mn>1</m:mn>
                                                <m:mo>&#8722;</m:mo>
                                                <m:msubsup>
                                                   <m:mi>&#961;</m:mi>
                                                   <m:mrow>
                                                      <m:mi>b</m:mi>
                                                      <m:mi>g</m:mi>
                                                   </m:mrow>
                                                   <m:mn>2</m:mn>
                                                </m:msubsup>
                                             </m:mrow>
                                             <m:mo>)</m:mo>
                                          </m:mrow>
                                       </m:mrow>
                                    </m:mfrac>
                                    <m:mrow>
                                       <m:mo>(</m:mo>
                                       <m:mrow>
                                          <m:mfrac>
                                             <m:mrow>
                                                <m:msup>
                                                   <m:mrow>
                                                      <m:mrow>
                                                         <m:mo>(</m:mo>
                                                         <m:mrow>
                                                            <m:mi>i</m:mi>
                                                            <m:mo>&#8722;</m:mo>
                                                            <m:msub>
                                                               <m:mi>&#956;</m:mi>
                                                               <m:mrow>
                                                                  <m:mi>b</m:mi>
                                                                  <m:mi>g</m:mi>
                                                                  <m:mo>,</m:mo>
                                                                  <m:mi>x</m:mi>
                                                               </m:mrow>
                                                            </m:msub>
                                                         </m:mrow>
                                                         <m:mo>)</m:mo>
                                                      </m:mrow>
                                                   </m:mrow>
                                                   <m:mn>2</m:mn>
                                                </m:msup>
                                             </m:mrow>
                                             <m:mrow>
                                                <m:msubsup>
                                                   <m:mi>&#963;</m:mi>
                                                   <m:mrow>
                                                      <m:mi>b</m:mi>
                                                      <m:mi>g</m:mi>
                                                      <m:mo>,</m:mo>
                                                      <m:mi>x</m:mi>
                                                   </m:mrow>
                                                   <m:mn>2</m:mn>
                                                </m:msubsup>
                                             </m:mrow>
                                          </m:mfrac>
                                          <m:mo>&#8722;</m:mo>
                                          <m:mn>2</m:mn>
                                          <m:mi>&#961;</m:mi>
                                          <m:mfrac>
                                             <m:mrow>
                                                <m:mrow>
                                                   <m:mo>(</m:mo>
                                                   <m:mrow>
                                                      <m:mi>i</m:mi>
                                                      <m:mo>&#8722;</m:mo>
                                                      <m:msub>
                                                         <m:mi>&#956;</m:mi>
                                                         <m:mrow>
                                                            <m:mi>b</m:mi>
                                                            <m:mi>g</m:mi>
                                                            <m:mo>,</m:mo>
                                                            <m:mi>x</m:mi>
                                                         </m:mrow>
                                                      </m:msub>
                                                   </m:mrow>
                                                   <m:mo>)</m:mo>
                                                </m:mrow>
                                                <m:mrow>
                                                   <m:mo>(</m:mo>
                                                   <m:mrow>
                                                      <m:mi>j</m:mi>
                                                      <m:mo>&#8722;</m:mo>
                                                      <m:msub>
                                                         <m:mi>&#956;</m:mi>
                                                         <m:mrow>
                                                            <m:mi>b</m:mi>
                                                            <m:mi>g</m:mi>
                                                            <m:mo>,</m:mo>
                                                            <m:mi>y</m:mi>
                                                         </m:mrow>
                                                      </m:msub>
                                                   </m:mrow>
                                                   <m:mo>)</m:mo>
                                                </m:mrow>
                                             </m:mrow>
                                             <m:mrow>
                                                <m:msub>
                                                   <m:mi>&#963;</m:mi>
                                                   <m:mrow>
                                                      <m:mi>b</m:mi>
                                                      <m:mi>g</m:mi>
                                                      <m:mo>,</m:mo>
                                                      <m:mi>x</m:mi>
                                                   </m:mrow>
                                                </m:msub>
                                                <m:msub>
                                                   <m:mi>&#963;</m:mi>
                                                   <m:mrow>
                                                      <m:mi>b</m:mi>
                                                      <m:mi>g</m:mi>
                                                      <m:mo>,</m:mo>
                                                      <m:mi>y</m:mi>
                                                   </m:mrow>
                                                </m:msub>
                                             </m:mrow>
                                          </m:mfrac>
                                          <m:mo>+</m:mo>
                                          <m:mfrac>
                                             <m:mrow>
                                                <m:msup>
                                                   <m:mrow>
                                                      <m:mrow>
                                                         <m:mo>(</m:mo>
                                                         <m:mrow>
                                                            <m:mi>j</m:mi>
                                                            <m:mo>&#8722;</m:mo>
                                                            <m:msub>
                                                               <m:mi>&#956;</m:mi>
                                                               <m:mrow>
                                                                  <m:mi>b</m:mi>
                                                                  <m:mi>g</m:mi>
                                                                  <m:mo>,</m:mo>
                                                                  <m:mi>y</m:mi>
                                                               </m:mrow>
                                                            </m:msub>
                                                         </m:mrow>
                                                         <m:mo>)</m:mo>
                                                      </m:mrow>
                                                   </m:mrow>
                                                   <m:mn>2</m:mn>
                                                </m:msup>
                                             </m:mrow>
                                             <m:mrow>
                                                <m:msubsup>
                                                   <m:mi>&#963;</m:mi>
                                                   <m:mrow>
                                                      <m:mi>b</m:mi>
                                                      <m:mi>g</m:mi>
                                                      <m:mo>,</m:mo>
                                                      <m:mi>y</m:mi>
                                                   </m:mrow>
                                                   <m:mn>2</m:mn>
                                                </m:msubsup>
                                             </m:mrow>
                                          </m:mfrac>
                                       </m:mrow>
                                       <m:mo>)</m:mo>
                                    </m:mrow>
                                 </m:mrow>
                                 <m:mo>]</m:mo>
                              </m:mrow>
                           </m:mrow>
                        </m:msup>
                        <m:mo>,</m:mo>
                        <m:mtext>&#160;&#160;&#160;&#160;&#160;</m:mtext>
                        <m:mrow>
                           <m:mo>(</m:mo>
                           <m:mn>2</m:mn>
                           <m:mo>)</m:mo>
                        </m:mrow>
                     </m:mrow>
                     <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGMbGzdaWgaaWcbaGaemyBa0gabeaakmaabmaabaGaemyAaKMaeiilaWIaemOAaOgacaGLOaGaayzkaaGaeyypa0ZaaSaaaeaacqWGjbqsdaWgaaWcbaGaemyBa0gabeaaaOqaaiabikdaYGGaciab=b8aWjab=n8aZnaaDaaaleaacqWGIbGycqWGNbWzcqWG4baEaeaacqaIYaGmaaGccqWFdpWCdaqhaaWcbaGaemOyaiMaem4zaCMaemyEaKhabaGaeGOmaidaaOWaaOaaaeaacqaIXaqmcqGHsislcqWFbpGCdaqhaaWcbaGaemOyaiMaem4zaCgabaGaeGOmaidaaaqabaaaaOGaemyzau2aaWbaaSqabeaadaWadaqaaiabgkHiTmaalaaabaGaeGymaedabaGaeGOmaiZaaeWaaeaacqaIXaqmcqGHsislcqWFbpGCdaqhaaadbaGaemOyaiMaem4zaCgabaGaeGOmaidaaaWccaGLOaGaayzkaaaaamaabmaabaWaaSaaaeaadaqadaqaaiabdMgaPjabgkHiTiab=X7aTnaaBaaameaacqWGIbGycqWGNbWzcqGGSaalcqWG4baEaeqaaaWccaGLOaGaayzkaaWaaWbaaWqabeaacqaIYaGmaaaaleaacqWFdpWCdaqhaaadbaGaemOyaiMaem4zaCMaeiilaWIaemiEaGhabaGaeGOmaidaaaaaliabgkHiTiabikdaYiab=f8aYnaalaaabaWaaeWaaeaacqWGPbqAcqGHsislcqWF8oqBdaWgaaadbaGaemOyaiMaem4zaCMaeiilaWIaemiEaGhabeaaaSGaayjkaiaawMcaamaabmaabaGaemOAaOMaeyOeI0Iae8hVd02aaSbaaWqaaiabdkgaIjabdEgaNjabcYcaSiabdMha5bqabaaaliaawIcacaGLPaaaaeaacqWFdpWCdaWgaaadbaGaemOyaiMaem4zaCMaeiilaWIaemiEaGhabeaaliab=n8aZnaaBaaameaacqWGIbGycqWGNbWzcqGGSaalcqWG5bqEaeqaaaaaliabgUcaRmaalaaabaWaaeWaaeaacqWGQbGAcqGHsislcqWF8oqBdaWgaaadbaGaemOyaiMaem4zaCMaeiilaWIaemyEaKhabeaaaSGaayjkaiaawMcaamaaCaaameqabaGaeGOmaidaaaWcbaGae83Wdm3aa0baaWqaaiabdkgaIjabdEgaNjabcYcaSiabdMha5bqaaiabikdaYaaaaaaaliaawIcacaGLPaaaaiaawUfacaGLDbaaaaGccqGGSaalcaWLjaGaaCzcamaabmaabaGaeGOmaidacaGLOaGaayzkaaaaaa@B5C9@</m:annotation>
                  </m:semantics>
               </m:math>
            </p>
            <p>where the parameters are provided by the user. These densities are summed, together with a linear gradient with a random tilt lower than <it>s</it>, to constitute the background surface signals (see Fig. <figr fid="F1">1B</figr>, top panel; see also the supplementary document). An additive modeling of the background effect is chosen over a multiplicative model. As to the reasons why, see the supplementary document.</p>
         </sec>
         <sec>
            <st>
               <p>Channel deviations, gene-dye interactions and spot pin deviation</p>
            </st>
            <p>The rationale behind the modeling of these three layers is the same: for each of the layer values a deviation, drawn from a normal distribution, is added to (a subset of) the Cy3 / Cy5 signals.</p>
            <p>In some actual DNA-microarray experiments, the average Cy3 signal is significantly different from the average Cy5 signal obtained for a spot; reasons are: gene-to-gene variation in dye incorporating efficiencies, global differences in Cy3 and Cy5 abundance in the DNA preparations, and bleaching of the dyes (Cy5 is inactivated faster than Cy3). This effect is incorporated in the model by a random deviation <it>C</it><sub>1 </sub>for <it>N</it>(0,<m:math name="1471-2105-7-205-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mrow><m:mi>c</m:mi><m:mi>h</m:mi><m:mi>a</m:mi><m:mi>n</m:mi><m:mi>n</m:mi><m:mi>e</m:mi><m:mi>l</m:mi></m:mrow><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaqhaaWcbaGaem4yamMaemiAaGMaemyyaeMaemOBa4MaemOBa4MaemyzauMaemiBaWgabaGaeGOmaidaaaaa@3906@</m:annotation></m:semantics></m:math>) to all Cy3-measurements and another random deviation <it>C</it><sub>2 </sub>from this distribution to all Cy5-measurements.</p>
            <p>Differential incorporation of dyes in DNA occurs due to (i) the preference of reverse transcriptase for the physically smaller Cy3 label in case of direct labeling and (ii) differential hybridization because of the physically larger Cy5 label. The effects differ between genes as a consequence of DNA size and the distribution of the labeled nucleotides in the DNA molecule <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>. Gene-dye interactions are embedded in the model similarly to the channel effect: for all measurements belonging to gene <it>g </it>and dye <it>d </it>a random deviation <it>X</it><sub><it>gk </it></sub>from <it>N</it>(0,<m:math name="1471-2105-7-205-i12" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mrow><m:mi>g</m:mi><m:mo>&#215;</m:mo><m:mi>d</m:mi></m:mrow><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaqhaaWcbaGaem4zaCMaey41aqRaemizaqgabaGaeGOmaidaaaaa@3454@</m:annotation></m:semantics></m:math>) is added to the signal.</p>
            <p>Spot pins show systematic deviations in the amount of probe delivered to the glass surface. The signals measured of targets spotted with one spot pin might differ significantly from those spotted with another spot pin. The <it>n</it><sub><it>pin </it></sub>drawings <it>S</it><sub><it>pin</it>(<it>ij</it>) </sub>from <it>N</it>(0,<m:math name="1471-2105-7-205-i13" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mrow><m:mi>p</m:mi><m:mi>i</m:mi><m:mi>n</m:mi></m:mrow><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaqhaaWcbaGaemiCaaNaemyAaKMaemOBa4gabaGaeGOmaidaaaaa@33BE@</m:annotation></m:semantics></m:math>) are used to model these effects. Spot pin effects are equal for both channels.</p>
         </sec>
         <sec>
            <st>
               <p>Non-linearity</p>
            </st>
            <p>(<it>M</it>, <it>A</it>) plots <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> from experimentally obtained DNA-microarray data often look somewhat curved (e.g. "banana shaped") and / or tilted, generally with higher deviations for lower <it>A </it>values (see "quantization and saturation" below). The <it>SIMAGE </it>model allows for non-linearity using a transformation <it>f</it><sub><it>nl </it></sub>based on two parameters: <it>&#945;</it><sub>1</sub>, specifying the maximum amount of curvature to be allowed, and <it>&#945;</it><sub>2</sub>, which specifies the maximum amount of linear tilt. Although the transformation is relatively simple, the resulting data mimics experimental data closely enough with respect to non-linearity. The exact definition of the modeling of this layer and some examples of the resulting curvature is presented in the supplementary document.</p>
         </sec>
         <sec>
            <st>
               <p>Fishtails</p>
            </st>
            <p>A common artifact of DNA-microarray data is that genes with a lower expression tend to be noisier. This is apparent in the 'fishtail' appearance of (<it>M</it>, <it>A</it>) graphs. The main reason for this artifact is mainly that spots with a lower signal are quantified with larger errors than spots with higher signals <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Another reason could be the 'over-transformation' of data due to the log-transformation, and that a less influential power transformation would be more appropriate to obtain normally distributed gene expressions. It would, however, be extremely complicated to use another transformation, since the property of logarithms that it turns multiplicative effects into additive effects is used throughout the complete <it>SIMAGE </it>model. We have to content ourselves with this approximation.</p>
            <p>The fishtailing behavior is incorporated in <it>SIMAGE </it>by a parameter <it>&#948; </it>that inflates the log ratio for spots with an average expression <it>A</it><sub><it>ijl </it></sub>below <it>&#956; </it>by a factor <m:math name="1471-2105-7-205-i14" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mrow><m:mo>(</m:mo><m:mrow><m:mn>1</m:mn><m:mo>+</m:mo><m:mi>&#948;</m:mi><m:msubsup><m:mi>&#963;</m:mi><m:mi>M</m:mi><m:mrow><m:mo>&#8722;</m:mo><m:mn>2</m:mn></m:mrow></m:msubsup><m:msup><m:mrow><m:mrow><m:mo>(</m:mo><m:mrow><m:msub><m:mi>A</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi><m:mi>l</m:mi></m:mrow></m:msub><m:mo>&#8722;</m:mo><m:mi>&#956;</m:mi></m:mrow><m:mo>)</m:mo></m:mrow></m:mrow><m:mn>2</m:mn></m:msup></m:mrow><m:mo>)</m:mo></m:mrow></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqadaqaaiabigdaXiabgUcaRGGaciab=r7aKjab=n8aZnaaDaaaleaacqWGnbqtaeaacqGHsislcqaIYaGmaaGcdaqadaqaaiabdgeabnaaBaaaleaacqWGPbqAcqWGQbGAcqWGSbaBaeqaaOGaeyOeI0Iae8hVd0gacaGLOaGaayzkaaWaaWbaaSqabeaacqaIYaGmaaaakiaawIcacaGLPaaaaaa@4154@</m:annotation></m:semantics></m:math>. A higher value of <it>&#948; </it>will result in a stronger effect and <it>&#948; </it>= 0 indicates no fishtail effect. The parameter <m:math name="1471-2105-7-205-i15" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mi>M</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaqhaaWcbaGaemyta0eabaGaeGOmaidaaaaa@30B8@</m:annotation></m:semantics></m:math> will be introduced below ("gene expressions"). Our model mimics biological replicates by randomizing the effect of <it>&#948; </it>for non-regulated genes. For each slide these genes have a probability of 1/2 of being influenced by fishtailing. In this way, the expressions of non-regulated genes are seldom distributed in such a way that they appear differentially regulated in a simulated experiment with replicated slides.</p>
         </sec>
         <sec>
            <st>
               <p>Quantization and saturation</p>
            </st>
            <p>Slide scanning introduces another non-linearity artifact. The scanning device uses 16 bits to describe signal strength, i.e. signals are always between log(2<sup>0</sup>) = 0 and log(2<sup>16</sup>) = 16. Furthermore, some DNA-microarray data scanners tend to over-measure low values, and under-measure high values <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. When no over- or under-measurement occurs, the signal transformation <it>l</it><sub>0</sub>(<it>y</it>) = min(max(0,y),16) on <it>y </it>will be used to incorporate the cutoff at log(2<sup>0</sup>) = 0 and log(2<sup>16</sup>) = 16. A maximal over- or under-measurement effect will result in the signal transformation <m:math name="1471-2105-7-205-i16" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mi>l</m:mi><m:mn>1</m:mn></m:msub><m:mrow><m:mo>(</m:mo><m:mi>y</m:mi><m:mo>)</m:mo></m:mrow><m:mo>=</m:mo><m:mn>16</m:mn><m:mo>/</m:mo><m:mrow><m:mo>(</m:mo><m:mrow><m:mn>1</m:mn><m:mo>+</m:mo><m:mi>exp</m:mi><m:mo>&#8289;</m:mo><m:mrow><m:mo>(</m:mo><m:mrow><m:mn>2</m:mn><m:mo>&#8722;</m:mo><m:mstyle scriptlevel="+1"><m:mfrac><m:mi>y</m:mi><m:mn>4</m:mn></m:mfrac></m:mstyle></m:mrow><m:mo>)</m:mo></m:mrow></m:mrow><m:mo>)</m:mo></m:mrow></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGSbaBdaWgaaWcbaGaeGymaedabeaakmaabmaabaGaemyEaKhacaGLOaGaayzkaaGaeyypa0JaeGymaeJaeGOnayJaei4la8YaaeWaaeaacqaIXaqmcqGHRaWkcyGGLbqzcqGG4baEcqGGWbaCdaqadaqaaiabikdaYiabgkHiTmaaleaaleaacqWG5bqEaeaacqaI0aanaaaakiaawIcacaGLPaaaaiaawIcacaGLPaaaaaa@439B@</m:annotation></m:semantics></m:math>. This transformation inflates low signals, deflates high signals, and leaves medium signals unchanged.</p>
            <p>Via a parameter <it>w </it>(<it>w </it>between 0 and 1) the <it>SIMAGE </it>model is instructed on the severity of the bias. The transformation <it>g</it><sub><it>nl </it></sub>used is a weighted average of <it>l</it><sub>0 </sub>and <it>l</it><sub>1</sub>, namely <it>g</it><sub><it>nl </it></sub>= (1 - <it>w</it>)<it>l</it><sub>0</sub>(<it>y</it>) + <it>wl</it><sub>1</sub>(<it>y</it>). The supplementary document provides a visualization of the effect of different choices of <it>w</it>.</p>
         </sec>
         <sec>
            <st>
               <p>Missing data</p>
            </st>
            <p>In actual DNA-microarray data measurements may be missing. Reasons are: slide surface imperfections, hybridization effects, no DNA delivered to the glass surface during the spotting process or presence of fibers or dust particles. Our simulation model implements three types of missing data (with signals set to 0): (i) line-segments mimic 'hairs', (ii) 'donuts' comprising multiple spots mimic dirty areas and (iii) missing spots. The implementation of the model allows specifying the number of occurrences of these types of missing data as well as their maximum size and shape.</p>
         </sec>
         <sec>
            <st>
               <p>The <it>SIMAGE </it>model</p>
            </st>
            <p>In the <it>SIMAGE </it>model 29 parameters have to be specified (Table <tblr tid="T2">2</tblr>), of which 6 are known constants (e.g. the number of spots in a grid). The model, which results from the components described above, is defined as follows: the simulated log expression at spot (<it>i, j</it>) of array <it>l </it>and channel <it>k </it>is denoted by <it>y</it><sub><it>ijkl </it></sub>as:</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Overview of parameters in the <it>SIMAGE </it>model. Some parameters are known (such as number of spots per grid), others should be set by the user (the flag indicates when the parameter can also be estimated from the data). Details concerning these parameters are discussed in the implementation section.</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c ca="left">
                        <p>Parameter</p>
                     </c>
                     <c ca="left">
                        <p>Description</p>
                     </c>
                     <c ca="left">
                        <p>Can be estimated</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>n</it>
                           <sub>
                              <it>row</it>
                           </sub>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Number of grids per row</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>n</it>
                           <sub>
                              <it>col</it>
                           </sub>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Number of grids per column</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <m:math name="1471-2105-7-205-i1" xmlns:m="http://www.w3.org/1998/Math/MathML">
                              <m:semantics>
                                 <m:mrow>
                                    <m:msubsup>
                                       <m:mi>n</m:mi>
                                       <m:mrow>
                                          <m:mi>s</m:mi>
                                          <m:mi>p</m:mi>
                                          <m:mi>o</m:mi>
                                          <m:mi>t</m:mi>
                                       </m:mrow>
                                       <m:mn>2</m:mn>
                                    </m:msubsup>
                                 </m:mrow>
                                 <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGUbGBdaqhaaWcbaGaem4CamNaemiCaaNaem4Ba8MaemiDaqhabaGaeGOmaidaaaaa@34E0@</m:annotation>
                              </m:semantics>
                           </m:math>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Number of spots per grid</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>n</it>
                           <sub>
                              <it>slide</it>
                           </sub>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Number of slides</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>n</it>
                           <sub>
                              <it>rep</it>
                           </sub>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Number of technical replications</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>n</it>
                           <sub>
                              <it>pin</it>
                           </sub>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Number of spot pins</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>&#956;</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Mean expression signal</p>
                     </c>
                     <c ca="left">
                        <p>v</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>&#956;</it>
                           <sub>-, </sub>
                           <it>&#956;</it>
                           <sub>+</sub>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Logratio-shift due to down- and upregulation</p>
                     </c>
                     <c ca="left">
                        <p>v</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>&#960;</it><sub>-</sub>, <it>&#960;</it><sub>+</sub></p>
                     </c>
                     <c ca="left">
                        <p>Proportion of down-, up-regulated genes</p>
                     </c>
                     <c ca="left">
                        <p>v</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <m:math name="1471-2105-7-205-i5" xmlns:m="http://www.w3.org/1998/Math/MathML">
                              <m:semantics>
                                 <m:mrow>
                                    <m:msubsup>
                                       <m:mi>&#963;</m:mi>
                                       <m:mi>G</m:mi>
                                       <m:mn>2</m:mn>
                                    </m:msubsup>
                                 </m:mrow>
                                 <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaqhaaWcbaGaem4raCeabaGaeGOmaidaaaaa@30AC@</m:annotation>
                              </m:semantics>
                           </m:math>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Variation in gene expression</p>
                     </c>
                     <c ca="left">
                        <p>v</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>&#961;</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Correlation between Cy3 and Cy5 expression</p>
                     </c>
                     <c ca="left">
                        <p>v</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <m:math name="1471-2105-7-205-i7" xmlns:m="http://www.w3.org/1998/Math/MathML">
                              <m:semantics>
                                 <m:mrow>
                                    <m:msubsup>
                                       <m:mi>&#963;</m:mi>
                                       <m:mi>&#949;</m:mi>
                                       <m:mn>2</m:mn>
                                    </m:msubsup>
                                 </m:mrow>
                                 <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaqhaaWcbaGae8xTdugabaGaeGOmaidaaaaa@3137@</m:annotation>
                              </m:semantics>
                           </m:math>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Replication variation</p>
                     </c>
                     <c ca="left">
                        <p>v</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>n</it>
                           <sub>
                              <it>bg</it>
                           </sub>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Number of background 'densities'</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>&#963;</it>
                           <sub>
                              <it>bg</it>
                           </sub>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Mean standard deviation per background density</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>s</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Maximum slope of the linear tilt</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <m:math name="1471-2105-7-205-i11" xmlns:m="http://www.w3.org/1998/Math/MathML">
                              <m:semantics>
                                 <m:mrow>
                                    <m:msubsup>
                                       <m:mi>&#963;</m:mi>
                                       <m:mrow>
                                          <m:mi>c</m:mi>
                                          <m:mi>h</m:mi>
                                          <m:mi>a</m:mi>
                                          <m:mi>n</m:mi>
                                          <m:mi>n</m:mi>
                                          <m:mi>e</m:mi>
                                          <m:mi>l</m:mi>
                                       </m:mrow>
                                       <m:mn>2</m:mn>
                                    </m:msubsup>
                                 </m:mrow>
                                 <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaqhaaWcbaGaem4yamMaemiAaGMaemyyaeMaemOBa4MaemOBa4MaemyzauMaemiBaWgabaGaeGOmaidaaaaa@3906@</m:annotation>
                              </m:semantics>
                           </m:math>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Channel variation</p>
                     </c>
                     <c ca="left">
                        <p>v</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <m:math name="1471-2105-7-205-i13" xmlns:m="http://www.w3.org/1998/Math/MathML">
                              <m:semantics>
                                 <m:mrow>
                                    <m:msubsup>
                                       <m:mi>&#963;</m:mi>
                                       <m:mrow>
                                          <m:mi>p</m:mi>
                                          <m:mi>i</m:mi>
                                          <m:mi>n</m:mi>
                                       </m:mrow>
                                       <m:mn>2</m:mn>
                                    </m:msubsup>
                                 </m:mrow>
                                 <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaqhaaWcbaGaemiCaaNaemyAaKMaemOBa4gabaGaeGOmaidaaaaa@33BE@</m:annotation>
                              </m:semantics>
                           </m:math>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Spot pin variation</p>
                     </c>
                     <c ca="left">
                        <p>v</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <m:math name="1471-2105-7-205-i17" xmlns:m="http://www.w3.org/1998/Math/MathML">
                              <m:semantics>
                                 <m:mrow>
                                    <m:msubsup>
                                       <m:mi>&#963;</m:mi>
                                       <m:mi>X</m:mi>
                                       <m:mn>2</m:mn>
                                    </m:msubsup>
                                 </m:mrow>
                                 <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaqhaaWcbaGaemiwaGfabaGaeGOmaidaaaaa@30CE@</m:annotation>
                              </m:semantics>
                           </m:math>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Gene &#215; dye variation</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>&#945;</it>
                           <sub>1</sub>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Non-linearity parameter 'curvature'</p>
                     </c>
                     <c ca="left">
                        <p>v</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>&#945;</it>
                           <sub>2</sub>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Non-linearity parameter 'tilt'</p>
                     </c>
                     <c ca="left">
                        <p>v</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>&#948;</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Fishtailing parameter</p>
                     </c>
                     <c ca="left">
                        <p>v</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>w</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Scanning device bias</p>
                     </c>
                     <c ca="left">
                        <p>v</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>n</it><sub><it>h</it></sub>, <it>n</it><sub><it>d</it></sub>, <it>n</it><sub><it>s</it></sub></p>
                     </c>
                     <c ca="left">
                        <p>Maximum number of hairs, donuts and missing spots</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>l</it><sub><it>h</it></sub>, <it>l</it><sub><it>d</it></sub></p>
                     </c>
                     <c ca="left">
                        <p>Maximum length of hairs and radius of donuts</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>
               <m:math name="1471-2105-7-205-i18" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mtable columnalign="left">
                        <m:mtr>
                           <m:mtd>
                              <m:msub>
                                 <m:mi>y</m:mi>
                                 <m:mrow>
                                    <m:mi>i</m:mi>
                                    <m:mi>j</m:mi>
                                    <m:mi>k</m:mi>
                                    <m:mi>l</m:mi>
                                 </m:mrow>
                              </m:msub>
                              <m:mo>=</m:mo>
                              <m:msub>
                                 <m:mi>g</m:mi>
                                 <m:mrow>
                                    <m:mi>n</m:mi>
                                    <m:mi>l</m:mi>
                                 </m:mrow>
                              </m:msub>
                              <m:mrow>
                                 <m:mo>(</m:mo>
                                 <m:mrow>
                                    <m:msub>
                                       <m:mi>t</m:mi>
                                       <m:mi>&#948;</m:mi>
                                    </m:msub>
                                    <m:mrow>
                                       <m:mo>(</m:mo>
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>f</m:mi>
                                             <m:mrow>
                                                <m:mi>n</m:mi>
                                                <m:mi>l</m:mi>
                                             </m:mrow>
                                          </m:msub>
                                          <m:mrow>
                                             <m:mo>(</m:mo>
                                             <m:mrow>
                                                <m:mi>b</m:mi>
                                                <m:msub>
                                                   <m:mi>g</m:mi>
                                                   <m:mrow>
                                                      <m:mi>i</m:mi>
                                                      <m:mi>j</m:mi>
                                                      <m:mi>l</m:mi>
                                                   </m:mrow>
                                                </m:msub>
                                                <m:mo>+</m:mo>
                                                <m:msub>
                                                   <m:mi>z</m:mi>
                                                   <m:mrow>
                                                      <m:mi>i</m:mi>
                                                      <m:mi>j</m:mi>
                                                      <m:mi>k</m:mi>
                                                      <m:mi>l</m:mi>
                                                   </m:mrow>
                                                </m:msub>
                                             </m:mrow>
                                             <m:mo>)</m:mo>
                                          </m:mrow>
                                       </m:mrow>
                                       <m:mo>)</m:mo>
                                    </m:mrow>
                                 </m:mrow>
                                 <m:mo>)</m:mo>
                              </m:mrow>
                              <m:mo>&#8901;</m:mo>
                              <m:msub>
                                 <m:mi>m</m:mi>
                                 <m:mrow>
                                    <m:mi>i</m:mi>
                                    <m:mi>j</m:mi>
                                    <m:mi>l</m:mi>
                                 </m:mrow>
                              </m:msub>
                              <m:mo>,</m:mo>
                              <m:mtext>&#160;with</m:mtext>
                           </m:mtd>
                        </m:mtr>
                        <m:mtr>
                           <m:mtd>
                              <m:msub>
                                 <m:mi>z</m:mi>
                                 <m:mrow>
                                    <m:mi>i</m:mi>
                                    <m:mi>j</m:mi>
                                    <m:mi>k</m:mi>
                                    <m:mi>l</m:mi>
                                 </m:mrow>
                              </m:msub>
                              <m:mo>=</m:mo>
                              <m:msub>
                                 <m:mi>G</m:mi>
                                 <m:mrow>
                                    <m:mi>g</m:mi>
                                    <m:mi>e</m:mi>
                                    <m:mi>n</m:mi>
                                    <m:mi>e</m:mi>
                                    <m:mrow>
                                       <m:mo>(</m:mo>
                                       <m:mrow>
                                          <m:mi>i</m:mi>
                                          <m:mi>j</m:mi>
                                          <m:mi>l</m:mi>
                                       </m:mrow>
                                       <m:mo>)</m:mo>
                                    </m:mrow>
                                    <m:mo>;</m:mo>
                                    <m:mi>k</m:mi>
                                 </m:mrow>
                              </m:msub>
                              <m:mo>+</m:mo>
                              <m:msub>
                                 <m:mi>D</m:mi>
                                 <m:mrow>
                                    <m:mi>g</m:mi>
                                    <m:mi>e</m:mi>
                                    <m:mi>n</m:mi>
                                    <m:mi>e</m:mi>
                                    <m:mrow>
                                       <m:mo>(</m:mo>
                                       <m:mrow>
                                          <m:mi>i</m:mi>
                                          <m:mi>j</m:mi>
                                          <m:mi>l</m:mi>
                                       </m:mrow>
                                       <m:mo>)</m:mo>
                                    </m:mrow>
                                    <m:mo>;</m:mo>
                                    <m:mi>k</m:mi>
                                 </m:mrow>
                              </m:msub>
                              <m:mo>+</m:mo>
                              <m:msub>
                                 <m:mi>C</m:mi>
                                 <m:mi>k</m:mi>
                              </m:msub>
                              <m:mo>+</m:mo>
                              <m:msub>
                                 <m:mi>S</m:mi>
                                 <m:mrow>
                                    <m:mi>p</m:mi>
                                    <m:mi>i</m:mi>
                                    <m:mi>n</m:mi>
                                    <m:mrow>
                                       <m:mo>(</m:mo>
                                       <m:mrow>
                                          <m:mi>i</m:mi>
                                          <m:mi>j</m:mi>
                                       </m:mrow>
                                       <m:mo>)</m:mo>
                                    </m:mrow>
                                 </m:mrow>
                              </m:msub>
                              <m:mo>+</m:mo>
                              <m:msub>
                                 <m:mi>X</m:mi>
                                 <m:mrow>
                                    <m:mi>g</m:mi>
                                    <m:mi>e</m:mi>
                                    <m:mi>n</m:mi>
                                    <m:mi>e</m:mi>
                                    <m:mrow>
                                       <m:mo>(</m:mo>
                                       <m:mrow>
                                          <m:mi>i</m:mi>
                                          <m:mi>j</m:mi>
                                          <m:mi>l</m:mi>
                                       </m:mrow>
                                       <m:mo>)</m:mo>
                                    </m:mrow>
                                    <m:mo>;</m:mo>
                                    <m:mi>k</m:mi>
                                 </m:mrow>
                              </m:msub>
                              <m:mo>+</m:mo>
                              <m:msub>
                                 <m:mi>&#949;</m:mi>
                                 <m:mrow>
                                    <m:mi>i</m:mi>
                                    <m:mi>j</m:mi>
                                    <m:mi>k</m:mi>
                                    <m:mi>l</m:mi>
                                 </m:mrow>
                              </m:msub>
                              <m:mtext>&#160;&#160;&#160;&#160;&#160;</m:mtext>
                              <m:mrow>
                                 <m:mo>(</m:mo>
                                 <m:mn>3</m:mn>
                                 <m:mo>)</m:mo>
                              </m:mrow>
                           </m:mtd>
                        </m:mtr>
                     </m:mtable>
                     <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakqaabeqaaiabdMha5naaBaaaleaacqWGPbqAcqWGQbGAcqWGRbWAcqWGSbaBaeqaaOGaeyypa0Jaem4zaC2aaSbaaSqaaiabd6gaUjabdYgaSbqabaGcdaqadaqaaiabdsha0naaBaaaleaaiiGacqWF0oazaeqaaOWaaeWaaeaacqWGMbGzdaWgaaWcbaGaemOBa4MaemiBaWgabeaakmaabmaabaGaemOyaiMaem4zaC2aaSbaaSqaaiabdMgaPjabdQgaQjabdYgaSbqabaGccqGHRaWkcqWG6bGEdaWgaaWcbaGaemyAaKMaemOAaOMaem4AaSMaemiBaWgabeaaaOGaayjkaiaawMcaaaGaayjkaiaawMcaaaGaayjkaiaawMcaaiabgwSixlabd2gaTnaaBaaaleaacqWGPbqAcqWGQbGAcqWGSbaBaeqaaOGaeiilaWIaeeiiaaIaee4DaCNaeeyAaKMaeeiDaqNaeeiAaGgabaGaemOEaO3aaSbaaSqaaiabdMgaPjabdQgaQjabdUgaRjabdYgaSbqabaGccqGH9aqpcqWGhbWrdaWgaaWcbaGaem4zaCMaemyzauMaemOBa4Maemyzau2aaeWaaeaacqWGPbqAcqWGQbGAcqWGSbaBaiaawIcacaGLPaaacqGG7aWocqWGRbWAaeqaaOGaey4kaSIaemiraq0aaSbaaSqaaiabdEgaNjabdwgaLjabd6gaUjabdwgaLnaabmaabaGaemyAaKMaemOAaOMaemiBaWgacaGLOaGaayzkaaGaei4oaSJaem4AaSgabeaakiabgUcaRiabdoeadnaaBaaaleaacqWGRbWAaeqaaOGaey4kaSIaem4uam1aaSbaaSqaaiabdchaWjabdMgaPjabd6gaUnaabmaabaGaemyAaKMaemOAaOgacaGLOaGaayzkaaaabeaakiabgUcaRiabdIfaynaaBaaaleaacqWGNbWzcqWGLbqzcqWGUbGBcqWGLbqzdaqadaqaaiabdMgaPjabdQgaQjabdYgaSbGaayjkaiaawMcaaiabcUda7iabdUgaRbqabaGccqGHRaWkcqWF1oqzdaWgaaWcbaGaemyAaKMaemOAaOMaem4AaSMaemiBaWgabeaakiaaxMaacaWLjaWaaeWaaeaacqaIZaWmaiaawIcacaGLPaaaaaaa@B3F0@</m:annotation>
                  </m:semantics>
               </m:math>
            </p>
            <p>where</p>
            <p><it>gene</it>(<it>ijl</it>) the gene spotted at location (<it>i</it>, <it>j</it>,<it>l</it>) (see "dimensions and notation" above);</p>
            <p><it>pin</it>(<it>ij</it>) the spot pin used to spot location (<it>i</it>, <it>j</it>);</p>
            <p><it>g</it><sub><it>nl </it></sub>a transformation due to quantization and saturation;</p>
            <p><it>t</it><sub><it>&#948; </it></sub>a transformation due to 'fishtailing';</p>
            <p><it>f</it><sub><it>nl </it></sub>a transformation due to non-linearity in measurements;</p>
            <p><it>m</it><sub><it>ijkl </it></sub>equals 0 if the spot at location (<it>i</it>, <it>j</it>,<it>l</it>) is 'missing', and 1 otherwise (see "missing data" above);</p>
            <p><it>bg</it><sub><it>ijl </it></sub>the background gradient level (see "background variation" above);</p>
            <p><it>G</it><sub><it>g</it>;<it>k </it></sub>the expression level of gene <it>g </it>in channel <it>k</it>;</p>
            <p><it>D</it><sub><it>g</it>;<it>k </it></sub>the change in expression due to up-/down-regulation;</p>
            <p><it>C</it><sub><it>k </it></sub>the channel effect;</p>
            <p><it>S</it><sub><it>pin</it>(<it>ij</it>) </sub>the spot pin effect;</p>
            <p><it>X</it><sub><it>gene</it>(<it>ijl</it>);<it>k </it></sub>the gene &#215; dye interaction;</p>
            <p><it>&#949;</it><sub><it>ijkl </it></sub>the replication error.</p>
         </sec>
         <sec>
            <st>
               <p>Datasets used and estimation of the parameters</p>
            </st>
            <p>In order to estimate the parameters for a number of slides hybridized at the Department of Molecular Genetics (MolGen), DNA-microarray data from 47 <it>Lactococcus lactis </it>IL1403 slides from MolGen <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp> were used (supplementary Table T1). These datasets specify, for each spot: (i) the annotated gene names, (ii) the raw expression values for both channels, and (iii) an estimated background signal. Parameters were determined from the net intensity values of the slides (i.e. corrected for background signals).</p>
            <p>Besides MolGen slides, several slides from the public domain were analyzed. Datasets GDS69, GDS100, and GDS273 were obtained from the Gene Expression Omnibus from NCBI <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. The dataset MEXP-225 was obtained from EBI's ArrayExpress <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. In the supplementary web site <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, the exact slides used for the parameter estimations are listed.</p>
            <p>The <it>SIMAGE </it>web site provides a tool for the estimation steps described in the following sections. In order to obtain robust estimates for the slides, some slides yielding obviously outlying estimates, i.e. when <m:math name="1471-2105-7-205-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mrow><m:mi>c</m:mi><m:mi>h</m:mi><m:mi>a</m:mi><m:mi>n</m:mi><m:mi>n</m:mi><m:mi>e</m:mi><m:mi>l</m:mi></m:mrow><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaqhaaWcbaGaem4yamMaemiAaGMaemyyaeMaemOBa4MaemOBa4MaemyzauMaemiBaWgabaGaeGOmaidaaaaa@3906@</m:annotation></m:semantics></m:math> was larger than <it>&#956;</it>, were discarded. Only those experiments were used for which at least 3 slides with acceptable estimates were obtained. The estimates of the MolGen "experiment" were determined by the median of the "experiment" slides, while the MolGen "validation" estimates were determined by the median of the estimates of the validation slide data <abbrgrp><abbr bid="B3">3</abbr></abbrgrp> (supplementary Table T1). For each of the 5 experiments, DNA-microarray data obtained from the same slide batch were used to estimate the experiment-specific parameters (see Results). This was done to minimize slide batch-specific bias in parameter estimation. The median, rather than the mean, of the estimates for each parameter is used as input in <it>SIMAGE </it>simulation interface because of the nonsymmetrical distribution of the estimates.</p>
         </sec>
         <sec>
            <st>
               <p>Background gradient and densities</p>
            </st>
            <p>Spot background signals for the MolGen slides were determined as described <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. Estimates for the number of background densities to be estimated, as well as their average standard deviations, were obtained by visually inspecting a number of experimentally obtained background densities. Furthermore, estimates for the average horizontal and vertical linear tilt were obtained by fitting a regression plane through the background signal distribution.</p>
         </sec>
         <sec>
            <st>
               <p>Non-linearity, quantization and saturation</p>
            </st>
            <p>A direct way to estimate the scanning device bias parameter <it>w </it>is not feasible since this parameter is confounded within the other variables in the <it>SIMAGE </it>model. One could assume absence of scanning device bias (<it>w </it>= 0). We used an alternative method to estimate this parameter (for more details see the supplementary document). The estimation of the non-linearity parameters <it>&#945;</it><sub>1 </sub>and <it>&#945;</it><sub>2 </sub>is done as follows: in the (<it>M</it>, <it>A</it>) plot for background-corrected data (hence, the net signals), a Lowess curve <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> is fitted. Then a second degree polynomial is fitted to this Lowess curve, and the parameters <it>&#945;</it><sub>1 </sub>and <it>&#945;</it><sub>2 </sub>are directly derived from the parameters of the polynomial (see the supplementary document for details).</p>
         </sec>
         <sec>
            <st>
               <p>Spot pin effects, channel effects, fishtails and replication variation</p>
            </st>
            <p>After subtracting the estimates for the background signals and the non-linearity effects from <it>y</it><sub><it>ijkl </it></sub>(the original signal) we obtain <it>z</it><sub><it>ijkl</it></sub>, where <it>z</it><sub><it>ijkl </it></sub>= <it>G</it><sub><it>gene</it>(<it>ijl</it>);<it>k </it></sub>+ <it>D</it><sub><it>gene</it>(<it>ijl</it>);<it>k </it></sub>+ <it>C</it><sub><it>k </it></sub>+ <it>S</it><sub><it>pin</it>(<it>ij</it>) </sub>+ <it>X</it><sub><it>gene</it>(<it>ijl</it>);<it>k </it></sub>+ <it>&#949;</it><sub><it>ijkl</it></sub>. The estimation of the gene layer factors <it>G </it>and <it>D </it>is discussed in the following paragraph. For all other factors, the estimation of their parameters is a matter of calculating a variance components model.</p>
            <p>The gene-dye interaction is strongest when direct labeling of RNA is applied. An analysis by ANOVA of indirectly-labeled RNA used in self hybridization slides <abbrgrp><abbr bid="B3">3</abbr></abbrgrp> and dye-swapped replicated slides (results not shown) performed at MolGen showed that the gene-dye interaction effects did not differ significantly. Note that the factor <it>X</it><sub><it>gene</it>(<it>ijl</it>);<it>k </it></sub>can only reliably be estimated in case dye-swaps are performed with the same RNA.</p>
            <p>The fishtail parameter <it>&#948; </it>was estimated by fitting a quadratic curve through the data points (<it>A</it><sub><it>ijl</it></sub>,<it>|M</it><sub><it>ijl</it></sub><it>|</it>) with <it>A</it><sub><it>ijl </it></sub>&lt;<it>&#956; </it>(for details see the supplementary document).</p>
         </sec>
         <sec>
            <st>
               <p>Gene expressions</p>
            </st>
            <p>After correcting <it>z</it><sub><it>ijkl </it></sub>for all factors except <it>G </it>and <it>D</it>, the parameters for <it>G </it>and <it>D </it>are estimated as follows: define <m:math name="1471-2105-7-205-i19" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>Y</m:mi><m:mo>&#732;</m:mo></m:mover><m:mrow><m:mi>g</m:mi><m:mo>,</m:mo><m:mi>k</m:mi></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGzbqwgaacamaaBaaaleaacqWGNbWzcqGGSaalcqWGRbWAaeqaaaaa@31B8@</m:annotation></m:semantics></m:math> as the average value of the <it>n</it><sub><it>rep</it></sub>replications <it>Y</it><sub><it>ijkl </it></sub>with <it>gene</it>(<it>ijl</it>) = <it>g </it>: <m:math name="1471-2105-7-205-i20" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>Y</m:mi><m:mo>&#732;</m:mo></m:mover><m:mrow><m:mi>g</m:mi><m:mo>;</m:mo><m:mi>k</m:mi></m:mrow></m:msub><m:mo>=</m:mo><m:mfrac><m:mn>1</m:mn><m:mrow><m:msub><m:mi>n</m:mi><m:mrow><m:mi>r</m:mi><m:mi>e</m:mi><m:mi>p</m:mi></m:mrow></m:msub></m:mrow></m:mfrac><m:mstyle displaystyle="true"><m:munder><m:mo>&#8721;</m:mo><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi><m:mi>l</m:mi><m:mo>|</m:mo><m:mi>g</m:mi><m:mi>e</m:mi><m:mi>n</m:mi><m:mi>e</m:mi><m:mrow><m:mo>(</m:mo><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi><m:mi>l</m:mi></m:mrow><m:mo>)</m:mo></m:mrow><m:mo>=</m:mo><m:mi>g</m:mi></m:mrow></m:munder><m:mrow><m:msub><m:mi>Y</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi><m:mi>l</m:mi></m:mrow></m:msub></m:mrow></m:mstyle></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGzbqwgaacamaaBaaaleaacqWGNbWzcqGG7aWocqWGRbWAaeqaaOGaeyypa0ZaaSaaaeaacqaIXaqmaeaacqWGUbGBdaWgaaWcbaGaemOCaiNaemyzauMaemiCaahabeaaaaGcdaaeqbqaaiabdMfaznaaBaaaleaacqWGPbqAcqWGQbGAcqWGSbaBaeqaaaqaaiabdMgaPjabdQgaQjabdYgaSjabcYha8jabdEgaNjabdwgaLjabd6gaUjabdwgaLnaabmaabaGaemyAaKMaemOAaOMaemiBaWgacaGLOaGaayzkaaGaeyypa0Jaem4zaCgabeqdcqGHris5aaaa@543C@</m:annotation></m:semantics></m:math>. Given the distributional assumptions outlined in "gene expressions" above, standard statistical theory provides that both the average log ratio (<m:math name="1471-2105-7-205-i21" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>M</m:mi><m:mo>&#732;</m:mo></m:mover><m:mi>g</m:mi></m:msub><m:mo>=</m:mo><m:msub><m:mover accent="true"><m:mi>Y</m:mi><m:mo>&#732;</m:mo></m:mover><m:mrow><m:mi>g</m:mi><m:mo>;</m:mo><m:mn>1</m:mn></m:mrow></m:msub><m:mo>&#8722;</m:mo><m:msub><m:mover accent="true"><m:mi>Y</m:mi><m:mo>&#732;</m:mo></m:mover><m:mrow><m:mi>g</m:mi><m:mo>;</m:mo><m:mn>2</m:mn></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGnbqtgaacamaaBaaaleaacqWGNbWzaeqaaOGaeyypa0JafmywaKLbaGaadaWgaaWcbaGaem4zaCMaei4oaSJaeGymaedabeaakiabgkHiTiqbdMfazzaaiaWaaSbaaSqaaiabdEgaNjabcUda7iabikdaYaqabaaaaa@3AE0@</m:annotation></m:semantics></m:math>) of the gene and the average intensity (<m:math name="1471-2105-7-205-i22" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>A</m:mi><m:mo>&#732;</m:mo></m:mover><m:mi>g</m:mi></m:msub><m:mo>=</m:mo><m:mstyle scriptlevel="+1"><m:mfrac><m:mn>1</m:mn><m:mn>2</m:mn></m:mfrac></m:mstyle><m:mrow><m:mo>(</m:mo><m:mrow><m:msub><m:mover accent="true"><m:mi>Y</m:mi><m:mo>&#732;</m:mo></m:mover><m:mrow><m:mi>g</m:mi><m:mo>;</m:mo><m:mn>1</m:mn></m:mrow></m:msub><m:mo>+</m:mo><m:msub><m:mover accent="true"><m:mi>Y</m:mi><m:mo>&#732;</m:mo></m:mover><m:mrow><m:mi>g</m:mi><m:mo>;</m:mo><m:mn>2</m:mn></m:mrow></m:msub></m:mrow><m:mo>)</m:mo></m:mrow></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGbbqqgaacamaaBaaaleaacqWGNbWzaeqaaOGaeyypa0ZaaSqaaSqaaiabigdaXaqaaiabikdaYaaakmaabmaabaGafmywaKLbaGaadaWgaaWcbaGaem4zaCMaei4oaSJaeGymaedabeaakiabgUcaRiqbdMfazzaaiaWaaSbaaSqaaiabdEgaNjabcUda7iabikdaYaqabaaakiaawIcacaGLPaaaaaa@3E58@</m:annotation></m:semantics></m:math>) of the gene are independently normally distributed, with variances <m:math name="1471-2105-7-205-i15" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mi>M</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaqhaaWcbaGaemyta0eabaGaeGOmaidaaaaa@30B8@</m:annotation></m:semantics></m:math> and <m:math name="1471-2105-7-205-i23" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mi>A</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaqhaaWcbaGaemyqaeeabaGaeGOmaidaaaaa@30A0@</m:annotation></m:semantics></m:math>, respectively. The distribution <m:math name="1471-2105-7-205-i24" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>A</m:mi><m:mo>&#732;</m:mo></m:mover><m:mi>g</m:mi></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGbbqqgaacamaaBaaaleaacqWGNbWzaeqaaaaa@2F49@</m:annotation></m:semantics></m:math> does not depend on whether gene <it>g </it>is up-, down- or non-regulated, and the parameters <it>&#956; </it>and <m:math name="1471-2105-7-205-i23" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mi>A</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaqhaaWcbaGaemyqaeeabaGaeGOmaidaaaaa@30A0@</m:annotation></m:semantics></m:math> involved, as well as the variance <m:math name="1471-2105-7-205-i7" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mi>&#949;</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaqhaaWcbaGae8xTdugabaGaeGOmaidaaaaa@3137@</m:annotation></m:semantics></m:math>, are estimated in a straightforward way. The mean of the distribution of <m:math name="1471-2105-7-205-i25" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>M</m:mi><m:mo>&#732;</m:mo></m:mover><m:mi>g</m:mi></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGnbqtgaacamaaBaaaleaacqWGNbWzaeqaaaaa@2F61@</m:annotation></m:semantics></m:math> depends on whether the gene is up-, down- or non-regulated. The parameters <it>&#960;</it><sub>-</sub>, <it>&#960;</it><sub>+</sub>, <it>&#956;</it><sub><it>D </it></sub>and <m:math name="1471-2105-7-205-i15" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mi>M</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaqhaaWcbaGaemyta0eabaGaeGOmaidaaaaa@30B8@</m:annotation></m:semantics></m:math> are estimated using the <it>EM</it>-algorithm <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> (Figure <figr fid="F2">2</figr>) with some restrictions (see the supplementary document). Note that for self-self hybridizations <it>&#960;</it><sub>- </sub>= <it>&#960;</it><sub>+ </sub>= 0 and the estimation of <m:math name="1471-2105-7-205-i15" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mi>M</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaqhaaWcbaGaemyta0eabaGaeGOmaidaaaaa@30B8@</m:annotation></m:semantics></m:math> is straightforward (it is the 'ordinary' variance of the <it>M</it>-values). Furthermore, fishtailing does not affect the spots with above-average intensities (<it>A</it><sub><it>ijl </it></sub>> <it>&#956;</it>), nor the <it>A</it>-values. For all spots with average intensities higher than the overall average intensity, the <it>A</it>- and <it>M</it>-values are used to estimate parameters.</p>
            <p>The estimates for <m:math name="1471-2105-7-205-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mi>G</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaqhaaWcbaGaem4raCeabaGaeGOmaidaaaaa@30AC@</m:annotation></m:semantics></m:math> and <it>&#961; </it>(see "gene expressions" above), required for the <it>SIMAGE </it>model, are calculated from the estimates for <m:math name="1471-2105-7-205-i23" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mi>A</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaqhaaWcbaGaemyqaeeabaGaeGOmaidaaaaa@30A0@</m:annotation></m:semantics></m:math> and <m:math name="1471-2105-7-205-i15" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mi>M</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaqhaaWcbaGaemyta0eabaGaeGOmaidaaaaa@30B8@</m:annotation></m:semantics></m:math>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <p>To illustrate the use of <it>SIMAGE </it>in drawing meaningful conclusions about the design and analysis of DNA microarray experiments we show a number of examples based on DNA microarray data generated within the MolGen department. <it>SIMAGE </it>is used as follows: (i) define parameters based on real DNA microarray data that are laboratory / experiment-specific, (ii) these parameters are roughly estimated by our estimation procedure, (iii) using these estimated parameters DNA microarray data is simulated mimicking real DNA microarray data as close as possible.</p>
         <sec>
            <st>
               <p>The parameter estimation</p>
            </st>
            <p>On the basis of a number of experiments that were performed at the MolGen department, the "MolGen experiment" parameters, 100 slides were simulated using <it>SIMAGE</it>. For each simulated slide, the model parameters were estimated using the estimation web-interface. Deviations of the mean and median of the estimated parameters from the original parameters are shown in Table <tblr tid="T3">3</tblr> and Figure <figr fid="F3">3</figr>.</p>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Estimation of parameters from the simulation of 100 DNA-microarray slides. The mentioned deviations are the number of estimated standard-deviations that the estimated mean, respectively median, lie away from the true value of the parameter.</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c ca="left">
                        <p>Parameter</p>
                     </c>
                     <c ca="left">
                        <p>Deviation (mean)</p>
                     </c>
                     <c ca="left">
                        <p>Deviation (median)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>&#956;</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>0.1</p>
                     </c>
                     <c ca="left">
                        <p>0.1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>&#963;</it>
                           <sub>channel</sub>
                        </p>
                     </c>
                     <c ca="left">
                        <p>-0.3</p>
                     </c>
                     <c ca="left">
                        <p>-0.5</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>&#963;</it>
                           <sub>pin</sub>
                        </p>
                     </c>
                     <c ca="left">
                        <p>1.5</p>
                     </c>
                     <c ca="left">
                        <p>1.5</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>&#963;</it>
                           <sup>2</sup>
                           <sub>gene</sub>
                        </p>
                     </c>
                     <c ca="left">
                        <p>-1.6</p>
                     </c>
                     <c ca="left">
                        <p>-1.6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>&#961;</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>-1.6</p>
                     </c>
                     <c ca="left">
                        <p>-1.6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>&#963;</it>
                           <sub>
                              <it>&#949;</it>
                           </sub>
                        </p>
                     </c>
                     <c ca="left">
                        <p>-0.7</p>
                     </c>
                     <c ca="left">
                        <p>-0.6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>|<it>&#956;</it><sub>-</sub>|, <it>&#956;</it><sub>+</sub></p>
                     </c>
                     <c ca="left">
                        <p>-1.1</p>
                     </c>
                     <c ca="left">
                        <p>-1.0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>&#960;</it><sub>-</sub>, <it>&#960;</it><sub>+</sub></p>
                     </c>
                     <c ca="left">
                        <p>1.0</p>
                     </c>
                     <c ca="left">
                        <p>0.7</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Distribution of the deviations of several of the model parameters estimated from 100 simulated DNA-microarray slides</p>
               </caption>
               <text>
                  <p><b>Distribution of the deviations of several of the model parameters estimated from 100 simulated DNA-microarray slides. </b>The deviation is calculated as (estimate - true value) / (standard deviation of 100 estimates).</p>
               </text>
               <graphic file="1471-2105-7-205-3"/>
            </fig>
            <p>The estimation of <it>&#956;</it>, <it>&#963;</it><sub>channel </sub>and <it>&#963;</it><sub><it>&#949; </it></sub>are good (Table <tblr tid="T3">3</tblr>). Parameter <it>&#963;</it><sub>pin </sub>is somewhat systematically overestimated (Fig. <figr fid="F3">3</figr>). The parameters <m:math name="1471-2105-7-205-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mi>G</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaqhaaWcbaGaem4raCeabaGaeGOmaidaaaaa@30AC@</m:annotation></m:semantics></m:math> and <it>&#961; </it>tend to be systematically mildly underestimated. This is likely because of the nonsymmetric estimation of <it>&#961; </it>(the true value of <it>&#961; </it>is usually close to 1), which influences <m:math name="1471-2105-7-205-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mi>G</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaqhaaWcbaGaem4raCeabaGaeGOmaidaaaaa@30AC@</m:annotation></m:semantics></m:math> which is calculated from the estimate of <it>&#961;</it>. The performance of the estimations of <it>&#948;</it>, <it>&#945;</it><sub>1</sub>, <it>&#945;</it><sub>2 </sub>and <it>w </it>depends highly on the actual values of these parameters. For low values of these parameters, they tend to be overestimated, while they are for high values underestimated. However, the estimation is rather good. As an indication, based on 100 slides with <it>&#948;</it>, <it>&#945;</it><sub>1</sub>, <it>&#945;</it><sub>2</sub>, and <it>w </it>all zero, the maxima of the 100 estimates for these parameters are 0.01, 0.004, 0.08, and 0.16 respectively.</p>
            <p>The quality of the estimations of <it>&#956;</it><sub>-</sub>, <it>&#956;</it><sub>+</sub>, <it>&#960;</it><sub>-</sub>, <it>&#960;</it><sub>+ </sub>is highly experiment-dependent. Table <tblr tid="T3">3</tblr> and Figure <figr fid="F3">3</figr> show results for the choice of <it>&#956;</it><sub>- </sub>= -1, <it>&#956;</it><sub>+ </sub>= 1, <it>&#960;</it><sub>- </sub>= <it>&#960;</it><sub>+ </sub>= 10%, hence when the proportions of regulated genes and their average shifts in log ratio are considerable. The theory of Dempster and coworkers (1977) <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> indicates that the estimates of <it>&#956;</it><sub>-</sub>, <it>&#956;</it><sub>+</sub>, <it>&#960;</it><sub>-</sub>, <it>&#960;</it><sub>+ </sub>are unbiased: provided the correctness of our underlying statistical assumptions, no other estimators perform better. This is in concordance with the deviations listed in Table <tblr tid="T3">3</tblr>. In some cases where the true values of <it>&#956;</it><sub>-</sub>, <it>&#956;</it><sub>+</sub>, <it>&#960;</it><sub>-</sub>, and <it>&#960;</it><sub>+ </sub>are small their estimates are rather poor (Fig. <figr fid="F3">3</figr>). The user of the web-interface is suggested to 'tweak' these gene-expression estimates somewhat, if necessary.</p>
         </sec>
         <sec>
            <st>
               <p>Within-laboratory experiment-dependency of estimated parameters</p>
            </st>
            <p>In order to investigate whether parameter estimates for different experiments from the same laboratory are significantly different, two profiles were generated: (i) "real" experiments <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp> and (ii) validation experiments <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. A Bonferonni-corrected Mann-Whitney test (<it>&#945; </it>= 5%) showed that only parameters <it>&#956; </it>and <it>&#961; </it>differ between the DNA-microarray data simulated using both profiles (supplementary Fig. S2, upper panel). Within the various "real" experiments, only the parameters describing regulation (<it>&#956;</it><sub>+</sub>, <it>&#960;</it><sub>+,</sub><it>&#956;</it><sub>-</sub>, and <it>&#960;</it><sub>-</sub>) differ significantly (supplementary Fig. S2, lower panel). Parameters concerned with technical aspects of the DNA microarray spotter and scanner are, as expected, not significantly different, since the same equipment in one laboratory was used. Future studies on datasets obtained in other laboratories may implicate other parameters than those described above.</p>
         </sec>
         <sec>
            <st>
               <p>Between-laboratory experiment-dependency of estimated parameters</p>
            </st>
            <p>The model parameters were estimated from five different datasets, obtained by different laboratories and querying the mRNA levels of different organisms. The CVs of the model parameters obtained for individual datasets are, in general, lower than those obtained for the combined experiments (Fig. <figr fid="F4">4</figr>). Parameters which differ strongly between the datasets (combined CV values are higher than the CVs of the individual experiment) are: average expression (<it>&#956;</it>), tail behavior (<it>&#948;</it>), gene variance (<it>&#963;</it><sup>2</sup><sub>gene</sub>), and the general error (<it>&#963;</it><sub><it>&#949;</it></sub>). For a few datasets the estimation of the parameters is quite "noisy", e.g. the CV for the non-linearity of scanner parameter (<it>w</it>) of the GDS69 dataset (Fig. <figr fid="F4">4</figr>). ANOVA shows that the estimated parameters, with the exception of the spot pin variation <it>&#963;</it><sub>pin </sub>and the covariance <it>&#961;</it>, are strongly experiment-dependent (Fig. <figr fid="F4">4</figr>). In particular the gene variance (<it>&#963;</it><sup>2</sup><sub>gene</sub>) is strongly influenced by the experiment (<it>p</it>-value of 10<sup>-21</sup>; Fig. <figr fid="F4">4</figr>).</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Experiment-dependency of the parameters of the <it>SIMAGE </it>model</p>
               </caption>
               <text>
                  <p><b>Experiment-dependency of the parameters of the <it>SIMAGE </it>model. </b>The bar-graph shows the CVs ((standard deviation / average) &#215; 100%) of the parameters, estimated from the individual datasets. The resulting CV was determined from the average estimates for each of the parameters obtained from the experiments. The <it>p</it>-value obtained by ANOVA is displayed below the parameter symbols; <it>p</it>-values below 0.05 are considered to be significant.</p>
               </text>
               <graphic file="1471-2105-7-205-4"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Graphical features of the simulated data</p>
            </st>
            <p>To ensure that the simulated data contains similar properties as experimentally obtained data, several aspects of the simulated slides were inspected. In supplementary Figure S3 three simulated and three "real" slides are visualized via an (<it>M</it>, <it>A</it>)-plot and by grid-based box plots. Several properties of the experimental slides are clearly present in a similar way in the simulated slides. Figure <figr fid="F5">5</figr> shows a graphical representation of the net and background spot signals of a simulated slide.</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Visualization of the signals of a simulated slide</p>
               </caption>
               <text>
                  <p><b>Visualization of the signals of a simulated slide. </b>The upper picture shows a visualization of the measured expressions, while the lower picture is a visualization of the measured background signals. The areas designated as 'missing' are grey.</p>
               </text>
               <graphic file="1471-2105-7-205-5"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>The modeled differentially expressed genes</p>
            </st>
            <p><it>SIMAGE </it>allows modeling differentially expressed genes as is demonstrated in an <it>in silico </it>simulated experiment (Fig. <figr fid="F6">6</figr>). In almost all cases the <it>p</it>-values and ratios of the 66 modeled differentially expressed genes (the blue diamonds in Fig. <figr fid="F6">6</figr>) are significantly lower than those of the non-modeled genes (the red squares in Fig. <figr fid="F6">6</figr>). Relatively few of the known differentially expressed genes had signals that were close to the background signals: these genes get <it>p</it>-values close to 1. The inset in Figure <figr fid="F6">6</figr> clearly demonstrates the signal dependency of the <it>p</it>-values: differentially expressed genes with higher expression levels are assigned lower <it>p</it>-values. This is in concordance with the p-value distributions in "real" DNA-microarray datasets.</p>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>Distribution of <it>p</it>-values of a DNA-microarray experiment simulated by <it>SIMAGE</it></p>
               </caption>
               <text>
                  <p><b>Distribution of <it>p</it>-values of a DNA-microarray experiment simulated by <it>SIMAGE</it>. </b>Data for 2200 genes, in 6 slides with technical duplicates hybridized in dye-swaps, was simulated using the MolGen experiment profile (supplementary Table T1) with some changes: <it>&#960;</it><sub>- </sub>= 1% and <it>&#960;</it><sub>+ </sub>= 2%, <it>&#956;</it><sub>- </sub>= -2 and <it>&#956;</it><sub>+ </sub>= 2), <it>&#963;</it><sub><it>bg </it></sub>= 700, and <it>s </it>= 30 % &#215; <it>&#956;</it>. The main graph shows the resulting ratios after normalization plotted versus the <it>p</it>-value. The graph was simplified by removing genes with ratios between 2/3 and 3/2. The 66 genes for which differential expressions were modeled are depicted by blue diamonds. The remaining genes are depicted in purple squares. The small graph on the right demonstrates the reversed <it>p</it>-value dependency on the average signal for the 66 differentially expressed genes modeled. The average signal was calculated for each of the 66 genes over the maximum of 12 normalized measurements. Normalization was performed using Lowess normalization and differential expression tests were performed with the non-Bayesian Cyber-T implementation of a variant of the <it>t</it>-test [3]. The Cyber-T test provides the <it>p</it>-values, which indicate the probability that a given ratio is not differential caused by chance. Genes with less than 8 measurements were excluded from these tests and assigned a <it>p</it>-value of 1, in order to be able to present these genes in the graph.</p>
               </text>
               <graphic file="1471-2105-7-205-6"/>
            </fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>As various factors, of both technical and biological nature, affect the quality of DNA-microarray data, it is essential to use a proper experiment design and sophisticated statistics and bioinformatics methods to deal with these variables. Since the factors involved, as well as their relative influence on data quality, vary between DNA-microarray laboratories and differ even between experiments executed in the same laboratory, the question as to which design and analysis method is best, cannot be answered in general terms. <it>SIMAGE</it>, a model and web-implementation to simulate gene expressions, requires the specification of up to 29 parameters. It allows simulating data that resemble experimental DNA-microarray data. To determine the relative contribution of the various parameters in DNA-microarray data is a knowledgeable task. A second web-implementation is provided to easily provide rough but reasonable estimates of most of these parameters from experimental DNA-microarray datasets. There is an important educational aspect about the simulation of DNA-microarray data, which is clearly illustrated in Figure <figr fid="F1">1B</figr>: it provides clear insights in the contribution of each background layer of the model to the measured signal.</p>
         <p>Almost all parameters in the model are strongly dependent on the experiment performed (Fig. <figr fid="F4">4</figr> and supplementary Fig. S2). This holds both for biological parameters in several different experiments from the same laboratory and for technical and biological parameters in experiments from different laboratories. The experiment-dependency of the <it>&#956; </it>estimations (Fig. <figr fid="F4">4</figr>) is obvious from the fact that the average signals in the prokaryote datasets are higher than those in the eukaryote datasets. This is due to the fact that prokaryotes generally express a larger complement of their genes. The experiment-dependency of the gene variance (<it>&#963;</it><sup>2</sup><sub>gene</sub>) might also be attributed to differences in gene expression in the different organisms interrogated in the DNA-microarray experiments discussed above. The latter is also clearly reflected by the high significance of the <it>&#963;</it><sup>2</sup><sub>gene </sub>parameter obtained by ANOVA.</p>
         <p>Any method for simulation of DNA microarray data can be questioned and criticized, mainly because there is neither established theory for the relation between expression (observed) and factors involved (some can be observed, others are hidden), nor for the statistical distribution of differential expression given by various causes across genes. Choices about the statistical aspects of the data need to be made when building a simulation model such as <it>SIMAGE</it>. Different choices would lead to (slightly) different models with other 'optimal parameter estimation methods'. We have compared the distribution of simulated data (under various circumstances) with that of experimentally obtained data and adapted our model to be able to mimic experimental data as much as possible.</p>
         <p>Creating a <it>SIMAGE</it>-like model to simulate data from other than dual-dye DNA microarray platforms might be interesting for future work. Our approach, using layers to model the factors involved, could be universally applied to simulate such data. This is, however, quite a task, since each type of DNA microarray platform has its own specific properties. Affymetrix, for instance, employs single-dye chips. The model of the synthesis of oligonucleotides on the chip surface and the different signals obtained for multiple probes would have to be quite sophisticated. Another interesting approach would be to expand <it>SIMAGE </it>to incorporate gene-regulatory interactions or genes involved in documented pathways. The use of simulated gene-regulatory networks would provide a powerful tool to estimate the efficiency of network reconstruction algorithms.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>A number of models for DNA microarray data simulation have recently been developed (Table <tblr tid="T1">1</tblr>). The question how to simulate DNA microarray data in the best way is not easily and straight-forwardly answered. There are many considerations that pose a researcher wanting to simulate DNA microarray data for difficult choices. We have developed <it>SIMAGE</it>: software that simulates dual-dye DNA microarray data. The model that we employ, although more advanced than existing models, is still a simplification of reality. To ensure that the simulated DNA microarray data mimics real data as close as possible the model is "fitted" onto "real" DNA microarray data.</p>
      </sec>
      <sec>
         <st>
            <p>Availability and requirements</p>
         </st>
         <p><b>Project name</b>: <it>SIMAGE</it></p>
         <p><b>Project home page</b>: <url>http://bioinformatics.biol.rug.nl/websoftware/simage</url></p>
         <p><b>Operating system(s)</b>: Runs on any JavaScript enabled web-browser.</p>
         <p><b>Programming languages</b>: PHP, Pascal, R, and shell scripting.</p>
         <p><b>Other requirements</b>: Additional files and figures are contained in the supplementary web site which is accessible from the above-mentioned web site.</p>
         <p><b>License</b>: The web-resource is freely accessible. Details concerning the conditions for using <it>SIMAGE </it>are available from the above-mentioned web site.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>SvH and CA conceived the <it>SIMAGE </it>concept and methodology. SvH programmed <it>SIMAGE </it>including the web-interface and CA programmed the estimator script. CA and SvH drafted the manuscript and generated the figures and tables. RJ, JK and OK critically read and revised the final manuscript. All authors read and approved the final manuscript.</p>
         <fig id="F2">
            <title>
               <p>Figure 2</p>
            </title>
            <caption>
               <p>The gene expression parameters</p>
            </caption>
            <text>
               <p><b>The gene expression parameters</b>. These parameters (Table 2) were estimated by using the <it>EM</it>-algorithm (see "gene expressions" in the implementation section). The vertical lines constitute a stem-plot of the data. The red, green and blue curves indicate down-, not-, and up-regulated genes, respectively. The black curve is a combination of the three curves and, hence, the distribution of the logratios.</p>
            </text>
            <graphic file="1471-2105-7-205-2"/>
         </fig>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We would like to thank Gerard te Meerman, Anne de Jong, and Paul Eilers for useful discussion and suggestions.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Reproducibility of oligonucleotide microarray transcriptome analyses - An interlaboratory comparison using chemostat cultures of Saccharomyces cerevisiae</p>
            </title>
            <aug>
               <au>
                  <snm>Piper</snm>
                  <fnm>MDW</fnm>
               </au>
               <au>
                  <snm>ran-Lapujade</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Bro</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Regenberg</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Knudsen</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Pronk</snm>
                  <fnm>JT</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2002</pubdate>
            <volume>277</volume>
            <fpage>37001</fpage>
            <lpage>37008</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12121991</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Analysis of variance components in gene expression data</p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Delongchamp</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Tsai</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Hsueh</snm>
                  <fnm>HM</fnm>
               </au>
               <au>
                  <snm>Sistare</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Thompson</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>Desai</snm>
                  <fnm>VG</fnm>
               </au>
               <au>
                  <snm>Fuscoe</snm>
                  <fnm>JG</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <fpage>1436</fpage>
            <lpage>1446</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14962916</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>A generally applicable validation scheme for the assessment of factors involved in reproducibility and quality of DNA-microarray data</p>
            </title>
            <aug>
               <au>
                  <snm>Van Hijum</snm>
                  <fnm>SAFT</fnm>
               </au>
               <au>
                  <snm>De Jong</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Baerends</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Karsens</snm>
                  <fnm>HA</fnm>
               </au>
               <au>
                  <snm>Kramer</snm>
                  <fnm>NE</fnm>
               </au>
               <au>
                  <snm>Larsen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Den Hengst</snm>
                  <fnm>CD</fnm>
               </au>
               <au>
                  <snm>Albers</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Kok</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kuipers</snm>
                  <fnm>OP</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>77</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1166551</pubid>
                  <pubid idtype="pmpid" link="fulltext">15907200</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Analysis of variance for gene expression microarray data</p>
            </title>
            <aug>
               <au>
                  <snm>Kerr</snm>
                  <fnm>MK</fnm>
               </au>
               <au>
                  <snm>Martin</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Churchill</snm>
                  <fnm>GA</fnm>
               </au>
            </aug>
            <source>J Comput Biol</source>
            <pubdate>2000</pubdate>
            <volume>7</volume>
            <fpage>819</fpage>
            <lpage>837</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11382364</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>The SIMAGE web-site</p>
            </title>
            <pubdate>2006</pubdate>
            <url>http://bioinformatics.biol.rug.nl/websoftware/simage</url>
         </bibl>
         <bibl id="B6">
            <title>
               <p>The FreePascal homepage</p>
            </title>
            <pubdate>2006</pubdate>
            <url>http://www.freepascal.org</url>
         </bibl>
         <bibl id="B7">
            <title>
               <p>The R project</p>
            </title>
            <pubdate>2006</pubdate>
            <url>http://www.r-project.org</url>
         </bibl>
         <bibl id="B8">
            <aug>
               <au>
                  <snm>Wit</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>McClure</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Statistics for Microarrays - Design, Analysis and Inference</source>
            <publisher>Hobroken NJ, John Wiley &amp; Sons</publisher>
            <edition>first</edition>
            <pubdate>2004</pubdate>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Empirical Bayes analysis of a microarray experiment</p>
            </title>
            <aug>
               <au>
                  <snm>Efron</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Tibshirani</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Storey</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Tusher</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>J Am Stat Assoc</source>
            <pubdate>2001</pubdate>
            <volume>96</volume>
            <fpage>1151</fpage>
            <lpage>1160</lpage>
         </bibl>
         <bibl id="B10">
            <title>
               <p>The curse of normalization</p>
            </title>
            <aug>
               <au>
                  <snm>Wolkenhouer</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Moeller-Levet</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Sanchez-Cabo</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Comp Func Genomics</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <fpage>375</fpage>
            <lpage>379</lpage>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Gene-specific dye bias in microarray reference designs</p>
            </title>
            <aug>
               <au>
                  <snm>Dombkowski</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Thibodeau</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Starcevic</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Novak</snm>
                  <fnm>RF</fnm>
               </au>
            </aug>
            <source>FEBS Lett</source>
            <pubdate>2004</pubdate>
            <volume>560</volume>
            <fpage>120</fpage>
            <lpage>124</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14988009</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Evaluation of the gene-specific dye bias in cDNA microarray experiments</p>
            </title>
            <aug>
               <au>
                  <snm>Martin-Magniette</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Aubert</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Cabannes</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Daudin</snm>
                  <fnm>JJ</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <fpage>1995</fpage>
            <lpage>2000</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15691855</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Normalization for cDNA microarray data</p>
            </title>
            <aug>
               <au>
                  <snm>Dudoit</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>YH</fnm>
               </au>
               <au>
                  <snm>Luu</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Speed</snm>
                  <fnm>TP</fnm>
               </au>
            </aug>
            <source>Proc SPIE</source>
            <pubdate>2001</pubdate>
            <volume>4266</volume>
            <fpage>141</fpage>
            <lpage>152</lpage>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Statistical theory of quantization</p>
            </title>
            <aug>
               <au>
                  <snm>Widrow</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Koll&#225;r</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>MC</fnm>
               </au>
            </aug>
            <source>IEEE Trans Instrum Meas</source>
            <pubdate>1996</pubdate>
            <volume>45</volume>
            <fpage>353</fpage>
            <lpage>361</lpage>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Saturation and Quantization Reduction in Microarray Experiments using Two Scans at Different Sensitivities</p>
            </title>
            <aug>
               <au>
                  <snm>Garc&#237;a de la Nava</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Van Hijum</snm>
                  <fnm>SAFT</fnm>
               </au>
               <au>
                  <snm>Trelles</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Stat Appl Gen Mol Biol</source>
            <pubdate>2004</pubdate>
            <volume>3</volume>
            <fpage>Article 11</fpage>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Transcriptional regulation of central amino acid metabolism in Lactococcus lactis</p>
            </title>
            <aug>
               <au>
                  <snm>Larsen</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <publisher>the Netherlands, University of Groningen</publisher>
            <pubdate>2005</pubdate>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Nisin-resistance in Gram-positive bacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Kramer</snm>
                  <fnm>NE</fnm>
               </au>
            </aug>
            <publisher>the Netherlands, University of Groningen</publisher>
            <pubdate>2005</pubdate>
         </bibl>
         <bibl id="B18">
            <title>
               <p>The Lactococcus lactis CodY regulon: identification of a conserved cis-regulatory element</p>
            </title>
            <aug>
               <au>
                  <snm>Den Hengst</snm>
                  <fnm>CD</fnm>
               </au>
               <au>
                  <snm>Van Hijum</snm>
                  <fnm>SAFT</fnm>
               </au>
               <au>
                  <snm>Geurts</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Nauta</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kok</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kuipers</snm>
                  <fnm>OP</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2005</pubdate>
            <volume>280</volume>
            <fpage>34332</fpage>
            <lpage>34342</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16040604</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>The gene expression omnibus (GEO) from NCBI</p>
            </title>
            <pubdate>2006</pubdate>
            <url>http://www.ncbi.nlm.nih.gov/geo</url>
         </bibl>
         <bibl id="B20">
            <title>
               <p>EBI databases - ArrayExpress home</p>
            </title>
            <pubdate>2006</pubdate>
            <url>http://www.ebi.ac.uk/arrayexpress</url>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>YH</fnm>
               </au>
               <au>
                  <snm>Dudoit</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Luu</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Peng</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Ngai</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Speed</snm>
                  <fnm>TP</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <fpage>e15</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">100354</pubid>
                  <pubid idtype="pmpid" link="fulltext">11842121</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Maximum Likelihood from Incomplete Data Via EM Algorithm</p>
            </title>
            <aug>
               <au>
                  <snm>Dempster</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Laird</snm>
                  <fnm>NM</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>DB</fnm>
               </au>
            </aug>
            <source>J R Stat Soc Ser B Methodol</source>
            <pubdate>1977</pubdate>
            <volume>39</volume>
            <fpage>1</fpage>
            <lpage>38</lpage>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Characterization, modeling, and simulation of mouse microarray data</p>
            </title>
            <aug>
               <au>
                  <snm>Lalush</snm>
                  <fnm>DS</fnm>
               </au>
            </aug>
            <source>Methods of Microarray Data Analysis III</source>
            <publisher>Boston, USA, Kluwer</publisher>
            <editor>Lin S and Johnson K</editor>
            <edition>first</edition>
            <pubdate>2003</pubdate>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Simulation of cDNA microarrays via a parameterized random signal model</p>
            </title>
            <aug>
               <au>
                  <snm>Balagurunathan</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Dougherty</snm>
                  <fnm>ER</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>YD</fnm>
               </au>
               <au>
                  <snm>Bittner</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Trent</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>J Biomed Opt</source>
            <pubdate>2002</pubdate>
            <volume>7</volume>
            <fpage>507</fpage>
            <lpage>523</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12175304</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Replicated microarray data</p>
            </title>
            <aug>
               <au>
                  <snm>Lonnstedt</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Speed</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Stat Sin</source>
            <pubdate>2002</pubdate>
            <volume>12</volume>
            <fpage>31</fpage>
            <lpage>46</lpage>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Simulation of DNA array hybridization experiments and evaluation of critical parameters during subsequent image and data analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Wierling</snm>
                  <fnm>CK</fnm>
               </au>
               <au>
                  <snm>Steinfath</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Elge</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Schulze-Kremer</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Aanstad</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lehrach</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Herwig</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <fpage>29</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">138793</pubid>
                  <pubid idtype="pmpid" link="fulltext">12390683</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Gene expression data simulator</p>
            </title>
            <pubdate>2006</pubdate>
            <url>http://bioinformatics.upmc.edu/GE2/index.html</url>
         </bibl>
      </refgrp>
   </bm>
</art>
