<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2004-6-1-p2</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Deposited research article</dochead>
      <bibl>
         <title>
            <p>A tool for comparing different statistical methods on identifying differentially expressed genes</p>
         </title>
         <aug>
            <au id="A1" ce="yes">
               <snm>Fogel</snm>
               <fnm>Paul</fnm>
               <insr iid="I1"/>
            </au>
            <au id="A2" ca="yes" ce="yes">
               <snm>Liu</snm>
               <fnm>Li</fnm>
               <insr iid="I2"/>
               <email>Li.Liu@aventis.com</email>
            </au>
            <au id="A3">
               <snm>Dumas</snm>
               <fnm>Bruno</fnm>
               <insr iid="I3"/>
            </au>
            <au id="A4">
               <snm>Ge</snm>
               <fnm>Nanxiang</fnm>
               <insr iid="I2"/>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Paul Fogel Consultant, 4 rue Le Goff, 75005 Paris, France</p>
            </ins>
            <ins id="I2">
               <p>Biometrics and Data Management, Sanofi-Aventis, Mail Stop B-203A, PO Box 6800, 1041 Route 202-206, Bridgewater, NJ 08873, USA</p>
            </ins>
            <ins id="I3">
               <p>Yeast Genomics, Functional Genomics, Sanofi-Aventis,13 Quai Jules Guesde, 94403 Vitry sur Seine Cedex, France</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2004</pubdate>
         <volume>6</volume>
         <issue>1</issue>
         <fpage>P2</fpage>
         <url>http://genomebiology.com/2004/6/1/P2</url>
         <note>This was the first version of this article to be made available publicly.</note>
         <xrefbib>
            <pubid idtype="doi">10.1186/gb-2004-6-1-p2</pubid>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>7</day>
               <month>12</month>
               <year>2004</year>
            </date>
         </rec>
         <pub>
            <date>
               <day>8</day>
               <month>12</month>
               <year>2004</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2004</year>
         <collab>BioMed Central Ltd</collab>
      </cpyrt>
      <shorttitle>
         <p>A tool for comparing different statistical methods on identifying differentially expressed genes</p>
      </shorttitle>
      <shortabs>
         <p>The authors have developed a procedure (a visualization method and associated variability conformation rate (VCR) criterion) that allows viewing and quantifying differences between method-dependent selections.
</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Many different statistical methods have been developed to deal with two group
comparison microarray experiments. Most often, a substantial number of genes may be selected or not, depending on which method was actually used. Practical guidance on the application of these methods is therefore required. We developed a procedure based on bootstrap and a criterion to allow viewing and quantifying differences between method-dependent selections. We applied this procedure on three datasets that cover a range of possible sample sizes to compare three well known methods, namely: t-test, LPE and SAM.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Our visualization method and associated <it>variability conformation rate</it> (VCR) criterion show that standard t-test is appropriate for large sample sizes to allow accurate variance estimates. LPE borrows strength from neighboring genes to estimate the variances and is therefore more appropriate for small sample sizes whenever gene variances are similar for similar gene intensity levels. SAM has both advantages of considering gene specific variance like t-test and adjusting multiple tests by permutation based false discovery rate. However, for small sample sizes and in cases of numerous expressed genes, the distribution based on permutated datasets may not approximate the null distribution well, resulting in an inaccurate false discovery rate. Moreover, genes with low variances may be filtered because of the fudge factor.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p>We proposed using VCR to assess different statistical methods available for analyzing microarray data and developed a bootstrap method - on which our criterion is based - to estimate the 2-d distribution of treated vs. control gene intensity levels, under the null hypothesis that there is no difference between the treatment and control group. The biological evaluation of selected genes according to one or another method confirmed that this criterion is indeed appropriate to help identifying the most suitable method.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010013">Methods</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010002">Bioinformatics</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Additional data files</p>
         </st>
         <p>The following additional data files are provided with this article: Additional data file <supplr sid="s1">1</supplr>, depicting a table showing the overlap among different methods for the yeast data; Additional data file <supplr sid="s2">2</supplr>, showing additional Figure 1; Additional data file <supplr sid="s3">3</supplr>, showing additional Figure 2; Additional data file <supplr sid="s4">4</supplr>, showing additional Figure 3; Additional data file <supplr sid="s5">5</supplr>, showing additional Figure 4.</p>
         <suppl id="s1">
            <title>
               <p>Additional data file 1</p>
            </title>
            <caption>
               <p>A table showing the overlap among different methods for the yeast data</p>
            </caption>
            <text>
               <p>A table showing the overlap among different methods for the yeast data</p>
            </text>
            <file name="gb-2004-6-1-p2-s1.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="s2">
            <title>
               <p>Additional data file 2</p>
            </title>
            <caption>
               <p>Additional figure 1</p>
            </caption>
            <text>
               <p>Additional figure 1</p>
            </text>
            <file name="gb-2004-6-1-p2-s2.png">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="s3">
            <title>
               <p>Additional data file 3</p>
            </title>
            <caption>
               <p>Additional figure 2</p>
            </caption>
            <text>
               <p>Additional figure 2</p>
            </text>
            <file name="gb-2004-6-1-p2-s3.png">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="s4">
            <title>
               <p>Additional data file 4</p>
            </title>
            <caption>
               <p>Additional figure 3</p>
            </caption>
            <text>
               <p>Additional figure 3</p>
            </text>
            <file name="gb-2004-6-1-p2-s4.png">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="s5">
            <title>
               <p>Additional data file 5</p>
            </title>
            <caption>
               <p>Additional figure 4</p>
            </caption>
            <text>
               <p>Additional figure 4</p>
            </text>
            <file name="gb-2004-6-1-p2-s5.png">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
</art>
