<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-10-5</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Methodology article</dochead>
      <bibl>
         <title>
            <p>Bayesian optimal discovery procedure for simultaneous significance testing</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Cao</snm>
               <fnm>Jing</fnm>
               <insr iid="I1"/>
               <email>jcao@smu.edu</email>
            </au>
            <au id="A2">
               <snm>Xie</snm>
               <fnm>Xian-Jin</fnm>
               <insr iid="I2"/>
               <email>xian-jin.xie@utsouthwestern.edu</email>
            </au>
            <au id="A3">
               <snm>Zhang</snm>
               <fnm>Song</fnm>
               <insr iid="I2"/>
               <email>song.zhang@utsouthwestern.edu</email>
            </au>
            <au id="A4">
               <snm>Whitehurst</snm>
               <fnm>Angelique</fnm>
               <insr iid="I3"/>
               <email>Angelique.Whitehurst@UTSouthwestern.edu</email>
            </au>
            <au id="A5">
               <snm>White</snm>
               <mi>A</mi>
               <fnm>Michael</fnm>
               <insr iid="I3"/>
               <email>michael.white@utsouthwestern.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Statistical Science, Southern Methodist University, Dallas, Texas, USA</p>
            </ins>
            <ins id="I2">
               <p>Department of Clinical Sciences, University of Texas Southwestern Medical Center, Dallas, Texas, USA</p>
            </ins>
            <ins id="I3">
               <p>Department of Cell Biology, University of Texas Southwestern Medical Center, Dallas, Texas, USA</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2009</pubdate>
         <volume>10</volume>
         <issue>1</issue>
         <fpage>5</fpage>
         <url>http://www.biomedcentral.com/1471-2105/10/5</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">19126217</pubid>
               <pubid idtype="doi">10.1186/1471-2105-10-5</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>14</day>
               <month>7</month>
               <year>2008</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>06</day>
               <month>1</month>
               <year>2009</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>06</day>
               <month>1</month>
               <year>2009</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2009</year>
         <collab>Cao et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>In high throughput screening, such as differential gene expression screening, drug sensitivity screening, and genome-wide RNAi screening, tens of thousands of tests need to be conducted simultaneously. However, the number of replicate measurements per test is extremely small, rarely exceeding 3. Several current approaches demonstrate that test statistics with shrinking variance estimates have more power over the traditional <it>t </it>statistic.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We propose a Bayesian hierarchical model to incorporate the shrinkage concept by introducing a mixture structure on variance components. The estimates from the Bayesian model are utilized in the optimal discovery procedure (ODP) proposed by Storey in 2007, which was shown to have optimal performance in multiple significance tests. We compared the performance of the Bayesian ODP with several competing test statistics.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>We have conducted simulation studies with 2 to 6 replicates per gene. We have also included test results from two real datasets. The Bayesian ODP outperforms the other methods in our study, including the original ODP. The advantage of the Bayesian ODP becomes more significant when there are few replicates per test. The improvement over the original ODP is based on the fact that Bayesian model borrows strength across genes in estimating unknown parameters. The proposed approach is efficient in computation due to the conjugate structure of the Bayesian model. The R code (see Additional file <supplr sid="S1">1</supplr>) to calculate the Bayesian ODP is provided.</p>
               <suppl id="S1">
                  <title>
                     <p>Additional file 1</p>
                  </title>
                  <text>
                     <p><b>Bayesian ODP R code</b>. This file contains the R code to calculate the posterior probability from the Bayesian model and the Bayesian ODP.</p>
                  </text>
                  <file name="1471-2105-10-5-S1.txt">
                     <p>Click here for file</p>
                  </file>
               </suppl>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>High throughput screening (HTS) is a method for scientific experimentation, which is widely used in drug discovery. It allows researchers to effectively conduct thousands or millions of biochemical or genetic tests simultaneously. Microarray experimentation is a special case of HTS. While one microarray chip can be used to test thousands of genes simultaneously, 96-well plates are typically used in HTS, each well containing one compound. Hundreds or thousands of 96-well plates are needed to test all the compounds once. The number of replicates in HTS is often less than that in microarray experiments, rarely exceeding 3. Nevertheless, they all need to deal with the scenario where the number of statistical comparisons far exceeds the number of biological replicates. To connect with previous methods, we will use "hits" in HTS and "differentially expressed genes" in microarray exchangeably.</p>
         <p>Many statistical methods have been developed to identify differentially expressed (DE) genes in microarray experiments. There are variants of Student's <it>t </it>test statistic that conduct a test on each individual biological entity and then correct for multiple comparisons. The problem is that, with a large number of tests and a small number of replicates, the statistics are very unstable. For example, a large difference in the measurements under different conditions might be driven by an outlier. Also, a large <it>t </it>statistic might arise because of a small variance, even with a small difference in the sample means.</p>
         <p>Cui and Churchill <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> used the average of gene-specific variance and pooled variance to estimate variance component. There are several alternative statistics which also modify the estimator of variance. The SAM <it>t </it>statistic was proposed in <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> where a suitable constant is added to gene-specific variance estimates. A shrunken <it>t </it>statistic <abbrgrp><abbr bid="B3">3</abbr></abbrgrp> was developed with a variance estimator that borrows information across genes using the James-Stein shrinkage idea. In James-Stein estimation, the shrinkage estimate is a linear combination of the original unbiased estimator (sample variance in this case) and a target estimate to minimize a certain loss function (e.g. the mean squared error). This procedure is computationally simple, yet produces efficient estimates. Also in the framework of James-Stein shrinkage, Opgen-Rhein and Strimmer <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> proposed a "shrinkage <it>t</it>" approach, which requires no distributional assumption. In general, these analytic shrinkage estimators show a powerful and robust performance in testing DE genes.</p>
         <p>From the Bayesian perspective, the introduction of a prior distribution on gene-specific variance naturally implements the shrinkage idea. Baldi and Long <abbrgrp><abbr bid="B5">5</abbr></abbrgrp> proposed the regularized <it>t </it>statistic to replace gene-specific variance with a Bayesian estimator based on a hierarchical model. Fox and Dimmic <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> extended Baldi and Long's approach by explicitly calculating the marginal posterior distribution for the difference in mean expression levels. Lonnstedt and Speed <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> proposed an empirical Bayes approach for replicated two-color mi-croarray experiment. Smyth <abbrgrp><abbr bid="B8">8</abbr></abbrgrp> extended the empirical Bayes approach for general microarray experiments. Sartor <it>et al</it>. <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> further extended Smyth's method by accounting for the dependence of variance on gene expression intensity. Kendziorski <it>et al</it>. <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> considered a hierarchical gamma-gamma model to test DE genes.</p>
         <p>Lonnstedt and Britton <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> proposed full Bayesian models and compared them to several highly-used frequentist methods and empirical Bayes methods. They found that the full Bayesian models seem to have less power selecting DE genes. This is because the frequentist test statistics and the empirical Bayes methods, which are similar in performance, put a stronger shrinkage on variance estimates. When the number of replicates is extremely small, the shrinkage becomes more useful in stabilizing the test statistics. In light of this study, we make a simple but important modification by adding a point mass component in the variance prior. It introduces adequate shrinkage in the estimation of variance components so that the full Bayesian model could have equivalent or greater power compared to those highly-used differential expression methods.</p>
         <p>The Bayesian model can be combined with frequentist method to further enhance the performance. One of the most current developments in this area is the optimal discovery procedure (ODP) proposed by Storey <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. Different from the conventional practise of calculating test statistic on each individual gene and then adjusting for multiple comparison, the ODP statistic is calculated based on the information across genes. The method has shown significant gains in power relative to a number of leading methods. To estimate the proportion of the true nulls, Storey used an ad hoc method which is based on ranking the tests by using a univariate statistic (e.g., a <it>t </it>statistic). He also used gene-specific sample mean and sample variance to estimate the parameters in the hypothesized null and alternative distributions. In this paper, we propose to use the posterior probability of a gene being DE to estimate the set of true nulls. By doing this, we don't need to choose a cutoff to determine the null set. The uncertainty in the estimation is accounted for in a probabilistic fashion. Furthermore, the sample mean and variance are replaced by the posterior mean and variance of gene expression level. The Bayesian estimates can borrow strength across genes. They may be more reliable than sample mean and variance, which are computed separately for each gene. Our study shows that the Bayesian ODP has considerable improvement over the original ODP, especially when there are few replicates per gene.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>The Bayesian model</p>
            </st>
            <p>In this section, we build a full Bayesian hierarchical model, and then we construct the Bayesian ODP statistic to identify DE genes. Let <it>x</it><sub><it>ij </it></sub>be the expression measurement from the <it>i</it>th gene on the <it>j</it>th array under the control (<it>i </it>= 1,..., <it>n </it>and <it>j </it>= 1,..., <it>n</it><sub>0<it>i</it></sub>), and <it>y</it><sub><it>ik </it></sub>be the expression measurement from the <it>i</it>th gene on the <it>k</it>th array under the treatment (<it>k </it>= 1,..., <it>n</it><sub>1<it>i</it></sub>). Replicate number <it>n</it><sub>0<it>i </it></sub>and <it>n</it><sub>1<it>i </it></sub>can be different among genes and between conditions, which means that the Bayesian method can deal with missing values and unbalanced experiment designs. Through a logarithm transformation (or some other transformation) on the original measurements, <it>x</it><sub><it>ij </it></sub>and <it>y</it><sub><it>ik </it></sub>are modeled by normal distributions. The first level of the Bayesian model is</p>
            <p>
               <display-formula>
                  <m:math name="1471-2105-10-5-i1" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mtable columnalign="left">
                              <m:mtr columnalign="left">
                                 <m:mtd columnalign="left">
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>x</m:mi>
                                          <m:mrow>
                                             <m:mi>i</m:mi>
                                             <m:mi>j</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                       <m:mo>|</m:mo>
                                       <m:msub>
                                          <m:mi>&#956;</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:msubsup>
                                          <m:mi>&#963;</m:mi>
                                          <m:mi>i</m:mi>
                                          <m:mn>2</m:mn>
                                       </m:msubsup>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd columnalign="left">
                                    <m:mo>~</m:mo>
                                 </m:mtd>
                                 <m:mtd columnalign="left">
                                    <m:mrow>
                                       <m:mi>N</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:msub>
                                          <m:mi>&#956;</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:msubsup>
                                          <m:mi>&#963;</m:mi>
                                          <m:mi>i</m:mi>
                                          <m:mn>2</m:mn>
                                       </m:msubsup>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd columnalign="left">
                                    <m:mrow>
                                       <m:mi>i</m:mi>
                                       <m:mo>=</m:mo>
                                       <m:mn>1</m:mn>
                                       <m:mo>,</m:mo>
                                       <m:mo>&#8943;</m:mo>
                                       <m:mo>,</m:mo>
                                       <m:mi>n</m:mi>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd columnalign="left">
                                    <m:mrow>
                                       <m:mi>j</m:mi>
                                       <m:mo>=</m:mo>
                                       <m:mn>1</m:mn>
                                       <m:mo>,</m:mo>
                                       <m:mo>&#8943;</m:mo>
                                       <m:mo>,</m:mo>
                                       <m:msub>
                                          <m:mi>n</m:mi>
                                          <m:mrow>
                                             <m:mn>0</m:mn>
                                             <m:mi>i</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                       <m:mo>;</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                              <m:mtr columnalign="left">
                                 <m:mtd columnalign="left">
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>y</m:mi>
                                          <m:mrow>
                                             <m:mi>i</m:mi>
                                             <m:mi>j</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                       <m:mo>|</m:mo>
                                       <m:msub>
                                          <m:mi>&#956;</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:msub>
                                          <m:mi>&#916;</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:msubsup>
                                          <m:mi>&#963;</m:mi>
                                          <m:mi>i</m:mi>
                                          <m:mn>2</m:mn>
                                       </m:msubsup>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd columnalign="left">
                                    <m:mo>~</m:mo>
                                 </m:mtd>
                                 <m:mtd columnalign="left">
                                    <m:mrow>
                                       <m:mi>N</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:msub>
                                          <m:mi>&#956;</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo>+</m:mo>
                                       <m:msub>
                                          <m:mi>&#916;</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:msubsup>
                                          <m:mi>&#963;</m:mi>
                                          <m:mi>i</m:mi>
                                          <m:mn>2</m:mn>
                                       </m:msubsup>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd columnalign="left">
                                    <m:mrow>
                                       <m:mi>i</m:mi>
                                       <m:mo>=</m:mo>
                                       <m:mn>1</m:mn>
                                       <m:mo>,</m:mo>
                                       <m:mo>&#8943;</m:mo>
                                       <m:mo>,</m:mo>
                                       <m:mi>n</m:mi>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd columnalign="left">
                                    <m:mrow>
                                       <m:mi>k</m:mi>
                                       <m:mo>=</m:mo>
                                       <m:mn>1</m:mn>
                                       <m:mo>,</m:mo>
                                       <m:mo>&#8943;</m:mo>
                                       <m:mo>,</m:mo>
                                       <m:msub>
                                          <m:mi>n</m:mi>
                                          <m:mrow>
                                             <m:mn>1</m:mn>
                                             <m:mi>i</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                           </m:mtable>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaqbaeaabiqbaaaabaGaemiEaG3aaSbaaSqaaiabdMgaPjabdQgaQbqabaGccqGG8baFcqaH8oqBdaWgaaWcbaGaemyAaKgabeaakiabcYcaSiabeo8aZnaaDaaaleaacqWGPbqAaeaacqaIYaGmaaaakeaacqGG+bGFaeaacqWGobGtcqGGOaakcqaH8oqBdaWgaaWcbaGaemyAaKgabeaakiabcYcaSiabeo8aZnaaDaaaleaacqWGPbqAaeaacqaIYaGmaaGccqGGPaqkcqGGSaalaeaacqWGPbqAcqGH9aqpcqaIXaqmcqGGSaalcqWIVlctcqGGSaalcqWGUbGBcqGGSaalaeaacqWGQbGAcqGH9aqpcqaIXaqmcqGGSaalcqWIVlctcqGGSaalcqWGUbGBdaWgaaWcbaGaeGimaaJaemyAaKgabeaakiabcUda7aqaaiabdMha5naaBaaaleaacqWGPbqAcqWGQbGAaeqaaOGaeiiFaWNaeqiVd02aaSbaaSqaaiabdMgaPbqabaGccqGGSaalcqqHuoardaWgaaWcbaGaemyAaKgabeaakiabcYcaSiabeo8aZnaaDaaaleaacqWGPbqAaeaacqaIYaGmaaaakeaacqGG+bGFaeaacqWGobGtcqGGOaakcqaH8oqBdaWgaaWcbaGaemyAaKgabeaakiabgUcaRiabfs5aenaaBaaaleaacqWGPbqAaeqaaOGaeiilaWIaeq4Wdm3aa0baaSqaaiabdMgaPbqaaiabikdaYaaakiabcMcaPiabcYcaSaqaaiabdMgaPjabg2da9iabigdaXiabcYcaSiabl+UimjabcYcaSiabd6gaUjabcYcaSaqaaiabdUgaRjabg2da9iabigdaXiabcYcaSiabl+UimjabcYcaSiabd6gaUnaaBaaaleaacqaIXaqmcqWGPbqAaeqaaOGaeiilaWcaaaaa@961D@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where <it>&#956;</it><sub><it>i </it></sub>is the baseline expression level under the control, and &#916;<sub><it>i </it></sub>is the difference in expression levels between treatment and control. We assume that variance <inline-formula><m:math name="1471-2105-10-5-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mi>i</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeq4Wdm3aa0baaSqaaiabdMgaPbqaaiabikdaYaaaaaa@3012@</m:annotation></m:semantics></m:math></inline-formula> is the same under the two conditions for the <it>i</it>th gene.</p>
            <p>In Bayesian modeling, it is common to introduce a latent variable to indicate the expression status of the <it>i</it>th gene <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B7">7</abbr></abbrgrp>. Here we use <it>r</it><sub><it>i </it></sub>= 1/0 to denote differential/nondifferential expression for gene <it>i</it>. Specifically, we have</p>
            <p>
               <display-formula>
                  <m:math name="1471-2105-10-5-i3" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mrow>
                              <m:mo>{</m:mo>
                              <m:mrow>
                                 <m:mtable columnalign="left">
                                    <m:mtr columnalign="left">
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>&#916;</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo>=</m:mo>
                                             <m:mn>0</m:mn>
                                             <m:mo>,</m:mo>
                                          </m:mrow>
                                       </m:mtd>
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:mtext>if&#160;</m:mtext>
                                             <m:msub>
                                                <m:mi>r</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo>=</m:mo>
                                             <m:mn>0</m:mn>
                                             <m:mo>,</m:mo>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                    <m:mtr columnalign="left">
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>&#916;</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo>~</m:mo>
                                             <m:mtext>N</m:mtext>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mn>0</m:mn>
                                             <m:mo>,</m:mo>
                                             <m:msubsup>
                                                <m:mi>s</m:mi>
                                                <m:mi>&#916;</m:mi>
                                                <m:mn>2</m:mn>
                                             </m:msubsup>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>,</m:mo>
                                          </m:mrow>
                                       </m:mtd>
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:mtext>if&#160;</m:mtext>
                                             <m:msub>
                                                <m:mi>r</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo>=</m:mo>
                                             <m:mn>1.</m:mn>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                 </m:mtable>
                              </m:mrow>
                           </m:mrow>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaWaaiqaaeaafaqaaeGacaaabaGaeuiLdq0aaSbaaSqaaiabdMgaPbqabaGccqGH9aqpcqaIWaamcqGGSaalaeaacqqGPbqAcqqGMbGzcqqGGaaicqWGYbGCdaWgaaWcbaGaemyAaKgabeaakiabg2da9iabicdaWiabcYcaSaqaaiabfs5aenaaBaaaleaacqWGPbqAaeqaaOGaeiOFa4NaeeOta4KaeiikaGIaeGimaaJaeiilaWIaem4Cam3aa0baaSqaaiabfs5aebqaaiabikdaYaaakiabcMcaPiabcYcaSaqaaiabbMgaPjabbAgaMjabbccaGiabdkhaYnaaBaaaleaacqWGPbqAaeqaaOGaeyypa0JaeGymaeJaeiOla4caaaGaay5Eaaaaaa@53A3@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>Thus &#916;<sub><it>i </it></sub>is modeled by a mixture of two components, one being a point mass at 0 for non-DE genes, and another being a normal distribution for DE genes. Hyper-parameter <inline-formula><m:math name="1471-2105-10-5-i4" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>s</m:mi><m:mi>&#916;</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4Cam3aa0baaSqaaiabfs5aebqaaiabikdaYaaaaaa@2FC9@</m:annotation></m:semantics></m:math></inline-formula> is specified as a constant. We further assume that <it>r</it><sub><it>i </it></sub>| <it>p</it><sub><it>r </it></sub>~ <it>Bernoulli</it>(<it>p</it><sub><it>r</it></sub>), where <it>p</it><sub><it>r </it></sub>is the mixing probability.</p>
            <p>To introduce a shrinkage on variance component, we impose a mixture structure on <inline-formula><m:math name="1471-2105-10-5-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mi>i</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeq4Wdm3aa0baaSqaaiabdMgaPbqaaiabikdaYaaaaaa@3012@</m:annotation></m:semantics></m:math></inline-formula></p>
            <p>
               <display-formula>
                  <m:math name="1471-2105-10-5-i5" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mrow>
                              <m:mo>{</m:mo>
                              <m:mrow>
                                 <m:mtable columnalign="left">
                                    <m:mtr columnalign="left">
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:msubsup>
                                                <m:mi>&#963;</m:mi>
                                                <m:mi>i</m:mi>
                                                <m:mn>2</m:mn>
                                             </m:msubsup>
                                             <m:mo>=</m:mo>
                                             <m:msubsup>
                                                <m:mi>&#963;</m:mi>
                                                <m:mn>0</m:mn>
                                                <m:mn>2</m:mn>
                                             </m:msubsup>
                                             <m:mo>,</m:mo>
                                          </m:mrow>
                                       </m:mtd>
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:mtext>if&#160;</m:mtext>
                                             <m:msub>
                                                <m:mi>v</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo>=</m:mo>
                                             <m:mn>0</m:mn>
                                             <m:mo>,</m:mo>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                    <m:mtr columnalign="left">
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:msubsup>
                                                <m:mi>&#963;</m:mi>
                                                <m:mi>i</m:mi>
                                                <m:mn>2</m:mn>
                                             </m:msubsup>
                                             <m:mo>~</m:mo>
                                             <m:mtext>IG</m:mtext>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:msub>
                                                <m:mi>a</m:mi>
                                                <m:mi>&#963;</m:mi>
                                             </m:msub>
                                             <m:mo>,</m:mo>
                                             <m:msub>
                                                <m:mi>b</m:mi>
                                                <m:mi>&#963;</m:mi>
                                             </m:msub>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>,</m:mo>
                                          </m:mrow>
                                       </m:mtd>
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:mtext>if&#160;</m:mtext>
                                             <m:msub>
                                                <m:mi>v</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo>=</m:mo>
                                             <m:mn>1.</m:mn>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                 </m:mtable>
                              </m:mrow>
                           </m:mrow>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaWaaiqaaeaafaqaaeGacaaabaGaeq4Wdm3aa0baaSqaaiabdMgaPbqaaiabikdaYaaakiabg2da9iabeo8aZnaaDaaaleaacqaIWaamaeaacqaIYaGmaaGccqGGSaalaeaacqqGPbqAcqqGMbGzcqqGGaaicqWG2bGDdaWgaaWcbaGaemyAaKgabeaakiabg2da9iabicdaWiabcYcaSaqaaiabeo8aZnaaDaaaleaacqWGPbqAaeaacqaIYaGmaaGccqGG+bGFcqqGjbqscqqGhbWrcqGGOaakcqWGHbqydaWgaaWcbaGaeq4WdmhabeaakiabcYcaSiabdkgaInaaBaaaleaacqaHdpWCaeqaaOGaeiykaKIaeiilaWcabaGaeeyAaKMaeeOzayMaeeiiaaIaemODay3aaSbaaSqaaiabdMgaPbqabaGccqGH9aqpcqaIXaqmcqGGUaGlaaaacaGL7baaaaa@5BE8@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>We assume that <it>v</it><sub><it>i </it></sub>| <it>p</it><sub><it>v </it></sub>~ <it>Bernoulli</it>(<it>p</it><sub><it>v</it></sub>), where <it>p</it><sub><it>v </it></sub>serves as the mixing probability. Thus <it>v</it><sub><it>i </it></sub>= 0 indicates that gene <it>i </it>shares a common variance with some other genes, and <it>v</it><sub><it>i </it></sub>= 1 indicates that it has a gene-specific variance arising from a continuous inverse gamma distribution. We specify hyper-parameters <it>a</it><sub><it>&#963; </it></sub>and <it>b</it><sub><it>&#963; </it></sub>as constants.</p>
            <p>We complete the Bayesian model with prior specifications for parameters (<it>&#956;</it><sub><it>i</it></sub>, <inline-formula><m:math name="1471-2105-10-5-i6" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mn>0</m:mn><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeq4Wdm3aa0baaSqaaiabicdaWaqaaiabikdaYaaaaaa@2FA5@</m:annotation></m:semantics></m:math></inline-formula>, <it>p</it><sub><it>r</it></sub>,<it>p</it><sub><it>v</it></sub>),</p>
            <p>
               <display-formula>
                  <m:math name="1471-2105-10-5-i7" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mtable>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>&#956;</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mo>~</m:mo>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mi>N</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mn>0</m:mn>
                                       <m:mo>,</m:mo>
                                       <m:msubsup>
                                          <m:mi>s</m:mi>
                                          <m:mi>&#956;</m:mi>
                                          <m:mn>2</m:mn>
                                       </m:msubsup>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:msubsup>
                                          <m:mi>&#963;</m:mi>
                                          <m:mn>0</m:mn>
                                          <m:mn>2</m:mn>
                                       </m:msubsup>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mo>~</m:mo>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mi>I</m:mi>
                                       <m:mi>G</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:msub>
                                          <m:mi>a</m:mi>
                                          <m:mn>0</m:mn>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:msub>
                                          <m:mi>b</m:mi>
                                          <m:mn>0</m:mn>
                                       </m:msub>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>p</m:mi>
                                          <m:mi>r</m:mi>
                                       </m:msub>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mo>~</m:mo>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mi>B</m:mi>
                                       <m:mi>e</m:mi>
                                       <m:mi>t</m:mi>
                                       <m:mi>a</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:msub>
                                          <m:mi>a</m:mi>
                                          <m:mi>r</m:mi>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:msub>
                                          <m:mi>b</m:mi>
                                          <m:mi>r</m:mi>
                                       </m:msub>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>p</m:mi>
                                          <m:mi>v</m:mi>
                                       </m:msub>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mo>~</m:mo>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mi>B</m:mi>
                                       <m:mi>e</m:mi>
                                       <m:mi>t</m:mi>
                                       <m:mi>a</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:msub>
                                          <m:mi>a</m:mi>
                                          <m:mi>v</m:mi>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:msub>
                                          <m:mi>b</m:mi>
                                          <m:mi>v</m:mi>
                                       </m:msub>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                           </m:mtable>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaqbaeqabqWaaaaabaGaeqiVd02aaSbaaSqaaiabdMgaPbqabaaakeaacqGG+bGFaeaacqWGobGtcqGGOaakcqaIWaamcqGGSaalcqWGZbWCdaqhaaWcbaGaeqiVd0gabaGaeGOmaidaaOGaeiykaKIaeiilaWcabaGaeq4Wdm3aa0baaSqaaiabicdaWaqaaiabikdaYaaaaOqaaiabc6ha+bqaaiabdMeajjabdEeahjabcIcaOiabdggaHnaaBaaaleaacqaIWaamaeqaaOGaeiilaWIaemOyai2aaSbaaSqaaiabicdaWaqabaGccqGGPaqkcqGGSaalaeaacqWGWbaCdaWgaaWcbaGaemOCaihabeaaaOqaaiabc6ha+bqaaiabdkeacjabdwgaLjabdsha0jabdggaHjabcIcaOiabdggaHnaaBaaaleaacqWGYbGCaeqaaOGaeiilaWIaemOyai2aaSbaaSqaaiabdkhaYbqabaGccqGGPaqkcqGGSaalaeaacqWGWbaCdaWgaaWcbaGaemODayhabeaaaOqaaiabc6ha+bqaaiabdkeacjabdwgaLjabdsha0jabdggaHjabcIcaOiabdggaHnaaBaaaleaacqWG2bGDaeqaaOGaeiilaWIaemOyai2aaSbaaSqaaiabdAha2bqabaGccqGGPaqkcqGGSaalaaaaaa@70CE@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where (<inline-formula><m:math name="1471-2105-10-5-i8" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>s</m:mi><m:mi>&#956;</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4Cam3aa0baaSqaaiabeY7aTbqaaiabikdaYaaaaaa@3019@</m:annotation></m:semantics></m:math></inline-formula>, <it>a</it><sub>0</sub>, <it>b</it><sub>0</sub>, <it>a</it><sub><it>r</it></sub>, <it>b</it><sub><it>r</it></sub>, <it>a</it><sub><it>v</it></sub>, <it>b</it><sub><it>v</it></sub>) are specified as constants.</p>
            <p>Let <it>X </it>and <it>Y </it>be the collections of expression measurements from all the genes under control and treatment, respectively. Our primary interest is <it>z</it><sub><it>i </it></sub>= <it>E</it>(<it>r</it><sub><it>i </it></sub>| <it>X</it>, <it>Y</it>), the marginal posterior probability that gene <it>i </it>is DE. We use <it>z</it><sub><it>i </it></sub>as the test statistic, i.e., a gene is flagged as DE if <it>z</it><sub><it>i </it></sub>> <it>&#955;</it>, where <it>&#955; </it>is a cutoff value.</p>
            <p>Computing <it>z</it><sub><it>i </it></sub>involves integration over all the other parameters in the joint posterior distribution. This integration does not have a closed form. We implement a Markov Chain Monte Carlo (MCMC) algorithm to make posterior inference. All the full conditional distributions are of standard forms such as normal, inverse gamma, beta, and Bernoulli distributions, so it is efficient to run the MCMC simulation.</p>
         </sec>
         <sec>
            <st>
               <p>The Bayesian ODP</p>
            </st>
            <p>Multiple testing methods are typically based on <it>p</it>-values obtained from each hypothesis test, which only uses information from individual tests. Because there is often a strong biological structure among HTS tests, the measurements from different tests can be related. Storey <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> proposed the optimal discovery procedure (ODP) to construct a test statistic using information across tests. Denote the expected number of true positives as ETP and the expected number of false positives as EFP. The ODP is optimal in that it maximizes the ETP for each fixed EFP level. The method has shown significant gains in power relative to a number of current leading methods.</p>
            <p>Here is the outline of the ODP. Suppose there are <it>n </it>tests, and test <it>i </it>has null density <it>f</it><sub><it>i </it></sub>and alternative density <it>g</it><sub><it>i</it></sub>, for <it>i </it>= 1,..., <it>n</it>. The observed data are <b>x</b><sub>1</sub>, <b>x</b><sub>2</sub>,..., <b>x</b><sub><b>n</b></sub>, where <b>x</b><sub><b>i </b></sub>corresponds to test <it>i</it>. Then the ODP test statistic is</p>
            <p>
               <display-formula>
                  <m:math name="1471-2105-10-5-i9" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mi>S</m:mi>
                              <m:mrow>
                                 <m:mi>O</m:mi>
                                 <m:mi>D</m:mi>
                                 <m:mi>P</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mstyle mathvariant="bold" mathsize="normal">
                              <m:mi>x</m:mi>
                           </m:mstyle>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mtable>
                                    <m:mtr>
                                       <m:mtd>
                                          <m:mrow>
                                             <m:mtext>sum&#160;of&#160;probability&#160;of&#160;data&#160;</m:mtext>
                                             <m:mstyle mathvariant="bold" mathsize="normal">
                                                <m:mi>x</m:mi>
                                             </m:mstyle>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                    <m:mtr>
                                       <m:mtd>
                                          <m:mrow>
                                             <m:mtext>under&#160;each&#160;true&#160;alternative&#160;distribution</m:mtext>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                 </m:mtable>
                              </m:mrow>
                              <m:mrow>
                                 <m:mtable>
                                    <m:mtr>
                                       <m:mtd>
                                          <m:mrow>
                                             <m:mtext>sum&#160;of&#160;probability&#160;of&#160;data&#160;</m:mtext>
                                             <m:mstyle mathvariant="bold" mathsize="normal">
                                                <m:mi>x</m:mi>
                                             </m:mstyle>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                    <m:mtr>
                                       <m:mtd>
                                          <m:mrow>
                                             <m:mtext>under&#160;each&#160;true&#160;null&#160;distribution</m:mtext>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                 </m:mtable>
                              </m:mrow>
                           </m:mfrac>
                           <m:mo>.</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4uam1aaSbaaSqaaiabd+eapjabdseaejabdcfaqbqabaGccqGGOaakcqWH4baEcqGGPaqkcqGH9aqpjuaGdaWcaaqaauaabeqaceaaaeaacqqGZbWCcqqG1bqDcqqGTbqBcqqGGaaicqqGVbWBcqqGMbGzcqqGGaaicqqGWbaCcqqGYbGCcqqGVbWBcqqGIbGycqqGHbqycqqGIbGycqqGPbqAcqqGSbaBcqqGPbqAcqqG0baDcqqG5bqEcqqGGaaicqqGVbWBcqqGMbGzcqqGGaaicqqGKbazcqqGHbqycqqG0baDcqqGHbqycqqGGaaicqWH4baEaeaacqqG1bqDcqqGUbGBcqqGKbazcqqGLbqzcqqGYbGCcqqGGaaicqqGLbqzcqqGHbqycqqGJbWycqqGObaAcqqGGaaicqqG0baDcqqGYbGCcqqG1bqDcqqGLbqzcqqGGaaicqqGHbqycqqGSbaBcqqG0baDcqqGLbqzcqqGYbGCcqqGUbGBcqqGHbqycqqG0baDcqqGPbqAcqqG2bGDcqqGLbqzcqqGGaaicqqGKbazcqqGPbqAcqqGZbWCcqqG0baDcqqGYbGCcqqGPbqAcqqGIbGycqqG1bqDcqqG0baDcqqGPbqAcqqGVbWBcqqGUbGBaaaabaqbaeqabiqaaaqaaiabbohaZjabbwha1jabb2gaTjabbccaGiabb+gaVjabbAgaMjabbccaGiabbchaWjabbkhaYjabb+gaVjabbkgaIjabbggaHjabbkgaIjabbMgaPjabbYgaSjabbMgaPjabbsha0jabbMha5jabbccaGiabb+gaVjabbAgaMjabbccaGiabbsgaKjabbggaHjabbsha0jabbggaHjabbccaGiabhIha4bqaaiabbwha1jabb6gaUjabbsgaKjabbwgaLjabbkhaYjabbccaGiabbwgaLjabbggaHjabbogaJjabbIgaOjabbccaGiabbsha0jabbkhaYjabbwha1jabbwgaLjabbccaGiabb6gaUjabbwha1jabbYgaSjabbYgaSjabbccaGiabbsgaKjabbMgaPjabbohaZjabbsha0jabbkhaYjabbMgaPjabbkgaIjabbwha1jabbsha0jabbMgaPjabb+gaVjabb6gaUbaaaaGccqGGUaGlaaa@DD0F@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>Because the true parameters in the null and alternative distributions are unknown, Storey <it>et al</it>. <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> proposed the canonical plug-in estimate</p>
            <p>
               <display-formula id="M1">
                  <m:math name="1471-2105-10-5-i10" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mover accent="true">
                                 <m:mi>S</m:mi>
                                 <m:mo>^</m:mo>
                              </m:mover>
                              <m:mrow>
                                 <m:mi>O</m:mi>
                                 <m:mi>D</m:mi>
                                 <m:mi>P</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mstyle mathvariant="bold" mathsize="normal">
                              <m:mi>x</m:mi>
                           </m:mstyle>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mstyle displaystyle="true">
                                    <m:munderover>
                                       <m:mo>&#8721;</m:mo>
                                       <m:mrow>
                                          <m:mi>i</m:mi>
                                          <m:mo>=</m:mo>
                                          <m:mn>1</m:mn>
                                       </m:mrow>
                                       <m:mi>n</m:mi>
                                    </m:munderover>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mover accent="true">
                                             <m:mi>g</m:mi>
                                             <m:mo>^</m:mo>
                                          </m:mover>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mstyle mathvariant="bold" mathsize="normal">
                                          <m:mi>x</m:mi>
                                       </m:mstyle>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                 </m:mstyle>
                              </m:mrow>
                              <m:mrow>
                                 <m:mstyle displaystyle="true">
                                    <m:munderover>
                                       <m:mo>&#8721;</m:mo>
                                       <m:mrow>
                                          <m:mi>i</m:mi>
                                          <m:mo>=</m:mo>
                                          <m:mn>1</m:mn>
                                       </m:mrow>
                                       <m:mi>n</m:mi>
                                    </m:munderover>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mover accent="true">
                                             <m:mi>w</m:mi>
                                             <m:mo>^</m:mo>
                                          </m:mover>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:msub>
                                          <m:mover accent="true">
                                             <m:mi>f</m:mi>
                                             <m:mo>^</m:mo>
                                          </m:mover>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mstyle mathvariant="bold" mathsize="normal">
                                          <m:mi>x</m:mi>
                                       </m:mstyle>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                 </m:mstyle>
                              </m:mrow>
                           </m:mfrac>
                           <m:mo>,</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafm4uamLbaKaadaWgaaWcbaGaem4ta8KaemiraqKaemiuaafabeaakiabcIcaOiabhIha4jabcMcaPiabg2da9KqbaoaalaaabaWaaabCaeaacuWGNbWzgaqcamaaBaaabaGaemyAaKgabeaacqGGOaakcqWH4baEcqGGPaqkaeaacqWGPbqAcqGH9aqpcqaIXaqmaeaacqWGUbGBaiabggHiLdaabaWaaabCaeaacuWG3bWDgaqcamaaBaaabaGaemyAaKgabeaacuWGMbGzgaqcamaaBaaabaGaemyAaKgabeaacqGGOaakcqWH4baEcqGGPaqkaeaacqWGPbqAcqGH9aqpcqaIXaqmaeaacqWGUbGBaiabggHiLdaaaOGaeiilaWcaaa@539B@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where <inline-formula><m:math name="1471-2105-10-5-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>f</m:mi><m:mo>^</m:mo></m:mover><m:mi>i</m:mi></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafmOzayMbaKaadaWgaaWcbaGaemyAaKgabeaaaaa@2EC1@</m:annotation></m:semantics></m:math></inline-formula> and <inline-formula><m:math name="1471-2105-10-5-i12" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>g</m:mi><m:mo>^</m:mo></m:mover><m:mi>i</m:mi></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafm4zaCMbaKaadaWgaaWcbaGaemyAaKgabeaaaaa@2EC3@</m:annotation></m:semantics></m:math></inline-formula> are the estimates of <it>f</it><sub><it>i </it></sub>and <it>g</it><sub><it>i</it></sub>, <inline-formula><m:math name="1471-2105-10-5-i13" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>w</m:mi><m:mo>^</m:mo></m:mover><m:mi>i</m:mi></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafm4DaCNbaKaadaWgaaWcbaGaemyAaKgabeaaaaa@2EE3@</m:annotation></m:semantics></m:math></inline-formula> = 1 if <inline-formula><m:math name="1471-2105-10-5-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>f</m:mi><m:mo>^</m:mo></m:mover><m:mi>i</m:mi></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafmOzayMbaKaadaWgaaWcbaGaemyAaKgabeaaaaa@2EC1@</m:annotation></m:semantics></m:math></inline-formula> is to be included in the denominator, and <inline-formula><m:math name="1471-2105-10-5-i13" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>w</m:mi><m:mo>^</m:mo></m:mover><m:mi>i</m:mi></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafm4DaCNbaKaadaWgaaWcbaGaemyAaKgabeaaaaa@2EE3@</m:annotation></m:semantics></m:math></inline-formula> = 0 otherwise. Specifically, the authors <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> assumed that the expression measurements follow a normal distribution, and they proposed to plug in the constrained maximum likelihood estimates under <it>f</it><sub><it>i </it></sub>and the unconstrained maximum likelihood estimates under <it>g</it><sub><it>i</it></sub>. The estimates are the sample mean and sample variance under the hypothesized normal distribution. To estimate the null set, Storey <it>et al</it>. suggested an ad hoc approach to estimate <it>w</it><sub><it>i</it></sub>. First, rank the tests using a univariate statistic (e.g., <it>t </it>statistic). Second, decide a cutoff, and the tests with the univariate statistic falling below the cutoff are classified into the null set (<inline-formula><m:math name="1471-2105-10-5-i13" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>w</m:mi><m:mo>^</m:mo></m:mover><m:mi>i</m:mi></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafm4DaCNbaKaadaWgaaWcbaGaemyAaKgabeaaaaa@2EE3@</m:annotation></m:semantics></m:math></inline-formula> = 1). The cutoff is chosen where the proportion of statistics not exceeding the cutoff equals the estimated proportion of true nulls based on the method in <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Finally, a null hypothesis is rejected if <inline-formula><m:math name="1471-2105-10-5-i14" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>S</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mi>O</m:mi><m:mi>D</m:mi><m:mi>P</m:mi></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafm4uamLbaKaadaWgaaWcbaGaem4ta8KaemiraqKaemiuaafabeaaaaa@30A1@</m:annotation></m:semantics></m:math></inline-formula>(<b>x</b><sub><b>i</b></sub>) exceeds some cutoff chosen to attain a given EFP level.</p>
            <p>The above ad hoc approach can be improved because the distributional parameters are estimated only based on information from individual tests. The posterior estimates from the proposed Bayesian model allow borrowing strength across all tests, which could provide more stable estimates. We propose to use the posterior means of <it>&#956;</it><sub><it>i</it></sub>, <it>&#956;</it><sub><it>i </it></sub>+ &#916;<sub><it>i</it></sub>, and <inline-formula><m:math name="1471-2105-10-5-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mi>i</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeq4Wdm3aa0baaSqaaiabdMgaPbqaaiabikdaYaaaaaa@3012@</m:annotation></m:semantics></m:math></inline-formula> to estimate the parameters of <it>f</it><sub><it>i </it></sub>and <it>g</it><sub><it>i </it></sub>in the ODP statistic.</p>
            <p>One way to estimate <it>w</it><sub><it>i </it></sub>is to decide a cutoff on the posterior probability (<it>z</it><sub><it>i</it></sub>) of a gene being DE, i.e., <inline-formula><m:math name="1471-2105-10-5-i13" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>w</m:mi><m:mo>^</m:mo></m:mover><m:mi>i</m:mi></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafm4DaCNbaKaadaWgaaWcbaGaemyAaKgabeaaaaa@2EE3@</m:annotation></m:semantics></m:math></inline-formula> = 0 if <it>z</it><sub><it>i </it></sub>is greater than the cutoff (e.g., 0.5) and <inline-formula><m:math name="1471-2105-10-5-i13" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>w</m:mi><m:mo>^</m:mo></m:mover><m:mi>i</m:mi></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafm4DaCNbaKaadaWgaaWcbaGaemyAaKgabeaaaaa@2EE3@</m:annotation></m:semantics></m:math></inline-formula> = 1 otherwise. Storey <it>et al</it>. <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> suggested that <it>w</it><sub><it>i </it></sub>can be thought of as weights estimating the true status of each hypothesis, and they could take on a continuum of values. Then another option is to set <inline-formula><m:math name="1471-2105-10-5-i13" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>w</m:mi><m:mo>^</m:mo></m:mover><m:mi>i</m:mi></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafm4DaCNbaKaadaWgaaWcbaGaemyAaKgabeaaaaa@2EE3@</m:annotation></m:semantics></m:math></inline-formula> = 1 - <it>z</it><sub><it>i</it></sub>, the probability of the <it>i</it>th gene being non-DE, which can also be interpreted as the probability of the <it>i</it>th null hypothesis being true. The natural introduction of the posterior probability into the ODP statistic overcomes the problem of choosing an arbitrary cutoff value. It also accommodates the uncertainty in estimating the true status of each test. In this paper, we implement this second option to construct the Bayesian ODP statistic.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <p>We conducted simulation studies and data analysis based on two experimental datasets to assess the performance of the Bayesian ODP. It is compared to six methods in identifying DE genes: the original ODP, the posterior probability from the Bayesian mixture model, the shrunken <it>t </it><abbrgrp><abbr bid="B3">3</abbr></abbrgrp>, Fox and Dimmic's Bayesian <it>t </it>(Fox) <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, the moderated <it>t </it><abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, and the intensity-based moderated <it>t </it>(IBMT) <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>.</p>
         <sec>
            <st>
               <p>Simulation study</p>
            </st>
            <p>We simulated data based on the estimated parameters from the HTS lung cancer data set described next. Specifically, we used an inverse gamma distribution to model the gene variance components. Figure <figr fid="F1">1</figr> plots the empirical density curves of the observed sample variances and simulated sample variances based on the inverse gamma model. The two curves are similar, except that the curve based on the observed sample variances is relatively more spiked in the center. The difference can be accommodated by assuming that some genes have a common variance around the mean of the gene-specific variances. In the simulation, we used the inverse gamma model to generate gene-specific variances <inline-formula><m:math name="1471-2105-10-5-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mi>i</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeq4Wdm3aa0baaSqaaiabdMgaPbqaaiabikdaYaaaaaa@3012@</m:annotation></m:semantics></m:math></inline-formula>,</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>The empirical density curves of the observed sample variances and the simulated sample variances for the lung cancer data, respectively</p>
               </caption>
               <text>
                  <p><b>The empirical density curves of the observed sample variances and the simulated sample variances for the lung cancer data, respectively</b>.</p>
               </text>
               <graphic file="1471-2105-10-5-1"/>
            </fig>
            <p>
               <display-formula>
                  <m:math name="1471-2105-10-5-i15" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mrow>
                              <m:mo>{</m:mo>
                              <m:mrow>
                                 <m:mtable columnalign="left">
                                    <m:mtr columnalign="left">
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:msubsup>
                                                <m:mi>&#963;</m:mi>
                                                <m:mi>i</m:mi>
                                                <m:mn>2</m:mn>
                                             </m:msubsup>
                                             <m:mo>=</m:mo>
                                             <m:msubsup>
                                                <m:mi>&#963;</m:mi>
                                                <m:mn>0</m:mn>
                                                <m:mn>2</m:mn>
                                             </m:msubsup>
                                             <m:mo>,</m:mo>
                                          </m:mrow>
                                       </m:mtd>
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:mtext>if&#160;</m:mtext>
                                             <m:msub>
                                                <m:mi>v</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo>=</m:mo>
                                             <m:mn>0</m:mn>
                                             <m:mo>,</m:mo>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                    <m:mtr columnalign="left">
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:msubsup>
                                                <m:mi>&#963;</m:mi>
                                                <m:mi>i</m:mi>
                                                <m:mn>2</m:mn>
                                             </m:msubsup>
                                             <m:mo>~</m:mo>
                                             <m:mtext>IG</m:mtext>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mn>2.3</m:mn>
                                             <m:mo>,</m:mo>
                                             <m:mn>0.01</m:mn>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>,</m:mo>
                                          </m:mrow>
                                       </m:mtd>
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:mtext>if&#160;</m:mtext>
                                             <m:msub>
                                                <m:mi>v</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo>=</m:mo>
                                             <m:mn>1</m:mn>
                                             <m:mo>,</m:mo>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                 </m:mtable>
                              </m:mrow>
                           </m:mrow>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaWaaiqaaeaafaqaaeGacaaabaGaeq4Wdm3aa0baaSqaaiabdMgaPbqaaiabikdaYaaakiabg2da9iabeo8aZnaaDaaaleaacqaIWaamaeaacqaIYaGmaaGccqGGSaalaeaacqqGPbqAcqqGMbGzcqqGGaaicqWG2bGDdaWgaaWcbaGaemyAaKgabeaakiabg2da9iabicdaWiabcYcaSaqaaiabeo8aZnaaDaaaleaacqWGPbqAaeaacqaIYaGmaaGccqGG+bGFcqqGjbqscqqGhbWrcqGGOaakcqaIYaGmcqGGUaGlcqaIZaWmcqGGSaalcqaIWaamcqGGUaGlcqaIWaamcqaIXaqmcqGGPaqkcqGGSaalaeaacqqGPbqAcqqGMbGzcqqGGaaicqWG2bGDdaWgaaWcbaGaemyAaKgabeaakiabg2da9iabigdaXiabcYcaSaaaaiaawUhaaaaa@5BD4@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where we set the common variance <inline-formula><m:math name="1471-2105-10-5-i6" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mn>0</m:mn><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeq4Wdm3aa0baaSqaaiabicdaWaqaaiabikdaYaaaaaa@2FA5@</m:annotation></m:semantics></m:math></inline-formula> to be the mean of the gene-specific variances. Without loss of generality, we assumed that the mean expression level under control equals 0 (<it>&#956;</it><sub><it>i </it></sub>= 0). The difference in expression levels between treatment and control is specified as</p>
            <p>
               <display-formula>
                  <m:math name="1471-2105-10-5-i16" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mrow>
                              <m:mo>{</m:mo>
                              <m:mrow>
                                 <m:mtable columnalign="left">
                                    <m:mtr columnalign="left">
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>&#916;</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo>=</m:mo>
                                             <m:mn>0</m:mn>
                                             <m:mo>,</m:mo>
                                          </m:mrow>
                                       </m:mtd>
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:mtext>if&#160;</m:mtext>
                                             <m:msub>
                                                <m:mi>r</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo>=</m:mo>
                                             <m:mn>0</m:mn>
                                             <m:mo>,</m:mo>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                    <m:mtr columnalign="left">
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>&#916;</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo>~</m:mo>
                                             <m:mtext>N</m:mtext>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mn>0</m:mn>
                                             <m:mo>,</m:mo>
                                             <m:mn>0.12</m:mn>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>,</m:mo>
                                          </m:mrow>
                                       </m:mtd>
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:mtext>if&#160;</m:mtext>
                                             <m:msub>
                                                <m:mi>r</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo>=</m:mo>
                                             <m:mn>1.</m:mn>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                 </m:mtable>
                              </m:mrow>
                           </m:mrow>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaWaaiqaaeaafaqaaeGacaaabaGaeuiLdq0aaSbaaSqaaiabdMgaPbqabaGccqGH9aqpcqaIWaamcqGGSaalaeaacqqGPbqAcqqGMbGzcqqGGaaicqWGYbGCdaWgaaWcbaGaemyAaKgabeaakiabg2da9iabicdaWiabcYcaSaqaaiabfs5aenaaBaaaleaacqWGPbqAaeqaaOGaeiOFa4NaeeOta4KaeiikaGIaeGimaaJaeiilaWIaeGimaaJaeiOla4IaeGymaeJaeGOmaiJaeiykaKIaeiilaWcabaGaeeyAaKMaeeOzayMaeeiiaaIaemOCai3aaSbaaSqaaiabdMgaPbqabaGccqGH9aqpcqaIXaqmcqGGUaGlaaaacaGL7baaaaa@5359@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>We conducted simulation studies with 2 to 6 replicates per gene. We considered two scenarios for a given number of replicates. In Scenario 1, all gene variances are gene-specific; in Scenario 2, 80% of gene variances are gene-specific and 20% of genes have a common variance. One hundred datasets were simulated under each scenario, where each dataset contains 1000 genes with 100 genes being DE.</p>
            <p>We used noninformative priors so that posterior inference is dominated by the information from data. Specifically, we let <inline-formula><m:math name="1471-2105-10-5-i8" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>s</m:mi><m:mi>&#956;</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4Cam3aa0baaSqaaiabeY7aTbqaaiabikdaYaaaaaa@3019@</m:annotation></m:semantics></m:math></inline-formula> = <inline-formula><m:math name="1471-2105-10-5-i14" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>S</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mi>O</m:mi><m:mi>D</m:mi><m:mi>P</m:mi></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafm4uamLbaKaadaWgaaWcbaGaem4ta8KaemiraqKaemiuaafabeaaaaa@30A1@</m:annotation></m:semantics></m:math></inline-formula> = 1.0 where 1.0 is sufficiently large for the expression levels. To specify the hyper-parameters for the inverse gamma priors, first we set <it>a</it><sub><it>&#963; </it></sub>= <it>a</it><sub>0 </sub>= 2.0 so that the inverse gamma priors have an infinite variance. Then we let the prior means, <inline-formula><m:math name="1471-2105-10-5-i17" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mfrac><m:mrow><m:msub><m:mi>b</m:mi><m:mi>&#963;</m:mi></m:msub></m:mrow><m:mrow><m:msub><m:mi>a</m:mi><m:mi>&#963;</m:mi></m:msub><m:mo>&#8722;</m:mo><m:mn>1</m:mn></m:mrow></m:mfrac></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaqcfa4aaSaaaeaacqWGIbGydaWgaaqaaiabeo8aZbqabaaabaGaemyyae2aaSbaaeaacqaHdpWCaeqaaiabgkHiTiabigdaXaaaaaa@34B0@</m:annotation></m:semantics></m:math></inline-formula> and <inline-formula><m:math name="1471-2105-10-5-i18" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mfrac><m:mrow><m:msub><m:mi>b</m:mi><m:mn>0</m:mn></m:msub></m:mrow><m:mrow><m:msub><m:mi>a</m:mi><m:mn>0</m:mn></m:msub><m:mo>&#8722;</m:mo><m:mn>1</m:mn></m:mrow></m:mfrac></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaqcfa4aaSaaaeaacqWGIbGydaWgaaqaaiabicdaWaqabaaabaGaemyyae2aaSbaaeaacqaIWaamaeqaaiabgkHiTiabigdaXaaaaaa@3306@</m:annotation></m:semantics></m:math></inline-formula>, equal to the average of the sample variances to solve for <it>b</it><sub><it>&#963; </it></sub>and <it>b</it><sub>0</sub>. Finally, we choose <it>a</it><sub><it>r </it></sub>= <it>b</it><sub><it>r </it></sub>= <it>a</it><sub><it>v </it></sub>= <it>b</it><sub><it>v </it></sub>= 1, which corresponds to the uniform priors for <it>p</it><sub><it>r </it></sub>and <it>p</it><sub><it>v</it></sub>. The computation is done by Gibbs sampling with 11,000 cycles. The burn-in is 1,000. We monitor two parallel chains with different starting points to assess convergence.</p>
            <p>Figures <figr fid="F2">2</figr>, <figr fid="F3">3</figr>, <figr fid="F4">4</figr>, <figr fid="F5">5</figr>, and <figr fid="F6">6</figr> plot the false discovery rate (FDR) versus the number of rejected genes with 2 to 6 replicates per gene. The top panel is under Scenario 1 and the bottom panel is under Scenario 2. In general, the two plots in each figure show a similar pattern, indicating that the true percentage of genes having a common variance does not affect the results much. The introduction of the mixture model on variance components is useful even when all the variance components are gene-specific. In all the cases considered, the Bayesian ODP significantly outperforms the others, including the original ODP. The posterior probability shows similar performance as the shrunken <it>t</it>, the moderated <it>t</it>, Fox, and IBMT. The extra shrinkage introduced by the mixture distribution on variance components makes the full Bayesian model comparable to the shrinkage and empirical Bayes statistics.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>ROC curves which compare the Bayesian ODP, the original ODP, the posterior probability from the Bayesian model, the shrunken <it>t</it>, the moderated <it>t</it>, Fox, and IBMT</p>
               </caption>
               <text>
                  <p><b>ROC curves which compare the Bayesian ODP, the original ODP, the posterior probability from the Bayesian model, the shrunken <it>t</it>, the moderated <it>t</it>, Fox, and IBMT</b>. The number of replicates per gene is 2. In Scenario 1, gene variances are gene-specific; in Scenario 2, 80% of gene variances are gene-specific and 20% of genes have a common variance.</p>
               </text>
               <graphic file="1471-2105-10-5-2"/>
            </fig>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>ROC curves which compare the Bayesian ODP, the original ODP, the posterior probability from the Bayesian model, the shrunken <it>t</it>, the moderated <it>t</it>, Fox, and IBMT</p>
               </caption>
               <text>
                  <p><b>ROC curves which compare the Bayesian ODP, the original ODP, the posterior probability from the Bayesian model, the shrunken <it>t</it>, the moderated <it>t</it>, Fox, and IBMT</b>. The number of replicates per gene is 3. In Scenario 1, gene variances are gene-specific; in Scenario 2, 80% of gene variances are gene-specific and 20% of genes have a common variance.</p>
               </text>
               <graphic file="1471-2105-10-5-3"/>
            </fig>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>ROC curves which compare the Bayesian ODP, the original ODP, the posterior probability from the Bayesian model, the shrunken <it>t</it>, the moderated <it>t</it>, Fox, and IBMT</p>
               </caption>
               <text>
                  <p><b>ROC curves which compare the Bayesian ODP, the original ODP, the posterior probability from the Bayesian model, the shrunken <it>t</it>, the moderated <it>t</it>, Fox, and IBMT</b>. The number of replicates per gene is 4. In Scenario 1, gene variances are gene-specific; in Scenario 2, 80% of gene variances are gene-specific and 20% of genes have a common variance.</p>
               </text>
               <graphic file="1471-2105-10-5-4"/>
            </fig>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>ROC curves which compare the Bayesian ODP, the original ODP, the posterior probability from the Bayesian model, the shrunken <it>t</it>, the moderated <it>t</it>, Fox, and IBMT</p>
               </caption>
               <text>
                  <p><b>ROC curves which compare the Bayesian ODP, the original ODP, the posterior probability from the Bayesian model, the shrunken <it>t</it>, the moderated <it>t</it>, Fox, and IBMT</b>. The number of replicates per gene is 5. In Scenario 1, gene variances are gene-specific; in Scenario 2, 80% of gene variances are gene-specific and 20% of genes have a common variance.</p>
               </text>
               <graphic file="1471-2105-10-5-5"/>
            </fig>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>ROC curves which compare the Bayesian ODP, the original ODP, the posterior probability from the Bayesian model, the shrunken <it>t</it>, the moderated <it>t</it>, Fox, and IBMT</p>
               </caption>
               <text>
                  <p><b>ROC curves which compare the Bayesian ODP, the original ODP, the posterior probability from the Bayesian model, the shrunken <it>t</it>, the moderated <it>t</it>, Fox, and IBMT</b>. The number of replicates per gene is 6. In Scenario 1, gene variances are gene-specific; in Scenario 2, 80% of gene variances are gene-specific and 20% of genes have a common variance.</p>
               </text>
               <graphic file="1471-2105-10-5-6"/>
            </fig>
            <p>In <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, the ODP shows significant improvement over the shrunken <it>t </it>statistic. However, in our simulation study, the ODP has the worst performance with 2 replicates per gene. It performs comparably to the shrunken <it>t </it>with 3 or 4 replicates per gene, and it outperforms the shrunken <it>t </it>with 5 or 6 replicates. The reason might be that, in <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> each gene was tested on a relatively large number of arrays, i.e., with six, seven, and eight replicates under three conditions, respectively. The sample mean and sample variance, which are used in the ODP statistic defined in (1), are much more stable compared to those based on few replicates. As shown in <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>, the fewer replicates there are, the more the shrinkage is introduced in the shrunken <it>t </it>statistic. In such cases, the ODP, which uses sample mean and variance, might be outperformed by the shrinkage method. As the number of replicates increases, sample variance becomes more stable, the benefit of the shrinkage becomes less significant, and the advantage of the ODP statistic can be revealed.</p>
            <p>The Bayesian ODP is constructed based on the ODP test statistic, which has been shown to have optimal performance in multiple significance tests <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. It also takes advantage of the parameter estimates from the Bayesian mixture model which are more reliable than those in the original ODP. When the number of replicates is extremely small, the Bayesian ODP might have a better performance in identifying DE genes.</p>
         </sec>
         <sec>
            <st>
               <p>Experimental datasets</p>
            </st>
            <p>In this section, we applied the Bayesian ODP to two experimental datasets. The first dataset is from a real HTS experiment. Paclitaxel and related taxanes are routinely used in the treatment of non-small cell lung cancer and other epithelial malignancies. The goal of the experiment is to identify gene targets that specifically reduce cell viability in the presence of paclitaxel. Whitehurst <it>et al</it>. <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> designed an HTS experiment which combined a high throughput cell-based one-well/one-gene screening platform with an arrayed genome-wide synthetic siRNA library for systematic interrogation of the molecular underpinnings of cancer cell chemoresponsiveness. The information on the dataset can be accessed from the <it>Nature </it>website <url>http://www.nature.com/nature/journal/v446/n7137/suppinfo/nature05697.html</url>. The dataset was generated under two conditions (in the presence and absence of paclitaxel). Over 21,000 genes were measured, each with 3 replicates. The measurements are the cell viability scores based on Adenosine TriPhosphate (ATP) concentration.</p>
            <p>The raw data were normalized to internal reference control samples on each plate to allow for plate-to-plate comparisons. After we ranked the genes according to the Bayesian ODP statistic, we employed the Bayesian FDR to control multiple test errors. The posterior probability of a gene being non-DE can be interpreted as a local FDR <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. A direct estimator of FDR <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> can be computed based on the posterior probability <it>z</it><sub><it>i</it></sub>. Specifically, the posterior expected FDR is</p>
            <p>
               <display-formula>
                  <m:math name="1471-2105-10-5-i19" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mtable columnalign="left">
                              <m:mtr columnalign="left">
                                 <m:mtd columnalign="left">
                                    <m:mrow>
                                       <m:mover accent="true">
                                          <m:mrow>
                                             <m:mi>F</m:mi>
                                             <m:mi>D</m:mi>
                                             <m:mi>R</m:mi>
                                          </m:mrow>
                                          <m:mo stretchy="true">&#175;</m:mo>
                                       </m:mover>
                                       <m:mo>=</m:mo>
                                       <m:mi>E</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>F</m:mi>
                                       <m:mi>D</m:mi>
                                       <m:mi>R</m:mi>
                                       <m:mo>|</m:mo>
                                       <m:mi>d</m:mi>
                                       <m:mi>a</m:mi>
                                       <m:mi>t</m:mi>
                                       <m:mi>a</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd columnalign="left">
                                    <m:mo>=</m:mo>
                                 </m:mtd>
                                 <m:mtd columnalign="left">
                                    <m:mrow>
                                       <m:mi>E</m:mi>
                                       <m:mrow>
                                          <m:mo>(</m:mo>
                                          <m:mrow>
                                             <m:mfrac>
                                                <m:mrow>
                                                   <m:mstyle displaystyle="true">
                                                      <m:mo>&#8721;</m:mo>
                                                      <m:mrow>
                                                         <m:msub>
                                                            <m:mi>&#948;</m:mi>
                                                            <m:mi>i</m:mi>
                                                         </m:msub>
                                                         <m:mo stretchy="false">(</m:mo>
                                                         <m:mn>1</m:mn>
                                                         <m:mo>&#8722;</m:mo>
                                                         <m:msub>
                                                            <m:mi>r</m:mi>
                                                            <m:mi>i</m:mi>
                                                         </m:msub>
                                                         <m:mo stretchy="false">)</m:mo>
                                                      </m:mrow>
                                                   </m:mstyle>
                                                </m:mrow>
                                                <m:mi>D</m:mi>
                                             </m:mfrac>
                                             <m:mo>|</m:mo>
                                             <m:mi>d</m:mi>
                                             <m:mi>a</m:mi>
                                             <m:mi>t</m:mi>
                                             <m:mi>a</m:mi>
                                          </m:mrow>
                                          <m:mo>)</m:mo>
                                       </m:mrow>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                              <m:mtr columnalign="left">
                                 <m:mtd columnalign="left">
                                    <m:mrow/>
                                 </m:mtd>
                                 <m:mtd columnalign="left">
                                    <m:mo>=</m:mo>
                                 </m:mtd>
                                 <m:mtd columnalign="left">
                                    <m:mrow>
                                       <m:mfrac>
                                          <m:mrow>
                                             <m:mstyle displaystyle="true">
                                                <m:mo>&#8721;</m:mo>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mi>&#948;</m:mi>
                                                      <m:mi>i</m:mi>
                                                   </m:msub>
                                                   <m:mo stretchy="false">(</m:mo>
                                                   <m:mn>1</m:mn>
                                                   <m:mo>&#8722;</m:mo>
                                                   <m:msub>
                                                      <m:mi>z</m:mi>
                                                      <m:mi>i</m:mi>
                                                   </m:msub>
                                                   <m:mo stretchy="false">)</m:mo>
                                                </m:mrow>
                                             </m:mstyle>
                                          </m:mrow>
                                          <m:mi>D</m:mi>
                                       </m:mfrac>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                           </m:mtable>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaqbaeaabiWaaaqaamaanaaabaGaemOrayKaemiraqKaemOuaifaaiabg2da9iabdweafjabcIcaOiabdAeagjabdseaejabdkfasjabcYha8jabdsgaKjabdggaHjabdsha0jabdggaHjabcMcaPaqaaiabg2da9aqaaiabdweafnaabmaabaqcfa4aaSaaaeaadaaeabqaaiabes7aKnaaBaaabaGaemyAaKgabeaacqGGOaakcqaIXaqmcqGHsislcqWGYbGCdaWgaaqaaiabdMgaPbqabaGaeiykaKcabeqabiabggHiLdaabaGaemiraqeaaOGaeiiFaWNaemizaqMaemyyaeMaemiDaqNaemyyaegacaGLOaGaayzkaaaabaaabaGaeyypa0dabaqcfa4aaSaaaeaadaaeabqaaiabes7aKnaaBaaabaGaemyAaKgabeaacqGGOaakcqaIXaqmcqGHsislcqWG6bGEdaWgaaqaaiabdMgaPbqabaGaeiykaKcabeqabiabggHiLdaabaGaemiraqeaaiabcYcaSaaaaaa@6491@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where <it>D </it>is the number of total rejections, indicator <it>&#948;</it><sub><it>i </it></sub>= 1 if the <it>i</it>th gene is identified as a hit (its Bayesian ODP statistic ranks among the top <it>D</it>), and <it>&#948;</it><sub><it>i </it></sub>= 0 otherwise. Plugging in the posterior probability <it>z</it><sub><it>i</it></sub>, we obtained an estimated FDR. Controlling the Bayesian FDR at 5%, we produced a list of 363 genes identified as hits.</p>
            <p>Sixty eight genes from the list were retested using the same reagent (Dhar-macon siRNA) as in the original experiment, all of which turned out to be positive, showing a remarkably high level of reproducibility. Through empirical testing, the gamma tubulin ring complex (<it>&#947;</it>TURC) is known to modulate paclitaxel sensitivity in a broad variety of non-small cell lung cancer cell lines. Thus selected genes from the complex can be considered landmark hits. The Bayesian ODP selected all the seven major components of the <it>&#947;</it>TURC (TUBGCP2, TUBA8, TUBGCP5, 76P, TUBGCP3, TUBG2, TUBG1). Considering the same number of selected genes (363), the original ODP produced 4 major components of the <it>&#947;</it>TURC (TUBG1, TUBA8, TUBG2, TUBGCP2), and the other five methods produced at most 5 of the major components.</p>
            <p>Without knowing the list of truly DE genes, we could not compare the Bayesian ODP and other competing methods accurately based on the HTS lung cancer data. To overcome this problem, we used the Golden Spike data <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> to compare the Bayesian ODP with the other six methods included in the simulation study.</p>
            <p>The Golden Spike dataset includes two conditions, with 3 replicates per condition. Each array has 14,010 probesets among which 3,866 probsets have spike-in RNAs. Among these 3,866 spike-in probsets, 2,535 probsets have equal concentrations of RNAs under the two conditions and 1,331 probsets are spiked in at different fold-change levels, ranging from 1.2 to 4-fold. Compared to other spike datasets, the Golden Spike dataset has a large number of probsets that are known to be DE, which makes it very popular for comparing differential expression methods.</p>
            <p>There have been criticisms of the Golden Spike data set <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr></abbrgrp>. One of the undesirable characteristics is that the non-DE probesets have non-uniform p-value distributions. Irizarry <it>et al</it>. <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> identified a severe experimental artifact, which is that "the feature intensities for genes spiked-in to be at 1:1 ratios behave very differently from the features from non-spiked-in genes". Pearson <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> suggested that one can use the Golden Spike dataset as a valid benchmark with the 2,535 equal fold-change probsets as the true negatives instead of including the non-spiked-in probsets. As such, there are 1,331 true positives and 2,535 true negatives. Opgen-Rhein and Strimmer <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> proposed to remove the 2,535 equal fold-change probsets, leaving in total 11,475 genes, and 1,331 known DE genes. In this paper, we conducted the analysis in both cases, with the former denoted as Scenario 1 and the latter Scenario 2. We used the distribution free weighted method (DFW) <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> as the expression summary measure.</p>
            <p>In addition to comparing the power of the seven methods given the same number of selected genes, we also compared their ability to correctly estimate the FDR. Because the null distributions of some of the test statistics (i.e., the Bayesian ODP, the original ODP, the shrunken <it>t</it>) are unknown, the Benjamini-Hochbergwe FDR procedure <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> can not be applied. We estimated the FDR by permutation analysis <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B13">13</abbr></abbrgrp>. The upper panels of Figure <figr fid="F7">7</figr> and Figure <figr fid="F8">8</figr> plot the true FDR versus the number of selected genes under the two scenarios. In general, the Bayesian ODP outperforms the other methods in both scenarios. In Scenario 2, the Bayesian ODP has a 1% FDR when the total number of rejections is less than 160, while the original ODP has a zero FDR. Note that the difference is caused only by one gene that is a false negative. As the total number of rejections increases, the Bayesian ODP has a much smaller FDR than the original ODP. Fox and IBMT have the second best performance under Scenario 1 and Scenario 2, respectively. We provided the list of the first 400 genes, along with their true expression status, identified by the competing methods under each scenario in Additional file <supplr sid="S2">2</supplr> and <supplr sid="S3">3</supplr>.</p>
            <fig id="F7">
               <title>
                  <p>Figure 7</p>
               </title>
               <caption>
                  <p>Results from the comparison based on the Golden Spike dataset under Scenario 1 where the total number of genes to compare is 3866</p>
               </caption>
               <text>
                  <p><b>Results from the comparison based on the Golden Spike dataset under Scenario 1 where the total number of genes to compare is 3866</b>. The top figure plots the FDR versus the total number of rejected genes. The bottom figure plots the estimated FDR versus the true proportion of false positives.</p>
               </text>
               <graphic file="1471-2105-10-5-7"/>
            </fig>
            <fig id="F8">
               <title>
                  <p>Figure 8</p>
               </title>
               <caption>
                  <p>Results from the comparison based on the Golden Spike dataset under Scenario 2 where the total number of genes to compare is 11475</p>
               </caption>
               <text>
                  <p><b>Results from the comparison based on the Golden Spike dataset under Scenario 2 where the total number of genes to compare is 11475</b>. The top figure plots the FDR versus the total number of rejected genes. The bottom figure plots the estimated FDR versus the true proportion of false positives.</p>
               </text>
               <graphic file="1471-2105-10-5-8"/>
            </fig>
            <suppl id="S2">
               <title>
                  <p>Additional file 2</p>
               </title>
               <text>
                  <p><b>List of selected DE genes under Scenario 1</b>. This file contains the list of the DE genes, along with their true expression status, identified by the different methods from the Golden Spike dataset under Scenario 1 (excluding the 10144 non-spiked-in probsets).</p>
               </text>
               <file name="1471-2105-10-5-S2.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S3">
               <title>
                  <p>Additional file 3</p>
               </title>
               <text>
                  <p><b>List of selected DE genes under Scenario 2</b>. This file contains the list of the DE genes, along with their true expression status, identified by the different methods from the Golden Spike dataset under Scenario 2 (excluding the 2,535 equal fold-change probsets).</p>
               </text>
               <file name="1471-2105-10-5-S3.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>The lower panel of Figure <figr fid="F7">7</figr> and Figure <figr fid="F8">8</figr> compare the estimated FDR with the true proportion of false positives <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, from which we can assess the ability of the methods to correctly establish the statistical significance of DE genes. We did not include the posterior probability because its permutation-based FDR assessment is computationally intractable (it requires MCMC simulation on thousands of datasets, each generated by replacing a gene with a simulated null gene). All of methods in the comparison underestimate the number of false positives, which is consistent with the results reported in <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. Correctly estimating FDR when the null distribution is unknown remains a challenge.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>One important feature of high throughput screening is that the number of replicates is extremely small, rarely exceeding 3. Full Bayesian hierarchical models were shown to be less competitive compared with some existing frequentist and empirical Bayes methods <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. This is because full Bayesian models usually employ noninformative priors which do not provide suffcient shrinkage in the estimation. In this paper, we demonstrate that the full Bayesian model can be made a competitive approach by simply adding a point mass component in the variance prior. This modification introduces adequate shrinkage which improves the performance of the full Bayesian model considerably. The Bayesian computation is efficient. It takes about 10 minutes to run the FOR-TRAN program on a HP laptop (Pentium(R)4 CPU 3.20 GHz, 1 GB RAM) to analyze the lung cancer data.</p>
         <p>The optimal discovery procedure (ODP) is one of the current developments in multiple testing. It has shown significant improvements over many leading methods. The full Bayesian model can be further combined with the ODP statistic. The Bayesian ODP can perform better than the original ODP, especially when there are few replicates in HTS. The Bayesian ODP employs the posterior probability of a gene being DE which naturally accounts for the uncertainty in the estimation of the null set. The parameter estimates in the original ODP, which are the sample mean and sample variance of individual test, are not reliable with few replicates. By replacing those with the estimates from the Bayesian model, the ODP is improved by a joint force of shrinkage estimation and borrowing strength across tests.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>JC, XX, and SZ developed the methods. JC and XX implemented and applied the methods. JC and SZ wrote the manuscript. AW and MAW provided the lung cancer HTS data and tested the analysis results.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>The authors thank the associate editor and the reviewers for their constructive comments and suggestions, which led to substantial improvement of the manuscript. This work was partly supported by NIH grant UL1 RR024982.    </p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Statistical tests for differential expression in cdna microarray experiments</p>
            </title>
            <aug>
               <au>
                  <snm>Cui</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Churchill</snm>
                  <fnm>GA</fnm>
               </au>
            </aug>
            <source>Genome Biology</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <fpage>210</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">154570</pubid>
                  <pubid idtype="pmpid" link="fulltext">12702200</pubid>
                  <pubid idtype="doi">10.1186/gb-2003-4-4-210</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Significance analysis of microar-rays applied to transcriptional responses to ionizing radiation</p>
            </title>
            <aug>
               <au>
                  <snm>Tusher</snm>
                  <fnm>VG</fnm>
               </au>
               <au>
                  <snm>Tibshirani</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Chu</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Proceedings of the National Academy of Sciences</source>
            <pubdate>2001</pubdate>
            <volume>98</volume>
            <fpage>5116</fpage>
            <lpage>5121</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1073/pnas.091062498</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Improved statistical tests for differential gene expression by shrinking variance components estimates</p>
            </title>
            <aug>
               <au>
                  <snm>Cui</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Hwang</snm>
                  <fnm>JTG</fnm>
               </au>
               <au>
                  <snm>Qiu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Blades</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Churchill</snm>
                  <fnm>GA</fnm>
               </au>
            </aug>
            <source>Biostatistics</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>59</fpage>
            <lpage>75</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/biostatistics/kxh018</pubid>
                  <pubid idtype="pmpid" link="fulltext">15618528</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Accurate ranking of differentially expressed genes by a distribution-free shrinkage approach</p>
            </title>
            <aug>
               <au>
                  <snm>Opgen-Rhein</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Strimmer</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Statistical Applications in Genetics and Molecular Biology</source>
            <pubdate>2007</pubdate>
            <volume>6</volume>
            <issue>1</issue>
            <fpage>9</fpage>
            <xrefbib>
               <pubid idtype="doi">10.2202/1544-6115.1252</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Bayesian framework for the analysis of mi-croarray expression data: regularized t-test and statistical inference of gene changes</p>
            </title>
            <aug>
               <au>
                  <snm>Baldi</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Long</snm>
                  <fnm>AD</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2001</pubdate>
            <volume>17</volume>
            <fpage>509</fpage>
            <lpage>519</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/17.6.509</pubid>
                  <pubid idtype="pmpid" link="fulltext">11395427</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>A two-sample Bayesian t-test for microarray data</p>
            </title>
            <aug>
               <au>
                  <snm>Fox</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Dimmic</snm>
                  <fnm>MW</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>126</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1431571</pubid>
                  <pubid idtype="pmpid" link="fulltext">16529652</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-7-126</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Replicated microarray data</p>
            </title>
            <aug>
               <au>
                  <snm>Lonnstedt</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Speed</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Statistica Sinica</source>
            <pubdate>2002</pubdate>
            <volume>12</volume>
            <fpage>31</fpage>
            <lpage>46</lpage>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Linear models and empirical Bayes methods for assessing differential expression in microarray experiments</p>
            </title>
            <aug>
               <au>
                  <snm>Smyth</snm>
                  <fnm>GK</fnm>
               </au>
            </aug>
            <source>Stat Appl Genet Mol Biol</source>
            <pubdate>2004</pubdate>
            <volume>3</volume>
            <fpage>Article3</fpage>
            <xrefbib>
               <pubid idtype="pmpid">16646809</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Intensity-based hierarchical Bayes method improves testing for differentially expressed genes in microarray experiments</p>
            </title>
            <aug>
               <au>
                  <snm>Sartor</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Tomlinson</snm>
                  <fnm>CR</fnm>
               </au>
               <au>
                  <snm>Wesselkamper</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Sivaganesan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Leikauf</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Medvedovic</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>538</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1781470</pubid>
                  <pubid idtype="pmpid" link="fulltext">17177995</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-7-538</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>On parametric empirical Bayes methods for comparing multiple groups using replicated gene expression profiles</p>
            </title>
            <aug>
               <au>
                  <snm>Kendziorski</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Newton</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Lan</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Gould</snm>
                  <fnm>MN</fnm>
               </au>
            </aug>
            <source>Statistics in Medicine</source>
            <pubdate>2003</pubdate>
            <volume>22</volume>
            <fpage>3899</fpage>
            <lpage>3914</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/sim.1548</pubid>
                  <pubid idtype="pmpid" link="fulltext">14673946</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Hierarchical Bayes models for cDNA mi-croarray gene expression</p>
            </title>
            <aug>
               <au>
                  <snm>Lonnstedt</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Britton</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Biostatistics</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>279</fpage>
            <lpage>291</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/biostatistics/kxi009</pubid>
                  <pubid idtype="pmpid" link="fulltext">15772106</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>The optimal discovery procedure: A new approach to simultaneous significance testing</p>
            </title>
            <aug>
               <au>
                  <snm>Storey</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>Journal of the Royal Statistical Society, Series B</source>
            <pubdate>2007</pubdate>
            <volume>69</volume>
            <fpage>1</fpage>
            <lpage>22</lpage>
         </bibl>
         <bibl id="B13">
            <title>
               <p>The optimal discovery procedure for large-scale significance testing, with applications to comparative microarray experiments</p>
            </title>
            <aug>
               <au>
                  <snm>Storey</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Dai</snm>
                  <fnm>JY</fnm>
               </au>
               <au>
                  <snm>Leek</snm>
                  <fnm>JT</fnm>
               </au>
            </aug>
            <source>Biostatistics</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>414</fpage>
            <lpage>432</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/biostatistics/kxl019</pubid>
                  <pubid idtype="pmpid" link="fulltext">16928955</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>A direct approach to false discovery rate</p>
            </title>
            <aug>
               <au>
                  <snm>Storey</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>Journal of the Royal Statistical Society, Series B</source>
            <pubdate>2002</pubdate>
            <volume>64</volume>
            <fpage>479</fpage>
            <lpage>498</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1111/1467-9868.00346</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Synthetic lethal screen identification of chemosensitizer loci in cancer cells</p>
            </title>
            <aug>
               <au>
                  <snm>Whitehurst</snm>
                  <fnm>AW</fnm>
               </au>
               <au>
                  <snm>Bodemann</snm>
                  <fnm>BO</fnm>
               </au>
               <au>
                  <snm>Cardenas</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ferguson</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Girard</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Pay-ton</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Minna</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Michnoff</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Hao</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Roth</snm>
                  <fnm>MG</fnm>
               </au>
               <au>
                  <snm>Xie</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2007</pubdate>
            <volume>446</volume>
            <fpage>815</fpage>
            <lpage>819</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature05697</pubid>
                  <pubid idtype="pmpid" link="fulltext">17429401</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Empirical Bayes analysis of a microarray experiment</p>
            </title>
            <aug>
               <au>
                  <snm>Efron</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Tibshirani</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Storey</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Tusher</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>Journal of the American Statistical Association</source>
            <pubdate>2001</pubdate>
            <volume>96</volume>
            <fpage>1151</fpage>
            <lpage>1160</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1198/016214501753382129</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Detecting differential gene expression with a semiparametric hierarchical mixture method</p>
            </title>
            <aug>
               <au>
                  <snm>Newton</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Noueiry</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sarkar</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Ahlquist</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Biostatistics</source>
            <pubdate>2004</pubdate>
            <volume>4</volume>
            <fpage>155</fpage>
            <lpage>176</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1093/biostatistics/5.2.155</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Preferred analysis methods for Affymetrix GeneChips revealed by a wholly defined control dataset</p>
            </title>
            <aug>
               <au>
                  <snm>Choe</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Boutros</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Michelson</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Church</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Halfon</snm>
                  <fnm>MS</fnm>
               </au>
            </aug>
            <source>Genome Biology</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <issue>2</issue>
            <fpage>R16</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">551536</pubid>
                  <pubid idtype="pmpid" link="fulltext">15693945</pubid>
                  <pubid idtype="doi">10.1186/gb-2005-6-2-r16</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>A reanalysis of a published Affymetrix GeneChip control dataset</p>
            </title>
            <aug>
               <au>
                  <snm>Dabney</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Storey</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>Genome Biology</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <issue>3</issue>
            <fpage>401</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1557755</pubid>
                  <pubid idtype="pmpid" link="fulltext">16563185</pubid>
                  <pubid idtype="doi">10.1186/gb-2006-7-3-401</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Feature-level exploration of a published Affymetrix GeneChip control dataset</p>
            </title>
            <aug>
               <au>
                  <snm>Irizarry</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Cope</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Genome Biology</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <issue>8</issue>
            <fpage>404</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1779590</pubid>
                  <pubid idtype="pmpid" link="fulltext">16953902</pubid>
                  <pubid idtype="doi">10.1186/gb-2006-7-8-404</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Putative null distributions corresponding to tests of differential expression in the Golden Spike dataset are intensity dependent</p>
            </title>
            <aug>
               <au>
                  <snm>Gaile</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Miecznikowski</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>105</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1892022</pubid>
                  <pubid idtype="pmpid" link="fulltext">17445265</pubid>
                  <pubid idtype="doi">10.1186/1471-2164-8-105</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>A comprehensive re-analysis of the Golden Spike data: Towards a benchmark for differential expression methods</p>
            </title>
            <aug>
               <au>
                  <snm>Pearson</snm>
                  <fnm>RD</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2008</pubdate>
            <volume>9</volume>
            <fpage>164</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2324099</pubid>
                  <pubid idtype="pmpid" link="fulltext">18366762</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-9-164</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>A distribution free summarization method for Affymetrix GeneChip arrays</p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>McGee</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Scheuermann</snm>
                  <fnm>RH</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2007</pubdate>
            <volume>23</volume>
            <issue>3</issue>
            <fpage>321</fpage>
            <lpage>327</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btl609</pubid>
                  <pubid idtype="pmpid" link="fulltext">17148508</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Controlling the false discovery rate: a practical and powerful approach to multiple testing</p>
            </title>
            <aug>
               <au>
                  <snm>Benjamini</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Hochberg</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Journal of the Royal Statistical Society B</source>
            <pubdate>1995</pubdate>
            <volume>57</volume>
            <fpage>289</fpage>
            <lpage>300</lpage>
         </bibl>
      </refgrp>
   </bm>
</art>
