<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2164-7-87</ui>
   <ji>1471-2164</ji>
   <fm>
      <dochead>Methodology article</dochead>
      <bibl>
         <title>
            <p>Operon information improves gene expression estimation for cDNA microarrays</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Xiao</snm>
               <fnm>Guanghua</fnm>
               <insr iid="I1"/>
               <email>guanghx@biostat.umn.edu</email>
            </au>
            <au id="A2">
               <snm>Martinez-Vaz</snm>
               <fnm>Betsy</fnm>
               <insr iid="I2"/>
               <email>bzayas@biosci.cbs.umn.edu</email>
            </au>
            <au id="A3">
               <snm>Pan</snm>
               <fnm>Wei</fnm>
               <insr iid="I1"/>
               <email>weip@biostat.umn.edu</email>
            </au>
            <au id="A4">
               <snm>Khodursky</snm>
               <mi>B</mi>
               <fnm>Arkady</fnm>
               <insr iid="I2"/>
               <email>khodu001@umn.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Division of Biostatistics, School of Public Health, University of Minnesota, A460 Mayo Building, Minneapolis, MN 55455-0378, USA</p>
            </ins>
            <ins id="I2">
               <p>Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Saint Paul, MN, 55108, USA</p>
            </ins>
         </insg>
         <source>BMC Genomics</source>
         <issn>1471-2164</issn>
         <pubdate>2006</pubdate>
         <volume>7</volume>
         <issue>1</issue>
         <fpage>87</fpage>
         <url>http://www.biomedcentral.com/1471-2164/7/87</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">16630355</pubid>
               <pubid idtype="doi">10.1186/1471-2164-7-87</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>30</day>
               <month>9</month>
               <year>2005</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>21</day>
               <month>4</month>
               <year>2006</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>21</day>
               <month>4</month>
               <year>2006</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2006</year>
         <collab>Xiao et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>In prokaryotic genomes, genes are organized in operons, and the genes within an operon tend to have similar levels of expression. Because of co-transcription of genes within an operon, borrowing information from other genes within the same operon can improve the estimation of relative transcript levels; the estimation of relative levels of transcript abundances is one of the most challenging tasks in experimental genomics due to the high noise level in microarray data. Therefore, techniques that can improve such estimations, and moreover are based on sound biological premises, are expected to benefit the field of microarray data analysis</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>In this paper, we propose a hierarchical Bayesian model, which relies on borrowing information from other genes within the same operon, to improve the estimation of gene expression levels and, hence, the detection of differentially expressed genes. The simulation studies and the analysis of experiential data demonstrated that the proposed method outperformed other techniques that are routinely used to estimate transcript levels and detect differentially expressed genes, including the sample mean and SAM t statistics. The improvement became more significant as the noise level in microarray data increases.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>By borrowing information about transcriptional activity of genes within classified operons, we improved the estimation of gene expression levels and the detection of differentially expressed genes.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Genome-wide monitoring of transcription by means of DNA microarrays is used to infer transcriptional and regulatory networks in living organisms. In most of microarray experiments, transcript levels of thousands of genes are measured with a relatively small number of replications, so the estimates of true expression levels from microarray data may be poor, mostly due to a small sample size. To address this problem, several statistical methods have been proposed to borrow information from other genes to improve detection of the differentially expressed ones <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>. The main idea is to borrow information from other genes to estimate either the distributions of genes's expression levels or the distribution of error terms. The underlying assumption is that it should be possible to improve the estimates of expression levels of genes by borrowing information about transcriptional activity across the sets of genes that are biologically, or physically, related. In some cases, the expression levels may significantly vary across the genes, then borrowing information from unrelated genes may not improve, or even worsen, the estimates of gene expression levels <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. However, if, based on biological knowledge, we can expect that some genes are more likely to express at similar levels (i.e. co-express), then we can improve the inference by using information about the activity of those genes.</p>
         <p>An operon <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> is a set of linearly juxtaposed genes transcribed as a single mRNA; operons are commonly found in prokaryotic genomes such as <it>Escherichia coli</it>. Transcription of operons of <it>E. coli </it>has been examined, and operons have been predicted in many studies <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp>, which provides background information about the <it>E. coli </it>regulatory network. The genes within the same operon usually have similar expression levels, hence show some local structure in expression profiles <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, and this fact has been successfully used in operon prediction <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>.</p>
         <p>Based on the existing information about the structure of operons, we propose a hierarchical Bayesian model which improves gene expression estimation by borrowing information from genes within the same operon. Most existing methods for detecting differently expressed genes <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp> borrow information from other genes in the whole genome, while our proposed method only borrows information from other genes within the same operon, which has sound biological basis. Wren <it>et al </it><abbrgrp><abbr bid="B23">23</abbr></abbrgrp> have proposed a simulated annealing approach to adjust gene expression data by using existing microarray measurements obtained on the same organism, which effectively reduced the noise and made it possible to compare different microarray experiments. But their method relies on reference microarray experiments that cover the dynamic range of transcript abundances for most of the genes, which may be difficult to select or unavailable. Instead of using existing microarray measurements, we use existing information about the operons' structure to reduce the noise in microarray data.</p>
         <p>A more accurate estimation of transcript abundances of individual genes will improve our ability to evaluate transcriptional activity on a genome-wide scale, and hence facilitate the exploration of gene regulatory networks. Our proposed method provides a better way to estimate relative transcript levels, which is critical for distinguishing differentially expressed (DE) genes from equally expressed (EE) genes. Herein, we refer to the logarithm of a ratio of the fluorescent intensities of the test and control samples as the observed gene expression level. The genes with estimated expression levels significantly different from zero are identified as DE genes, otherwise as EE genes.</p>
         <p>Using more than 200 microarray experiments, we obtained the evidence of co-transcription of genes within <it>E. coli </it>operons on a genome-wide scale. We applied the proposed method to three simulated and one experimentally obtained data sets. The simulation studies and the real data application demonstrated that the proposed method performed better than the sample mean and the SAM t statistics in estimating the gene expression levels as well as in detecting differentially expressed genes. The improvement became more significant as the noise level in microarray data increased.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Simulation study</p>
            </st>
            <p>We carried out three simulations, with similar settings and the noise level gradually increasing from simulation 1 to 3. In simulations, we assumed that the genes within an operon are co-transcribed. Since the true expression levels in a simulated data set were known, we could calculate the mean squared errors of the estimated expression levels (summarized in Table <tblr tid="T1">1</tblr>). Without incorporating the information from operons, the sample mean of the observed expression levels would be a natural estimator of a gene's expression level. The mean squared errors of the two estimates, the posterior mean from the proposed model and the sample mean, are shown in Table <tblr tid="T1">1</tblr> for comparison. It can be easily seen that the incorporation of operon information leads to a better estimate, with a smaller mean squared error, of expression levels. When the noise level increased, the improvement from incorporating operon information also increased.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Mean squared errors for different methods</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Proposed model</p>
                     </c>
                     <c ca="center">
                        <p>Sample mean</p>
                     </c>
                     <c ca="center">
                        <p>Difference</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Setting 1</p>
                     </c>
                     <c ca="center">
                        <p>0.062</p>
                     </c>
                     <c ca="center">
                        <p>0.068</p>
                     </c>
                     <c ca="center">
                        <p>8.6%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Setting 2</p>
                     </c>
                     <c ca="center">
                        <p>0.129</p>
                     </c>
                     <c ca="center">
                        <p>0.146</p>
                     </c>
                     <c ca="center">
                        <p>12.8%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Setting 3</p>
                     </c>
                     <c ca="center">
                        <p>0.222</p>
                     </c>
                     <c ca="center">
                        <p>0.255</p>
                     </c>
                     <c ca="center">
                        <p>15.1%</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>To evaluate the performance of the proposed method in detecting DE genes, the estimated expression level of each gene was used to rank the genes, and the highly ranked genes were identified as DE genes. Since the identities of DE genes in the simulation studies were known, we compared the performances of the proposed method, sample mean, and SAM t statistics in detecting DE genes using receiver operating-characteristic (ROC) curves (Figure <figr fid="F1">1</figr>). In a ROC curve, the <it>sensitivity </it>is plotted against 1 &#8211; <it>specificity</it>. The sensitivity is denned as a fraction of true DE genes being correctly detected and the specificity is a fraction of the true EE genes being correctly identified. The ROC curves in Figure <figr fid="F1">1</figr> demonstrate that the performance of the sample mean and of the SAM t statistic were very close, and our hierarchical model, incorporating operon information, outperformed both of them. The difference in the performance became greater as the noise level increased. For example, as the specificity equals to 0.8, the sensitivities of the methods using the sample mean or SAM t were about 0.91, 0.80, and 0.68 for simulations 1,2, and 3, respectively, while the sensitivities of the proposed method were 0.95, 0.89 and 0.83, respectively.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>ROC curves of simulation settings</p>
               </caption>
               <text>
                  <p><b>ROC curves of simulation settings</b>. The Figure (A), (B), and (C) are the ROC for simulations 1,2 and 3, respectively. It shows that the sample mean and the SAM t statistic have similar performance in detecting DE genes, and our hierarchical model outperformed both of them.</p>
               </text>
               <graphic file="1471-2164-7-87-1"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Application to E. coli data</p>
            </st>
            <p>To verify the assumption that the genes organized in operons are co-expressed, we pooled together data from 217 microarray experiments, obtained in 53 conditions <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. The distribution of pairwise correlations between expression profiles of genes in operons was greatly skewed towards positive values, with the mean correlation of 0.62 (Figure <figr fid="F2">2A</figr>). Unlike the profiles of genes organized in operons, expression profiles of randomly picked pairs of genes were not correlated; the corresponding distribution of correlation coefficients was almost symmetric around 0, with the mean correlation of 0.012 (Figure <figr fid="F2">2B</figr>). This result demonstrated the similarity of transcriptional activity of genes within operons and served as a motivation for borrowing information from other genes within the same operon.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Histogram of the correlation coefficients</p>
               </caption>
               <text>
                  <p><b>Histogram of the correlation coefficients</b>. Histogram of pairwise correlation coefficients for (A)the genes within operons and (B) random gene pairs. The correlations are calculated across experimental conditions. The correlation of genes organized in operons is much higher than that of random genes, strongly indicating the co-expression of genes within operons.</p>
               </text>
               <graphic file="1471-2164-7-87-2"/>
            </fig>
            <p>The proposed method [see <supplr sid="S1">Additional file 1</supplr>] was used to analyze differential transcriptional activity in an <it>E. coli </it>mutant lacking the <it>flhDC </it>gene, a master regulator of transcription of genes whose products mediate bacterial motility and chemotaxis [see <supplr sid="S2">Additional file 2</supplr>]. The genes were ranked by their estimated expression levels, i.e. their posterior means of <it>&#956;<sub><it>i </it></sub></it>obtained from the proposed model. For the sake of comparison, the sample mean and SAM t statistics were also used to rank the genes [see <supplr sid="S3">Additional file 3</supplr>]. Using the functional annotation from Macnab <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> as a standard, we obtained the number of false positives at different cut-off levels (total positives). The comparison revealed that ranking genes by the proposed method produced fewer false positives than the ranking based on the SAM t or sample mean statistics (Figure <figr fid="F3">3</figr>).</p>
            <suppl id="S1">
               <title>
                  <p>Additional File 1</p>
               </title>
               <text>
                  <p>Operon.r &#8211; The R code used in the study.</p>
               </text>
               <file name="1471-2164-7-87-S1.R">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S2">
               <title>
                  <p>Additional File 2</p>
               </title>
               <text>
                  <p>Motility.txt &#8211; E. coli motility data set.</p>
               </text>
               <file name="1471-2164-7-87-S2.txt">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S3">
               <title>
                  <p>Additional File 3</p>
               </title>
               <text>
                  <p>Result.txt &#8211; The result for E. coli motility data, including posterior mean of proposed method, sample mean and SAM t statistics</p>
               </text>
               <file name="1471-2164-7-87-S3.txt">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Number of false positives vs. Number of total positives</p>
               </caption>
               <text>
                  <p><b>Number of false positives vs. Number of total positives</b>. For the <it>E. coli </it>motility data, the genes are ranked by using the proposed method, SAM t statistic and sample mean. The number of false positives is plotted against the number of total positives for each ranking criterion. It shows that ranking genes by proposed method has less false positives than ranking genes by SAM t or sample mean.</p>
               </text>
               <graphic file="1471-2164-7-87-3"/>
            </fig>
            <p>To find a reasonable cutoff value for differently expressed genes, we calculated the false discovery rate (FDR) <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>based on the posterior probability <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. In this experiment, we also estimated the FDR by using the functional annotation from Macnab <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> as a reference. Comparison of the estimated False Discovery Rates revealed that the estimated FDR from the posterior probability was a little lower than that derived from the annotation, which could be due to the partial incompleteness of the reference (Fig. <figr fid="F4">4</figr>). Overall, the estimated FDR from the posterior probability is close to the FDR using the reference list of genes, indicating that our method for estimating the FDR is adequate. We set the cutoff for the FDR to be 0.01, which identified the top 44 genes as DE genes. At such a cutoff, the estimated number of false negatives is 14 and the estimated false negative rate is about 0.003 (see the "Methods" section for details). The top 44 genes are listed in Table <tblr tid="T2">2</tblr>. Note that, in Table <tblr tid="T2">2</tblr>, the gene expression level is on the log scale and the FDR corresponds to a specific number of DE genes and not to each individual gene itself. According to Macnab's classification <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>, 41 genes out of the 44 were expected to be differentially expressed in the <it>flhDC- </it>dependent manner, whereas the lists of 44 genes identified by using SAM t and sample mean contained only 36 and 38 expected genes, respectively.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>FDR estimation</p>
               </caption>
               <text>
                  <p><b>FDR estimation</b>. Estimate FDR by using posterior probability and the functional annotation from Macnab <it>et al</it>. The solid line is for the FDR estimate using existing functional annotation while the dashed line for using posterior probability. It indicates that estimate the FDR by using posterior probability yields reasonable result.</p>
               </text>
               <graphic file="1471-2164-7-87-4"/>
            </fig>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Top 44 genes and their estimated relative transcription levels</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c ca="center">
                        <p>Name</p>
                     </c>
                     <c ca="center">
                        <p>B number</p>
                     </c>
                     <c ca="center">
                        <p>Operon</p>
                     </c>
                     <c ca="center">
                        <p>Verified</p>
                     </c>
                     <c ca="center">
                        <p>Estimated expression</p>
                     </c>
                     <c ca="center">
                        <p>FDR</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>fliC</p>
                     </c>
                     <c ca="center">
                        <p>B1923</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-4.71</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>flgB</p>
                     </c>
                     <c ca="center">
                        <p>B1073</p>
                     </c>
                     <c ca="center">
                        <p>flgBCDEFGHIJK</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-3.90</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>flgE</p>
                     </c>
                     <c ca="center">
                        <p>B1076</p>
                     </c>
                     <c ca="center">
                        <p>flgBCDEFGHIJK</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-3.82</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>flgL</p>
                     </c>
                     <c ca="center">
                        <p>B1083</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-3.81</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>flgF</p>
                     </c>
                     <c ca="center">
                        <p>B1077</p>
                     </c>
                     <c ca="center">
                        <p>flgBCDEFGHIJK</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-3.81</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>flgD</p>
                     </c>
                     <c ca="center">
                        <p>B1075</p>
                     </c>
                     <c ca="center">
                        <p>flgBCDEFGHIJK</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-3.81</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>flgK</p>
                     </c>
                     <c ca="center">
                        <p>B1082</p>
                     </c>
                     <c ca="center">
                        <p>flgBCDEFGHIJK</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-3.76</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>flgI</p>
                     </c>
                     <c ca="center">
                        <p>B1080</p>
                     </c>
                     <c ca="center">
                        <p>flgBCDEFGHIJK</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-3.76</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>flgJ</p>
                     </c>
                     <c ca="center">
                        <p>B1081</p>
                     </c>
                     <c ca="center">
                        <p>flgBCDEFGHIJK</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-3.75</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>flgH</p>
                     </c>
                     <c ca="center">
                        <p>B1079</p>
                     </c>
                     <c ca="center">
                        <p>flgBCDEFGHIJK</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-3.74</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>flgC</p>
                     </c>
                     <c ca="center">
                        <p>B1074</p>
                     </c>
                     <c ca="center">
                        <p>flgBCDEFGHIJK</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-3.73</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>flgG</p>
                     </c>
                     <c ca="center">
                        <p>B1078</p>
                     </c>
                     <c ca="center">
                        <p>flgBCDEFGHIJK</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-3.71</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>fliA</p>
                     </c>
                     <c ca="center">
                        <p>B1922</p>
                     </c>
                     <c ca="center">
                        <p>fliAZY</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-3.44</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>fliY</p>
                     </c>
                     <c ca="center">
                        <p>B1920</p>
                     </c>
                     <c ca="center">
                        <p>fliAZY</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-3.35</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>fliZ</p>
                     </c>
                     <c ca="center">
                        <p>B1921</p>
                     </c>
                     <c ca="center">
                        <p>fliAZY</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-3.32</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>fliD</p>
                     </c>
                     <c ca="center">
                        <p>B1924</p>
                     </c>
                     <c ca="center">
                        <p>fliDST</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-3.30</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>fliS</p>
                     </c>
                     <c ca="center">
                        <p>B1925</p>
                     </c>
                     <c ca="center">
                        <p>fliDST</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-3.25</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>fliT</p>
                     </c>
                     <c ca="center">
                        <p>B1926</p>
                     </c>
                     <c ca="center">
                        <p>fliDST</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-3.25</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>tap</p>
                     </c>
                     <c ca="center">
                        <p>B1885</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-3.22</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>tar</p>
                     </c>
                     <c ca="center">
                        <p>B1886</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-2.97</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>tsr</p>
                     </c>
                     <c ca="center">
                        <p>B4355</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-2.64</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>fadL</p>
                     </c>
                     <c ca="center">
                        <p>B2344</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-2.37</p>
                     </c>
                     <c ca="center">
                        <p>0.001</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>fliH</p>
                     </c>
                     <c ca="center">
                        <p>B1940</p>
                     </c>
                     <c ca="center">
                        <p>fliFGHIJK</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-2.03</p>
                     </c>
                     <c ca="center">
                        <p>0.001</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>fliG</p>
                     </c>
                     <c ca="center">
                        <p>B1939</p>
                     </c>
                     <c ca="center">
                        <p>fliFGHIJK</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-2.01</p>
                     </c>
                     <c ca="center">
                        <p>0.001</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>fliF</p>
                     </c>
                     <c ca="center">
                        <p>B1938</p>
                     </c>
                     <c ca="center">
                        <p>fliFGHIJK</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-1.99</p>
                     </c>
                     <c ca="center">
                        <p>0.001</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>fliK</p>
                     </c>
                     <c ca="center">
                        <p>B1943</p>
                     </c>
                     <c ca="center">
                        <p>fliFGHIJK</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-1.99</p>
                     </c>
                     <c ca="center">
                        <p>0.001</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>fliJ</p>
                     </c>
                     <c ca="center">
                        <p>B1942</p>
                     </c>
                     <c ca="center">
                        <p>fliFGHIJK</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-1.92</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>flgA</p>
                     </c>
                     <c ca="center">
                        <p>B1072</p>
                     </c>
                     <c ca="center">
                        <p>flgAMN</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-1.91</p>
                     </c>
                     <c ca="center">
                        <p>0.001</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>fliI</p>
                     </c>
                     <c ca="center">
                        <p>B1941</p>
                     </c>
                     <c ca="center">
                        <p>fliFGHIJK</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-1.89</p>
                     </c>
                     <c ca="center">
                        <p>0.001</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>flgM</p>
                     </c>
                     <c ca="center">
                        <p>B1071</p>
                     </c>
                     <c ca="center">
                        <p>flgAMN</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-1.87</p>
                     </c>
                     <c ca="center">
                        <p>0.001</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>flgN</p>
                     </c>
                     <c ca="center">
                        <p>B1070</p>
                     </c>
                     <c ca="center">
                        <p>flgAMN</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-1.81</p>
                     </c>
                     <c ca="center">
                        <p>0.001</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>flxA</p>
                     </c>
                     <c ca="center">
                        <p>B1566</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-1.70</p>
                     </c>
                     <c ca="center">
                        <p>0.002</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>aer</p>
                     </c>
                     <c ca="center">
                        <p>B3072</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-1.66</p>
                     </c>
                     <c ca="center">
                        <p>0.002</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>cheZ</p>
                     </c>
                     <c ca="center">
                        <p>B1881</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-1.63</p>
                     </c>
                     <c ca="center">
                        <p>0.002</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>cheR</p>
                     </c>
                     <c ca="center">
                        <p>B1884</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-1.60</p>
                     </c>
                     <c ca="center">
                        <p>0.003</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>fliM</p>
                     </c>
                     <c ca="center">
                        <p>B1945</p>
                     </c>
                     <c ca="center">
                        <p>fliMNOPQR</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-1.55</p>
                     </c>
                     <c ca="center">
                        <p>0.003</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>cheY</p>
                     </c>
                     <c ca="center">
                        <p>B1882</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-1.51</p>
                     </c>
                     <c ca="center">
                        <p>0.005</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>fliL</p>
                     </c>
                     <c ca="center">
                        <p>B1944</p>
                     </c>
                     <c ca="center">
                        <p>fliMNOPQR</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-1.50</p>
                     </c>
                     <c ca="center">
                        <p>0.005</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>fliN</p>
                     </c>
                     <c ca="center">
                        <p>B1946</p>
                     </c>
                     <c ca="center">
                        <p>fliMNOPQR</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-1.49</p>
                     </c>
                     <c ca="center">
                        <p>0.005</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>ycgR</p>
                     </c>
                     <c ca="center">
                        <p>B1194</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-1.44</p>
                     </c>
                     <c ca="center">
                        <p>0.007</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>cheW</p>
                     </c>
                     <c ca="center">
                        <p>B1887</p>
                     </c>
                     <c ca="center">
                        <p>motAB-cheAW</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-1.42</p>
                     </c>
                     <c ca="center">
                        <p>0.007</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>cheA</p>
                     </c>
                     <c ca="center">
                        <p>B1888</p>
                     </c>
                     <c ca="center">
                        <p>motAB-cheAW</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-1.39</p>
                     </c>
                     <c ca="center">
                        <p>0.007</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>b1904</p>
                     </c>
                     <c ca="center">
                        <p>B1904</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-1.38</p>
                     </c>
                     <c ca="center">
                        <p>0.009</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>mot A</p>
                     </c>
                     <c ca="center">
                        <p>B1890</p>
                     </c>
                     <c ca="center">
                        <p>motAB-cheAW</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-1.32</p>
                     </c>
                     <c ca="center">
                        <p>0.009</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Note: the estimated gene expression levels is log scale.</p>
               </tblfn>
            </tbl>
            <p>We examined some genes and operons in more detail, to demonstrate the advantages of borrowing information from within an operon. For example, an operon <it>argT-hisJQMP </it>contains 5 genes (argT, hisJ, hisM, hisP, hisQ) and is not expected to be differentially expressed under the examined experimental condition, according to our biological knowledge <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. But from the microarray data, the mean expression level (an average log ratio) of the gene <it>argT was </it>-2.73, which ranked 15th among all the expression levels in the <it>E. coli </it>genome. This "high expression" of the <it>argT </it>could possibly be caused by random noise in microarray measurements and/or in biological samples, since the mean expression levels of other genes within the same operon were close to 0 (those for genes hisP, hisM, hisQ, hisJ were 0.00, -0.05, 0.03, and 0.05, respectively). However, accounting for expression levels of other genes within the same operon lowered the estimate of the expression level (posterior mean of <it>&#956;<sub><it>i </it></sub></it>of the <it>argT to </it>-0.02, rank of 1456, indicating that this gene was not differentially expressed. Analysis of another operon, <it>fliDST</it>, illustrates a complimentary case. Transcription of the <it>fliDST operon </it>(containing 3 genes, <it>fliD, fliS, fliT</it>) is known to be controlled by the FlhDC <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>, and thus, under the experimental condition, differential expression of genes in that operon would be expected. While the expression level of the <it>fliT</it>, estimated by the sample mean at -0.90, ranked only 65th, the estimated expression level of the gene after borrowing information from two other genes in the operon was -3.25, which ranked 19th.</p>
            <p>In general, through borrowing information, our Bayesian method worked in a way giving more consistent estimates of the expression levels for the genes of the same operon. For example, compared with using the sample mean to estimate expression levels, the Bayesian method tended to yield smaller standard deviations of the expression estimates for within-operon genes (see Figure <figr fid="F5">5</figr>).</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Within-operon standard deviation of the estimated gene expressions</p>
               </caption>
               <text>
                  <p><b>Within-operon standard deviation of the estimated gene expressions</b>. Distribution of the standard deviation of the expression estimates of the genes from the same operon. The solid line is for the sample mean method while the dashed line for our proposed Bayesian model. Comparing to the sample mean method, our proposed method yields smaller within-operon standard deviations of the gene expressions.</p>
               </text>
               <graphic file="1471-2164-7-87-5"/>
            </fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>In this paper, we proposed and applied a hierarchical Bayesian model, to estimate relative gene expression levels and detect differentially expressed genes by borrowing expression information within operons. The performance of the proposed method was compared with that of the sample mean and SAM t statistics. Through the simulation studies, we showed that the proposed method outperformed the sample mean and the SAM t statistics in estimating gene expression levels and detecting DE genes. The proposed method was used to analyze differential expression in an <it>E. coli </it>mutant with a defect in transcription of motility/chemotaxis genes, giving results more consistent with the existing biological knowledge than those obtained by using the other statistics.</p>
         <p>A major advantage of the proposed approach is in borrowing expression information from other genes within the same operon. The approach is developed within a statistically sound Bayesian model and it offers necessary flexibility with respect to the amount of information that needs to be borrowed from other genes. By borrowing information we can obtain stabilized estimates of expression levels from rather noisy microarray data. As a result, the estimates of transcript levels within the same operon become more similar to each other, more so than without borrowing information; this is consistent with a biological fact that genes within the same operon are transcribed as a single mRNA molecule. With the proposed method, the estimated expression levels of genes in "differentially expressed" operons are consistently high, and more importantly, transcript abundances of genes in "equally expressed" operons are stabilized towards zero. In the experimentally obtained microarray data, the within operon variation was smaller than the variation among the replicate data points for the same genes, indicating that the expression levels of the genes within an operon were very similar.</p>
         <p>In our model, the ratio of parameters <it>&#964;</it><sup>2 </sup>and <graphic file="1471-2164-7-87-i1.gif"/> determines how much information comes from the observed expression of a gene and how much comes from the average expression of an operon, when estimating the expression level of an individual gene within an operon. A smaller <it>&#964;</it>, as compared to <graphic file="1471-2164-7-87-i1.gif"/>, puts more weight on the average expression of an operon. Here we assumed the same <it>&#964; </it>value for all operons, implying that all operons had similar within-the-operon variability. However, this assumption might not be realistic. In some operons and physiological conditions, the genes might express very similarly, but in others, especially under the control of internal promoters, transcription of individual genes may be more heterogeneous. In the future, we will investigate the effect of an operon-specific <it>&#964;</it>. Operonal organization of genes is common in prokaryotes and also present in some eukaryotic organisms <abbrgrp><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp>, and the proposed method can be extended to biological systems where the operonal structure is unknown. Many biological studies have demonstrated that co-expressed genes tend to cluster on the chromosome. Although the nature of this phenomenon is not quite understood, a positional clustering of co-expressed genes can be found in many eukaryotes including yeast <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr></abbrgrp>, worm <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>, fly <abbrgrp><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr></abbrgrp>, mouse <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>, and human <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr></abbrgrp>. These findings indicate that the genes are likely to co-transcribe with their chromosomal neighbors. In those cases, instead of borrowing information from genes in the same operon, we can borrow information from gene neighbors on the chromosome. Another extension of our method would involve incorporation of gene annotation information into the analysis of expression data. The approach would be very similar to the one described in this paper: based on biological knowledge, the genes belonging to the same functional group are more likely to be co-expressed, so we can use a hierarchical model to borrow information from and for the genes within the same functional group to improve the estimates of gene expression levels.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>The information about operon structure leads to a better estimation of gene expression levels. Using simulated and experimental data sets, we have demonstrated that the proposed method performs better than the sample mean and the SAM t statistics in estimating the relative levels of transcript abundances and detecting differentially expressed genes.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>RegulonDB database and E. coli. microarray data</p>
            </st>
            <p>RegulonDB <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> is a database containing information about known operons in <it>E. coli</it>. According to the RegulonDB annotations, 1486 genes (about one third of all genes predicted in the <it>E. coli </it>genome) are organized in 600 operons.</p>
            <p>The <it>E. coli </it>data set contains results of 217 microarrays collected in 53 different experimental conditions. The fluorescent intensities of the test and control samples were measured, and the average log ratio of the intensities for each gene under the same condition was used here to represent an observed gene expression level under that condition <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>.</p>
            <p>An <it>E. coli </it>motility expression data set (<abbrgrp><abbr bid="B41">41</abbr></abbrgrp> series accession number: GPL2101) was obtained in a direct pair-wise comparison between a knock-out mutant of the <it>flhDC</it>, a master regulator of the motility/chemotaxis regulon <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>, and its isogenic wild type strain. Total RNA samples of a mutant <it>E. coli </it>(test samples) and an isogenic wild type <it>E. coli </it>(control samples) were labeled with red (Cy5) and green (Cy3) fluorophors. The intensities from the red and the green channels were normalized by the lowess method <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>. There were 4281 genes (G = 4281) with four replicates for each gene (n = 4). Let <it>Y</it><sub><it>i</it>,<it>j </it></sub>be defined as the log ratio of the intensities between the test and control samples for gene <it>i </it>on array <it>j; </it>that is,</p>
            <p>
               <graphic file="1471-2164-7-87-i2.gif"/>
            </p>
         </sec>
         <sec>
            <st>
               <p>Hierarchical models</p>
            </st>
            <p>We propose a hierarchical Bayesian model,</p>
            <p>
               <graphic file="1471-2164-7-87-i3.gif"/>
            </p>
            <p>where, <it>Y</it><sub><it>i</it>,<it>j </it></sub>is the log ratio of gene <it>i </it>in replicate <it>j, &#956;<sub><it>i </it></sub></it>and <it>&#963;</it><sub><it>i </it></sub>are the true expression level and the standard deviation, respectively. In our method, the posterior mean of <it>&#956;<sub><it>i </it></sub></it>is used as the estimated expression level of gene <it>i</it>, while the sample mean, <graphic file="1471-2164-7-87-i4.gif"/>., is referred to as the observed expression level.</p>
            <p>As prior knowledge, we assume that if several genes belong to the same operon, in accordance with the RegulonDB annotation, then their expression levels are from a normal distribution, with the mean <it>&#955;</it><sub><it>p </it></sub>and the variance <it>&#964;</it><sup>2</sup>. Specifically,</p>
            <p>
               <graphic file="1471-2164-7-87-i5.gif"/>
            </p>
            <p>where <it>O</it><sub><it>p </it></sub>denotes operon <it>p</it>. <it>&#955;</it><sub><it>p </it></sub>represents the expression level of the operon <it>p</it>, which is the average of the mean expression levels of all genes within the operon <it>p</it>. <it>&#964;</it><sup>2 </sup>is the within operon variation, and is assumed to be the same across all operons. A non-informative prior is assigned to <it>&#955;</it><sub><it>p</it></sub>, that is <it>Pr</it>(<it>&#955;</it><sub><it>p</it></sub>) &#945; 1, to reflect the lack of prior information, <graphic file="1471-2164-7-87-i1.gif"/> and <it>&#964;</it><sup>2 </sup>have vague priors, which are inverse Gamma distributions with the shape and rate parameters equal to 0.01 and 0.01 respectively <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>. If gene <it>i </it>is not in any operon, then</p>
            <p>
               <graphic file="1471-2164-7-87-i6.gif"/>
            </p>
            <p>so the posterior mean of <it>&#956;<sub><it>i </it></sub></it>is just the the sample mean <graphic file="1471-2164-7-87-i4.gif"/>; if gene <it>i </it>is in operon <it>p</it>, then the conditional distribution of <it>&#956;<sub><it>i </it></sub></it>can be derived as:</p>
            <p>
               <graphic file="1471-2164-7-87-i7.gif"/>
            </p>
            <p>where</p>
            <p>
               <graphic file="1471-2164-7-87-i8.gif"/>
            </p>
            <p>Equation (3) shows that, when borrowing information from the other genes within the same operon, the estimated expression level of the gene <it>i </it>becomes the weighted average of the observed expression level of gene <it>i </it>and the expression level of operon <it>p</it>, given that gene <it>i </it>belongs to operon <it>p</it>. The weights are inversely proportional to the variances. In this model, a key concept is to shrink the observed expression level <graphic file="1471-2164-7-87-i4.gif"/>, towards <it>&#955;</it><sub><it>p</it></sub>, the expression level of an operon, based on the knowledge of the operon structure. The degree of shrinkage is determined by the variability of <graphic file="1471-2164-7-87-i4.gif"/>. and <it>&#955;</it><sub><it>p</it></sub>. Without incorporating operon information, the estimated expression level would be close to the observed expression level, <graphic file="1471-2164-7-87-i4.gif"/> In the hierarchical model, <it>&#955;</it><sub><it>p </it></sub>represents the expression level of operon <it>p</it>, and</p>
            <p>
               <graphic file="1471-2164-7-87-i9.gif"/>
            </p>
            <p>where, <it>m</it><sub><it>p </it></sub>is the number of genes in operon <it>p</it>, and <it>i </it>&#8712; <it>O </it><sub><it>p </it></sub>denotes that gene <it>t </it>is in operon <it>p</it>. Although the posterior distribution was not available in a closed form, we could derive a closed form of the full conditional distribution, and used Markov chain Monte Carlo (MCMC) to simulate the parameters from the posterior distribution. With this closed form expression, the model could be easily coded in R [see <supplr sid="S1">Additional file 1</supplr>] for MCMC simulation using Gibbs sampling <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. The expression level of gene <it>i </it>is estimated by the posterior mean of <it>&#956;<sub><it>i</it></sub></it>, and the genes are ranked by the absolute values of the posterior means of the <it>&#956;<sub><it>i</it></sub>'</it>s. Genes with high rankings were designated as differentially expressed (DE) genes.</p>
         </sec>
         <sec>
            <st>
               <p>SAM t statistic</p>
            </st>
            <p>To evaluate the performance of the proposed method in estimating gene expression levels and identifying DE genes, we compared the proposed method to the sample mean and SAM t statistics <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B3">3</abbr></abbrgrp>. Because of its good performance, the SAM t statistic <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B3">3</abbr></abbrgrp> is widely used to rank genes and detect DE genes <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. We denote the SAM t statistic for the gene <it>i </it>as <it>Z</it><sub><it>i</it></sub>, then</p>
            <p>
               <graphic file="1471-2164-7-87-i10.gif"/>
            </p>
            <p>where <graphic file="1471-2164-7-87-i11.gif"/> and <it>S</it><sub><it>i </it></sub>are the sample mean and sample standard deviation for the gene <it>i</it>, and <it>So </it>is the 90 <it>th </it>percentile of <it>S</it><sub><it>i</it></sub>'s.</p>
         </sec>
         <sec>
            <st>
               <p>Simulation settings</p>
            </st>
            <p>We conducted three simulation studies to assess the usefulness of our method. The operon structure of the <it>E. coli </it>genome from the RegulonDB database <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> was used in simulation studies. We randomly chose 100 operons (involving about 340 genes) and assumed that genes from those operons were differentially expressed DE genes. Then we randomly picked a subset of non-operon genes to be DE genes and adjusted the total number of DE genes to 400. Let <it>&#956;<sub><it>i </it></sub></it>be the expression level of gene <it>i</it>, for <it>i = </it>1, 2,..., 4821. For DE genes, <it>&#956;<sub><it>i</it></sub>'</it>s were simulated from an equal mixture of <it>N</it>(1, 0.25<sup>2</sup>) and <it>N(&#8211;</it>1,0.25<sup>2</sup>) distributions, and genes within the same operon were from the same component of the mixture distribution. For EE genes, <it>&#956;</it><sub><it>i </it></sub>~ <it>N</it>(0, 0.25<sup>2</sup>). We simulated 4 replicates from a normal distribution for each gene, <it>Y</it><sub><it>ij </it></sub>~ <it>N</it>(&#956;<sub><it>i</it></sub>,<graphic file="1471-2164-7-87-i1.gif"/>), where <it>Y</it><sub><it>i</it>,<it>j </it></sub>was the log ratio of transcript abundances for the gene <it>i </it>on the array <it>j</it>. To provide increasing noise levels for simulations 1, 2 and 3, the &#963;<sub><it>i</it></sub>'s were simulated from the <it>uniform</it>(0.25, 0.75), <it>uniform(</it>0.5,1.0), and <it>uniform(</it>0.75,1.25), respectively.</p>
         </sec>
         <sec>
            <st>
               <p>Estimation of false positives and false negatives</p>
            </st>
            <p>Using the posterior distributions, we can evaluate the FDR for specific number of DE genes. Using <it>Pr</it>(|<it>&#956;</it><sub><it>i</it></sub>| > <it>&#948;'</it>|<it>Y</it><sub><it>i,j</it></sub>) to estimate the probability of gene <it>i </it>to be a DE gene, we can estimate the number of false positives for a cut off value <it>k </it><abbrgrp><abbr bid="B7">7</abbr></abbrgrp>:</p>
            <p>
               <graphic file="1471-2164-7-87-i12.gif"/>
            </p>
            <p>Here, the genes are ranked based on the estimated mean expression level <it>&#956;<sub><it>i</it></sub></it>. In this study, we set &#948; = 1,</p>
            <p>which corresponding to the commonly used 2-fold cutoff.</p>
            <p>The false discovery rate (FDR) for the cut off <it>k </it>can be derived as:</p>
            <p>
               <graphic file="1471-2164-7-87-i13.gif"/>
            </p>
            <p>Similarly, the number of false negatives for the cut off <it>k </it>can be calculated as:</p>
            <p>
               <graphic file="1471-2164-7-87-i14.gif"/>
            </p>
         </sec>
         <sec>
            <st>
               <p>Algorithm for Gibbs sampler</p>
            </st>
            <p>The algorithm is implemented below:</p>
            <p>
               <b>Set initial values:</b>
            </p>
            <p>
               <graphic file="1471-2164-7-87-i15.gif"/>
            </p>
            <p>
               <b>FOR t FROM 1 TO T, draws random samples:</b>
            </p>
            <p>
               <graphic file="1471-2164-7-87-i16.gif"/>
            </p>
            <sec>
               <st>
                  <p>END FOR</p>
               </st>
               <p>where <it>V</it><sub><it>i </it></sub>is the sample variance of gene <it>i</it>, and <it>V</it><sub>0 </sub>is the median of <it>V</it><sub><it>i</it></sub>'s. <it>n</it><sub><it>p </it></sub>is the number of genes in operon <it>p</it>. <it>T </it>is the total number of iteration. To diminish the effect of the initial values, we discard the results from the early iterations <it>(t &#8804; T</it><sub><it>B</it></sub>, where <it>T</it><sub><it>B </it></sub>is the burn in time). The posterior mean of <it>&#956;</it><sub><it>i</it></sub>, of gene <it>i </it>is calculated by:</p>
               <p>
                  <graphic file="1471-2164-7-87-i17.gif"/>
               </p>
               <p>In our proposed method, the expression level of gene <it>i </it>is estimated by <graphic file="1471-2164-7-87-i18.gif"/>. In the real data example, <it>T</it><sub><it>B </it></sub>and <it>T </it>are 500 and 2000, respectively.</p>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>GX initiated the study, implemented the methods and conducted data analysis. WP participated in development of the methods and co-wrote the paper. BMV generated the <it>E. coli </it>motility data. ABK generated the <it>E. coli </it>data set containing 217 microarrays, supervised the project and co-wrote the paper. All authors contributed to the writing, read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>The authors are grateful to the reviewers for helpful comments. GX is supported by a Merck fellowship, and BMMV was supported in part by a Ford postdoctoral fellowship. This work was supported in part by NIH grant GM066098 (ABK) and HL65462 (WP), and a University of Minnesota AHC faculty research development grant (WP and ABK).</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Significance analysis of microarrays applied to the ionizing radiation response</p>
            </title>
            <aug>
               <au>
                  <snm>Tusher</snm>
                  <fnm>VG</fnm>
               </au>
               <au>
                  <snm>Tibshirani</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Chu</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>PNAS</source>
            <pubdate>2001</pubdate>
            <volume>98</volume>
            <issue>9</issue>
            <fpage>5116</fpage>
            <lpage>5121</lpage>
            <url>http://www.pnas.Org/cgi/content/abstract/98/9/5116</url>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">11309499</pubid>
                  <pubid idtype="doi">10.1073/pnas.091062498</pubid>
                  <pubid idtype="pmcid">33173</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>A Bayesian framework for the analysis of microarray expression data: regularized t -test and statistical inferences of gene changes</p>
            </title>
            <aug>
               <au>
                  <snm>Baldi</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Long</snm>
                  <fnm>AD</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2001</pubdate>
            <volume>17</volume>
            <issue>6</issue>
            <fpage>509</fpage>
            <lpage>519</lpage>
            <url>http://bioinformatics.oxfordjournals.Org/cgi/content/abstract/17/6/509</url>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/17.6.509</pubid>
                  <pubid idtype="pmpid" link="fulltext">11395427</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Empirical Bayes analysis of a microarray experiment</p>
            </title>
            <aug>
               <au>
                  <snm>Efron</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Tishirani</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Storey</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Tusher</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>J Amer Statist Assoc</source>
            <pubdate>2001</pubdate>
            <volume>96</volume>
            <fpage>1151</fpage>
            <lpage>1160</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1198/016214501753382129</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>A comparative review of statistical methods for discovering dierentially expressed genes
in replicated microarray experiments</p>
            </title>
            <aug>
               <au>
                  <snm>Pan</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2002</pubdate>
            <volume>18</volume>
            <issue>4</issue>
            <fpage>546</fpage>
            <lpage>554</lpage>
            <url>http://bioinformatics.oxfordjournals.Org/cgi/content/abstract/18/4/546</url>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/18.4.546</pubid>
                  <pubid idtype="pmpid" link="fulltext">12016052</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Bayesian Hierarchical Model for Identifying Changes in Gene Expression from Microarray Experiments</p>
            </title>
            <aug>
               <au>
                  <snm>Broet</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Richardson</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Radvanyi</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Journal of Computational Biology</source>
            <pubdate>2002</pubdate>
            <volume>9</volume>
            <issue>4</issue>
            <fpage>671</fpage>
            <lpage>683</lpage>
            <url>http://www.liebertonline.com/doi/abs/10.1089/106652702760277381</url>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1089/106652702760277381</pubid>
                  <pubid idtype="pmpid" link="fulltext">12323100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>On parametric empirical Bayes methods for comparing multiple groups using replicated gene expression profiles</p>
            </title>
            <aug>
               <au>
                  <snm>Kendziorski</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Newton</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Lan</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Gould</snm>
                  <fnm>MN</fnm>
               </au>
            </aug>
            <source>Statistics in Medicine</source>
            <pubdate>2003</pubdate>
            <volume>22</volume>
            <issue>24</issue>
            <fpage>3899</fpage>
            <lpage>3914</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/sim.1548</pubid>
                  <pubid idtype="pmpid" link="fulltext">14673946</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Detecting differential gene expression with a semiparametric hierarchical mixture method</p>
            </title>
            <aug>
               <au>
                  <snm>Newton</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Noueiry</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sarkar</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Ahlquist</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Biostat</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <issue>2</issue>
            <fpage>155</fpage>
            <lpage>176</lpage>
            <url>http://biostatistics.oxfordjournals.Org/cgi/content/abstract/5/2/155</url>
            <xrefbib>
               <pubid idtype="doi">10.1093/biostatistics/5.2.155</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Replicated microarray data</p>
            </title>
            <aug>
               <au>
                  <snm>Lonnstedt</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Speed</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Statist Sinica</source>
            <pubdate>2002</pubdate>
            <volume>12</volume>
            <fpage>31</fpage>
            <lpage>46</lpage>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Bayesian Modelling of Differential Gene Expression</p>
            </title>
            <aug>
               <au>
                  <snm>Lewin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Richardson</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Marshall</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>A</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Aitman</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Biometrics</source>
            <pubdate>2005</pubdate>
            <inpress/>
            <note>
               <url>http://www.bgx.org.uk/papers.html</url>
            </note>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Screening for Differentially Expressed Genes: Are Multilevel Models Helpful?</p>
            </title>
            <aug>
               <au>
                  <snm>Liu</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Parmigiani</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Caffo</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Johns Hopkins University, Dept. of Biostatistics Working Papers</source>
            <pubdate>2004</pubdate>
            <url>http://www.bepress.com/jhubiostat/paper34</url>
         </bibl>
         <bibl id="B11">
            <aug>
               <au>
                  <snm>Miller</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Reznikoff</snm>
                  <fnm>WS</fnm>
               </au>
            </aug>
            <source>The operon</source>
            <publisher>Cold Spring Harbor, NY: Cold Spring Harbor Laboratory Press</publisher>
            <pubdate>1978</pubdate>
         </bibl>
         <bibl id="B12">
            <title>
               <p>DNA microarray analysis of gene expression in response to physiological and genetic changes that affect tryptophan metabolism in Escherichia coli</p>
            </title>
            <aug>
               <au>
                  <snm>Khodursky</snm>
                  <fnm>AB</fnm>
               </au>
               <au>
                  <snm>Peter</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Cozzarelli</snm>
                  <fnm>NR</fnm>
               </au>
               <au>
                  <snm>Botstein</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>PO</fnm>
               </au>
               <au>
                  <snm>Yanofsky</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>PNAS</source>
            <pubdate>2000</pubdate>
            <volume>97</volume>
            <issue>22</issue>
            <fpage>12170</fpage>
            <lpage>12175</lpage>
            <url>http://www.pnas.org/cgi/content/abstract/97/22/12170</url>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">17313</pubid>
                  <pubid idtype="pmpid" link="fulltext">11027315</pubid>
                  <pubid idtype="doi">10.1073/pnas.220414297</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Comparative Gene Expression Profiles Following UV Exposure in Wild-Type and SOS-Deficient Escherichia coli</p>
            </title>
            <aug>
               <au>
                  <snm>Courcelle</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Khodursky</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Peter</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>PO</fnm>
               </au>
               <au>
                  <snm>Hanawalt</snm>
                  <fnm>PC</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2001</pubdate>
            <volume>158</volume>
            <fpage>41</fpage>
            <lpage>64</lpage>
            <url>http://www.genetics.Org/cgi/content/full/158/l/41</url>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11333217</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Transcription unit conservation in the three domains of life: a perspective from Escherichia coli</p>
            </title>
            <aug>
               <au>
                  <snm>Moreno-Hagelsieb</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Trevino</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Perez-Rueda</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>TF</fnm>
               </au>
               <au>
                  <snm>Collado-Vides</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2001</pubdate>
            <volume>17</volume>
            <issue>4</issue>
            <fpage>175</fpage>
            <lpage>7</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(01)02241-7</pubid>
                  <pubid idtype="pmpid" link="fulltext">11275307</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Operons in Escherichia coli: Genomic analyses and predictions</p>
            </title>
            <aug>
               <au>
                  <snm>Salgado</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Moreno-Hagelsieb</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>TF</fnm>
               </au>
               <au>
                  <snm>Collado-Vides</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>PNAS</source>
            <pubdate>2000</pubdate>
            <volume>97</volume>
            <issue>12</issue>
            <fpage>6652</fpage>
            <lpage>6657</lpage>
            <url>http://www.pnas.org/cgi/content/abstract/97/12/6652</url>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">18690</pubid>
                  <pubid idtype="pmpid" link="fulltext">10823905</pubid>
                  <pubid idtype="doi">10.1073/pnas.110147297</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>A powerful non-homology method for the prediction of operons in prokaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Moreno-Hagelsieb</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Collado-Vides</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2002</pubdate>
            <issue>18 Suppl 1(NIL)</issue>
            <fpage>S329</fpage>
            <lpage>36</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12169563</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Prediction of operons in microbial genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Ermolaeva</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Salzberg</snm>
                  <fnm>SL</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <issue>5</issue>
            <fpage>1216</fpage>
            <lpage>21</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">29727</pubid>
                  <pubid idtype="pmpid" link="fulltext">11222772</pubid>
                  <pubid idtype="doi">10.1093/nar/29.5.1216</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>A fuzzy guided genetic algorithm for operon prediction</p>
            </title>
            <aug>
               <au>
                  <snm>Jacob</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Sasikumar</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Nair</snm>
                  <fnm>KNR</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>8</issue>
            <fpage>1403</fpage>
            <lpage>7</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti156</pubid>
                  <pubid idtype="pmpid" link="fulltext">15564303</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Operon prediction without a training set</p>
            </title>
            <aug>
               <au>
                  <snm>Westover</snm>
                  <fnm>BP</fnm>
               </au>
               <au>
                  <snm>Buhler</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Sonnenburg</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Gordon</snm>
                  <fnm>JI</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>7</issue>
            <fpage>880</fpage>
            <lpage>8</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti123</pubid>
                  <pubid idtype="pmpid" link="fulltext">15539453</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Spatial patterns of transcriptional activity in the chromosome of Escherichia coli</p>
            </title>
            <aug>
               <au>
                  <snm>Jeong</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Ahn</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Khodursky</snm>
                  <fnm>AB</fnm>
               </au>
            </aug>
            <source>Genome Biology</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>R86</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">545777</pubid>
                  <pubid idtype="pmpid" link="fulltext">15535862</pubid>
                  <pubid idtype="doi">10.1186/gb-2004-5-11-r86</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Co-expression pattern from DNA microarray experiments as a tool for operon prediction</p>
            </title>
            <aug>
               <au>
                  <snm>Sabatti</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Rohlin</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Oh</snm>
                  <fnm>MK</fnm>
               </au>
               <au>
                  <snm>Liao</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <issue>13</issue>
            <fpage>2886</fpage>
            <lpage>93</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">117043</pubid>
                  <pubid idtype="pmpid" link="fulltext">12087173</pubid>
                  <pubid idtype="doi">10.1093/nar/gkf388</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>A Bayesian network approach to operon prediction</p>
            </title>
            <aug>
               <au>
                  <snm>Bockhorst</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Craven</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Page</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Shavlik</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Glasner</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <issue>10</issue>
            <fpage>1227</fpage>
            <lpage>35</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btg147</pubid>
                  <pubid idtype="pmpid" link="fulltext">12835266</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Simulated annealing of microarray data reduces noise and enables cross-experimental comparisons</p>
            </title>
            <aug>
               <au>
                  <snm>Wren</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Yao</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Langer</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Conway</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>DNA Cell Biol</source>
            <pubdate>2004</pubdate>
            <volume>23</volume>
            <issue>10</issue>
            <fpage>695</fpage>
            <lpage>700</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1089/dna.2004.23.695</pubid>
                  <pubid idtype="pmpid" link="fulltext">15585127</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>A classification based framework for quantitative description of large-scale microarray data</p>
            </title>
            <aug>
               <au>
                  <snm>Sangurdekar</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Srienc</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Khodursky</snm>
                  <fnm>AB</fnm>
               </au>
            </aug>
            <source>Genome Biology</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <issue>4</issue>
            <fpage>R32</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/gb-2006-7-4-r32</pubid>
                  <pubid idtype="pmpid" link="fulltext">16626502</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Genetics and biogenesis of bacterial flagella</p>
            </title>
            <aug>
               <au>
                  <snm>Macnab</snm>
                  <fnm>RM</fnm>
               </au>
            </aug>
            <source>Annu Rev Genet</source>
            <pubdate>1992</pubdate>
            <issue>26(NIL)</issue>
            <fpage>131</fpage>
            <lpage>58</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.ge.26.120192.001023</pubid>
                  <pubid idtype="pmpid" link="fulltext">1482109</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Controlling the false discovery rate: A practical and powerful approach to multiple testing</p>
            </title>
            <aug>
               <au>
                  <snm>Benjamini</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Hochberg</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>J R Stat Soc B</source>
            <pubdate>1995</pubdate>
            <volume>57</volume>
            <fpage>289</fpage>
            <lpage>300</lpage>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Statistical significance for genomewide studies</p>
            </title>
            <aug>
               <au>
                  <snm>Storey</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Tibshirani</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>PNAS</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <issue>16</issue>
            <fpage>9440</fpage>
            <lpage>9445</lpage>
            <url>http://www.pnas.org/cgi/content/abstract/100/16/9440</url>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">170937</pubid>
                  <pubid idtype="pmpid" link="fulltext">12883005</pubid>
                  <pubid idtype="doi">10.1073/pnas.1530509100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Coexpression of neighboring genes in Caenorhabditis elegans is mostly due to operons and duplicate genes</p>
            </title>
            <aug>
               <au>
                  <snm>Lercher</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Blumenthal</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hurst</snm>
                  <fnm>LD</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <issue>2</issue>
            <fpage>238</fpage>
            <lpage>43</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">420373</pubid>
                  <pubid idtype="pmpid" link="fulltext">12566401</pubid>
                  <pubid idtype="doi">10.1101/gr.553803</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Caenorhabditis elegans operons: form and function</p>
            </title>
            <aug>
               <au>
                  <snm>Blumenthal</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Gleason</snm>
                  <fnm>KS</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <issue>2</issue>
            <fpage>112</fpage>
            <lpage>20</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg995</pubid>
                  <pubid idtype="pmpid" link="fulltext">12560808</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Operons in eukaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Blumenthal</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Brief Funct Genomic Proteomic</source>
            <pubdate>2004</pubdate>
            <volume>3</volume>
            <issue>3</issue>
            <fpage>199</fpage>
            <lpage>211</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bfgp/3.3.199</pubid>
                  <pubid idtype="pmpid" link="fulltext">15642184</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>A computational analysis of whole-genome expression data reveals chromosomal domains of gene expression</p>
            </title>
            <aug>
               <au>
                  <snm>Cohen</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Mitra</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Hughes</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Church</snm>
                  <fnm>GM</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2000</pubdate>
            <volume>26</volume>
            <issue>2</issue>
            <fpage>183</fpage>
            <lpage>6</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/79896</pubid>
                  <pubid idtype="pmpid" link="fulltext">11017073</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Regulation of adjacent yeast genes</p>
            </title>
            <aug>
               <au>
                  <snm>Kruglyak</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Tang</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2000</pubdate>
            <volume>16</volume>
            <issue>3</issue>
            <fpage>109</fpage>
            <lpage>11</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(99)01941-1</pubid>
                  <pubid idtype="pmpid" link="fulltext">10689350</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Chromosomal clustering of muscle-expressed genes in Caenorhabditis elegans</p>
            </title>
            <aug>
               <au>
                  <snm>Roy</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Stuart</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Lund</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>SK</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>418</volume>
            <issue>6901</issue>
            <fpage>975</fpage>
            <lpage>9</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12214599</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Large clusters of co-expressed genes in the Drosophila genome</p>
            </title>
            <aug>
               <au>
                  <snm>Boutanaev</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Kalmykova</snm>
                  <fnm>AI</fnm>
               </au>
               <au>
                  <snm>Shevelyov</snm>
                  <fnm>YY</fnm>
               </au>
               <au>
                  <snm>Nurminsky</snm>
                  <fnm>DI</fnm>
               </au>
            </aug>
            <source>Nature </source>
            <pubdate>2002</pubdate>
            <volume>420</volume>
            <issue>6916</issue>
            <fpage>666</fpage>
            <lpage>9</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature01216</pubid>
                  <pubid idtype="pmpid" link="fulltext">12478293</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Evidence for large domains of similarly expressed genes in the Drosophila genome</p>
            </title>
            <aug>
               <au>
                  <snm>Spellman</snm>
                  <fnm>PT</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>GM</fnm>
               </au>
            </aug>
            <source>J Biol</source>
            <pubdate>2002</pubdate>
            <volume>1</volume>
            <fpage>5</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">117248</pubid>
                  <pubid idtype="pmpid" link="fulltext">12144710</pubid>
                  <pubid idtype="doi">10.1186/1475-4924-1-5</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Genome-scale analysis of positional clustering of mouse testis-specific genes</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>BTK</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>7</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">548148</pubid>
                  <pubid idtype="pmpid" link="fulltext">15656914</pubid>
                  <pubid idtype="doi">10.1186/1471-2164-6-7</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>The human transcriptome map: clustering of highly expressed genes in chromosomal domains</p>
            </title>
            <aug>
               <au>
                  <snm>Caron</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>van Schaik</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>van der Mee</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Baas</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Riggins</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>van Sluis</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Hermus</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>van Asperen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Boon</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Voute</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Heisterkamp</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>van Kampen</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Versteeg</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2001</pubdate>
            <volume>291</volume>
            <issue>5507</issue>
            <fpage>1289</fpage>
            <lpage>92</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1056794</pubid>
                  <pubid idtype="pmpid" link="fulltext">11181992</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>The human transcriptome map reveals extremes in gene density, intron length, GC content, and repeat pattern for domains of highly and weakly expressed genes</p>
            </title>
            <aug>
               <au>
                  <snm>Versteeg</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>van Schaik</snm>
                  <fnm>BDC</fnm>
               </au>
               <au>
                  <snm>van Batenburg</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Roos</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Monajemi</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Caron</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Bussemaker</snm>
                  <fnm>HJ</fnm>
               </au>
               <au>
                  <snm>van Kampen</snm>
                  <fnm>AHC</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <issue>9</issue>
            <fpage>1998</fpage>
            <lpage>2004</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">403669</pubid>
                  <pubid idtype="pmpid" link="fulltext">12915492</pubid>
                  <pubid idtype="doi">10.1101/gr.1649303</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>First comprehensive mapping of cartilage transcripts to the human genome</p>
            </title>
            <aug>
               <au>
                  <snm>Yager</snm>
                  <fnm>TD</fnm>
               </au>
               <au>
                  <snm>Dempsey</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Tang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Stamatiou</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Chao</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Marshall</snm>
                  <fnm>KW</fnm>
               </au>
               <au>
                  <snm>Liew</snm>
                  <fnm>CC</fnm>
               </au>
            </aug>
            <source>Genomics</source>
            <pubdate>2004</pubdate>
            <volume>84</volume>
            <issue>3</issue>
            <fpage>524</fpage>
            <lpage>35</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.ygeno.2004.05.006</pubid>
                  <pubid idtype="pmpid" link="fulltext">15498459</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>RegulonDB (version 4.0): transcriptional regulation, operon organization and growth conditions in Escherichia coli K-12</p>
            </title>
            <aug>
               <au>
                  <snm>Salgado</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Gama-Castro</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Martinez-Antonio</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Diaz-Peredo</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Sanchez-Solano</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Peralta-Gil</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Garcia-Alonso</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Jimenez-Jacinto</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Santos-Zavaleta</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bonavides-Martinez</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Collado-Vides</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Nucl Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>90001</issue>
            <fpage>D303</fpage>
            <lpage>306</lpage>
            <url>http://nar.oxfordjournals.org/cgi/content/full/32/suppLl/D303</url>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308874</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681419</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh140</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>NCBI Gene Expression Omnibus</p>
            </title>
            <url>http://www.ncbi.nlm.nih.gov/geo/</url>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>YH</fnm>
               </au>
               <au>
                  <snm>Dudoit</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Luu</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Peng</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Ngai</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Speed</snm>
                  <fnm>TP</fnm>
               </au>
            </aug>
            <source>Nucl Acids Res</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <issue>4</issue>
            <fpage>e15</fpage>
            <url>http://nar.oxfordjournals.Org/cgi/content/full/30/4/el5</url>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">100354</pubid>
                  <pubid idtype="pmpid" link="fulltext">11842121</pubid>
                  <pubid idtype="doi">10.1093/nar/30.4.e15</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <aug>
               <au>
                  <snm>Carlin</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Louis</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Bayes and Empirical Bayes Methods for Data Analysis</source>
            <publisher>Boca Raton, FL: Chapman and Hall/CRC Press 2000</publisher>
            <pubdate>2000</pubdate>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Sampling Based Approaches to Calculating Marginal Densities</p>
            </title>
            <aug>
               <au>
                  <snm>Gelfand</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Journal Amer Stat Assoc</source>
            <pubdate>1990</pubdate>
            <volume>85</volume>
            <fpage>398</fpage>
            <lpage>409</lpage>
            <xrefbib>
               <pubid idtype="doi">10.2307/2289776</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>A case study on choosing normalization methods and test statistics for two-channel microarray data</p>
            </title>
            <aug>
               <au>
                  <snm>Xie</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Jeong</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Pan</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Khodursky</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Carlin</snm>
                  <fnm>BP</fnm>
               </au>
            </aug>
            <source>Comp Fund Genom</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>432</fpage>
            <lpage>444</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1002/cfg.416</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
