<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1752-0509-1-S1-P10</ui>
   <ji>1752-0509</ji>
   <fm>
      <dochead>Poster presentation</dochead>
      <bibl>
         <title>
            <p>A global statistical test for improved detection of gene activity</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Bakewell</snm>
               <mi>JG</mi>
               <fnm>David</fnm>
               <insr iid="I1"/>
               <email>d.bakewell@liv.ac.uk</email>
            </au>
            <au id="A2">
               <snm>Wit</snm>
               <fnm>Ernst</fnm>
               <insr iid="I2"/>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Electrical Engineering and Electronics, University of Liverpool, Liverpool, L69 3GJ, UK</p>
            </ins>
            <ins id="I2">
               <p>Department of Mathematics and Statistics, Lancaster University, Lancaster LA1 4YF, UK</p>
            </ins>
         </insg>
         <source>BMC Systems Biology</source>
         <supplement>
            <title>
               <p>BioSysBio 2007: Systems Biology, Bioinformatics, Synthetic Biology</p>
            </title>
            <editor>John Cumbers, Xu Gu, Jong Sze Wong</editor>
            <note>Meeting abstracts &#8211; A single PDF containing all abstracts in this Supplement is available <a href="www.biomedcentral.com/content/files/pdf/1752-0509-1-S1-full.pdf">here</a>.</note>
            <url>http://www.biomedcentral.com/content/pdf/1752-0509-1-S1-info.pdf</url>
         </supplement>
         <conference>
            <title>
               <p>BioSysBio 2007: Systems Biology, Bioinformatics and Synthetic Biology</p>
            </title>
            <location>Manchester, UK</location>
            <date-range>11&#8211;13 January 2007</date-range>
            <url>http://www.biosysbio.com</url>
         </conference>
         <issn>1752-0509</issn>
         <pubdate>2007</pubdate>
         <volume>1</volume>
         <issue>Suppl 1</issue>
         <fpage>P10</fpage>
         <url>http://www.biomedcentral.com/1752-0509/1?issue=S1</url>
         <xrefbib>
            <pubid idtype="doi">10.1186/1752-0509-1-S1-P10</pubid>
         </xrefbib>
      </bibl>
      <history>
         <pub>
            <date>
               <day>8</day>
               <month>5</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Bakewell and Wit; licensee BioMed Central Ltd.</collab>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p/>
            </st>
            <p/>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Introduction</p>
         </st>
         <p>Interpreting gene transcription for understanding cellular processes using DNA microarray measurements presents a number of challenges to the investigator. One of these is the relatively large number of genes (mRNA transcript identities) that are measured with relatively few <it>independent </it>biological samples. The small number of independent samples limits the statistical inference that can be made about the behaviour of individual genes. Consequently, the microarray end-user is faced with bewildering lists of differentially expressed genes that typically contain false positives &#8211; a direct result of the well-known 'few samples &#8211; many genes' dimensionality problem. At the same time investigators are using microarrays to understand processes involving the collective action of a number of genes, often organized as a complex or pathway <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. To address these needs and remedy the problem of dimensionality, we consider a global, likelihood ratio (LR) test for examining the behaviour of a pre-assigned <it>group </it>of genes &#8211; rather than individual genes.</p>
         <p>This poster describes the development of the LR test for detecting low levels of activity in a group of related genes or 'pathway'. The underlying principle of the global LR test is to share variance information across related genes so that small concordant changes in expression in a group of genes can be detected more sensitively than current multiple univariate tests. The LR test takes into account only the magnitude and uncertainty of the changes, that is, it is not affected by whether the genes within the group are up- or down-regulated. Compared with other exploratory methods <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp> that find patterns of differentially expressed genes, our method is more a confirmatory method. It is particularly well suited to investigating the involvement of a particular functional group of genes in explaining the difference between two or more phenotypes.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <p>The LR method uses a two-layer hierarchy that models variation between genes within a pathway and experimental variation due to effects of sample preparation, microarray measurements and other sources. Hierarchical maximum likelihood estimation is used to determine the likelihood ratios, which are in-turn used to determine P-values (i.e. how likely the observed data would be if in fact none of the pathway genes are differentially expressed). The global maximum of the likelihood is evaluated using standard differential calculus techniques and numerical methods along similar lines described in previous work <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. Further details are given in the poster. Comparisons are made with P-values evaluated using standard procedures, such as, the univariate regularised <it>t</it>-test <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> with decision rules that take into account the effect of multiple or 'sequential' testing. In the examples described in the poster, the Family Wise Error Rate is controlled by adjusted P-values calculated using the Bonferroni correction or Hochberg multiple step-down procedure <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <p>The P-values determined by LR and sequential <it>t</it>-test methods are compared using data from Monte Carlo (MC) simulations and a published two-channel spotted microarray experiment <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. The MC simulations generated LR and T-statistics that enabled pathways to be identified as True Positive (Tp) and False Positive (Fp) for a particular cut-off value. Counts of Tp and Fp were combined into Receiver Operating Characteristics (ROCs) similar to earlier work <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. The MC simulations show the hierarchical LR test yields lower P-values compared with the sequential <it>t</it>-test for low cutoff range, 0&#8211;5%, of interest. That is, the LR test leads to improved detection of differential gene-group expression compared to a sequential application of the <it>t</it>-test.</p>
         <p>The LR and t-test methods were also applied to data about three transforming growth factor genes implicated in the regulation of breast cancer tumorigenesis <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. The effect of sample size on the P-values for the two methods was investigated by randomly selecting samples (maximum 10) and taking combinatorial averages. Computed relationships of the P-values versus number of samples shows the LR test requires almost three times fewer independent biological samples to give the same gene-group P-value compared with a multiple comparison <it>t</it>-test.</p>
      </sec>
      <sec>
         <st>
            <p>Concluding remark</p>
         </st>
         <p>A global, likelihood ratio test is developed that shares variance information across a pre-assigned group of genes. The test is shown to be a more sensitive detector of differential gene expression than conventional techniques, such as, the multiple comparison <it>t</it>-test. This improvement is particularly advantageous for typical of microarray measurements where only a few independent biological samples are available.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>D. B. would like to thank Cancer Research U.K. and The Wellcome Trust (Project 062511) for financial support and facilities.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <aug>
               <au>
                  <snm>Lewin</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Genes VII</source>
            <publisher>Oxford University Press</publisher>
            <pubdate>2002</pubdate>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Discovering molecular functions significantly related to phenotypes by combining gene expression data and biological information</p>
            </title>
            <aug>
               <au>
                  <snm>Al-Shahrour</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>D&#237;az-Uriarte</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Dopazo</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>13</issue>
            <fpage>2988</fpage>
            <lpage>2993</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti457</pubid>
                  <pubid idtype="pmpid" link="fulltext">15840702</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Iterative group analysis (iGA): a simple tool to enhance sensivity and faciltate interpretation of microarray experiments</p>
            </title>
            <aug>
               <au>
                  <snm>Breitling</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Amtmann</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Herzyk</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <issue/>
            <fpage>34</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">403636</pubid>
                  <pubid idtype="pmpid" link="fulltext">15050037</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-5-34</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Gotoolbox: functional analysis of gene datasets based on gene ontology</p>
            </title>
            <aug>
               <au>
                  <snm>Martin</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Brun</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Remy</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Mouren</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Thieffry</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Jacq</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Genome Biology</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <issue>12</issue>
            <fpage>R101</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">545796</pubid>
                  <pubid idtype="pmpid" link="fulltext">15575967</pubid>
                  <pubid idtype="doi">10.1186/gb-2004-5-12-r101</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Weighted analysis of microarray gene expression using maximum-likelihood</p>
            </title>
            <aug>
               <au>
                  <snm>Bakewell</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Wit</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>6</issue>
            <fpage>723</fpage>
            <lpage>729</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti051</pubid>
                  <pubid idtype="pmpid" link="fulltext">15454412</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Statistical tests for differential expression in cDNA microarray experiments</p>
            </title>
            <aug>
               <au>
                  <snm>Cui</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Churchill</snm>
                  <fnm>GA</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <issue>4</issue>
            <fpage>210</fpage>
            <lpage/>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/gb-2003-4-4-210</pubid>
                  <pubid idtype="pmpid" link="fulltext">12702200</pubid>
                  <pubid idtype="pmcid">154570</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Multiple hypothesis testing in microarray experiments</p>
            </title>
            <aug>
               <au>
                  <snm>Dudoit</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Shaffer</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Boldrick</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>Working Paper Series 110</source>
            <publisher>U. C. Berkeley Division of Biostatistics</publisher>
            <pubdate>2002</pubdate>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Statistical methods for identifying differentially expressed genes in replicated cDNA microarray experiments</p>
            </title>
            <aug>
               <au>
                  <snm>Dudoit</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>YH</fnm>
               </au>
               <au>
                  <snm>Callow</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Speed</snm>
                  <fnm>TP</fnm>
               </au>
            </aug>
            <source>Statistica Sinica</source>
            <pubdate>2002</pubdate>
            <volume>7</volume>
            <fpage>111</fpage>
            <lpage>139</lpage>
         </bibl>
         <bibl id="B9">
            <title>
               <p>A sharper Bonferroni procedure for multiple tests of significance</p>
            </title>
            <aug>
               <au>
                  <snm>Hochberg</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Biometrika</source>
            <pubdate>1988</pubdate>
            <volume>75</volume>
            <issue>4</issue>
            <fpage>800</fpage>
            <lpage>802</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1093/biomet/75.4.800</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Determination of stromal signatures in breast carcinoma</p>
            </title>
            <aug>
               <au>
                  <snm>West</snm>
                  <fnm>RB</fnm>
               </au>
               <au>
                  <snm>Nuyten</snm>
                  <fnm>DSA</fnm>
               </au>
               <au>
                  <snm>Subramanian</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>TO</fnm>
               </au>
               <au>
                  <snm>Corless</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>PR</fnm>
               </au>
               <au>
                  <snm>Montgomery</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Patel</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Hernandez-Boussard</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Goldblum</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>PO</fnm>
               </au>
               <au>
                  <snm>van de Vijver</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>van de Rijn</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>PLoS Biology</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <issue>6</issue>
            <fpage>e187</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1088973</pubid>
                  <pubid idtype="pmpid" link="fulltext">15869330</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0030187</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Tgf-beta signalling in cancer &#8211; a double-edged sword</p>
            </title>
            <aug>
               <au>
                  <snm>Akhurst</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Derynck</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Trends in Cell Biology</source>
            <pubdate>2001</pubdate>
            <volume>11</volume>
            <issue>11</issue>
            <fpage>S44</fpage>
            <lpage>S51</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0962-8924(01)02130-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">11684442</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
