<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-7-99</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Methodology article</dochead>
      <bibl>
         <title>
            <p>Determination of strongly overlapping signaling activity from microarray data</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Bidaut</snm>
               <fnm>Ghislain</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <insr iid="I3"/>
               <email>ghbidaut@pcbi.upenn.edu</email>
            </au>
            <au id="A2">
               <snm>Suhre</snm>
               <fnm>Karsten</fnm>
               <insr iid="I2"/>
               <email>Karsten.Suhre@igs.cnrs-mrs.fr</email>
            </au>
            <au id="A3">
               <snm>Claverie</snm>
               <fnm>Jean-Michel</fnm>
               <insr iid="I2"/>
               <email>Jean-Michel.Claverie@igs.cnrs-mrs.fr</email>
            </au>
            <au id="A4" ca="yes">
               <snm>Ochs</snm>
               <mi>F</mi>
               <fnm>Michael</fnm>
               <insr iid="I1"/>
               <email>m_ochs@fccc.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Fox Chase Cancer Center, 333 Cottman Avenue, Philadelphia, PA, 19111, USA</p>
            </ins>
            <ins id="I2">
               <p>Structural and Genomic Information Laboratory, UPR2589-CNRS, 13288 Marseille, France</p>
            </ins>
            <ins id="I3">
               <p>Center for Bioinformatics, Department of Genetics, University of Pennsylvania School of Medicine, 1423 Blockley Hall, 423 Guardian Drive, Philadelphia, PA 19104-6021, USA</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2006</pubdate>
         <volume>7</volume>
         <issue>1</issue>
         <fpage>99</fpage>
         <url>http://www.biomedcentral.com/1471-2105/7/99</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">16507110</pubid>
               <pubid idtype="doi">10.1186/1471-2105-7-99</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>22</day>
               <month>9</month>
               <year>2005</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>28</day>
               <month>2</month>
               <year>2006</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>28</day>
               <month>2</month>
               <year>2006</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2006</year>
         <collab>Bidaut et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>As numerous diseases involve errors in signal transduction, modern therapeutics often target proteins involved in cellular signaling. Interpretation of the activity of signaling pathways during disease development or therapeutic intervention would assist in drug development, design of therapy, and target identification. Microarrays provide a global measure of cellular response, however linking these responses to signaling pathways requires an analytic approach tuned to the underlying biology. An ongoing issue in pattern recognition in microarrays has been how to determine the number of patterns (or clusters) to use for data interpretation, and this is a critical issue as measures of statistical significance in gene ontology or pathways rely on proper separation of genes into groups.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Here we introduce a method relying on gene annotation coupled to decompositional analysis of global gene expression data that allows us to estimate specific activity on strongly coupled signaling pathways and, in some cases, activity of specific signaling proteins. We demonstrate the technique using the Rosetta yeast deletion mutant data set, decompositional analysis by Bayesian Decomposition, and annotation analysis using ClutrFree. We determined from measurements of gene persistence in patterns across multiple potential dimensionalities that 15 basis vectors provides the correct dimensionality for interpreting the data. Using gene ontology and data on gene regulation in the Saccharomyces Genome Database, we identified the transcriptional signatures of several cellular processes in yeast, including cell wall creation, ribosomal disruption, chemical blocking of protein synthesis, and, criticially, individual signatures of the strongly coupled mating and filamentation pathways.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>This works demonstrates that microarray data can provide downstream indicators of pathway activity either through use of gene ontology or transcription factor databases. This can be used to investigate the specificity and success of targeted therapeutics as well as to elucidate signaling activity in normal and disease processes.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Many diseases develop because of errors in signaling, and newer therapeutics specifically target proteins involved in cellular signaling <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. However, these therapies are not always effective <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>, and the reason for failure, whether inherent poor interaction or complex cellular response, is unknown. In order to understand the development of disease and drug resistance in these cases, the recovery of the process that led to the specific cellular malfunction must be identified. Such errors generally involve the cellular signaling networks that control cell growth, differentiation, apoptosis, and motility <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. Because of the extreme underlying biological complexity of these pathways, diseases that involve errors in signaling processes arise from a myriad of different cellular malfunctions, for example in cancers <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp> and diabetes <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>. It is from this complex background that functional genomics attempts to glean insight to improve our understanding of diseases.</p>
         <p>One of the major uses of microarrays has been elucidation of gene expression in cancer, often focused on refining cancer identification using computational and statistical approaches <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>. In addition, the discovery of biomarkers in the form of differential levels of production of mRNA has been a focus in a number of studies <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp>. The fact that determination of the mRNA levels of a single gene is easier than using an entire array has driven the shift to the use of arrays to generate potential biomarkers, so that the expression levels of these individual genes can be screened for in a more economical way (see, for example, <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>). For diabetes, microarrays have been used to elucidate gene expression in both type I and type II diseases, and customized chips targeting genes of interest have been developed <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>.</p>
         <p>Many tools for statistical inference, pattern recognition, and data mining have been developed for microarray data analysis. Statistical tests include SAM <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>, VERAandSAM <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>, ANOVA techniques <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr></abbrgrp>, Bayesian approaches <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>, and rank tests <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. Pattern recognition and data mining techniques comprise both unsupervised techniques, such as hierarchical clustering <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>, singular value decomposition <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, multidimensional scaling <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>, Bayesian mixture models <abbrgrp><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr></abbrgrp>, and other clustering methods <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>, and supervised techniques, such as support vector machines <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> and artificial neural networks <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>, (for a review see <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>).</p>
         <p>While these techniques are useful, they have certain limitations as regards more advanced uses in the elucidation of mechanisms operating in diseased tissues. New therapeutics specifically target proteins involved in cellular signaling <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr></abbrgrp>. As noted above, these therapies are not always effective, and a method to understand the reason for their ineffectiveness is highly desirable. If the failure modes for the targeted therapeutics are understood, new therapeutics can be designed or combination therapies undertaken. In addition, to design new therapies that work alone or in combination with other therapies, an understanding of signaling networks is required. Microarray measurements can provide insight into these issues.</p>
         <p>Unfortunately, the recovery of pathway information from transcriptional data requires complex analysis, since signaling protein activity is not generally linked to the mRNA expression levels of genes encoding the signaling proteins themselves <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>, nor are protein levels tightly coupled to transcript levels even in yeast <abbrgrp><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr></abbrgrp>. This makes it impossible to directly link an increase in mRNA expression of the gene encoding a signaling protein, such as the therapeutic target, with activity of the protein and therefore of the signaling pathway. Instead, an analysis must treat changes in mRNA levels as downstream indicators of activity.</p>
         <p>An important issue to resolve in order to correctly interpret patterns in microarray data is the underlying dimensionality of the data, since statistical analysis of genes in groups relies on correct separation. The dimensionality provides an estimate of the number of patterns required to explain the variation in the data not related to noise, which is equivalent to the number of basis vectors required mathematically to describe the data or the number of principal components required to span the data.</p>
         <p>We present here a new application of Bayesian Decomposition <abbrgrp><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr></abbrgrp> and ClutrFree <abbrgrp><abbr bid="B49">49</abbr></abbrgrp> that estimates dimensionality by measuring the consistency of assignment of genes to patterns. With this approach, transcriptional signatures are linked to signaling activities through gene ontology <abbrgrp><abbr bid="B50">50</abbr></abbrgrp> using the MIPS database <abbrgrp><abbr bid="B51">51</abbr></abbrgrp> and through analysis of transcription factor activity <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>. We demonstrate this technique on the Rosetta deletion mutant dataset <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>, which is a compendium of genome-wide transcription measured for 6300 genes across 300 conditions (mostly deletion mutants, but some chemical treatments). Figure <figr fid="F1">1</figr> details the workflow of our analysis. Previous studies of the compendium were performed using hierarchical clustering <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>, non-negative matrix factorization <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>, and Bayesian Decomposition <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. The dimensionality of the data was estimated in various ways in these studies leading to estimates from 7 to 50 dimensions.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Data analysis flowchart</p>
            </caption>
            <text>
               <p><b>Data analysis flowchart</b>. The data was downloaded from Rosetta Inpharmatics and filtered to include only genes and experiments that showed significant variation. Bayesian Decomposition analysis generated patterns and associated gene lists for all dimensionalities between 3 and 25. ClutrFree was used to interpret these results, including use of the MIPS database of ontologies.</p>
            </text>
            <graphic file="1471-2105-7-99-1"/>
         </fig>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Dimensionality of the data</p>
            </st>
            <p>We propose a value for the Rosetta dataset dimensionality based on the average persistence calculations at each tree level made with ClutrFree using multiple Bayesian Decomposition simulations. The dimensionality has been inferred from the average persistence defined in the Methods section. As the number of basis vectors (i.e., patterns) <it>k </it>is increased, the curve shows a dramatic drop for <it>k </it>> <it>15 </it>(see Figure <figr fid="F2">2</figr>). This drop is due to the reorganization of the groups of mutants constituting the basis vectors for <it>k </it>> <it>15 </it>patterns, leading to an overly low average persistence.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>The average persistence across all dimensions</p>
               </caption>
               <text>
                  <p><b>The average persistence across all dimensions</b>. The average persistence across the dimensions is plotted for 3 to 25 dimensions. The significant drop between 15 and 16 dimensions suggests that 15 patterns provides the correct dimensionality for analysis.</p>
               </text>
               <graphic file="1471-2105-7-99-2"/>
            </fig>
            <p>The freedom to move between branches leads also to some loss of consistency in the annotations as one moves down a branch. This contrasts with the behavior of basis vectors obtained for less than <it>15 </it>patterns where biological functions split logically as the number of patterns increases. We also observed a reduction in the number of genes related to each basis vector for <it>16 </it>patterns in comparison to <it>15</it>. Also, the standard deviation across samples of the obtained vectors is significantly higher for <it>16 </it>patterns (1.7 &#215; 10<sup>-3</sup>) than for 15 patterns (7.9 &#215; l0<sup>-4</sup>) indicating that the Markov chain sampling is not as tightly constrained by the probability distribution. The behavior observed occurs because of the potential to overfit the data with <it>16 </it>basis vectors allowing the algorithm to find multiple configurations to explain the variation in the data.</p>
         </sec>
         <sec>
            <st>
               <p>Identifying patterns and functions</p>
            </st>
            <p>Bayesian Decomposition retrieves the two linked matrices: the <b>P </b>matrix (pattern matrix) groups mutants that share cellular functions, which can be deduced from the genes linked to each pattern contained in the <b>A </b>matrix. Each mutant (a column of the <b>P </b>matrix) can belong to multiple patterns, which models the fact that each mutant will have many cellular functions active. Each gene (a row of the <b>A </b>matrix) can be assigned to multiple patterns, reflecting the fact that evolution has led to genes being involved in multiple cellular processes. Interpretation of the results involves identifying cellular processes from the genes that are significantly expressed in a pattern (i.e., within a column of <b>A</b>).</p>
            <p>We proceed by using the dimensionality estimate of <it>15 </it>patterns and exploring for each pattern the genes associated with that pattern. These genes are interpreted using the MIPS ontology for yeast <abbrgrp><abbr bid="B51">51</abbr></abbrgrp> in order to predict the cellular processes associated with a pattern. In addition, for patterns that can be linked to signaling pathways, we discuss the use of data on genes regulated by specific transcription factors and validate the results by analysis of specific key deletion mutants. For each pattern that shows enhancement of ontological terms we provide the terms, the enhancement (as defined in the methods section), and the <it>p </it>value for a hypergeometric test on the term.</p>
            <p>We summarize the results in terms of patterns previously identified in other studies using this data set, then we present the new results isolating signatures for activity of the mating and filametation pathways.</p>
         </sec>
         <sec>
            <st>
               <p>Patterns identified in previous studies</p>
            </st>
            <p>Examination of pattern 1 shows expression of the overall common minimal processes necessary for survival, with 386 annotated genes associated with this pattern at a 3<it>&#963; </it>level. Measure of enhancement, <it>e</it>, of cellular functions, reveals two highly represented functional groups: 1) groups related to protein synthesis and 2) groups related to DNA synthesis. Group 1 includes genes enhanced in Protein Targeting, Sorting and Translocation (Term 14.04, <it>e </it>= <it>1.84</it>, <it>p </it>= 0.0022), Protein Synthesis (Term 12, <it>e </it>= 1.55, <it>p </it>= 0.024), and Ribosome Biogenesis (Term 12.01, <it>e </it>= 1.60, <it>p </it>= 0.059). Group 2 includes DNA Processing (Term 10.01, <it>e </it>= 1.32, p = 0.097), DNA Recombination and DNA Repair (Term 10.01.05, <it>e </it>= 1.31, p = 0.16), and DNA Synthesis (Term 10.01.03, <it>e </it>= 1.58, p = 0.16). The <it>p</it>-values for the ontology terms remain high, due to the large number of genes associated with this pattern.</p>
            <p>This pattern, which essentially includes genes necessary for viability, contains all the mutants of the dataset, although the Ssn6&#916; mutant shows a lower level for this pattern than other mutants. As the Ssn6&#916; mutant exhibits substantially greater overall expression than any other mutant (including the Tup1&#916; mutant with the second highest level), this may reflect the high association of the Ssn6&#916; mutant seen in almost all patterns, which will have some gene overlap with this pattern.</p>
            <p>Pattern 5 contains 172 annotated genes. Highly enhanced ontologies include transport-related functions: Transported Compounds (Term 20.01, <it>e </it>= 2.02, <it>p </it>&lt; 10<sup>-4</sup>), C-compound and Carbohydrate Transport (Term 20.01.03, <it>e </it>= 2.60, <it>p </it>&lt; 3 &#215; 10<sup>-4</sup>), and Cellular Transport, Transport Facilitation and Cellular Routes (Term 20, <it>e </it>= 1.62, <it>p </it>&lt; 6 &#215; 10<sup>-4</sup>), in addition to other transport terms at <it>p </it>&lt; 10<sup>-3</sup>. The pattern contains the two deletion mutants, Ssn6&#916; and Tup1&#916;, and represents the strong response seen in the original study <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>. Ssn6p and Tup1p form a system of transcriptional repression that appears to be highly conserved in eukaryotes <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>. In yeast, the complex acts as a global transcriptional repressor over a large number of genes (more that 150), coordinating several cellular systems, including haploid specific genes, glucose repressible genes, and oxygen utilization genes <abbrgrp><abbr bid="B56">56</abbr></abbrgrp>. Turning off this repression leads to a large overall increase in gene expression (the overall expression in these two mutants is many fold higher than in other mutants).</p>
            <p>Pattern 7 is related to the lack of cell wall functions (Cell Wall, Term 42.01, <it>e </it>= 0.0), as 28 cell wall genes (of 32 total) are absent from this pattern, while the other four genes have multiple annotations suggesting they have roles unrelated to Cell Wall function. Enhancement is present for Protein Modification (Term 14.07, <it>e </it>= 4.7, <it>p </it>&lt; 10<sup>-4</sup>) and Fermentation (Term 02.16, <it>e </it>= 4.5, <it>p </it>&lt; 0.01). This pattern contains the mutants Gas1&#916; and Fks1&#916;, which impair cell-wall synthesis, as well as the mutant YER083c&#916;, annotated as disrupting the cell wall in the original study <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>. The pattern contains other mutants disrupting ergosterol biosynthesis as well, including Erg2&#916;, She4&#916;, as well as YER044c&#916;. In addition, the pattern includes yeast treated with the drugs that are known to disrupt the cell wall, such as Tunicamycin and Glucosamine.</p>
            <p>Pattern 11 is related to ribosomal function, with enhancements in terms for Ribosome Biogenesis (Term 12.01, <it>e </it>= 6.32, <it>p </it>&lt; 4 &#215; 10<sup>-4</sup>) and Protein Synthesis (Term 12, <it>e </it>= 3,89, <it>p </it>&lt; 6 &#215; 10<sup>-3</sup>). The pattern contains 8 mutants related to ribosomal proteins, Rpll2a&#916;, Rpl27a&#916;, Rpl34a&#916;, Rpl6b&#916;, Rp18a&#916;, Rps24a&#916;, Rps24a&#916; (haploid), and Rps27b&#916;, as well as some mutants with deleted ORFs of unknown function, YOR078w&#916;, YMR269w&#916;, and YHR034c&#916;, proposed to be involved in ribosomal functions <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Patterns related to cellular signalling pathways</p>
            </st>
            <p>The two patterns that represent new insights into this data are 13 and 15, which appear related to two strongly coupled developmental pathways in yeast. Previous studies <abbrgrp><abbr bid="B44">44</abbr><abbr bid="B53">53</abbr></abbrgrp> have identified the mating pathway transcriptional response, however this has included both the filamentation response and the mating response. It is difficult to separate these signatures, as the mating and filamentation pathways share many common elements in a MAPK cascade as shown in Figure <figr fid="F3">3</figr><abbrgrp><abbr bid="B57">57</abbr><abbr bid="B58">58</abbr><abbr bid="B59">59</abbr></abbrgrp>.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Yeast MAPK signaling for mating and filamentation</p>
               </caption>
               <text>
                  <p><b>Yeast MAPK signaling for mating and filamentation</b>. The strongly linked MAPK signaling pathways for mating and filamentation are shown schematically with black arrows indicating mating pathway signaling and gray arrows showing filamentation pathway signaling. The mating pathway is initiated by binding to Ste2p or Ste3p receptors, while the causative molecular trigger for filamentation is unclear. The pathways share many components.</p>
               </text>
               <graphic file="1471-2105-7-99-3"/>
            </fig>
            <p>Gene ontology (GO) was used to determine the biological function described by each pattern, with a term added specifically for transposable elements, as these are known to play a role during filamentation <abbrgrp><abbr bid="B60">60</abbr><abbr bid="B61">61</abbr></abbrgrp>. The terms that showed enhancement are summarized in Table <tblr tid="T1">1</tblr>. The patterns show strong overlap, since many genes are shared between the mating and filamentation responses. However, the filamentation ontology term is significantly higher only in pattern 15, which also shows a strong signature of transposable element genes. Meanwhile, the GO terms for meiosis and morphogenesis (such as for budding in <it>S. cerevisiae</it>) are significantly enhanced only in pattern 13. This allows association of pattern 13 with activation of the mating pathway, and pattern 15 with activation of the filamentation pathway.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>The most enhanced gene ontology terms in patterns 13 and 15. Each term is presented together with a measure of how overrepresented it is compared to a random draw of the same number of genes. These were also confirmed to be significant by hypergeometric tests.</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c cspan="2" ca="center">
                        <p>
                           <b>Pattern 13</b>
                        </p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>
                           <b>Pattern 15</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Pheromone response, mating-type determination</p>
                     </c>
                     <c ca="left">
                        <p>6.31</p>
                     </c>
                     <c ca="left">
                        <p>Transposable elements, viral and plasmid</p>
                     </c>
                     <c ca="left">
                        <p>8.42</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>development</p>
                     </c>
                     <c ca="left">
                        <p>6.09</p>
                     </c>
                     <c ca="left">
                        <p>Pheromone response, mating-type determination</p>
                     </c>
                     <c ca="left">
                        <p>7.26</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Fungal/microorganism development</p>
                     </c>
                     <c ca="left">
                        <p>6.09</p>
                     </c>
                     <c ca="left">
                        <p>Transmembrane signal transduction</p>
                     </c>
                     <c ca="left">
                        <p>6.41</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Mating (fertilization)</p>
                     </c>
                     <c ca="left">
                        <p>6.09</p>
                     </c>
                     <c ca="left">
                        <p>G-protein mediated signal transduction</p>
                     </c>
                     <c ca="left">
                        <p>6.10</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Transmembrane signal transduction</p>
                     </c>
                     <c ca="left">
                        <p>4.98</p>
                     </c>
                     <c ca="left">
                        <p>development</p>
                     </c>
                     <c ca="left">
                        <p>5.69</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Chemoperception and response</p>
                     </c>
                     <c ca="left">
                        <p>4.47</p>
                     </c>
                     <c ca="left">
                        <p>Fungal/microorganism development</p>
                     </c>
                     <c ca="left">
                        <p>5.69</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cellular sensing and response</p>
                     </c>
                     <c ca="left">
                        <p>4.25</p>
                     </c>
                     <c ca="left">
                        <p>Mating (fertilization)</p>
                     </c>
                     <c ca="left">
                        <p>5.69</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Interaction with cellular environment</p>
                     </c>
                     <c ca="left">
                        <p>2.94</p>
                     </c>
                     <c ca="left">
                        <p>Chemoperception and response</p>
                     </c>
                     <c ca="left">
                        <p>5.47</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Meiosis</p>
                     </c>
                     <c ca="left">
                        <p>2.86</p>
                     </c>
                     <c ca="left">
                        <p>Cellular sensing and response</p>
                     </c>
                     <c ca="left">
                        <p>5.21</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cell growth/morphogenesis</p>
                     </c>
                     <c ca="left">
                        <p>2.77</p>
                     </c>
                     <c ca="left">
                        <p>Cellular communications</p>
                     </c>
                     <c ca="left">
                        <p>4.45</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cellular communications</p>
                     </c>
                     <c ca="left">
                        <p>2.77</p>
                     </c>
                     <c ca="left">
                        <p>Enzyme mediated signal transduction</p>
                     </c>
                     <c ca="left">
                        <p>3.81</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Development of asco- basidio- or zygospore</p>
                     </c>
                     <c ca="left">
                        <p>2.57</p>
                     </c>
                     <c ca="left">
                        <p>Interaction with cellular environment</p>
                     </c>
                     <c ca="left">
                        <p>3.61</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Enzyme mediated signal transduction</p>
                     </c>
                     <c ca="left">
                        <p>2.37</p>
                     </c>
                     <c ca="left">
                        <p>Enzyme activator</p>
                     </c>
                     <c ca="left">
                        <p>3.56</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Protein kinase cascades</p>
                     </c>
                     <c ca="left">
                        <p>2.37</p>
                     </c>
                     <c ca="left">
                        <p>Intracellular signaling</p>
                     </c>
                     <c ca="left">
                        <p>3.14</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>G protein mediated signal transduction</p>
                     </c>
                     <c ca="left">
                        <p>2.37</p>
                     </c>
                     <c ca="left">
                        <p>Budding, cell polarity, filamentation</p>
                     </c>
                     <c ca="left">
                        <p>2.87</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>In addition, we analyzed the 10 genes whose expression is most strongly linked to each pattern. These are shown in Table <tblr tid="T2">2</tblr>, which summarizes which genes are known to be regulated by the transcriptional activators related to the mating (Ste12p) and filamentation (Ste12p-Tec1p complex) pathways. The results show that the top 10 genes related to pattern 13 have 9 genes of known function, with 8 related to the mating response, of which five are known to be regulated by Ste12p. For pattern 15, 7 of the top 10 genes are known to be transposable element genes, with three other genes having unknown functions. This again links pattern 13 to mating and pattern 15 to filamentation.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>The genes most strongly associated with patterns 13 and 15 in order of strength of association. For pattern 13, it is noted whether the genes are known to be regulated in the mating process, and whether the gene is known to be directly regulated by Stel2p. For pattern 15, the gene function is shown. All data is from the Saccharomyces Genome Database [73, 74].</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c cspan="3" ca="center">
                        <p>
                           <b>Pattern 13</b>
                        </p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>
                           <b>Pattern 15</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Gene</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Mating?</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Ste12p</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Gene</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Function</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Fig1</p>
                     </c>
                     <c ca="left">
                        <p>Yes</p>
                     </c>
                     <c ca="left">
                        <p>No</p>
                     </c>
                     <c ca="left">
                        <p>YCL019W</p>
                     </c>
                     <c ca="left">
                        <p>Transposable element gene</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Prm6</p>
                     </c>
                     <c ca="left">
                        <p>Yes</p>
                     </c>
                     <c ca="left">
                        <p>Yes</p>
                     </c>
                     <c ca="left">
                        <p>YER138C</p>
                     </c>
                     <c ca="left">
                        <p>Transposable element gene</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Fus1</p>
                     </c>
                     <c ca="left">
                        <p>Yes</p>
                     </c>
                     <c ca="left">
                        <p>Yes</p>
                     </c>
                     <c ca="left">
                        <p>YER117C</p>
                     </c>
                     <c ca="left">
                        <p>Verified ORF, Prm5</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Ste2</p>
                     </c>
                     <c ca="left">
                        <p>Yes</p>
                     </c>
                     <c ca="left">
                        <p>Yes</p>
                     </c>
                     <c ca="left">
                        <p>YER160C</p>
                     </c>
                     <c ca="left">
                        <p>Transposable element gene</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Aga1</p>
                     </c>
                     <c ca="left">
                        <p>Yes</p>
                     </c>
                     <c ca="left">
                        <p>No</p>
                     </c>
                     <c ca="left">
                        <p>YJR029W</p>
                     </c>
                     <c ca="left">
                        <p>Transposable element gene</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Fus3</p>
                     </c>
                     <c ca="left">
                        <p>Yes</p>
                     </c>
                     <c ca="left">
                        <p>Yes</p>
                     </c>
                     <c ca="left">
                        <p>YBR012W</p>
                     </c>
                     <c ca="left">
                        <p>Removed from SGD</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Pes4</p>
                     </c>
                     <c ca="left">
                        <p>No</p>
                     </c>
                     <c ca="left">
                        <p>No</p>
                     </c>
                     <c ca="left">
                        <p>YML045W</p>
                     </c>
                     <c ca="left">
                        <p>Transposable element gene</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Prm1</p>
                     </c>
                     <c ca="left">
                        <p>Yes</p>
                     </c>
                     <c ca="left">
                        <p>Yes</p>
                     </c>
                     <c ca="left">
                        <p>YAR009C</p>
                     </c>
                     <c ca="left">
                        <p>Transposable element gene</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ORF</p>
                     </c>
                     <c ca="left">
                        <p>--</p>
                     </c>
                     <c ca="left">
                        <p>--</p>
                     </c>
                     <c ca="left">
                        <p>YJR027W</p>
                     </c>
                     <c ca="left">
                        <p>Transposable element gene</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Bar1</p>
                     </c>
                     <c ca="left">
                        <p>Yes</p>
                     </c>
                     <c ca="left">
                        <p>No</p>
                     </c>
                     <c ca="left">
                        <p>YLR334C</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical ORF</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>In order to validate that the patterns were actually measuring activity of the mating and filamentation pathways, we explored deletion mutants related to these pathways <abbrgrp><abbr bid="B61">61</abbr><abbr bid="B62">62</abbr></abbrgrp>. The mating response in <it>S. cerevisiae </it>is mediated via a MAPK signaling cascade initiated by binding to the Ste2p or Ste3p membrane receptors (Figure <figr fid="F3">3</figr>). The signal is transduced through Ste11p, Ste7p, and Fus3p with Ste5p serving as a scaffolding protein. The signal activates the Ste12p transcription factor, leading to transcription of mating response genes. In addition, the signal is transduced to the MAPK cascade from the membrane receptor by a G protein complex or through the Ste20p protein. Pattern 13 shows near zero signal for the deletion mutants Ste11&#916;, Ste7&#916; Fus3&#916;, Ste12&#916;, Ste5&#916;, and Ste2&#916;, while showing signal for deletion mutants of Ste20&#916; and Tec1&#916;. This is exactly as expected, with the membrane receptor, all signaling proteins in the cascade, and the transcription factor necessary to generate the transcriptional response related to the mating signal (note that the Ste3&#916; mutant is not in the data set). Ste20p is not necessary to raise the mating response, since the G-protein complex can trigger activation of Ste11p directly. For pattern 15, the response is very similar. The signal is near zero for the deletion mutants Ste2&#916;, Ste11&#916;, Ste7&#916;, and Ste12&#916;. The Fus3&#916; mutant shows a signal for pattern 15, as appropriate, while the Fus3&#916;, Kss1&#916; double deletion mutant does not. In addition, the Tec1&#916; mutant shows no signal for pattern 15, indicating that Tec1p is required for filamentation <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>. Finally, the signal is greatly reduced for the Ste20&#916; deletion mutant in pattern 15 relative to pattern 13, which agrees with previous work suggesting that the filamentation pathway is more dependent on Ste20p signaling than is the mating pathway <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>Microarrays and GeneChips&#8482; have become the tools of choice for the investigation of genome-wide transcription in most biological systems. The resulting data comprises noisy estimates of transcription levels for roughly 6,000 genes in yeast to more than 20,000 genes in typical mammalian studies. Numerous statistical and data mining methods have been applied to this data in order to identify individual genes showing differential expression, to identify patterns related to physiological states, and to identify groups of genes comprising biological processes. These studies generally have not focused on the estimation of cellular signaling from the data, despite the prevalence of cellular signaling in many diseases.</p>
         <p>As noted in the introduction, the recovery of signaling pathway information from transcription data requires complex analysis, since protein levels do not correlate well with mRNA levels and signaling protein activity is not generally linked to the mRNA expression levels of genes encoding the signaling proteins themselves. As such, changes in mRNA levels are limited to being a downstream indicator of activity. If a complete model for the transcription of genes, including all known transcriptional regulators and biological processes regulating transcription, was available, the inference of activity would be straightforward. Unfortunately, the network models and even gene annotations are still far from being complete. In addition, the growing evidence supporting the important role of non-coding RNAs in regulation of gene expression (including antisense transcripts and micro-RNAs, see for example <abbrgrp><abbr bid="B63">63</abbr><abbr bid="B64">64</abbr><abbr bid="B65">65</abbr></abbrgrp>) further undermines the potential of using mRNA species as markers for proteins and their activities <abbrgrp><abbr bid="B66">66</abbr></abbrgrp>.</p>
         <p>In order to overcome this incompleteness, we have created the method described here. We couple identification of transcriptional signatures with our Bayesian Decomposition algorithm to a consistency analysis for gene assignment to patterns determined by comparison of different dimensionalities using ClutrFree. This allows the identification of the correct dimensionality to be applied to subsequent ontology and transcription factor analyses. ClutrFree is also used to determine the ontological terms enhanced within each pattern and to obtain a list of genes tied to this pattern, which can then be linked to specific transcription factors. In this way, the biological processes associated with conditions can be identified, and inferences can be made on the activity of specific transcription factors. This then allows inference on the activity of signaling pathways, which cannot be obtained with methods previously applied to microarray data. Overall, the method requires many separate steps, each modeling an aspect of the biological system, in order to make proper inferences on signaling from the data.</p>
         <p>In the application to the Compendium data presented here, our analysis was able to extract the common features for a set of mutants that eliminated related pathways. As in previous studies, the global transcriptional repressor complex Ssn6-Tupl has been isolated in a single group. In addition, patterns for cell-wall synthesis, ribosomal function, and the global functions necessary for continued viability of yeast were isolated. In contrast to previous analyses of this data, two pathways related to the MAPK cascade were isolated, one related to mating and the other to filamentation. Once the correct dimensionality was determined, Bayesian Decomposition was able to identify transcriptional signatures unique for each pathway. The assignment was validated by an investigation of the deletion mutants known to adversely affect these pathways.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>Microarray studies have been widespread in biological and medical research, often focusing on identification of genes significantly correlated with various disease states. However, many diseases arise from disruptions in cellular signaling, and in these cases gene expression only provides a downstream indicator of signaling activity. This greatly complicates the analysis. The new approach introduced here recovered signatures allowing us to make validated inferences on strongly overlapping signaling pathways.</p>
         <p>The results demonstrate that for <it>Saccharomyces cerevisiae</it>, the mating and filamentation pathways can be distinguished from transcriptional signatures determined from analysis of microarray data, despite the intrinsic high noise, confounding transcriptional activity, and tightly coupled nature of the pathways. The next step will be to apply these methods to more complex signaling networks in worms, flies, and mammals.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>The Rosetta deletion mutant data set</p>
            </st>
            <p>The Rosetta Compendium comprises 300 conditions, including 276 deletion mutants, 11 tetracycline regulated genes, and 13 drug treatments, in <it>S. cerevisiae </it>growing in rich medium <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>. The data were generated from a two color cDNA microarray hybridization assay <abbrgrp><abbr bid="B67">67</abbr></abbrgrp>, and transcriptional profiles were measured both with technical replication and biological replication (151 mutants). In parallel with the 300 experiments, 63 controls of wild-type <it>S. cerevisiae </it>were grown in identical conditions and compared against each other, permitting creation of a gene-specific error model. The data was downloaded from Rosetta Inpharmatics.</p>
         </sec>
         <sec>
            <st>
               <p>Data preprocessing</p>
            </st>
            <p>The data was filtered to retain only conditions characterized by at least a variation of 3 fold in a minimum of 2 genes. Then all genes that did not vary by 3-fold in at least 2 conditions were also removed, leading to a data matrix comprising 764 genes and 228 conditions.</p>
            <p>The data used by Bayesian Decomposition included both the mean log ratio for each data point and the uncertainty in this measurement determined by the Rosetta error model. Since Bayesian Decomposition as applied here requires positivity <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>, the log ratios were converted to ratios and the uncertainties propagated to uncertainties on the ratios. Although analysis of residuals suggested that seven dimensions fit the data <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>, the analysis presented here suggests that this is due to overestimation of uncertainty in the data. Bayesian Decomposition is not highly sensitive to minor misestimations of noise however, so that this should not be a problem for this analysis.</p>
         </sec>
         <sec>
            <st>
               <p>Analysis with Bayesian Decomposition</p>
            </st>
            <p>Bayesian Decomposition (BD) has been applied to multiple types of data: <it>in vivo </it>spectroscopic data <abbrgrp><abbr bid="B68">68</abbr></abbrgrp>, medical imaging <abbrgrp><abbr bid="B69">69</abbr></abbrgrp>, microarray data from single cell organisms <abbrgrp><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr></abbrgrp>, mammalian model organisms <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>, humans <abbrgrp><abbr bid="B70">70</abbr></abbrgrp>, and on phylogenomic sequence data <abbrgrp><abbr bid="B71">71</abbr></abbrgrp>. A detailed description of the algorithm <abbrgrp><abbr bid="B46">46</abbr></abbrgrp> and a review of applications <abbrgrp><abbr bid="B48">48</abbr></abbrgrp> have been published.</p>
            <p>Briefly, BD models the microarray data, comprising a matrix of estimates of the ratio of expression between the experimental condition and a control, as the result of the multiplication of two matrices describing behaviors across conditions (the <b>P </b>or pattern matrix) and the distribution of genes within those behaviors (the <b>A </b>or amplitude matrix). Naturally, the data, <b>D</b>, includes noise, so that the full relationship is defined by</p>
            <p>
               <m:math name="1471-2105-7-99-i1" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mrow>
                        <m:msub>
                           <m:mi>D</m:mi>
                           <m:mrow>
                              <m:mi>i</m:mi>
                              <m:mi>j</m:mi>
                           </m:mrow>
                        </m:msub>
                        <m:mo>&#8773;</m:mo>
                        <m:mstyle displaystyle="true">
                           <m:munderover>
                              <m:mo>&#8721;</m:mo>
                              <m:mrow>
                                 <m:mi>k</m:mi>
                                 <m:mo>=</m:mo>
                                 <m:mn>1</m:mn>
                              </m:mrow>
                              <m:mi>K</m:mi>
                           </m:munderover>
                           <m:mrow>
                              <m:msub>
                                 <m:mi>A</m:mi>
                                 <m:mrow>
                                    <m:mi>i</m:mi>
                                    <m:mi>k</m:mi>
                                 </m:mrow>
                              </m:msub>
                              <m:msub>
                                 <m:mi>P</m:mi>
                                 <m:mrow>
                                    <m:mi>k</m:mi>
                                    <m:mi>j</m:mi>
                                 </m:mrow>
                              </m:msub>
                              <m:mo>+</m:mo>
                              <m:msub>
                                 <m:mi>&#949;</m:mi>
                                 <m:mrow>
                                    <m:mi>i</m:mi>
                                    <m:mi>j</m:mi>
                                 </m:mrow>
                              </m:msub>
                           </m:mrow>
                        </m:mstyle>
                        <m:mtext>&#160;&#160;&#160;&#160;&#160;</m:mtext>
                        <m:mo stretchy="false">[</m:mo>
                        <m:mn>1</m:mn>
                        <m:mo stretchy="false">]</m:mo>
                     </m:mrow>
                     <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGebardaWgaaWcbaGaemyAaKMaemOAaOgabeaakiabgwKianaaqahabaGaemyqae0aaSbaaSqaaiabdMgaPjabdUgaRbqabaGccqWGqbaudaWgaaWcbaGaem4AaSMaemOAaOgabeaakiabgUcaRGGaciab=v7aLnaaBaaaleaacqWGPbqAcqWGQbGAaeqaaaqaaiabdUgaRjabg2da9iabigdaXaqaaiabdUealbqdcqGHris5aOGaaCzcaiaaxMaacqGGBbWwcqaIXaqmcqGGDbqxaaa@4AD1@</m:annotation>
                  </m:semantics>
               </m:math>
            </p>
            <p>where <it>D</it><sub><it>ij </it></sub>is the estimated ratio for gene <it>i </it>in mutant <it>j</it>, <it>A</it><sub><it>ik </it></sub>is the strength of gene <it>i </it>in pattern <it>k</it>, <it>P</it><sub><it>kj </it></sub>is the strength of mutant <it>j </it>in pattern <it>k</it>, and <it>&#949;</it><sub><it>ij </it></sub>is the noise for gene <it>i </it>in mutant <it>j </it>estimated by the Rosetta error model. BD estimates <b>A </b>and <b>P </b>by a Markov chain Monte Carlo (MCMC) simulation. For the fixed noise estimate, <it>&#949;</it>, <b>A </b>and <b>P </b>are inferred from the marginal probability distribution</p>
            <p><it>p </it>(<b>D </b>| <b>A,P</b>) <it>p </it>(<b>A,P</b>) &#160;&#160;&#160; [2]</p>
            <p>where <it>p</it>(<b>A,P</b>) is the prior probability and <it>p </it>(<b>D </b>| <b>A,P</b>) is given by the likelihood such that</p>
            <p>
               <m:math name="1471-2105-7-99-i2" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mrow>
                        <m:mi>log</m:mi>
                        <m:mo>&#8289;</m:mo>
                        <m:mi>p</m:mi>
                        <m:mo stretchy="false">(</m:mo>
                        <m:mi>D</m:mi>
                        <m:mo>|</m:mo>
                        <m:mi>A</m:mi>
                        <m:mo>,</m:mo>
                        <m:mi>P</m:mi>
                        <m:mo stretchy="false">)</m:mo>
                        <m:mo>=</m:mo>
                        <m:mo>&#8722;</m:mo>
                        <m:mstyle displaystyle="true">
                           <m:munder>
                              <m:mo>&#8721;</m:mo>
                              <m:mi>i</m:mi>
                           </m:munder>
                           <m:mrow>
                              <m:mstyle displaystyle="true">
                                 <m:munder>
                                    <m:mo>&#8721;</m:mo>
                                    <m:mi>j</m:mi>
                                 </m:munder>
                                 <m:mrow>
                                    <m:mrow>
                                       <m:mo>{</m:mo>
                                       <m:mrow>
                                          <m:mfrac>
                                             <m:mn>1</m:mn>
                                             <m:mrow>
                                                <m:mn>2</m:mn>
                                                <m:msubsup>
                                                   <m:mi>&#949;</m:mi>
                                                   <m:mrow>
                                                      <m:mi>i</m:mi>
                                                      <m:mi>j</m:mi>
                                                   </m:mrow>
                                                   <m:mn>2</m:mn>
                                                </m:msubsup>
                                             </m:mrow>
                                          </m:mfrac>
                                          <m:msup>
                                             <m:mrow>
                                                <m:mrow>
                                                   <m:mo>(</m:mo>
                                                   <m:mrow>
                                                      <m:msub>
                                                         <m:mi>D</m:mi>
                                                         <m:mrow>
                                                            <m:mi>i</m:mi>
                                                            <m:mi>j</m:mi>
                                                         </m:mrow>
                                                      </m:msub>
                                                      <m:mo>&#8722;</m:mo>
                                                      <m:mstyle displaystyle="true">
                                                         <m:munderover>
                                                            <m:mo>&#8721;</m:mo>
                                                            <m:mrow>
                                                               <m:mi>k</m:mi>
                                                               <m:mo>=</m:mo>
                                                               <m:mn>1</m:mn>
                                                            </m:mrow>
                                                            <m:mi>K</m:mi>
                                                         </m:munderover>
                                                         <m:mrow>
                                                            <m:msub>
                                                               <m:mi>A</m:mi>
                                                               <m:mrow>
                                                                  <m:mi>i</m:mi>
                                                                  <m:mi>k</m:mi>
                                                               </m:mrow>
                                                            </m:msub>
                                                            <m:msub>
                                                               <m:mi>P</m:mi>
                                                               <m:mrow>
                                                                  <m:mi>k</m:mi>
                                                                  <m:mi>j</m:mi>
                                                               </m:mrow>
                                                            </m:msub>
                                                         </m:mrow>
                                                      </m:mstyle>
                                                   </m:mrow>
                                                   <m:mo>)</m:mo>
                                                </m:mrow>
                                             </m:mrow>
                                             <m:mn>2</m:mn>
                                          </m:msup>
                                       </m:mrow>
                                       <m:mo>}</m:mo>
                                    </m:mrow>
                                 </m:mrow>
                              </m:mstyle>
                           </m:mrow>
                        </m:mstyle>
                        <m:mo>.</m:mo>
                        <m:mtext>&#160;&#160;&#160;&#160;&#160;</m:mtext>
                        <m:mo stretchy="false">[</m:mo>
                        <m:mn>3</m:mn>
                        <m:mo stretchy="false">]</m:mo>
                     </m:mrow>
                     <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacyGGSbaBcqGGVbWBcqGGNbWzcqWGWbaCcqGGOaakieqacqWFebarcqGG8baFcqWFbbqqcqGGSaalcqWFqbaucqGGPaqkcqGH9aqpcqGHsisldaaeqbqaamaaqafabaWaaiWaaeaadaWcaaqaaiabigdaXaqaaiabikdaYGGaciab+v7aLnaaDaaaleaacqWGPbqAcqWGQbGAaeaacqaIYaGmaaaaaOWaaeWaaeaacqWGebardaWgaaWcbaGaemyAaKMaemOAaOgabeaakiabgkHiTmaaqahabaGaemyqae0aaSbaaSqaaiabdMgaPjabdUgaRbqabaGccqWGqbaudaWgaaWcbaGaem4AaSMaemOAaOgabeaaaeaacqWGRbWAcqGH9aqpcqaIXaqmaeaacqWGlbWsa0GaeyyeIuoaaOGaayjkaiaawMcaamaaCaaaleqabaGaeGOmaidaaaGccaGL7bGaayzFaaaaleaacqWGQbGAaeqaniabggHiLdaaleaacqWGPbqAaeqaniabggHiLdGccqGGUaGlcaWLjaGaaCzcaiabcUfaBjabiodaZiabc2faDbaa@682F@</m:annotation>
                  </m:semantics>
               </m:math>
            </p>
            <p>The prior is used here to require positivity and to minimize structure in the estimates of <b>A </b>and <b>P </b><abbrgrp><abbr bid="B46">46</abbr></abbrgrp>.</p>
            <p>The analysis with BD is similar mathematically to an analysis with singular value decomposition (SVD) or with principal component analysis (PCA), since all methods estimate two matrices that together reconstruct the data. In both PCA and SVD, orthogonality conditions force each row of <b>P </b>to be linearly independent, deriving <b>P </b>from either the data matrix (SVD) or the covariance matrix (PCA) using deterministic algorithms. Since patterns of expression related to biological processes will not generally be independent, BD uses the MCMC approach to avoid orthogonality. The resulting rows of <b>P </b>are usually easier to relate to biological processes than those from SVD or PCA.</p>
            <p>BD was run at each posited dimensionality, <it>K </it>(as in equation 1), between 3 and 25, generating a mean estimate and an uncertainty (i.e., standard deviation from samples of the posterior probability distribution) for each element of <b>A </b>and <b>P</b>. The dimensionality is equivalent to the number of patterns, as the patterns can be viewed as nonorthogonal basis vectors for <b>D</b>.</p>
         </sec>
         <sec>
            <st>
               <p>Persistence and dimensionality</p>
            </st>
            <p>The results of Bayesian Decomposition across the different estimated dimensionalities, <it>K</it>, were compared with ClutrFree in order to visualize stable basis vectors and a persistence measurement on them <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>. Each independent BD analysis provides a tree level, with each pattern represented by a node (see Figure <figr fid="F4">4</figr>). The analysis with the fewest patterns is placed at the top of the tree, and additional levels of analyses are added creating a tree from fewer to larger numbers of patterns. Connections in the tree are created in a greedy way. Level <it>N+1 </it>(e.g., 4) is connected to level <it>N </it>(e.g., 3) by finding the node in level <it>N+1 </it>with the highest correlation to a node in level <it>N</it>. The correlation is given by the Pearson correlation for the strength of the assignment of mutants to a pattern (i.e., between <it>P</it><sub><it>kj </it></sub>for the nodes). These nodes are connected and removed. From the remaining nodes, the maximum correlation between a node at level <it>N+1 </it>and one at <it>N </it>is found again, and this process is repeated until only a single node remains at level <it>N+1</it>. This node is then connected to the node at level <it>N </it>that yields the highest correlation.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Relationship of patterns across dimensionalities</p>
               </caption>
               <text>
                  <p><b>Relationship of patterns across dimensionalities</b>. The results for all patterns identified in all runs of Bayesian Decomposition are summarized here. The top row shows three patterns from an analysis with 3 dimensions, while the bottom row shows 25 dimensions. The highlighted node is pattern 13 in 15 dimensions, which is the pattern identified as the mating response. Nodes are connected as described in the text using Pearson correlation measures. The numbers within the nodes are indices and have no intrinsic meaning. Each number provides the row index for <b>P </b>and column index for <b>A </b>for the analysis at that level.</p>
               </text>
               <graphic file="1471-2105-7-99-4"/>
            </fig>
            <p>Following our previous work <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>, we use a measure of persistence to quantify the robustness of a pattern across the variation of the number of patterns. An example calculation is shown in Figure <figr fid="F5">5</figr>. The assignment of a mutant to a pattern (i.e., a node) is binarized based on the mean and uncertainty of the assignment of the mutant to the pattern determined by the MCMC sampling, using a requirement that the mutant be assigned to the pattern at greater than 3<it>&#963; </it>above zero. In Figure <figr fid="F5">5</figr>, it is assumed there are four mutants in a pattern, thus there are four binarized values. Then at each node, each mutant is compared for consistency in presence of the mutant in linked nodes within the tree. For example, for the highlighted node in Figure <figr fid="F5">5</figr>, the first mutant is present in the node above and the node below, so it is present in all 3 connected nodes. For the second mutant it is present in 2, and the third and fourth mutants are absent. The average persistence for the node is therefore (3+2+0+0)/4 = 1.25 as noted in Figure <figr fid="F5">5</figr>. For branches, the mutant status is only required to agree in a single branch to be counted. The average persistence at a dimension is then the average of the persistence for all nodes at that dimension (i.e., a row in Figure <figr fid="F4">4</figr>).</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>A sample calculation of the average persistence for a single node</p>
               </caption>
               <text>
                  <p><b>A sample calculation of the average persistence for a single node</b>. The average persistence is calculated by comparing the persistence at each node in the tree given in Figure <figr fid="F4">4</figr>. Each assignment of each mutant (4 are shown here) to a pattern is binarized as described in the text, then the average persistence for a node is calculated by checking on the number of times the mutant assigned to the pattern occurs in the connected nodes. The mutant can occur in any branch below the node of interest to be considered as present. If it occurs in multiple child nodes at a single level, that is still treated as a single occurrence for that level. The average for a dimension is then the average of the persistence of all nodes at that level.</p>
               </text>
               <graphic file="1471-2105-7-99-5"/>
            </fig>
            <p>We assessed the dimensionality of the data using measurements of persistence. The persistence was measured for analyses from <it>3 </it>&#8211; <it>25 </it>patterns and the dimension chosen where a significant drop occurred in an otherwise slow monotonic decline, which was expected due to the branching nature of the tree. Figure <figr fid="F2">2</figr> shows the significant drop between 15 and 16 dimensions, so 15 patterns were chosen for further analysis.</p>
         </sec>
         <sec>
            <st>
               <p>Ontology and function</p>
            </st>
            <p>To assign ontological terms to the genes contained in basis vectors, we annotated our data using the gene ontologies from the Comprehensive Yeast Genome Database (CYGD) hosted at the Munich Information center for Protein Sequences (MIPS) <abbrgrp><abbr bid="B51">51</abbr><abbr bid="B72">72</abbr></abbrgrp>. The analysis here utilizes the Functional Catalog (FunCat) format that describes each gene using a hierarchical ontological model.</p>
            <p>Similar to persistence, we defined enhancement as a measure of the over-representation, or under representation, of a gene function in a subset of the data <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>. It is the ratio of the frequency of occurrence of genes annotated by a particular ontological term in the pattern to the frequency of occurrence of the same term in the whole dataset,</p>
            <p>
               <m:math name="1471-2105-7-99-i3" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mrow>
                        <m:mi>e</m:mi>
                        <m:mo stretchy="false">(</m:mo>
                        <m:mi>t</m:mi>
                        <m:mo>,</m:mo>
                        <m:mi>p</m:mi>
                        <m:mo stretchy="false">)</m:mo>
                        <m:mo>=</m:mo>
                        <m:mfrac>
                           <m:mrow>
                              <m:msub>
                                 <m:mi>g</m:mi>
                                 <m:mi>p</m:mi>
                              </m:msub>
                              <m:mo>/</m:mo>
                              <m:msub>
                                 <m:mi>n</m:mi>
                                 <m:mi>p</m:mi>
                              </m:msub>
                           </m:mrow>
                           <m:mrow>
                              <m:mi>G</m:mi>
                              <m:mo>/</m:mo>
                              <m:mi>N</m:mi>
                           </m:mrow>
                        </m:mfrac>
                        <m:mo>,</m:mo>
                        <m:mtext>&#160;&#160;&#160;&#160;&#160;</m:mtext>
                        <m:mo stretchy="false">[</m:mo>
                        <m:mn>4</m:mn>
                        <m:mo stretchy="false">]</m:mo>
                     </m:mrow>
                     <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGLbqzcqGGOaakcqWG0baDcqGGSaalcqWGWbaCcqGGPaqkcqGH9aqpdaWcaaqaaiabdEgaNnaaBaaaleaacqWGWbaCaeqaaOGaei4la8IaemOBa42aaSbaaSqaaiabdchaWbqabaaakeaacqWGhbWrcqGGVaWlcqWGobGtaaGaeiilaWIaaCzcaiaaxMaacqGGBbWwcqaI0aancqGGDbqxaaa@441D@</m:annotation>
                  </m:semantics>
               </m:math>
            </p>
            <p>with <it>g</it><sub><it>p </it></sub>being the number of genes annotated with the term <it>t </it>in pattern <it>p</it>, <it>n</it><sub><it>p </it></sub>the total number of genes in the pattern <it>p</it>, <it>G </it>the number of genes annotated by the term <it>t </it>in the data, and <it>N </it>being the total number of genes in the dataset. In addition, we apply a hypergeometric test to estimate a p-value for each term. Function was then determined by inspection of enhanced ontological terms.</p>
         </sec>
         <sec>
            <st>
               <p>Transcription factor analysis</p>
            </st>
            <p>The genes were also analyzed for the patterns determined to be related to signaling pathways by exploration of the ten genes most strongly associated with the pattern. Each gene was analyzed using the Saccharomyces Genome Database <abbrgrp><abbr bid="B73">73</abbr></abbrgrp> to determine whether it was known to be associated with mating or filamentation processes and to determine if it was directly regulated by the Ste12p transcription factor.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>GB performed analysis with BD and ClutrFree, including identification of biological processes from gene ontology measurements. KS and JMC provided advice and guidance on development of ClutrFree for these analyses. MFO oversaw the project and did gene ontology and transcription factor analysis on patterns 13 and 15.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank the National Institutes of Health, National Library of Medicine (LM008309 to mfo) and National Cancer Institute (CCCG CA06927 supporting mfo), and the Pennsylvania Department of Health (grant to mfo). JMC and KS acknowledge the support from the R&#233;seau National des G&#233;nopoles (RNG).</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>STI571: targeting BCR-ABL as therapy for CML</p>
            </title>
            <aug>
               <au>
                  <snm>Mauro</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Druker</snm>
                  <fnm>BJ</fnm>
               </au>
            </aug>
            <source>Oncologist</source>
            <pubdate>2001</pubdate>
            <volume>6</volume>
            <fpage>233</fpage>
            <lpage>8</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11423669</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Trastuzumab and interleukin-2 in HER2-positive metastatic breast cancer: a pilot study</p>
            </title>
            <aug>
               <au>
                  <snm>Repka</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Chiorean</snm>
                  <fnm>EG</fnm>
               </au>
               <au>
                  <snm>Gay</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Herwig</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Kohl</snm>
                  <fnm>VK</fnm>
               </au>
               <au>
                  <snm>Yee</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>JS</fnm>
               </au>
            </aug>
            <source>Clin Cancer Res</source>
            <pubdate>2003</pubdate>
            <volume>9</volume>
            <fpage>2440</fpage>
            <lpage>6</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12855616</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Recent advances in the management of gastrointestinal stromal tumors</p>
            </title>
            <aug>
               <au>
                  <snm>von Mehren</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Curr Oncol Rep</source>
            <pubdate>2003</pubdate>
            <volume>5</volume>
            <fpage>288</fpage>
            <lpage>94</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12781070</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Taking the study of cancer cell survival to a new dimension</p>
            </title>
            <aug>
               <au>
                  <snm>Jacks</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Weinberg</snm>
                  <fnm>RA</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2002</pubdate>
            <volume>111</volume>
            <fpage>923</fpage>
            <lpage>5</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12507419</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Meaningful relationships: the regulation of the Ras/Raf/MEK/ERK pathway by protein interactions</p>
            </title>
            <aug>
               <au>
                  <snm>Kolch</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Biochem J</source>
            <pubdate>2000</pubdate>
            <volume>351</volume>
            <issue>Pt 2</issue>
            <fpage>289</fpage>
            <lpage>305</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1221363</pubid>
                  <pubid idtype="pmpid" link="fulltext">11023813</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Elements of Human Cancer</p>
            </title>
            <aug>
               <au>
                  <snm>Cooper</snm>
                  <fnm>GM</fnm>
               </au>
            </aug>
            <publisher>Boston: Jones and Bartlett Publishers</publisher>
            <pubdate>1992</pubdate>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Molecular Biology of Cancer</p>
            </title>
            <aug>
               <au>
                  <snm>Macdonald</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Ford</snm>
                  <fnm>CHJ</fnm>
               </au>
            </aug>
            <publisher>Oxford: BIOS Scientific Publishers, Ltd</publisher>
            <pubdate>1997</pubdate>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Emerging role of Akt kinase/protein kinase B signaling in pathophysiology of diabetes and its complications</p>
            </title>
            <aug>
               <au>
                  <snm>Zdychova</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Komers</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Physiol Res</source>
            <pubdate>2005</pubdate>
            <volume>54</volume>
            <fpage>1</fpage>
            <lpage>16</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15717836</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Insulin signaling defects in type 2 diabetes</p>
            </title>
            <aug>
               <au>
                  <snm>Leng</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Karlsson</snm>
                  <fnm>HK</fnm>
               </au>
               <au>
                  <snm>Zierath</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>Rev Endocr Metab Disord</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>111</fpage>
            <lpage>7</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15041786</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling</p>
            </title>
            <aug>
               <au>
                  <snm>Alizadeh</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>RE</fnm>
               </au>
               <au>
                  <snm>Ma</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Lossos</snm>
                  <fnm>IS</fnm>
               </au>
               <au>
                  <snm>Rosenwald</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Boldrick</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Sabet</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Tran</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>X</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>403</volume>
            <fpage>503</fpage>
            <lpage>11</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10676951</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Molecular classification of cancer: class discovery and class prediction by gene expression monitoring</p>
            </title>
            <aug>
               <au>
                  <snm>Golub</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>Slonim</snm>
                  <fnm>DK</fnm>
               </au>
               <au>
                  <snm>Tamayo</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Huard</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Gaasenbeek</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Mesirov</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Coller</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Loh</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Downing</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Caligiuri</snm>
                  <fnm>MA</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>1999</pubdate>
            <volume>286</volume>
            <fpage>531</fpage>
            <lpage>7</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10521349</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Recursive partitioning for tumor classification with gene expression microarray data</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>CY</fnm>
               </au>
               <au>
                  <snm>Singer</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Xiong</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2001</pubdate>
            <volume>98</volume>
            <fpage>6730</fpage>
            <lpage>5</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">34421</pubid>
                  <pubid idtype="pmpid" link="fulltext">11381113</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Identification and validation of genes involved in the pathogenesis of colorectal cancer using cDNA microarrays and RNA interference</p>
            </title>
            <aug>
               <au>
                  <snm>Williams</snm>
                  <fnm>NS</fnm>
               </au>
               <au>
                  <snm>Gaynor</snm>
                  <fnm>RB</fnm>
               </au>
               <au>
                  <snm>Scoggin</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Verma</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Gokaslan</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Simmang</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Fleming</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Tavana</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Frenkel</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Becerra</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Clin Cancer Res</source>
            <pubdate>2003</pubdate>
            <volume>9</volume>
            <fpage>931</fpage>
            <lpage>46</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12631590</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Expression profiles of non- small cell lung cancers on cDNA microarrays: identification of genes for prediction of lymph-node metastasis and sensitivity to anti-cancer drugs</p>
            </title>
            <aug>
               <au>
                  <snm>Kikuchi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Daigo</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Katagiri</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Tsunoda</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Okada</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kakiuchi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Zembutsu</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Furukawa</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Kawamura</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kobayashi</snm>
                  <fnm>K</fnm>
               </au>
               <etal/>
            </aug>
            <source>Oncogene</source>
            <pubdate>2003</pubdate>
            <volume>22</volume>
            <fpage>2192</fpage>
            <lpage>205</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12687021</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Gene-expression profiling in human cutaneous melanoma</p>
            </title>
            <aug>
               <au>
                  <snm>Carr</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>Bittner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Trent</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Oncogene</source>
            <pubdate>2003</pubdate>
            <volume>22</volume>
            <fpage>3076</fpage>
            <lpage>80</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12789283</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Response markers and the molecular mechanisms of action of Gleevec in gastrointestinal stromal tumors</p>
            </title>
            <aug>
               <au>
                  <snm>Frolov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Chahwan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ochs</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Arnoletti</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Pan</snm>
                  <fnm>ZZ</fnm>
               </au>
               <au>
                  <snm>Favorova</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Fletcher</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>von Mehren</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Eisenberg</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Godwin</snm>
                  <fnm>AK</fnm>
               </au>
            </aug>
            <source>Mol Cancer Ther</source>
            <pubdate>2003</pubdate>
            <volume>2</volume>
            <fpage>699</fpage>
            <lpage>709</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12939459</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Functional genomics of the endocrine pancreas: the pancreas clone set and PancChip, new resources for diabetes research</p>
            </title>
            <aug>
               <au>
                  <snm>Scearce</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Brestelli</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>McWeeney</snm>
                  <fnm>SK</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>CS</fnm>
               </au>
               <au>
                  <snm>Mazzarelli</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Pinney</snm>
                  <fnm>DF</fnm>
               </au>
               <au>
                  <snm>Pizarro</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Stoeckert</snm>
                  <fnm>CJ</fnm>
                  <suf>Jr</suf>
               </au>
               <au>
                  <snm>Clifton</snm>
                  <fnm>SW</fnm>
               </au>
               <au>
                  <snm>Permutt</snm>
                  <fnm>MA</fnm>
               </au>
               <etal/>
            </aug>
            <source>Diabetes</source>
            <pubdate>2002</pubdate>
            <volume>51</volume>
            <fpage>1997</fpage>
            <lpage>2004</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12086925</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Significance analysis of microarrays applied to the ionizing radiation response</p>
            </title>
            <aug>
               <au>
                  <snm>Tusher</snm>
                  <fnm>VG</fnm>
               </au>
               <au>
                  <snm>Tibshirani</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Chu</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2001</pubdate>
            <volume>98</volume>
            <fpage>5116</fpage>
            <lpage>21</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">33173</pubid>
                  <pubid idtype="pmpid" link="fulltext">11309499</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Testing for differentially-expressed genes by maximum-likelihood analysis of microarray data</p>
            </title>
            <aug>
               <au>
                  <snm>Ideker</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Thorsson</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Siegel</snm>
                  <fnm>AF</fnm>
               </au>
               <au>
                  <snm>Hood</snm>
                  <fnm>LE</fnm>
               </au>
            </aug>
            <source>J Comput Biol</source>
            <pubdate>2000</pubdate>
            <volume>7</volume>
            <fpage>805</fpage>
            <lpage>17</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11382363</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Analysis of variance for gene expression microarray data</p>
            </title>
            <aug>
               <au>
                  <snm>Kerr</snm>
                  <fnm>MK</fnm>
               </au>
               <au>
                  <snm>Martin</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Churchill</snm>
                  <fnm>GA</fnm>
               </au>
            </aug>
            <source>J Comput Biol</source>
            <pubdate>2000</pubdate>
            <volume>7</volume>
            <fpage>819</fpage>
            <lpage>37</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11382364</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Statistical analysis of a gene expression microarray experiment with replication</p>
            </title>
            <aug>
               <au>
                  <snm>Kerr</snm>
                  <fnm>MK</fnm>
               </au>
               <au>
                  <snm>Afshari</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Bennett</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Bushel</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Martinez</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Walker</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Churchill</snm>
                  <fnm>GA</fnm>
               </au>
            </aug>
            <source>Statistica Sinica</source>
            <pubdate>2002</pubdate>
            <volume>12</volume>
            <fpage>203</fpage>
            <lpage>218</lpage>
         </bibl>
         <bibl id="B22">
            <title>
               <p>On differential variability of expression ratios: improving statistical inference about gene expression changes from microarray data</p>
            </title>
            <aug>
               <au>
                  <snm>Newton</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Kendziorski</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Richmond</snm>
                  <fnm>CS</fnm>
               </au>
               <au>
                  <snm>Blattner</snm>
                  <fnm>FR</fnm>
               </au>
               <au>
                  <snm>Tsui</snm>
                  <fnm>KW</fnm>
               </au>
            </aug>
            <source>J Comput Biol</source>
            <pubdate>2001</pubdate>
            <volume>8</volume>
            <fpage>37</fpage>
            <lpage>52</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11339905</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>A statistical framework for expression-based molecular classification in cancer</p>
            </title>
            <aug>
               <au>
                  <snm>Parmigiani</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Garrett</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Anbazhagan</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gabrielson</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Journal of the Royal Statistical Society, B</source>
            <pubdate>2002</pubdate>
            <volume>64</volume>
            <fpage>717</fpage>
            <lpage>736</lpage>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Nonparametric methods for identifying differentially expressed genes in microarray data</p>
            </title>
            <aug>
               <au>
                  <snm>Troyanskaya</snm>
                  <fnm>OG</fnm>
               </au>
               <au>
                  <snm>Garber</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>PO</fnm>
               </au>
               <au>
                  <snm>Botstein</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Altman</snm>
                  <fnm>RB</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2002</pubdate>
            <volume>18</volume>
            <fpage>1454</fpage>
            <lpage>61</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12424116</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Cluster analysis and display of genome-wide expression patterns</p>
            </title>
            <aug>
               <au>
                  <snm>Eisen</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Spellman</snm>
                  <fnm>PT</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>PO</fnm>
               </au>
               <au>
                  <snm>Botstein</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1998</pubdate>
            <volume>95</volume>
            <fpage>14863</fpage>
            <lpage>8</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">24541</pubid>
                  <pubid idtype="pmpid" link="fulltext">9843981</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Singular value decomposition for genome-wide expression data processing and modeling</p>
            </title>
            <aug>
               <au>
                  <snm>Alter</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>PO</fnm>
               </au>
               <au>
                  <snm>Botstein</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2000</pubdate>
            <volume>97</volume>
            <fpage>10101</fpage>
            <lpage>6</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">27718</pubid>
                  <pubid idtype="pmpid" link="fulltext">10963673</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Gene expression profiling of alveolar rhabdomyosarcoma with cDNA microarrays</p>
            </title>
            <aug>
               <au>
                  <snm>Khan</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Simon</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Bittner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Leighton</snm>
                  <fnm>SB</fnm>
               </au>
               <au>
                  <snm>Pohida</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>PD</fnm>
               </au>
               <au>
                  <snm>Jiang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Gooden</snm>
                  <fnm>GC</fnm>
               </au>
               <au>
                  <snm>Trent</snm>
                  <fnm>JM</fnm>
               </au>
               <etal/>
            </aug>
            <source>Cancer Res</source>
            <pubdate>1998</pubdate>
            <volume>58</volume>
            <fpage>5009</fpage>
            <lpage>13</lpage>
            <xrefbib>
               <pubid idtype="pmpid">9823299</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Bayesian infinite mixture model based clustering of gene expression profiles</p>
            </title>
            <aug>
               <au>
                  <snm>Medvedovic</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sivaganesan</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2002</pubdate>
            <volume>18</volume>
            <fpage>1194</fpage>
            <lpage>206</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12217911</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Bayesian mixture model based clustering of replicated microarray data</p>
            </title>
            <aug>
               <au>
                  <snm>Medvedovic</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Yeung</snm>
                  <fnm>KY</fnm>
               </au>
               <au>
                  <snm>Bumgarner</snm>
                  <fnm>RE</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14871871</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Exploring the conditional coregulation of yeast gene expression through fuzzy k-means clustering</p>
            </title>
            <aug>
               <au>
                  <snm>Gasch</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>MB</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <fpage>RESEARCH0059</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">133443</pubid>
                  <pubid idtype="pmpid" link="fulltext">12429058</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Coupled two-way clustering analysis of gene microarray data</p>
            </title>
            <aug>
               <au>
                  <snm>Getz</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Levine</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Domany</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2000</pubdate>
            <volume>97</volume>
            <fpage>12079</fpage>
            <lpage>84</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">17297</pubid>
                  <pubid idtype="pmpid" link="fulltext">11035779</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Clustering gene expression patterns</p>
            </title>
            <aug>
               <au>
                  <snm>Ben-Dor</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Shamir</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Yakhini</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>J Comput Biol</source>
            <pubdate>1999</pubdate>
            <volume>6</volume>
            <fpage>281</fpage>
            <lpage>97</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10582567</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Exploring expression data: identification and analysis of coexpressed genes</p>
            </title>
            <aug>
               <au>
                  <snm>Heyer</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Kruglyak</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Yooseph</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>1999</pubdate>
            <volume>9</volume>
            <fpage>1106</fpage>
            <lpage>15</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">310826</pubid>
                  <pubid idtype="pmpid" link="fulltext">10568750</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Analysis of temporal gene expression profiles: clustering by simulated annealing and determining the optimal number of clusters</p>
            </title>
            <aug>
               <au>
                  <snm>Lukashin</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Fuchs</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2001</pubdate>
            <volume>17</volume>
            <fpage>405</fpage>
            <lpage>14</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11331234</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Knowledge-based analysis of microarray gene expression data by using support vector machines</p>
            </title>
            <aug>
               <au>
                  <snm>Brown</snm>
                  <fnm>MP</fnm>
               </au>
               <au>
                  <snm>Grundy</snm>
                  <fnm>WN</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Cristianini</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Sugnet</snm>
                  <fnm>CW</fnm>
               </au>
               <au>
                  <snm>Furey</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Ares</snm>
                  <fnm>M</fnm>
                  <suf>Jr</suf>
               </au>
               <au>
                  <snm>Haussler</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2000</pubdate>
            <volume>97</volume>
            <fpage>262</fpage>
            <lpage>7</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">26651</pubid>
                  <pubid idtype="pmpid" link="fulltext">10618406</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks</p>
            </title>
            <aug>
               <au>
                  <snm>Khan</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wei</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Ringner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Saal</snm>
                  <fnm>LH</fnm>
               </au>
               <au>
                  <snm>Ladanyi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Westermann</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Berthold</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Schwab</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Antonescu</snm>
                  <fnm>CR</fnm>
               </au>
               <au>
                  <snm>Peterson</snm>
                  <fnm>C</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nat Med</source>
            <pubdate>2001</pubdate>
            <volume>7</volume>
            <fpage>673</fpage>
            <lpage>9</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1282521</pubid>
                  <pubid idtype="pmpid" link="fulltext">11385503</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Microarrays in cancer: research and applications</p>
            </title>
            <aug>
               <au>
                  <snm>Ochs</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Godwin</snm>
                  <fnm>AK</fnm>
               </au>
            </aug>
            <source>Biotechniques</source>
            <pubdate>2003</pubdate>
            <volume>34</volume>
            <fpage>S4</fpage>
            <lpage>S15</lpage>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Inhibitors of Ras/Raf-1 interaction identified by two-hybrid screening revert Ras-dependent transformation phenotypes in human cancer cells</p>
            </title>
            <aug>
               <au>
                  <snm>Kato-Stankiewicz</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hakimi</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Zhi</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Serebriiskii</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Guo</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Edamatsu</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Koide</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Menon</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Eckl</snm>
                  <fnm>R</fnm>
               </au>
               <etal/>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2002</pubdate>
            <volume>99</volume>
            <fpage>14398</fpage>
            <lpage>403</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">137895</pubid>
                  <pubid idtype="pmpid" link="fulltext">12391290</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Raf kinase inhibitors in oncology</p>
            </title>
            <aug>
               <au>
                  <snm>Strumberg</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Seeber</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Onkologie</source>
            <pubdate>2005</pubdate>
            <volume>28</volume>
            <fpage>101</fpage>
            <lpage>7</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15665559</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>The Raf kinase inhibitor BAY 43-9006 reduces cellular uptake of platinum compounds and cytotoxicity in human colorectal carcinoma cell lines</p>
            </title>
            <aug>
               <au>
                  <snm>Heim</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Scharifi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Zisowsky</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Jaehde</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Voliotis</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Seeber</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Strumberg</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Anticancer Drugs</source>
            <pubdate>2005</pubdate>
            <volume>16</volume>
            <fpage>129</fpage>
            <lpage>36</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15655409</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Discordant protein and mRNA expression in lung adenocarcinomas</p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Gharib</snm>
                  <fnm>TG</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>CC</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Misek</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Kardia</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Giordano</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>lannettoni</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Orringer</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Hanash</snm>
                  <fnm>SM</fnm>
               </au>
               <etal/>
            </aug>
            <source>Mol Cell Proteomics</source>
            <pubdate>2002</pubdate>
            <volume>1</volume>
            <fpage>304</fpage>
            <lpage>13</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12096112</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Correlation between protein and mRNA abundance in yeast</p>
            </title>
            <aug>
               <au>
                  <snm>Gygi</snm>
                  <fnm>SP</fnm>
               </au>
               <au>
                  <snm>Rochon</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Franza</snm>
                  <fnm>BR</fnm>
               </au>
               <au>
                  <snm>Aebersold</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Mol Cell Biol</source>
            <pubdate>1999</pubdate>
            <volume>19</volume>
            <fpage>1720</fpage>
            <lpage>30</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">83965</pubid>
                  <pubid idtype="pmpid" link="fulltext">10022859</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Complementary profiling of gene expression at the transcriptome and proteome levels in Saccharomyces cerevisiae</p>
            </title>
            <aug>
               <au>
                  <snm>Griffin</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Gygi</snm>
                  <fnm>SP</fnm>
               </au>
               <au>
                  <snm>Ideker</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Rist</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Eng</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hood</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Aebersold</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Mol Cell Proteomics</source>
            <pubdate>2002</pubdate>
            <volume>1</volume>
            <fpage>323</fpage>
            <lpage>33</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12096114</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Bayesian Decomposition analysis of gene expression in yeast deletion mutants</p>
            </title>
            <aug>
               <au>
                  <snm>Bidaut</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Moloshok</snm>
                  <fnm>TD</fnm>
               </au>
               <au>
                  <snm>Grant</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Manion</snm>
                  <fnm>FJ</fnm>
               </au>
               <au>
                  <snm>Ochs</snm>
                  <fnm>MF</fnm>
               </au>
            </aug>
            <source>Methods of Microarray Data Analysis II</source>
            <publisher>Boston: Kluwer Academic</publisher>
            <editor>Johnson K, Lin S</editor>
            <pubdate>2002</pubdate>
            <fpage>105</fpage>
            <lpage>122</lpage>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Application of Bayesian Decomposition for analysing microarray data</p>
            </title>
            <aug>
               <au>
                  <snm>Moloshok</snm>
                  <fnm>TD</fnm>
               </au>
               <au>
                  <snm>Klevecz</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Grant</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Manion</snm>
                  <fnm>FJ</fnm>
               </au>
               <au>
                  <snm>Speier</snm>
                  <fnm>WFt</fnm>
               </au>
               <au>
                  <snm>Ochs</snm>
                  <fnm>MF</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2002</pubdate>
            <volume>18</volume>
            <fpage>566</fpage>
            <lpage>75</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12016054</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Bayesian Decomposition</p>
            </title>
            <aug>
               <au>
                  <snm>Ochs</snm>
                  <fnm>MF</fnm>
               </au>
            </aug>
            <source>The Analysis of Gene Expression Data: Methods and Software</source>
            <publisher>New York: Springer Verlag</publisher>
            <editor>Parmigiani G, Garrett E, Irizarry R, Zeger S</editor>
            <pubdate>2003</pubdate>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Bayesian Decomposition classification of the Project Normal data set</p>
            </title>
            <aug>
               <au>
                  <snm>Moloshok</snm>
                  <fnm>TD</fnm>
               </au>
               <au>
                  <snm>Datta</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kossenkov</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Ochs</snm>
                  <fnm>MF</fnm>
               </au>
            </aug>
            <source>Methods of Microarray Data Analysis III</source>
            <publisher>Boston: Kluwer Academic</publisher>
            <editor>Johnson KF, LIn SM</editor>
            <pubdate>2003</pubdate>
            <fpage>211</fpage>
            <lpage>232</lpage>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Bayesian Decomposition: Analyzing microarray data within a biological context</p>
            </title>
            <aug>
               <au>
                  <snm>Ochs</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Moloshok</snm>
                  <fnm>TD</fnm>
               </au>
               <au>
                  <snm>Bidaut</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Toby</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Annals of the New York Academy of Sciences</source>
            <pubdate>2004</pubdate>
            <volume>1020</volume>
            <fpage>212</fpage>
            <lpage>226</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15208194</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>ClutrFree: cluster tree visualization and interpretation</p>
            </title>
            <aug>
               <au>
                  <snm>Bidaut</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Ochs</snm>
                  <fnm>MF</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <fpage>2869</fpage>
            <lpage>71</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15145813</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Gene ontology: tool for the unification of biology. The Gene Ontology Consortium</p>
            </title>
            <aug>
               <au>
                  <snm>Ashburner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ball</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Blake</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Botstein</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Butler</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Cherry</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Dolinski</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Dwight</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Eppig</snm>
                  <fnm>JT</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2000</pubdate>
            <volume>25</volume>
            <fpage>25</fpage>
            <lpage>9</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10802651</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>MIPS: analysis and annotation of proteins from whole genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Mewes</snm>
                  <fnm>HW</fnm>
               </au>
               <au>
                  <snm>Amid</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Arnold</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Frishman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Guldener</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Mannhaupt</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Munsterkotter</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pagel</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Strack</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Stumpflen</snm>
                  <fnm>V</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>D41</fpage>
            <lpage>4</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308826</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681354</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>TRANSFAC: transcriptional regulation, from patterns to profiles</p>
            </title>
            <aug>
               <au>
                  <snm>Matys</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Fricke</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Geffers</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gossling</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Haubrock</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hehl</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Hornischer</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Karas</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kel</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Kel-Margoulis</snm>
                  <fnm>OV</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <fpage>374</fpage>
            <lpage>8</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">165555</pubid>
                  <pubid idtype="pmpid" link="fulltext">12520026</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Functional discovery via a compendium of expression profiles</p>
            </title>
            <aug>
               <au>
                  <snm>Hughes</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>Marton</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Roberts</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Stoughton</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Armour</snm>
                  <fnm>CD</fnm>
               </au>
               <au>
                  <snm>Bennett</snm>
                  <fnm>HA</fnm>
               </au>
               <au>
                  <snm>Coffey</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Dai</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>He</snm>
                  <fnm>YD</fnm>
               </au>
               <etal/>
            </aug>
            <source>Cell</source>
            <pubdate>2000</pubdate>
            <volume>102</volume>
            <fpage>109</fpage>
            <lpage>26</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10929718</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Subsystem identification through dimensionality reduction of large-scale gene expression data</p>
            </title>
            <aug>
               <au>
                  <snm>Kim</snm>
                  <fnm>PM</fnm>
               </au>
               <au>
                  <snm>Tidor</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <fpage>1706</fpage>
            <lpage>18</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">403744</pubid>
                  <pubid idtype="pmpid" link="fulltext">12840046</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>Turning genes off by Ssn6-Tupl: a conserved system of transcriptional repression in eukaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Smith</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>AD</fnm>
               </au>
            </aug>
            <source>Trends Biochem Sci</source>
            <pubdate>2000</pubdate>
            <volume>25</volume>
            <fpage>325</fpage>
            <lpage>30</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10871883</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>Ssn6-Tupl is a general repressor of transcription in yeast</p>
            </title>
            <aug>
               <au>
                  <snm>Keleher</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Redd</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Schultz</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Carlson</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>AD</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>1992</pubdate>
            <volume>68</volume>
            <fpage>709</fpage>
            <lpage>19</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">1739976</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>A conserved protein interaction network involving the yeast MAP kinases Fus3 and Kss1</p>
            </title>
            <aug>
               <au>
                  <snm>Kusari</snm>
                  <fnm>AB</fnm>
               </au>
               <au>
                  <snm>Molina</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Sabbagh</snm>
                  <fnm>W</fnm>
                  <suf>Jr</suf>
               </au>
               <au>
                  <snm>Lau</snm>
                  <fnm>CS</fnm>
               </au>
               <au>
                  <snm>Bardwell</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>J Cell Biol</source>
            <pubdate>2004</pubdate>
            <volume>164</volume>
            <fpage>267</fpage>
            <lpage>77</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14734536</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>The riddle of MAP kinase signaling specificity</p>
            </title>
            <aug>
               <au>
                  <snm>Madhani</snm>
                  <fnm>HD</fnm>
               </au>
               <au>
                  <snm>Fink</snm>
                  <fnm>GR</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>1998</pubdate>
            <volume>14</volume>
            <fpage>151</fpage>
            <lpage>5</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9594663</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>Principles of MAP kinase signaling specificity in Saccharomyces cerevisiae</p>
            </title>
            <aug>
               <au>
                  <snm>Schwartz</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Madhani</snm>
                  <fnm>HD</fnm>
               </au>
            </aug>
            <source>Annu Rev Genet</source>
            <pubdate>2004</pubdate>
            <volume>38</volume>
            <fpage>725</fpage>
            <lpage>48</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15568991</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>Dissection of filamentous growth by transposon mutagenesis in Saccharomyces cerevisiae</p>
            </title>
            <aug>
               <au>
                  <snm>Mosch</snm>
                  <fnm>HU</fnm>
               </au>
               <au>
                  <snm>Fink</snm>
                  <fnm>GR</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1997</pubdate>
            <volume>145</volume>
            <fpage>671</fpage>
            <lpage>84</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9055077</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>Activation of the Kss1 invasive-filamentous growth pathway induces Ty1 transcription and retrotransposition in Saccharomyces cerevisiae</p>
            </title>
            <aug>
               <au>
                  <snm>Morillon</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Springer</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lesage</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Mol Cell Biol</source>
            <pubdate>2000</pubdate>
            <volume>20</volume>
            <fpage>5766</fpage>
            <lpage>76</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">86054</pubid>
                  <pubid idtype="pmpid" link="fulltext">10891512</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>Signal transduction by MAP kinase cascades in budding yeast</p>
            </title>
            <aug>
               <au>
                  <snm>Posas</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Takekawa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Saito</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Curr Opin Microbiol</source>
            <pubdate>1998</pubdate>
            <volume>1</volume>
            <fpage>175</fpage>
            <lpage>82</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10066475</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>Antisense transcription in the mammalian transcriptome</p>
            </title>
            <aug>
               <au>
                  <snm>Katayama</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Tomaru</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Kasukawa</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Waki</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Nakanishi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nakamura</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nishida</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Yap</snm>
                  <fnm>CC</fnm>
               </au>
               <au>
                  <snm>Suzuki</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kawai</snm>
                  <fnm>J</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2005</pubdate>
            <volume>309</volume>
            <fpage>1564</fpage>
            <lpage>6</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16141073</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B64">
            <title>
               <p>Fewer genes, more noncoding RNA</p>
            </title>
            <aug>
               <au>
                  <snm>Claverie</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2005</pubdate>
            <volume>309</volume>
            <fpage>1529</fpage>
            <lpage>30</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16141064</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B65">
            <title>
               <p>The transcriptional landscape of the mammalian genome</p>
            </title>
            <aug>
               <au>
                  <snm>Carninci</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Kasukawa</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Katayama</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gough</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Frith</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Maeda</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Oyama</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ravasi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Lenhard</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Wells</snm>
                  <fnm>C</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2005</pubdate>
            <volume>309</volume>
            <fpage>1559</fpage>
            <lpage>63</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16141072</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B66">
            <title>
               <p>Gene silencing in mammals by small interfering RNAs</p>
            </title>
            <aug>
               <au>
                  <snm>McManus</snm>
                  <fnm>MT</fnm>
               </au>
               <au>
                  <snm>Sharp</snm>
                  <fnm>PA</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <fpage>737</fpage>
            <lpage>47</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12360232</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B67">
            <title>
               <p>Quantitative monitoring of gene expression patterns with a complementary DNA microarray</p>
            </title>
            <aug>
               <au>
                  <snm>Schena</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Shalon</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>PO</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1995</pubdate>
            <volume>270</volume>
            <fpage>467</fpage>
            <lpage>70</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7569999</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B68">
            <title>
               <p>A new method for spectral decomposition using a bilinear Bayesian approach</p>
            </title>
            <aug>
               <au>
                  <snm>Ochs</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Stoyanova</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Arias-Mendoza</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>TR</fnm>
               </au>
            </aug>
            <source>J Magn Reson</source>
            <pubdate>1999</pubdate>
            <volume>137</volume>
            <fpage>161</fpage>
            <lpage>76</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10053145</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B69">
            <title>
               <p>A Bayesian Markov chain Monte Carlo solution of the bilinear problem</p>
            </title>
            <aug>
               <au>
                  <snm>Ochs</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Stoyanova</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>Rooney</snm>
                  <fnm>WD</fnm>
               </au>
               <au>
                  <snm>Springer</snm>
                  <fnm>CS</fnm>
                  <suf>Jr</suf>
               </au>
            </aug>
            <source>Bayesian Inference and Maximum Entropy Methods in Science and Engineering: 19th International Workshop</source>
            <publisher>Melville: American Institute of Physics</publisher>
            <editor>Rychert JT, Erickson GJ, Smith CR</editor>
            <pubdate>2001</pubdate>
            <fpage>274</fpage>
            <lpage>284</lpage>
         </bibl>
         <bibl id="B70">
            <title>
               <p>Genes associated with prognosis in adenocarcinoma across studies at multiple institutions</p>
            </title>
            <aug>
               <au>
                  <snm>Kossenkov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bidaut</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Ochs</snm>
                  <fnm>MF</fnm>
               </au>
            </aug>
            <source>Methods of Microarray Data Analysis IV</source>
            <publisher>Boston: Kluwer Academic</publisher>
            <editor>Johnson K, Lin S</editor>
            <pubdate>2005</pubdate>
            <fpage>239</fpage>
         </bibl>
         <bibl id="B71">
            <title>
               <p>Bayesian decomposition analysis of bacterial phylogenomic profiles</p>
            </title>
            <aug>
               <au>
                  <snm>Bidaut</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Suhre</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Claverie</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Ochs</snm>
                  <fnm>MF</fnm>
               </au>
            </aug>
            <source>Am J Pharmacogenomics</source>
            <pubdate>2005</pubdate>
            <volume>5</volume>
            <fpage>63</fpage>
            <lpage>70</lpage>
            <xrefbib>
               <pubid idtype="pmpid">15727490</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B72">
            <title>
               <p>MIPS: a database for genomes and protein sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Mewes</snm>
                  <fnm>HW</fnm>
               </au>
               <au>
                  <snm>Heumann</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kaps</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Mayer</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Pfeiffer</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Stocker</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Frishman</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1999</pubdate>
            <volume>27</volume>
            <fpage>44</fpage>
            <lpage>8</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">148093</pubid>
                  <pubid idtype="pmpid" link="fulltext">9847138</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B73">
            <title>
               <p>Saccharomyces Genome Database (SGD) provides tools to identify and analyze sequences from Saccharomyces cerevisiae and related sequences from other organisms</p>
            </title>
            <aug>
               <au>
                  <snm>Christie</snm>
                  <fnm>KR</fnm>
               </au>
               <au>
                  <snm>Weng</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Balakrishnan</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Costanzo</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Dolinski</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Dwight</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Engel</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Feierbach</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Fisk</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Hirschman</snm>
                  <fnm>JE</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>D311</fpage>
            <lpage>4</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308767</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681421</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B74">
            <title>
               <p>Genetic and physical maps of Saccharomyces cerevisiae</p>
            </title>
            <aug>
               <au>
                  <snm>Cherry</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Ball</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Weng</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Juvik</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Schmidt</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Adler</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Dunn</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Dwight</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Riles</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Mortimer</snm>
                  <fnm>RK</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>1997</pubdate>
            <volume>387</volume>
            <fpage>67</fpage>
            <lpage>73</lpage>
            <xrefbib>
               <pubid idtype="pmpid">9169866</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
