<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-9-404</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Integrative bioinformatics analysis of transcriptional regulatory programs in breast cancer cells</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Niida</snm>
               <fnm>Atsushi</fnm>
               <insr iid="I1"/>
               <email>niida@iam.u-tokyo.ac.jp</email>
            </au>
            <au id="A2">
               <snm>Smith</snm>
               <mi>D</mi>
               <fnm>Andrew</fnm>
               <insr iid="I2"/>
               <email>asmith@cshl.edu</email>
            </au>
            <au id="A3">
               <snm>Imoto</snm>
               <fnm>Seiya</fnm>
               <insr iid="I3"/>
               <email>imoto@ims.u-tokyo.ac.jp</email>
            </au>
            <au id="A4">
               <snm>Tsutsumi</snm>
               <fnm>Shuichi</fnm>
               <insr iid="I4"/>
               <email>shuichi@genome.rcast.u-tokyo.ac.jp</email>
            </au>
            <au id="A5">
               <snm>Aburatani</snm>
               <fnm>Hiroyuki</fnm>
               <insr iid="I4"/>
               <email>haburata-tky@umin.ac.jp</email>
            </au>
            <au id="A6">
               <snm>Zhang</snm>
               <mi>Q</mi>
               <fnm>Michael</fnm>
               <insr iid="I2"/>
               <email>mzhang@cshl.org</email>
            </au>
            <au id="A7">
               <snm>Akiyama</snm>
               <fnm>Tetsu</fnm>
               <insr iid="I1"/>
               <email>akiyama@iam.u-tokyo.ac.jp</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Laboratory of Molecular and Genetic Information, Institute of Molecular and Cellular Biosciences, The University of Tokyo, 1-1-1, Yayoi, Bunkyo-ku, Tokyo, 110-0032, Japan</p>
            </ins>
            <ins id="I2">
               <p>Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11274, USA</p>
            </ins>
            <ins id="I3">
               <p>The Institute of Medical Science, The University of Tokyo, 4-6-1 Shirokanedai, Minato-ku, Tokyo 108-8639, Japan</p>
            </ins>
            <ins id="I4">
               <p>Genome Science Division, Research Center for Advanced Science and Technology, The University of Tokyo, 4-6-1 Komaba, Meguro, Tokyo, 153-8904, Japan</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2008</pubdate>
         <volume>9</volume>
         <issue>1</issue>
         <fpage>404</fpage>
         <url>http://www.biomedcentral.com/1471-2105/9/404</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18823535</pubid>
               <pubid idtype="doi">10.1186/1471-2105-9-404</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>17</day>
               <month>1</month>
               <year>2008</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>29</day>
               <month>9</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>29</day>
               <month>9</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Niida et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Microarray technology has unveiled transcriptomic differences among tumors of various phenotypes, and, especially, brought great progress in molecular understanding of phenotypic diversity of breast tumors. However, compared with the massive knowledge about the transcriptome, we have surprisingly little knowledge about regulatory mechanisms underling transcriptomic diversity.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>To gain insights into the transcriptional programs that drive tumor progression, we integrated regulatory sequence data and expression profiles of breast cancer into a Bayesian Network, and searched for <it>cis</it>-regulatory motifs statistically associated with given histological grades and prognosis. Our analysis found that motifs bound by ELK1, E2F, NRF1 and NFY are potential regulatory motifs that positively correlate with malignant progression of breast cancer.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The results suggest that these 4 motifs are principal regulatory motifs driving malignant progression of breast cancer. Our method offers a more concise description about transcriptome diversity among breast tumors with different clinical phenotypes.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Deregulation of transcriptional programs leads to development and progression of cancer, and many transcription factors (TFs) have been identified as oncogenes or tumor suppressor genes <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. In the last decade, microarray technology has revolutionized cancer biology: microarray-based expression profiling studies have revealed that transcriptomes of cancer cells drastically change during carcinogenesis, and vary among different types of tumors.</p>
         <p>Among many types of cancers, breast cancer has been attracting numerous investigators armed with microarray technology. Human breast tumors are diverse in their histology, prognosis, and responsiveness to treatments. Microarray technology has unveiled transcriptomic differences among tumors of various phenotypes, and brought great progress in molecular understanding of the phenotypic diversity. For example, Perou <it>et al</it>. <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> and Sorlie <it>et al</it>. <abbrgrp><abbr bid="B3">3</abbr></abbrgrp> established that breast tumors are classified into five different phenotypic subtypes. van't Veer <it>et al</it>. <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> and van de Vijver <it>et al</it>. <abbrgrp><abbr bid="B5">5</abbr></abbrgrp> accurately divided breast cancer patients into two groups with favorable or unfavorable outcome, suggesting the potential of microarrays as a diagnostic test to select patients who would need adjuvant therapies. Many other studies have also identified gene signatures that enable us to predict distant metastasis or survival <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>. However, compared with the massive knowledge about the transcriptome, we have surprisingly little knowledge about regulatory mechanisms underling transcriptomic diversity.</p>
         <p>To analyze the transcriptional regulatory programs, computational approaches that integrate regulatory sequence data with global expression profiles are essential. So far, many approaches have been developed and successfully applied to lower organisms like yeast. For finding motifs that regulate gene expressions in yeast, linear regression-based methods use the correlation between the presence of cis-regulatory motifs and expression values <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. A method employing multivariate adaptive regression spline (MARS) algorithm captured synergistic interactions between regulatory motifs and improved the prediction significantly as compared to that by the linear regression <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. A method based on Bayesian networks also successfully identified combinational gene regulation by multiple motifs in yeast promoter sequences <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. On the other hand, such challenges for gene regulation in higher eukaryotes like human are much harder owing to intrinsic complexity of their regulatory systems, and have just started <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>. As for breast cancer, although a small number of studies have also tried to decode transcriptional programs in cancer cell <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>, it also remains to be tested whether transcriptional programs exist that are associated with, and potentially drive, breast tumor malignancy.</p>
         <p>In this study, we propose a new approach to decipher transcriptional programs from cancer microarray data. Our method searches for the most probable motif combination associated with clinical phenotypes such as histological grade or survival time. Our approach has two major novel features. First, extending a previous work <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>, we introduce a Bayesian scoring function which can treat continuous expression values. Secondly, instead of using raw expression values, we define a "meta-expression value" based on a correlation between gene expression profiles of a gene and a clinical phenotype, and then search for motifs correlated with meta-expression values. We show that application of our method to breast cancer microarray data successfully identified <it>cis</it>-regulatory motifs which are associated with malignancy of breast cancer.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Methods Overview</p>
            </st>
            <p>To elucidate transcriptional programs in cancer cells, we used a bioinformatics method based on Bayesian networks. We integrated regulatory sequences and global expression profiling data, and searched for <it>cis</it>-regulatory motifs statistically associated with clinical annotation accompanying the expression profiling data (Fig. <figr fid="F1">1</figr>).</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Schema of our method</p>
               </caption>
               <text>
                  <p><b>Schema of our method</b>. We first calculate correlations between phenotypes and expression values as meta-expression values, while preparing a sequence feature table by searching promoter sequences for <it>cis</it>-regulatory motifs. <it>Cis</it>-regulatory motif data are prepared from two different sources: already known motifs, which are downloaded from databases, and <it>de novo </it>identified motifs, which were discovered by an <it>ab initio </it>motif finder program, DME. Then, associations between sequence features and meta-expression values were inferred by structure learning of Bayesian networks.</p>
               </text>
               <graphic file="1471-2105-9-404-1"/>
            </fig>
            <p>We prepared three types of data to be integrated: regulatory sequences, regulatory motifs and expression profiling data. For regulatory sequences, we used core promoter sequences spanning 500 bp upstream and 100 bp down stream of the transcriptional start sites (TSSs). The regulatory motif data were prepared as position weight matrices (PWMs) by the following method: the known TF binding motifs were obtained from the TRANSFAC <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> and JASPAR database <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. In addition, to complement missing information of the databases, we obtained potentially novel PWMs using an <it>ab initio </it>motif finder program, Discriminating Matrix Enumerator (DME, Smith <it>et al</it>., 2005). Among similar types of motif finder programs, an exceptional feature of DME is that it identifies motifs based on relative over-representation between two sets of sequences. To obtain the <it>de novo </it>identified motif set, DME was applied to the regulatory sequences of gene groups which display highest and lowest expression values in expression value data. After reducing redundancy of these two PWM sets by clustering, the regulatory sequence of every gene was scored by each PWM. Then, the obtained scores are binarized using multiple thresholds to produce sequence features. Here, each sequence feature indicates the presence of a motif assuming one version of the multiple PWM thresholds. Prepared sequence features are collected to produce a sequence feature table. The sequence feature table is a binary matrix with its rows for genes and its columns for sequence features.</p>
            <p>For expression value data, we prepared a publicly available data set of breast cancer expression profiles <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. The data set includes expression values of 16,425 genes in 252 samples and information about a phenotype of each sample including its histological grades and patient prognosis. In our analyses, in stead of using the raw expression values, we used a "meta-expression value" calculated as a kind of correlation of the raw expression values with the phenotypes (e.g. differential expression between two sample groups of different histological grades or correlation with prognosis). Hence, the expression value matrix is transformed to a vector whose element is a meta-expression value of a gene. The expression value data were divided into training data and test data with a ratio of 3:1. Only information from the training data was used in a series of searches including <it>de novo </it>motif search using DME, and the test data were used for statistical evaluation of the result.</p>
            <p>To infer associations between sequence features and the meta-expression values, our method learns parents of a single child node with methods originating from Bayesian network leaning. We assumed a two-layer network structure where sequence features regulate the meta-expression values. In this case, the structural learning indicates that the method identifies the subset of sequence features that regulates the meta-expression values of each gene. This probabilistic approach is motivated by the work of Beer and Tavazoie <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>, which successfully predicted gene expression patterns from combinations of regulatory motifs in yeast. This approach can analyze nonlinear synergistic effects between regulatory motifs, which are thought to be more critical for gene regulation in higher eukaryotes. It can also incorporate flexible conditions of sequence features, such as the threshold value for PWM search. In the work of Beer and Tavazoie <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>, the expression values were binarized to indicate whether each gene is assigned to an expression cluster. However, it is known that such discretization of data leads to loss of information <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. Moreover, results yielded are potentially dependent on the threshold chosen in the discretization <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. To solve this problem, our analysis introduces a new scoring function, which can deal with continuous meta-expression values. When a binary sequence feature table and continuous meta-expression value data are given as the input data, the scoring function represents the posterior probability of a model that represents the dependency of the expression values and a combination of sequence features. By a greedy strategy, we searched for the most probable combination of sequence features so as to maximize the scoring function. Starting from the empty model, we iteratively added a sequence feature to the model as long as the value of the scoring function increases.</p>
         </sec>
         <sec>
            <st>
               <p>Regulatory sequence analysis</p>
            </st>
            <p>For regulatory sequence data, we prepared promoter data of 31,718 human genes from the Ensembl database (Release 40). Additionally, we also retrieved 27,967 mouse promoter sequences for comparative analysis (see below). Assuming the TSS as the start base of the gene assigned in Ensembl, a repeat-masked promoter sequence covering the 500 bp upstream and the 100 bp downstream of the TSS for each gene was extracted from the genome sequences.</p>
            <p>For regulatory motif data, we prepared PWMs. The value <it>f</it><sub><it>ib </it></sub>of a PWM represents frequency of nucleotide base <it>b </it>at the <it>i</it>-th position in a motif. The frequencies of bases in each position are normalized so that &#8721;<sub><it>b </it>&#8712; {<it>a</it>, <it>t</it>, <it>g</it>, <it>c</it>} </sub><it>f</it><sub><it>ib </it></sub>= 1. If <it>f</it><sub><it>ib </it></sub>= 0, we assigned <it>f</it><sub><it>ib </it></sub>= 0.001 to avoid errors in log calculations. We acquired a total of 495 PWMs, which consist of vertebrate 367 PWMs annotated as "good" in TRANSFAC 10.1 <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>, 123 PWMs from JASPAR core <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>, and 5 PWMs from existing literature <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>. We then removed extremely simple or complex PWMs based on their information contents, and made a set of total 449 PWMs. Using the partition around medoids algorithm with the dissimilarity criterion based on the Kullback-Leibler divergence, the 449 PWMs are divided into 250 clusters (see Additional file <supplr sid="S1">1</supplr>). In the following analyses, we used 250 medoids of the clusters as the already known PWMs</p>
            <p>In addition to the already known PWM set, we prepared motifs appearing frequently in promoter sequences of genes with high or low values in the expression value data. For the top 500 and the bottom 500 genes for expression values in the training data, we obtained their promoter sequences (the 500 bp upstream and the 100 bp downstream of the TSS) and those of their mouse homologs. We then searched for motifs relatively overrepresented in either set of sequences using the <it>ab initio </it>motif finder program, DME. For each identified PWM, its quality was evaluated based on classification error rate calculated by the MOTIFCLASS program in CREAD package. In accordance with the classification error rates, PWMs were ranked and clustered so as to reduce redundancy (see Additional file <supplr sid="S1">1</supplr>). We used the highest ranked PWM in each cluster and added them to the <it>de novo </it>identified PWM set.</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p>
                     <b>supplementary methods, discussions, tables and figures.</b>
                  </p>
               </text>
               <file name="1471-2105-9-404-S1.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>To identify TF binding motifs in promoters, we used the log odds ratio <it>L </it>between a PWM and background base frequency <inline-formula><m:math name="1471-2105-9-404-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>f</m:mi><m:mi>b</m:mi><m:mrow><m:mi>b</m:mi><m:mi>g</m:mi></m:mrow></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemOzay2aa0baaSqaaiabdkgaIbqaaiabdkgaIjabdEgaNbaaaaa@3148@</m:annotation></m:semantics></m:math></inline-formula>. We calculated log odds ratio <it>L</it><sub><it>s </it></sub>for every subsequence of each promoter <it>s </it>(including the complementary strand), whose length is equal to the width of the motif of interest, <it>w</it>:</p>
            <p>
               <display-formula>
                  <m:math name="1471-2105-9-404-i2" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mi>L</m:mi>
                              <m:mi>s</m:mi>
                           </m:msub>
                           <m:mo>=</m:mo>
                           <m:mstyle displaystyle="true">
                              <m:munderover>
                                 <m:mo>&#8721;</m:mo>
                                 <m:mrow>
                                    <m:mi>i</m:mi>
                                    <m:mo>=</m:mo>
                                    <m:mn>1</m:mn>
                                 </m:mrow>
                                 <m:mi>w</m:mi>
                              </m:munderover>
                              <m:mrow>
                                 <m:mi>log</m:mi>
                                 <m:mo>&#8289;</m:mo>
                                 <m:mfrac>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>f</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:msub>
                                          <m:mi>b</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                    </m:mrow>
                                    <m:mrow>
                                       <m:msubsup>
                                          <m:mi>f</m:mi>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>b</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:mi>b</m:mi>
                                             <m:mi>g</m:mi>
                                          </m:mrow>
                                       </m:msubsup>
                                    </m:mrow>
                                 </m:mfrac>
                              </m:mrow>
                           </m:mstyle>
                           <m:mo>.</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemitaW0aaSbaaSqaaiabdohaZbqabaGccqGH9aqpdaaeWbqaaiGbcYgaSjabc+gaVjabcEgaNLqbaoaalaaabaGaemOzay2aaSbaaeaacqWGPbqAaeqaaiabdkgaInaaBaaabaGaemyAaKgabeaaaeaacqWGMbGzdaqhaaqaaiabdkgaInaaBaaabaGaemyAaKgabeaaaeaacqWGIbGycqWGNbWzaaaaaaWcbaGaemyAaKMaeyypa0JaeGymaedabaGaem4DaChaniabggHiLdGccqGGUaGlaaa@4921@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>In our analyses, <inline-formula><m:math name="1471-2105-9-404-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>f</m:mi><m:mi>b</m:mi><m:mrow><m:mi>b</m:mi><m:mi>g</m:mi></m:mrow></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemOzay2aa0baaSqaaiabdkgaIbqaaiabdkgaIjabdEgaNbaaaaa@3148@</m:annotation></m:semantics></m:math></inline-formula> is the base composition of each promoter, and the maximum of <it>L</it><sub><it>s </it></sub>in a human promoter sequence was taken as the motif score <it>L</it><sup><it>human </it></sup>for the sequence. For human genes whose mouse homologs are registered in Ensembl, <it>L</it><sup><it>mouse </it></sup>is also calculated. Then, <it>L</it><sup><it>human </it></sup>and <it>L</it><sup><it>mouse </it></sup>were averaged to produce the final score <it>L</it>. We found that this incorporation of homologous regulatory information improves our results, while PWM search combined with an ordinary phylogenetic footprinting approach reduces the performance presumably owing to the loss of sensitivity. For human genes that do not have any homologs, we used <it>L</it><sup><it>human </it></sup>as <it>L</it>. We assumed that the sequence has the motif if <it>L </it>is above the <it>p</it>% highest value in the population of all sequences. For all genes, we prepared binary data indicating the presence of the motif in their promoter with <it>p </it>= 5, 10, 15, and 20. This procedure was iterated for all members of the <it>de novo </it>identified and already known PWM set to produce the sequence feature table.</p>
         </sec>
         <sec>
            <st>
               <p>Expression data analysis</p>
            </st>
            <p>Expression data <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> produced by Affymetrix GeneChips were downloaded from the Gene Expression Omnibus (GEO) database at NCBI (The GEO accession number is GSE3494). Absolute expression values of a data set were converted to the log scale and normalized so that the mean is equal to 0 and the variance is equal to 1 in each sample. The probe set IDs were converted to Ensembl gene IDs. In cases that one gene ID matches multiple probe set IDs, the probe set which shows the most variance among the samples was mapped to the gene. For in total 16,425 genes, we prepared meta-expression values for subsequent Bayesian network analysis by calculating differential expression between two sample groups or correlation with survival time as described below. The meta-expression values were also normalized so that the mean is equal to 0 and the variance is equal to 1.</p>
            <p>Since the samples are separated into two groups, we measured differential expression of each gene between the two groups based on t-statistic. To evaluate the significance of differential expression, a null distribution of the t-statistic was produced from 100 data sets with randomly permutated sample labels. Based on the null distribution, the P-value was computed by two-sided test. To correct multiple hypotheses testing, the P-values were converted to Q-values using the qvalue package of R <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>.</p>
            <p>For Survival time information, we measured univariate correlation of each gene with survival time using the Cox proportional hazards regression method <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>, we used the ratio of each regression coefficient to its standard error as the correlation value with poor prognosis.</p>
         </sec>
         <sec>
            <st>
               <p>Bayesian network analysis</p>
            </st>
            <p>For selecting the network structure <it>N </it>of the Bayesian network, we apply a Bayesian approach. According to Bayes' theorem, the posterior probability of the network structure, <it>p</it>(<it>N</it>|<it>D</it>), is proportional to the product of the prior probability of the network structure, <it>p</it>(<it>N</it>) and the likelihood <it>p</it>(<it>D</it>|<it>N</it>) as</p>
            <p>
               <display-formula>
                  <m:math name="1471-2105-9-404-i3" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>p</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>N</m:mi>
                           <m:mo>|</m:mo>
                           <m:mi>D</m:mi>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mi>p</m:mi>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>N</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mi>p</m:mi>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>D</m:mi>
                                 <m:mo>|</m:mo>
                                 <m:mi>N</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                              <m:mrow>
                                 <m:mi>p</m:mi>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>D</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                           </m:mfrac>
                           <m:mo>&#8733;</m:mo>
                           <m:mi>p</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>N</m:mi>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mi>p</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>D</m:mi>
                           <m:mo>|</m:mo>
                           <m:mi>N</m:mi>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>.</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemiCaaNaeiikaGIaemOta4KaeiiFaWNaemiraqKaeiykaKIaeyypa0tcfa4aaSaaaeaacqWGWbaCcqGGOaakcqWGobGtcqGGPaqkcqWGWbaCcqGGOaakcqWGebarcqGG8baFcqWGobGtcqGGPaqkaeaacqWGWbaCcqGGOaakcqWGebarcqGGPaqkaaGccqGHDisTcqWGWbaCcqGGOaakcqWGobGtcqGGPaqkcqWGWbaCcqGGOaakcqWGebarcqGG8baFcqWGobGtcqGGPaqkcqGGUaGlaaa@5154@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>Based on this formula, we can infer the network structure <it>N </it>hidden behind the data <it>D</it>. In our analyses, we assumed that a network structure <it>N </it>is composed of a single child node and multiple parent nodes. The single child node has a continuous variable <it>x </it>representing a meta-expression value, and parent nodes have binary variables indicating the presence or absence of sequence features. The data <it>D </it>is composed of <it>M </it>meta-expression values and their sequence feature information. For a given data <it>D</it>, we search parent nodes, <it>i.e</it>., sequence features, for each group of meta-expressions by maximizing <it>p</it>(<it>N</it>|<it>D</it>).</p>
            <sec>
               <st>
                  <p>The likelihood</p>
               </st>
               <p>Suppose that we have gene expression profiles of <it>M </it>genes measured by a number of microarrays. The meta-expression vector, <it>x</it>, is then computed as the <it>M</it>-dimensional vector whose the <it>i</it>th element, <it>x</it><sub><it>i</it></sub>, represents the meta-expression value of the <it>i</it>th gene. We also assume that <it>S </it>is the sequence feature table whose the (<it>i</it>, <it>j</it>)th element, <it>s</it><sub><it>ij</it></sub>, takes one if the <it>i</it>th gene has the <it>j</it>th sequence feature in its promoter region, or zero otherwise. The network structure, <it>N</it>, specifies the set of sequence features as the parents of the meta-expression values. For example, if <it>N </it>specifies the two parents for the meta expression values, we then consider a three nodes Bayesian network with observations <inline-formula><m:math name="1471-2105-9-404-i4" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mo>{</m:mo><m:mo stretchy="false">(</m:mo><m:msub><m:mi>x</m:mi><m:mi>i</m:mi></m:msub><m:mo>,</m:mo><m:msub><m:mi>s</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi><m:mn>1</m:mn></m:mrow></m:msub><m:mo>,</m:mo><m:msub><m:mi>s</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi><m:mn>2</m:mn></m:mrow></m:msub><m:mo stretchy="false">)</m:mo><m:mo>:</m:mo><m:mi>i</m:mi><m:mo>=</m:mo><m:mn>1</m:mn><m:mo>,</m:mo><m:mn>...</m:mn><m:mo>,</m:mo><m:mi>M</m:mi><m:mo>}</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaei4EaSNaeiikaGIaemiEaG3aaSbaaSqaaiabdMgaPbqabaGccqGGSaalcqWGZbWCdaWgaaWcbaGaemyAaKMaemOAaOMaeGymaedabeaakiabcYcaSiabdohaZnaaBaaaleaacqWGPbqAcqWGQbGAcqaIYaGmaeqaaOGaeiykaKIaeiOoaOJaemyAaKMaeyypa0JaeGymaeJaeiilaWIaeiOla4IaeiOla4IaeiOla4IaeiilaWIaemyta0KaeiyFa0haaa@49C9@</m:annotation></m:semantics></m:math></inline-formula>, where <it>j</it><sub>1</sub>, <it>j</it><sub>2 </sub>&#8712; {1,..., <it>n</it>} and <it>j</it><sub>1 </sub>&#8800; <it>j</it><sub>2</sub>. Here <it>n </it>is the number of columns in <it>S</it>, <it>i.e</it>., the number of sequence features of interest. Our structural learning of Bayesian networks is to find the optimal combination of sequence features as the parents of meta-expression values.</p>
               <p>In the problem stated above, we would like to discuss our model for meta-expressions when the networks structure is given. Since the information of sequence features take binary variables, <it>i.e</it>., 0 or 1, the parent variables can theoretically take <inline-formula><m:math name="1471-2105-9-404-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msup><m:mn>2</m:mn><m:mrow><m:msub><m:mi>n</m:mi><m:mi>p</m:mi></m:msub></m:mrow></m:msup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeGOmaiZaaWbaaSqabeaacqWGUbGBdaWgaaadbaGaemiCaahabeaaaaaaaa@2FEF@</m:annotation></m:semantics></m:math></inline-formula> patterns, where <it>n</it><sub><it>p </it></sub>is the number of parents specified by the network structure. In the above example, the network model chooses two motifs as the parents and there are four patterns, {(0, 0), (0, 1), (1, 0), (1, 1)}, that the parents can take. In practice, since it is a possible case that we cannot find all the patterns of specified parents in <it>S </it>for large <it>n</it><sub><it>p</it></sub>, we denote the number of observed patterns by <it>q </it>(&#8804; <inline-formula><m:math name="1471-2105-9-404-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msup><m:mn>2</m:mn><m:mrow><m:msub><m:mi>n</m:mi><m:mi>p</m:mi></m:msub></m:mrow></m:msup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeGOmaiZaaWbaaSqabeaacqWGUbGBdaWgaaadbaGaemiCaahabeaaaaaaaa@2FEF@</m:annotation></m:semantics></m:math></inline-formula>). Therefore, if we specified the network structure, the meta-expression values can be separated into <it>q </it>exclusive groups. That is, the parents of the meta-expressions in each group show the same pattern.</p>
               <p>More mathematically, let <b><it>s</it></b><sub><it>i </it></sub>= {<it>s</it><sub><it>i</it>1</sub>,..., <it>s</it><sub><it>i</it>, <it>n</it></sub>} be the the <it>i</it>th row of <it>S</it>. Based on the specified structure <it>N</it>, we define the subset <inline-formula><m:math name="1471-2105-9-404-i6" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mi>s</m:mi><m:mi>i</m:mi></m:msub><m:mo stretchy="false">(</m:mo><m:mi>N</m:mi><m:mo stretchy="false">)</m:mo><m:mo>=</m:mo><m:mo>{</m:mo><m:msub><m:mi>s</m:mi><m:mrow><m:mi>i</m:mi><m:mo>,</m:mo><m:msub><m:mtext>p</m:mtext><m:mn>1</m:mn></m:msub></m:mrow></m:msub><m:mo>,</m:mo><m:mn>...</m:mn><m:mo>,</m:mo><m:msub><m:mi>s</m:mi><m:mrow><m:mi>i</m:mi><m:mo>,</m:mo><m:msub><m:mtext>p</m:mtext><m:mi>r</m:mi></m:msub></m:mrow></m:msub><m:mo>}</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaacbmGae83Cam3aaSbaaSqaaiabdMgaPbqabaGccqGGOaakcqWGobGtcqGGPaqkcqGH9aqpcqGG7bWEcqWGZbWCdaWgaaWcbaGaemyAaKMaeiilaWIaeeiCaa3aaSbaaWqaaiabigdaXaqabaaaleqaaOGaeiilaWIaeiOla4IaeiOla4IaeiOla4IaeiilaWIaem4Cam3aaSbaaSqaaiabdMgaPjabcYcaSiabbchaWnaaBaaameaacqWGYbGCaeqaaaWcbeaakiabc2ha9baa@4781@</m:annotation></m:semantics></m:math></inline-formula> as the parents of meta-expressions, where {p<sub>1</sub>,...,p<sub><it>r</it></sub>} &#8834; {1,..., <it>n</it>}. We then have the following decomposition:</p>
               <p>
                  <display-formula>
                     <m:math name="1471-2105-9-404-i7" xmlns:m="http://www.w3.org/1998/Math/MathML">
                        <m:semantics>
                           <m:mrow>
                              <m:mtable columnalign="left">
                                 <m:mtr columnalign="left">
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:mi>p</m:mi>
                                          <m:mo stretchy="false">(</m:mo>
                                          <m:mi>D</m:mi>
                                          <m:mo>|</m:mo>
                                          <m:mi>N</m:mi>
                                          <m:mo stretchy="false">)</m:mo>
                                       </m:mrow>
                                    </m:mtd>
                                    <m:mtd columnalign="left">
                                       <m:mo>=</m:mo>
                                    </m:mtd>
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:mstyle displaystyle="true">
                                             <m:munderover>
                                                <m:mo>&#8719;</m:mo>
                                                <m:mrow>
                                                   <m:mi>i</m:mi>
                                                   <m:mo>=</m:mo>
                                                   <m:mn>1</m:mn>
                                                </m:mrow>
                                                <m:mi>M</m:mi>
                                             </m:munderover>
                                             <m:mrow>
                                                <m:mi>p</m:mi>
                                                <m:mo stretchy="false">(</m:mo>
                                                <m:msub>
                                                   <m:mi>x</m:mi>
                                                   <m:mi>i</m:mi>
                                                </m:msub>
                                                <m:mo>,</m:mo>
                                                <m:msub>
                                                   <m:mi>s</m:mi>
                                                   <m:mi>i</m:mi>
                                                </m:msub>
                                                <m:mo>|</m:mo>
                                                <m:mi>N</m:mi>
                                                <m:mo stretchy="false">)</m:mo>
                                             </m:mrow>
                                          </m:mstyle>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                                 <m:mtr columnalign="left">
                                    <m:mtd columnalign="left">
                                       <m:mrow/>
                                    </m:mtd>
                                    <m:mtd columnalign="left">
                                       <m:mo>=</m:mo>
                                    </m:mtd>
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:mstyle displaystyle="true">
                                             <m:munderover>
                                                <m:mo>&#8719;</m:mo>
                                                <m:mrow>
                                                   <m:mi>i</m:mi>
                                                   <m:mo>=</m:mo>
                                                   <m:mn>1</m:mn>
                                                </m:mrow>
                                                <m:mi>M</m:mi>
                                             </m:munderover>
                                             <m:mrow>
                                                <m:mi>p</m:mi>
                                                <m:mo stretchy="false">(</m:mo>
                                                <m:msub>
                                                   <m:mi>x</m:mi>
                                                   <m:mi>i</m:mi>
                                                </m:msub>
                                                <m:mo>|</m:mo>
                                                <m:msub>
                                                   <m:mi>s</m:mi>
                                                   <m:mi>i</m:mi>
                                                </m:msub>
                                                <m:mo>,</m:mo>
                                                <m:mi>N</m:mi>
                                                <m:mo stretchy="false">)</m:mo>
                                                <m:mi>p</m:mi>
                                                <m:mo stretchy="false">(</m:mo>
                                                <m:msub>
                                                   <m:mi>s</m:mi>
                                                   <m:mi>i</m:mi>
                                                </m:msub>
                                                <m:mo>|</m:mo>
                                                <m:mi>N</m:mi>
                                                <m:mo stretchy="false">)</m:mo>
                                             </m:mrow>
                                          </m:mstyle>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                                 <m:mtr columnalign="left">
                                    <m:mtd columnalign="left">
                                       <m:mrow/>
                                    </m:mtd>
                                    <m:mtd columnalign="left">
                                       <m:mo>&#8733;</m:mo>
                                    </m:mtd>
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:mstyle displaystyle="true">
                                             <m:munderover>
                                                <m:mo>&#8719;</m:mo>
                                                <m:mrow>
                                                   <m:mi>i</m:mi>
                                                   <m:mo>=</m:mo>
                                                   <m:mn>1</m:mn>
                                                </m:mrow>
                                                <m:mi>M</m:mi>
                                             </m:munderover>
                                             <m:mrow>
                                                <m:mi>p</m:mi>
                                                <m:mo stretchy="false">(</m:mo>
                                                <m:msub>
                                                   <m:mi>x</m:mi>
                                                   <m:mi>i</m:mi>
                                                </m:msub>
                                                <m:mo>|</m:mo>
                                                <m:msub>
                                                   <m:mi>s</m:mi>
                                                   <m:mi>i</m:mi>
                                                </m:msub>
                                                <m:mo stretchy="false">(</m:mo>
                                                <m:mi>N</m:mi>
                                                <m:mo stretchy="false">)</m:mo>
                                                <m:mo stretchy="false">)</m:mo>
                                             </m:mrow>
                                          </m:mstyle>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                                 <m:mtr columnalign="left">
                                    <m:mtd columnalign="left">
                                       <m:mrow/>
                                    </m:mtd>
                                    <m:mtd columnalign="left">
                                       <m:mo>=</m:mo>
                                    </m:mtd>
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:mstyle displaystyle="true">
                                             <m:munderover>
                                                <m:mo>&#8719;</m:mo>
                                                <m:mrow>
                                                   <m:mi>k</m:mi>
                                                   <m:mo>=</m:mo>
                                                   <m:mn>1</m:mn>
                                                </m:mrow>
                                                <m:mi>q</m:mi>
                                             </m:munderover>
                                             <m:mrow>
                                                <m:mi>p</m:mi>
                                                <m:mo stretchy="false">(</m:mo>
                                                <m:msub>
                                                   <m:mi>d</m:mi>
                                                   <m:mi>k</m:mi>
                                                </m:msub>
                                                <m:mo>|</m:mo>
                                                <m:mi>p</m:mi>
                                                <m:msub>
                                                   <m:mi>a</m:mi>
                                                   <m:mi>k</m:mi>
                                                </m:msub>
                                                <m:mo stretchy="false">)</m:mo>
                                             </m:mrow>
                                          </m:mstyle>
                                          <m:mo>,</m:mo>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                              </m:mtable>
                           </m:mrow>
                           <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaqbaeaabqWaaaaabaGaemiCaaNaeiikaGIaemiraqKaeiiFaWNaemOta4KaeiykaKcabaGaeyypa0dabaWaaebCaeaacqWGWbaCcqGGOaakcqWG4baEdaWgaaWcbaGaemyAaKgabeaakiabcYcaSGqadiab=nhaZnaaBaaaleaacqWGPbqAaeqaaOGaeiiFaWNaemOta4KaeiykaKcaleaacqWGPbqAcqGH9aqpcqaIXaqmaeaacqWGnbqta0Gaey4dIunaaOqaaaqaaiabg2da9aqaamaarahabaGaemiCaaNaeiikaGIaemiEaG3aaSbaaSqaaiabdMgaPbqabaGccqGG8baFcqWFZbWCdaWgaaWcbaGaemyAaKgabeaakiabcYcaSiabd6eaojabcMcaPiabdchaWjabcIcaOiab=nhaZnaaBaaaleaacqWGPbqAaeqaaOGaeiiFaWNaemOta4KaeiykaKcaleaacqWGPbqAcqGH9aqpcqaIXaqmaeaacqWGnbqta0Gaey4dIunaaOqaaaqaaiabg2Hi1cqaamaarahabaGaemiCaaNaeiikaGIaemiEaG3aaSbaaSqaaiabdMgaPbqabaGccqGG8baFcqWFZbWCdaWgaaWcbaGaemyAaKgabeaakiabcIcaOiabd6eaojabcMcaPiabcMcaPaWcbaGaemyAaKMaeyypa0JaeGymaedabaGaemyta0eaniabg+GivdaakeaaaeaacqGH9aqpaeaadaqeWbqaaiabdchaWjabcIcaOiab=rgaKnaaBaaaleaacqWGRbWAaeqaaOGaeiiFaWNae8hCaaNae8xyae2aaSbaaSqaaiabdUgaRbqabaGccqGGPaqkaSqaaiabdUgaRjabg2da9iabigdaXaqaaiabdghaXbqdcqGHpis1aOGaeiilaWcaaaaa@8ED2@</m:annotation>
                        </m:semantics>
                     </m:math>
                  </display-formula>
               </p>
               <p>where <b><it>pa</it></b><sub><it>k </it></sub>is the <it>k</it>th pattern of parent motifs and <b><it>d</it></b><sub><it>k </it></sub>is the set of meta-expressions that have the same sequence feature information restricted by the parent motifs. For example, if <b><it>s</it></b><sub>1</sub>(<it>N</it>) and <b><it>s</it></b><sub>2</sub>(<it>N</it>) are equal to <b><it>pa</it></b><sub>1</sub>, then <it>x</it><sub>1 </sub>and <it>x</it><sub>2 </sub>are included in <b><it>d</it></b><sub>1</sub>. Note that we assume <it>p</it>(<b><it>s</it></b><sub><it>i</it></sub>|<it>N</it>) = <it>p</it>(<b><it>s</it></b><sub><it>i</it></sub>) follows uniform distribution and is independent from the selection of network structure <it>N</it>.</p>
               <p>We next consider a statistical model for <it>p</it>(<b><it>d</it></b><sub><it>k</it></sub>|<b><it>pa</it></b><sub><it>k</it></sub>). By omitting the subscript <it>k </it>and the parent state, we denote <it>p</it>(<b><it>d</it></b><sub><it>k</it></sub>|<b><it>pa</it></b><sub><it>k</it></sub>) as <it>p</it>(<b><it>d</it></b>). Suppose that <it>M</it><sub><it>k </it></sub>meta-expression values are included in the group, <it>i.e</it>., <inline-formula><m:math name="1471-2105-9-404-i8" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>d</m:mi><m:mo>=</m:mo><m:mo>{</m:mo><m:msup><m:mi>x</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:mn>1</m:mn><m:mo stretchy="false">)</m:mo></m:mrow></m:msup><m:mo>,</m:mo><m:mn>...</m:mn><m:mo>,</m:mo><m:msup><m:mi>x</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:msub><m:mi>M</m:mi><m:mi>k</m:mi></m:msub><m:mo stretchy="false">)</m:mo></m:mrow></m:msup><m:mo>}</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemizaqMaeyypa0Jaei4EaSNaemiEaG3aaWbaaSqabeaacqGGOaakcqaIXaqmcqGGPaqkaaGccqGGSaalcqGGUaGlcqGGUaGlcqGGUaGlcqGGSaalcqWG4baEdaahaaWcbeqaaiabcIcaOiabd2eannaaBaaameaacqWGRbWAaeqaaSGaeiykaKcaaOGaeiyFa0haaa@4006@</m:annotation></m:semantics></m:math></inline-formula>. Note that we also denote <it>M</it><sub><it>k </it></sub>as <it>M </it>hereafter. We fit a normal distribution to each element of <b><it>d </it></b>by</p>
               <p>
                  <display-formula>
                     <m:math name="1471-2105-9-404-i9" xmlns:m="http://www.w3.org/1998/Math/MathML">
                        <m:semantics>
                           <m:mrow>
                              <m:mtable>
                                 <m:mtr>
                                    <m:mtd>
                                       <m:mrow>
                                          <m:mi>&#981;</m:mi>
                                          <m:mo stretchy="false">(</m:mo>
                                          <m:msup>
                                             <m:mi>x</m:mi>
                                             <m:mrow>
                                                <m:mo stretchy="false">(</m:mo>
                                                <m:mi>m</m:mi>
                                                <m:mo stretchy="false">)</m:mo>
                                             </m:mrow>
                                          </m:msup>
                                          <m:mo>|</m:mo>
                                          <m:mi>&#956;</m:mi>
                                          <m:mo>,</m:mo>
                                          <m:mi>&#964;</m:mi>
                                          <m:mo stretchy="false">)</m:mo>
                                          <m:mo>=</m:mo>
                                          <m:msqrt>
                                             <m:mrow>
                                                <m:mfrac>
                                                   <m:mi>&#964;</m:mi>
                                                   <m:mrow>
                                                      <m:mn>2</m:mn>
                                                      <m:mi>&#960;</m:mi>
                                                   </m:mrow>
                                                </m:mfrac>
                                             </m:mrow>
                                          </m:msqrt>
                                          <m:mi>exp</m:mi>
                                          <m:mo>&#8289;</m:mo>
                                          <m:mrow>
                                             <m:mo>{</m:mo>
                                             <m:mrow>
                                                <m:mo>&#8722;</m:mo>
                                                <m:mfrac>
                                                   <m:mrow>
                                                      <m:mi>&#964;</m:mi>
                                                      <m:msup>
                                                         <m:mrow>
                                                            <m:mo stretchy="false">(</m:mo>
                                                            <m:msup>
                                                               <m:mi>x</m:mi>
                                                               <m:mrow>
                                                                  <m:mo stretchy="false">(</m:mo>
                                                                  <m:mi>m</m:mi>
                                                                  <m:mo stretchy="false">)</m:mo>
                                                               </m:mrow>
                                                            </m:msup>
                                                            <m:mo>&#8722;</m:mo>
                                                            <m:mi>&#956;</m:mi>
                                                            <m:mo stretchy="false">)</m:mo>
                                                         </m:mrow>
                                                         <m:mn>2</m:mn>
                                                      </m:msup>
                                                   </m:mrow>
                                                   <m:mn>2</m:mn>
                                                </m:mfrac>
                                             </m:mrow>
                                             <m:mo>}</m:mo>
                                          </m:mrow>
                                          <m:mo>,</m:mo>
                                       </m:mrow>
                                    </m:mtd>
                                    <m:mtd>
                                       <m:mrow>
                                          <m:mi>m</m:mi>
                                          <m:mo>=</m:mo>
                                          <m:mn>1</m:mn>
                                          <m:mo>,</m:mo>
                                          <m:mn>...</m:mn>
                                          <m:mo>,</m:mo>
                                          <m:mi>M</m:mi>
                                          <m:mo>,</m:mo>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                              </m:mtable>
                           </m:mrow>
                           <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaqbaeqabeGaaaqaaiabew9aMjabcIcaOiabdIha4naaCaaaleqabaGaeiikaGIaemyBa0MaeiykaKcaaOGaeiiFaWNaeqiVd0MaeiilaWIaeqiXdqNaeiykaKIaeyypa0ZaaOaaaKqbagaadaWcaaqaaiabes8a0bqaaiabikdaYiabec8aWbaaaSqabaGccyGGLbqzcqGG4baEcqGGWbaCdaGadaqaaiabgkHiTKqbaoaalaaabaGaeqiXdqNaeiikaGIaemiEaG3aaWbaaeqabaGaeiikaGIaemyBa0MaeiykaKcaaiabgkHiTiabeY7aTjabcMcaPmaaCaaabeqaaiabikdaYaaaaeaacqaIYaGmaaaakiaawUhacaGL9baacqGGSaalaeaacqWGTbqBcqGH9aqpcqaIXaqmcqGGSaalcqGGUaGlcqGGUaGlcqGGUaGlcqGGSaalcqWGnbqtcqGGSaalaaaaaa@5FF9@</m:annotation>
                        </m:semantics>
                     </m:math>
                  </display-formula>
               </p>
               <p>where <it>&#981;</it>(<it>x</it>|<it>&#956;</it>, <it>&#964;</it>) is the density of normal distribution with mean <it>&#956; </it>and variance <it>&#964;</it><sup>-1</sup>. Note that <it>&#964; </it>is called precision. We assume that the joint prior density of mean and precision, <it>&#956; </it>and <it>&#964;</it>, is decomposed by</p>
               <p><it>p</it>(<it>&#956;</it>, <it>&#964;</it>) = <it>p</it>(<it>&#956;</it>|<it>&#964;</it>)<it>p</it>(<it>&#964;</it>).</p>
               <p>The conditional density of <it>&#956; </it>is set as</p>
               <p>
                  <display-formula>
                     <m:math name="1471-2105-9-404-i10" xmlns:m="http://www.w3.org/1998/Math/MathML">
                        <m:semantics>
                           <m:mrow>
                              <m:mi>p</m:mi>
                              <m:mo stretchy="false">(</m:mo>
                              <m:mi>&#956;</m:mi>
                              <m:mo>|</m:mo>
                              <m:mi>&#964;</m:mi>
                              <m:mo stretchy="false">)</m:mo>
                              <m:mo>=</m:mo>
                              <m:mi>&#981;</m:mi>
                              <m:mo stretchy="false">(</m:mo>
                              <m:mi>&#956;</m:mi>
                              <m:mo>|</m:mo>
                              <m:msub>
                                 <m:mi>&#956;</m:mi>
                                 <m:mn>0</m:mn>
                              </m:msub>
                              <m:mo>,</m:mo>
                              <m:msub>
                                 <m:mi>&#955;</m:mi>
                                 <m:mn>0</m:mn>
                              </m:msub>
                              <m:mi>&#964;</m:mi>
                              <m:mo stretchy="false">)</m:mo>
                              <m:mo>=</m:mo>
                              <m:msqrt>
                                 <m:mrow>
                                    <m:mfrac>
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>&#955;</m:mi>
                                             <m:mn>0</m:mn>
                                          </m:msub>
                                          <m:mi>&#964;</m:mi>
                                       </m:mrow>
                                       <m:mrow>
                                          <m:mn>2</m:mn>
                                          <m:mi>&#960;</m:mi>
                                       </m:mrow>
                                    </m:mfrac>
                                 </m:mrow>
                              </m:msqrt>
                              <m:mi>exp</m:mi>
                              <m:mo>&#8289;</m:mo>
                              <m:mrow>
                                 <m:mo>{</m:mo>
                                 <m:mrow>
                                    <m:mo>&#8722;</m:mo>
                                    <m:mfrac>
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>&#955;</m:mi>
                                             <m:mn>0</m:mn>
                                          </m:msub>
                                          <m:mi>&#964;</m:mi>
                                          <m:msup>
                                             <m:mrow>
                                                <m:mo stretchy="false">(</m:mo>
                                                <m:mi>&#956;</m:mi>
                                                <m:mo>&#8722;</m:mo>
                                                <m:msub>
                                                   <m:mi>&#956;</m:mi>
                                                   <m:mn>0</m:mn>
                                                </m:msub>
                                                <m:mo stretchy="false">)</m:mo>
                                             </m:mrow>
                                             <m:mn>2</m:mn>
                                          </m:msup>
                                       </m:mrow>
                                       <m:mn>2</m:mn>
                                    </m:mfrac>
                                 </m:mrow>
                                 <m:mo>}</m:mo>
                              </m:mrow>
                              <m:mo>,</m:mo>
                           </m:mrow>
                           <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemiCaaNaeiikaGIaeqiVd0MaeiiFaWNaeqiXdqNaeiykaKIaeyypa0Jaeqy1dyMaeiikaGIaeqiVd0MaeiiFaWNaeqiVd02aaSbaaSqaaiabicdaWaqabaGccqGGSaalcqaH7oaBdaWgaaWcbaGaeGimaadabeaakiabes8a0jabcMcaPiabg2da9maakaaajuaGbaWaaSaaaeaacqaH7oaBdaWgaaqaaiabicdaWaqabaGaeqiXdqhabaGaeGOmaiJaeqiWdahaaaWcbeaakiGbcwgaLjabcIha4jabcchaWnaacmaabaGaeyOeI0scfa4aaSaaaeaacqaH7oaBdaWgaaqaaiabicdaWaqabaGaeqiXdqNaeiikaGIaeqiVd0MaeyOeI0IaeqiVd02aaSbaaeaacqaIWaamaeqaaiabcMcaPmaaCaaabeqaaiabikdaYaaaaeaacqaIYaGmaaaakiaawUhacaGL9baacqGGSaalaaa@63C8@</m:annotation>
                        </m:semantics>
                     </m:math>
                  </display-formula>
               </p>
               <p>where <it>&#956;</it><sub>0 </sub>and <it>&#955;</it><sub>0 </sub>are hyperparameters. The marginal distribution of the precision, <it>&#964;</it>, is set by the density of gamma distribution with hyperparemeters, <it>&#945;</it><sub>0 </sub>and <it>&#946;</it><sub>0</sub>, and given by</p>
               <p>
                  <display-formula>
                     <m:math name="1471-2105-9-404-i11" xmlns:m="http://www.w3.org/1998/Math/MathML">
                        <m:semantics>
                           <m:mrow>
                              <m:mi>p</m:mi>
                              <m:mo stretchy="false">(</m:mo>
                              <m:mi>&#964;</m:mi>
                              <m:mo stretchy="false">)</m:mo>
                              <m:mo>=</m:mo>
                              <m:mi>g</m:mi>
                              <m:mo stretchy="false">(</m:mo>
                              <m:mi>&#964;</m:mi>
                              <m:mo>|</m:mo>
                              <m:msub>
                                 <m:mi>&#945;</m:mi>
                                 <m:mn>0</m:mn>
                              </m:msub>
                              <m:mo>,</m:mo>
                              <m:msub>
                                 <m:mi>&#946;</m:mi>
                                 <m:mn>0</m:mn>
                              </m:msub>
                              <m:mo stretchy="false">)</m:mo>
                              <m:mo>=</m:mo>
                              <m:mfrac>
                                 <m:mrow>
                                    <m:msubsup>
                                       <m:mi>&#946;</m:mi>
                                       <m:mn>0</m:mn>
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>&#945;</m:mi>
                                             <m:mn>0</m:mn>
                                          </m:msub>
                                       </m:mrow>
                                    </m:msubsup>
                                 </m:mrow>
                                 <m:mrow>
                                    <m:mi>&#915;</m:mi>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:msub>
                                       <m:mi>&#945;</m:mi>
                                       <m:mn>0</m:mn>
                                    </m:msub>
                                    <m:mo stretchy="false">)</m:mo>
                                 </m:mrow>
                              </m:mfrac>
                              <m:msup>
                                 <m:mi>&#964;</m:mi>
                                 <m:mrow>
                                    <m:msub>
                                       <m:mi>&#945;</m:mi>
                                       <m:mn>0</m:mn>
                                    </m:msub>
                                    <m:mo>&#8722;</m:mo>
                                    <m:mn>1</m:mn>
                                 </m:mrow>
                              </m:msup>
                              <m:mi>exp</m:mi>
                              <m:mo>&#8289;</m:mo>
                              <m:mo stretchy="false">(</m:mo>
                              <m:mo>&#8722;</m:mo>
                              <m:msub>
                                 <m:mi>&#946;</m:mi>
                                 <m:mn>0</m:mn>
                              </m:msub>
                              <m:mi>&#964;</m:mi>
                              <m:mo stretchy="false">)</m:mo>
                              <m:mo>.</m:mo>
                           </m:mrow>
                           <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemiCaaNaeiikaGIaeqiXdqNaeiykaKIaeyypa0Jaem4zaCMaeiikaGIaeqiXdqNaeiiFaWNaeqySde2aaSbaaSqaaiabicdaWaqabaGccqGGSaalcqaHYoGydaWgaaWcbaGaeGimaadabeaakiabcMcaPiabg2da9KqbaoaalaaabaGaeqOSdi2aa0baaeaacqaIWaamaeaacqaHXoqydaWgaaqaaiabicdaWaqabaaaaaqaaiabfo5ahjabcIcaOiabeg7aHnaaBaaabaGaeGimaadabeaacqGGPaqkaaGccqaHepaDdaahaaWcbeqaaiabeg7aHnaaBaaameaacqaIWaamaeqaaSGaeyOeI0IaeGymaedaaOGagiyzauMaeiiEaGNaeiiCaaNaeiikaGIaeyOeI0IaeqOSdi2aaSbaaSqaaiabicdaWaqabaGccqaHepaDcqGGPaqkcqGGUaGlaaa@5E73@</m:annotation>
                        </m:semantics>
                     </m:math>
                  </display-formula>
               </p>
               <p>In this setting, <it>p</it>(<it>&#956;</it>, <it>&#964;</it>) is the density of normal-gamma distribution with hyperparameters, <it>&#956;</it><sub>0</sub>, <it>&#955;</it><sub>0</sub>, <it>&#945;</it><sub>0 </sub>and <it>&#946;</it><sub>0</sub>. Hence, the marginal likelihood <it>p</it>(<b><it>d</it></b>) is given by</p>
               <p>
                  <display-formula>
                     <m:math name="1471-2105-9-404-i12" xmlns:m="http://www.w3.org/1998/Math/MathML">
                        <m:semantics>
                           <m:mrow>
                              <m:mtable columnalign="left">
                                 <m:mtr columnalign="left">
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:mi>p</m:mi>
                                          <m:mo stretchy="false">(</m:mo>
                                          <m:mi>d</m:mi>
                                          <m:mo stretchy="false">)</m:mo>
                                       </m:mrow>
                                    </m:mtd>
                                    <m:mtd columnalign="left">
                                       <m:mo>=</m:mo>
                                    </m:mtd>
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:mstyle displaystyle="true">
                                             <m:mrow>
                                                <m:msubsup>
                                                   <m:mo>&#8747;</m:mo>
                                                   <m:mrow>
                                                      <m:mi>&#964;</m:mi>
                                                      <m:mo>=</m:mo>
                                                      <m:mn>0</m:mn>
                                                   </m:mrow>
                                                   <m:mi>&#8734;</m:mi>
                                                </m:msubsup>
                                                <m:mrow>
                                                   <m:mstyle displaystyle="true">
                                                      <m:mrow>
                                                         <m:msubsup>
                                                            <m:mo>&#8747;</m:mo>
                                                            <m:mrow>
                                                               <m:mi>&#956;</m:mi>
                                                               <m:mo>=</m:mo>
                                                               <m:mo>&#8722;</m:mo>
                                                               <m:mi>&#8734;</m:mi>
                                                            </m:mrow>
                                                            <m:mi>&#8734;</m:mi>
                                                         </m:msubsup>
                                                         <m:mrow>
                                                            <m:mi>p</m:mi>
                                                            <m:mo stretchy="false">(</m:mo>
                                                            <m:mi>d</m:mi>
                                                            <m:mo>|</m:mo>
                                                            <m:mi>&#956;</m:mi>
                                                            <m:mo>,</m:mo>
                                                            <m:mi>&#964;</m:mi>
                                                            <m:mo stretchy="false">)</m:mo>
                                                            <m:mi>p</m:mi>
                                                            <m:mo stretchy="false">(</m:mo>
                                                            <m:mi>&#956;</m:mi>
                                                            <m:mo>,</m:mo>
                                                            <m:mi>&#964;</m:mi>
                                                            <m:mo stretchy="false">)</m:mo>
                                                            <m:mi>d</m:mi>
                                                            <m:mi>&#956;</m:mi>
                                                            <m:mi>d</m:mi>
                                                            <m:mi>&#964;</m:mi>
                                                         </m:mrow>
                                                      </m:mrow>
                                                   </m:mstyle>
                                                </m:mrow>
                                             </m:mrow>
                                          </m:mstyle>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                                 <m:mtr columnalign="left">
                                    <m:mtd columnalign="left">
                                       <m:mrow/>
                                    </m:mtd>
                                    <m:mtd columnalign="left">
                                       <m:mo>=</m:mo>
                                    </m:mtd>
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:mstyle displaystyle="true">
                                             <m:mrow>
                                                <m:msubsup>
                                                   <m:mo>&#8747;</m:mo>
                                                   <m:mrow>
                                                      <m:mi>&#964;</m:mi>
                                                      <m:mo>=</m:mo>
                                                      <m:mn>0</m:mn>
                                                   </m:mrow>
                                                   <m:mi>&#8734;</m:mi>
                                                </m:msubsup>
                                                <m:mrow>
                                                   <m:mstyle displaystyle="true">
                                                      <m:mrow>
                                                         <m:msubsup>
                                                            <m:mo>&#8747;</m:mo>
                                                            <m:mrow>
                                                               <m:mi>&#956;</m:mi>
                                                               <m:mo>=</m:mo>
                                                               <m:mo>&#8722;</m:mo>
                                                               <m:mi>&#8734;</m:mi>
                                                            </m:mrow>
                                                            <m:mi>&#8734;</m:mi>
                                                         </m:msubsup>
                                                         <m:mrow>
                                                            <m:mrow>
                                                               <m:mo>{</m:mo>
                                                               <m:mrow>
                                                                  <m:mstyle displaystyle="true">
                                                                     <m:munderover>
                                                                        <m:mo>&#8719;</m:mo>
                                                                        <m:mrow>
                                                                           <m:mi>m</m:mi>
                                                                           <m:mo>=</m:mo>
                                                                           <m:mn>1</m:mn>
                                                                        </m:mrow>
                                                                        <m:mi>M</m:mi>
                                                                     </m:munderover>
                                                                     <m:mrow>
                                                                        <m:mi>&#981;</m:mi>
                                                                        <m:mo stretchy="false">(</m:mo>
                                                                        <m:msup>
                                                                           <m:mi>x</m:mi>
                                                                           <m:mrow>
                                                                              <m:mo stretchy="false">(</m:mo>
                                                                              <m:mi>m</m:mi>
                                                                              <m:mo stretchy="false">)</m:mo>
                                                                           </m:mrow>
                                                                        </m:msup>
                                                                        <m:mo>|</m:mo>
                                                                        <m:mi>&#956;</m:mi>
                                                                        <m:mo>,</m:mo>
                                                                        <m:mi>&#964;</m:mi>
                                                                        <m:mo stretchy="false">)</m:mo>
                                                                     </m:mrow>
                                                                  </m:mstyle>
                                                               </m:mrow>
                                                               <m:mo>}</m:mo>
                                                            </m:mrow>
                                                            <m:mi>&#981;</m:mi>
                                                            <m:mo stretchy="false">(</m:mo>
                                                            <m:mi>&#956;</m:mi>
                                                            <m:mo>|</m:mo>
                                                            <m:msub>
                                                               <m:mi>&#956;</m:mi>
                                                               <m:mn>0</m:mn>
                                                            </m:msub>
                                                            <m:mo>,</m:mo>
                                                            <m:msub>
                                                               <m:mi>&#955;</m:mi>
                                                               <m:mn>0</m:mn>
                                                            </m:msub>
                                                            <m:mi>&#964;</m:mi>
                                                            <m:mo stretchy="false">)</m:mo>
                                                            <m:mi>g</m:mi>
                                                            <m:mo stretchy="false">(</m:mo>
                                                            <m:mi>&#964;</m:mi>
                                                            <m:mo>|</m:mo>
                                                            <m:msub>
                                                               <m:mi>&#945;</m:mi>
                                                               <m:mn>0</m:mn>
                                                            </m:msub>
                                                            <m:mo>,</m:mo>
                                                            <m:msub>
                                                               <m:mi>&#946;</m:mi>
                                                               <m:mn>0</m:mn>
                                                            </m:msub>
                                                            <m:mo stretchy="false">)</m:mo>
                                                            <m:mi>d</m:mi>
                                                            <m:mi>&#956;</m:mi>
                                                            <m:mi>d</m:mi>
                                                            <m:mi>&#964;</m:mi>
                                                            <m:mo>.</m:mo>
                                                         </m:mrow>
                                                      </m:mrow>
                                                   </m:mstyle>
                                                </m:mrow>
                                             </m:mrow>
                                          </m:mstyle>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                              </m:mtable>
                           </m:mrow>
                           <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaqbaeaabiWaaaqaaiabdchaWjabcIcaOGqadiab=rgaKjabcMcaPaqaaiabg2da9aqaamaapedabaWaa8qmaeaacqWGWbaCcqGGOaakcqWFKbazcqGG8baFcqaH8oqBcqGGSaalcqaHepaDcqGGPaqkcqWGWbaCcqGGOaakcqaH8oqBcqGGSaalcqaHepaDcqGGPaqkcqWGKbazcqaH8oqBcqWGKbazcqaHepaDaSqaaiabeY7aTjabg2da9iabgkHiTiabg6HiLcqaaiabg6HiLcqdcqGHRiI8aaWcbaGaeqiXdqNaeyypa0JaeGimaadabaGaeyOhIukaniabgUIiYdaakeaaaeaacqGH9aqpaeaadaWdXaqaamaapedabaWaaiWaaeaadaqeWbqaaiabew9aMjabcIcaOiabdIha4naaCaaaleqabaGaeiikaGIaemyBa0MaeiykaKcaaOGaeiiFaWNaeqiVd0MaeiilaWIaeqiXdqNaeiykaKcaleaacqWGTbqBcqGH9aqpcqaIXaqmaeaacqWGnbqta0Gaey4dIunaaOGaay5Eaiaaw2haaiabew9aMjabcIcaOiabeY7aTjabcYha8jabeY7aTnaaBaaaleaacqaIWaamaeqaaOGaeiilaWIaeq4UdW2aaSbaaSqaaiabicdaWaqabaGccqaHepaDcqGGPaqkcqWGNbWzcqGGOaakcqaHepaDcqGG8baFcqaHXoqydaWgaaWcbaGaeGimaadabeaakiabcYcaSiabek7aInaaBaaaleaacqaIWaamaeqaaOGaeiykaKIaemizaqMaeqiVd0MaemizaqMaeqiXdqNaeiOla4caleaacqaH8oqBcqGH9aqpcqGHsislcqGHEisPaeaacqGHEisPa0Gaey4kIipaaSqaaiabes8a0jabg2da9iabicdaWaqaaiabg6HiLcqdcqGHRiI8aaaaaaa@A463@</m:annotation>
                        </m:semantics>
                     </m:math>
                  </display-formula>
               </p>
               <p>Since the normal-gamma distribution is a conjugate prior of normal distribution model, the integral in the marginal likelihood can analytically be calculated. Hence, by putting</p>
               <p>
                  <display-formula>
                     <m:math name="1471-2105-9-404-i13" xmlns:m="http://www.w3.org/1998/Math/MathML">
                        <m:semantics>
                           <m:mrow>
                              <m:mtable columnalign="left">
                                 <m:mtr columnalign="left">
                                    <m:mtd columnalign="left">
                                       <m:mover accent="true">
                                          <m:mi>x</m:mi>
                                          <m:mo>&#175;</m:mo>
                                       </m:mover>
                                    </m:mtd>
                                    <m:mtd columnalign="left">
                                       <m:mo>=</m:mo>
                                    </m:mtd>
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:mfrac>
                                             <m:mn>1</m:mn>
                                             <m:mi>M</m:mi>
                                          </m:mfrac>
                                          <m:mstyle displaystyle="true">
                                             <m:munderover>
                                                <m:mo>&#8721;</m:mo>
                                                <m:mrow>
                                                   <m:mi>m</m:mi>
                                                   <m:mo>=</m:mo>
                                                   <m:mn>1</m:mn>
                                                </m:mrow>
                                                <m:mi>M</m:mi>
                                             </m:munderover>
                                             <m:mrow>
                                                <m:msup>
                                                   <m:mi>x</m:mi>
                                                   <m:mrow>
                                                      <m:mo stretchy="false">(</m:mo>
                                                      <m:mi>m</m:mi>
                                                      <m:mo stretchy="false">)</m:mo>
                                                   </m:mrow>
                                                </m:msup>
                                             </m:mrow>
                                          </m:mstyle>
                                          <m:mo>,</m:mo>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                                 <m:mtr columnalign="left">
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>&#955;</m:mi>
                                             <m:mn>1</m:mn>
                                          </m:msub>
                                       </m:mrow>
                                    </m:mtd>
                                    <m:mtd columnalign="left">
                                       <m:mo>=</m:mo>
                                    </m:mtd>
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>&#955;</m:mi>
                                             <m:mn>0</m:mn>
                                          </m:msub>
                                          <m:mo>+</m:mo>
                                          <m:mi>M</m:mi>
                                          <m:mo>,</m:mo>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                                 <m:mtr columnalign="left">
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>&#956;</m:mi>
                                             <m:mn>1</m:mn>
                                          </m:msub>
                                       </m:mrow>
                                    </m:mtd>
                                    <m:mtd columnalign="left">
                                       <m:mo>=</m:mo>
                                    </m:mtd>
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:mfrac>
                                             <m:mrow>
                                                <m:msub>
                                                   <m:mi>&#955;</m:mi>
                                                   <m:mn>0</m:mn>
                                                </m:msub>
                                                <m:msub>
                                                   <m:mi>&#956;</m:mi>
                                                   <m:mn>0</m:mn>
                                                </m:msub>
                                                <m:mo>+</m:mo>
                                                <m:mi>M</m:mi>
                                                <m:mover accent="true">
                                                   <m:mi>x</m:mi>
                                                   <m:mo>&#175;</m:mo>
                                                </m:mover>
                                             </m:mrow>
                                             <m:mrow>
                                                <m:msub>
                                                   <m:mi>&#955;</m:mi>
                                                   <m:mn>1</m:mn>
                                                </m:msub>
                                             </m:mrow>
                                          </m:mfrac>
                                          <m:mo>,</m:mo>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                                 <m:mtr columnalign="left">
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>&#945;</m:mi>
                                             <m:mn>1</m:mn>
                                          </m:msub>
                                       </m:mrow>
                                    </m:mtd>
                                    <m:mtd columnalign="left">
                                       <m:mo>=</m:mo>
                                    </m:mtd>
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>&#945;</m:mi>
                                             <m:mn>0</m:mn>
                                          </m:msub>
                                          <m:mo>+</m:mo>
                                          <m:mfrac>
                                             <m:mi>M</m:mi>
                                             <m:mn>2</m:mn>
                                          </m:mfrac>
                                          <m:mo>,</m:mo>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                                 <m:mtr columnalign="left">
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>&#946;</m:mi>
                                             <m:mn>1</m:mn>
                                          </m:msub>
                                       </m:mrow>
                                    </m:mtd>
                                    <m:mtd columnalign="left">
                                       <m:mo>=</m:mo>
                                    </m:mtd>
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>&#946;</m:mi>
                                             <m:mn>0</m:mn>
                                          </m:msub>
                                          <m:mo>+</m:mo>
                                          <m:mfrac>
                                             <m:mn>1</m:mn>
                                             <m:mn>2</m:mn>
                                          </m:mfrac>
                                          <m:mstyle displaystyle="true">
                                             <m:munderover>
                                                <m:mo>&#8721;</m:mo>
                                                <m:mrow>
                                                   <m:mi>m</m:mi>
                                                   <m:mo>=</m:mo>
                                                   <m:mn>1</m:mn>
                                                </m:mrow>
                                                <m:mi>M</m:mi>
                                             </m:munderover>
                                             <m:mrow>
                                                <m:msup>
                                                   <m:mrow>
                                                      <m:mo stretchy="false">(</m:mo>
                                                      <m:msup>
                                                         <m:mi>x</m:mi>
                                                         <m:mrow>
                                                            <m:mo stretchy="false">(</m:mo>
                                                            <m:mi>m</m:mi>
                                                            <m:mo stretchy="false">)</m:mo>
                                                         </m:mrow>
                                                      </m:msup>
                                                      <m:mo>&#8722;</m:mo>
                                                      <m:mover accent="true">
                                                         <m:mi>x</m:mi>
                                                         <m:mo>&#175;</m:mo>
                                                      </m:mover>
                                                      <m:mo stretchy="false">)</m:mo>
                                                   </m:mrow>
                                                   <m:mn>2</m:mn>
                                                </m:msup>
                                                <m:mo>+</m:mo>
                                                <m:mfrac>
                                                   <m:mrow>
                                                      <m:mi>M</m:mi>
                                                      <m:msub>
                                                         <m:mi>&#955;</m:mi>
                                                         <m:mn>0</m:mn>
                                                      </m:msub>
                                                      <m:msup>
                                                         <m:mrow>
                                                            <m:mo stretchy="false">(</m:mo>
                                                            <m:mover accent="true">
                                                               <m:mi>x</m:mi>
                                                               <m:mo>&#175;</m:mo>
                                                            </m:mover>
                                                            <m:mo>&#8722;</m:mo>
                                                            <m:msub>
                                                               <m:mi>&#956;</m:mi>
                                                               <m:mn>0</m:mn>
                                                            </m:msub>
                                                            <m:mo stretchy="false">)</m:mo>
                                                         </m:mrow>
                                                         <m:mn>2</m:mn>
                                                      </m:msup>
                                                   </m:mrow>
                                                   <m:mrow>
                                                      <m:mn>2</m:mn>
                                                      <m:msub>
                                                         <m:mi>&#955;</m:mi>
                                                         <m:mn>1</m:mn>
                                                      </m:msub>
                                                   </m:mrow>
                                                </m:mfrac>
                                                <m:mo>,</m:mo>
                                             </m:mrow>
                                          </m:mstyle>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                              </m:mtable>
                           </m:mrow>
                           <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaqbaeaabuWaaaaabaGafmiEaGNbaebaaeaacqGH9aqpaeaajuaGdaWcaaqaaiabigdaXaqaaiabd2eanbaakmaaqahabaGaemiEaG3aaWbaaSqabeaacqGGOaakcqWGTbqBcqGGPaqkaaaabaGaemyBa0Maeyypa0JaeGymaedabaGaemyta0eaniabggHiLdGccqGGSaalaeaacqaH7oaBdaWgaaWcbaGaeGymaedabeaaaOqaaiabg2da9aqaaiabeU7aSnaaBaaaleaacqaIWaamaeqaaOGaey4kaSIaemyta0KaeiilaWcabaGaeqiVd02aaSbaaSqaaiabigdaXaqabaaakeaacqGH9aqpaKqbagaadaWcaaqaaiabeU7aSnaaBaaabaGaeGimaadabeaacqaH8oqBdaWgaaqaaiabicdaWaqabaGaey4kaSIaemyta0KafmiEaGNbaebaaeaacqaH7oaBdaWgaaqaaiabigdaXaqabaaaaiabcYcaSaGcbaGaeqySde2aaSbaaSqaaiabigdaXaqabaaakeaacqGH9aqpaeaacqaHXoqydaWgaaWcbaGaeGimaadabeaakiabgUcaRKqbaoaalaaabaGaemyta0eabaGaeGOmaidaaiabcYcaSaGcbaGaeqOSdi2aaSbaaSqaaiabigdaXaqabaaakeaacqGH9aqpaeaacqaHYoGydaWgaaWcbaGaeGimaadabeaakiabgUcaRKqbaoaalaaabaGaeGymaedabaGaeGOmaidaaOWaaabCaeaacqGGOaakcqWG4baEdaahaaWcbeqaaiabcIcaOiabd2gaTjabcMcaPaaakiabgkHiTiqbdIha4zaaraGaeiykaKYaaWbaaSqabeaacqaIYaGmaaGccqGHRaWkjuaGdaWcaaqaaiabd2eanjabeU7aSnaaBaaabaGaeGimaadabeaacqGGOaakcuWG4baEgaqeaiabgkHiTiabeY7aTnaaBaaabaGaeGimaadabeaacqGGPaqkdaahaaqabeaacqaIYaGmaaaabaGaeGOmaiJaeq4UdW2aaSbaaeaacqaIXaqmaeqaaaaacqGGSaalaSqaaiabd2gaTjabg2da9iabigdaXaqaaiabd2eanbqdcqGHris5aaaaaaa@90C2@</m:annotation>
                        </m:semantics>
                     </m:math>
                  </display-formula>
               </p>
               <p>we then have</p>
               <p>
                  <display-formula>
                     <m:math name="1471-2105-9-404-i14" xmlns:m="http://www.w3.org/1998/Math/MathML">
                        <m:semantics>
                           <m:mrow>
                              <m:mi>p</m:mi>
                              <m:mo stretchy="false">(</m:mo>
                              <m:mi>d</m:mi>
                              <m:mo stretchy="false">)</m:mo>
                              <m:mo>=</m:mo>
                              <m:mfrac>
                                 <m:mn>1</m:mn>
                                 <m:mrow>
                                    <m:msup>
                                       <m:mrow>
                                          <m:mo stretchy="false">(</m:mo>
                                          <m:mn>2</m:mn>
                                          <m:mi>&#960;</m:mi>
                                          <m:mo stretchy="false">)</m:mo>
                                       </m:mrow>
                                       <m:mrow>
                                          <m:mi>M</m:mi>
                                          <m:mo>/</m:mo>
                                          <m:mn>2</m:mn>
                                       </m:mrow>
                                    </m:msup>
                                 </m:mrow>
                              </m:mfrac>
                              <m:mo>&#8901;</m:mo>
                              <m:mfrac>
                                 <m:mrow>
                                    <m:mi>&#915;</m:mi>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:msub>
                                       <m:mi>&#945;</m:mi>
                                       <m:mn>1</m:mn>
                                    </m:msub>
                                    <m:mo stretchy="false">)</m:mo>
                                 </m:mrow>
                                 <m:mrow>
                                    <m:mi>&#915;</m:mi>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:msub>
                                       <m:mi>&#945;</m:mi>
                                       <m:mn>0</m:mn>
                                    </m:msub>
                                    <m:mo stretchy="false">)</m:mo>
                                 </m:mrow>
                              </m:mfrac>
                              <m:mo>&#8901;</m:mo>
                              <m:mfrac>
                                 <m:mrow>
                                    <m:msubsup>
                                       <m:mi>&#946;</m:mi>
                                       <m:mn>0</m:mn>
                                       <m:mrow>
                                          <m:msup>
                                             <m:mi>&#945;</m:mi>
                                             <m:mn>0</m:mn>
                                          </m:msup>
                                       </m:mrow>
                                    </m:msubsup>
                                 </m:mrow>
                                 <m:mrow>
                                    <m:msubsup>
                                       <m:mi>&#946;</m:mi>
                                       <m:mn>1</m:mn>
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>&#945;</m:mi>
                                             <m:mn>1</m:mn>
                                          </m:msub>
                                       </m:mrow>
                                    </m:msubsup>
                                 </m:mrow>
                              </m:mfrac>
                              <m:mo>&#8901;</m:mo>
                              <m:msup>
                                 <m:mrow>
                                    <m:mrow>
                                       <m:mo>(</m:mo>
                                       <m:mrow>
                                          <m:mfrac>
                                             <m:mrow>
                                                <m:msub>
                                                   <m:mi>&#955;</m:mi>
                                                   <m:mn>0</m:mn>
                                                </m:msub>
                                             </m:mrow>
                                             <m:mrow>
                                                <m:msub>
                                                   <m:mi>&#955;</m:mi>
                                                   <m:mn>1</m:mn>
                                                </m:msub>
                                             </m:mrow>
                                          </m:mfrac>
                                       </m:mrow>
                                       <m:mo>)</m:mo>
                                    </m:mrow>
                                 </m:mrow>
                                 <m:mrow>
                                    <m:mn>1</m:mn>
                                    <m:mo>/</m:mo>
                                    <m:mn>2</m:mn>
                                 </m:mrow>
                              </m:msup>
                              <m:mo>.</m:mo>
                           </m:mrow>
                           <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemiCaaNaeiikaGccbmGae8hzaqMaeiykaKIaeyypa0tcfa4aaSaaaeaacqaIXaqmaeaacqGGOaakcqaIYaGmcqaHapaCcqGGPaqkdaahaaqabeaacqWGnbqtcqGGVaWlcqaIYaGmaaaaaOGaeyyXICDcfa4aaSaaaeaacqqHtoWrcqGGOaakcqaHXoqydaWgaaqaaiabigdaXaqabaGaeiykaKcabaGaeu4KdCKaeiikaGIaeqySde2aaSbaaeaacqaIWaamaeqaaiabcMcaPaaakiabgwSixNqbaoaalaaabaGaeqOSdi2aa0baaeaacqaIWaamaeaacqaHXoqydaahaaqabeaacqaIWaamaaaaaaqaaiabek7aInaaDaaabaGaeGymaedabaGaeqySde2aaSbaaeaacqaIXaqmaeqaaaaaaaGccqGHflY1daqadaqcfayaamaalaaabaGaeq4UdW2aaSbaaeaacqaIWaamaeqaaaqaaiabeU7aSnaaBaaabaGaeGymaedabeaaaaaakiaawIcacaGLPaaadaahaaWcbeqcfayaaiabigdaXiabc+caViabikdaYaaakiabc6caUaaa@6560@</m:annotation>
                        </m:semantics>
                     </m:math>
                  </display-formula>
               </p>
               <p>The details of this calculation are shown in Additional file <supplr sid="S1">1</supplr>. Hence, the marginal likelihood, <it>p</it>(<it>D</it>|<it>N</it>), is obtained as the function of the hyperparameters {<it>&#956;</it><sub>0<it>j</it></sub>, <it>&#955;</it><sub>0<it>j</it></sub>, <it>&#945;</it><sub>0<it>j</it></sub>, <it>&#946;</it><sub>0<it>j</it></sub>} and is given by</p>
               <p>
                  <display-formula>
                     <m:math name="1471-2105-9-404-i15" xmlns:m="http://www.w3.org/1998/Math/MathML">
                        <m:semantics>
                           <m:mrow>
                              <m:mi>p</m:mi>
                              <m:mo stretchy="false">(</m:mo>
                              <m:mi>D</m:mi>
                              <m:mo>|</m:mo>
                              <m:mi>N</m:mi>
                              <m:mo stretchy="false">)</m:mo>
                              <m:mo>=</m:mo>
                              <m:mstyle displaystyle="true">
                                 <m:munderover>
                                    <m:mo>&#8719;</m:mo>
                                    <m:mrow>
                                       <m:mi>k</m:mi>
                                       <m:mo>=</m:mo>
                                       <m:mn>1</m:mn>
                                    </m:mrow>
                                    <m:mi>q</m:mi>
                                 </m:munderover>
                                 <m:mrow>
                                    <m:mfrac>
                                       <m:mn>1</m:mn>
                                       <m:mrow>
                                          <m:msup>
                                             <m:mrow>
                                                <m:mo stretchy="false">(</m:mo>
                                                <m:mn>2</m:mn>
                                                <m:mi>&#960;</m:mi>
                                                <m:mo stretchy="false">)</m:mo>
                                             </m:mrow>
                                             <m:mrow>
                                                <m:msub>
                                                   <m:mi>M</m:mi>
                                                   <m:mi>k</m:mi>
                                                </m:msub>
                                                <m:mo>/</m:mo>
                                                <m:mn>2</m:mn>
                                             </m:mrow>
                                          </m:msup>
                                       </m:mrow>
                                    </m:mfrac>
                                 </m:mrow>
                              </m:mstyle>
                              <m:mo>&#8901;</m:mo>
                              <m:mfrac>
                                 <m:mrow>
                                    <m:mi>&#915;</m:mi>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:msub>
                                       <m:mi>&#945;</m:mi>
                                       <m:mrow>
                                          <m:mn>1</m:mn>
                                          <m:mi>k</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                    <m:mo stretchy="false">)</m:mo>
                                 </m:mrow>
                                 <m:mrow>
                                    <m:mi>&#915;</m:mi>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:msub>
                                       <m:mi>&#945;</m:mi>
                                       <m:mrow>
                                          <m:mn>0</m:mn>
                                          <m:mi>k</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                    <m:mo stretchy="false">)</m:mo>
                                 </m:mrow>
                              </m:mfrac>
                              <m:mo>&#8901;</m:mo>
                              <m:mfrac>
                                 <m:mrow>
                                    <m:msubsup>
                                       <m:mi>&#946;</m:mi>
                                       <m:mrow>
                                          <m:mn>0</m:mn>
                                          <m:mi>k</m:mi>
                                       </m:mrow>
                                       <m:mrow>
                                          <m:msup>
                                             <m:mi>&#945;</m:mi>
                                             <m:mrow>
                                                <m:mn>0</m:mn>
                                                <m:mi>k</m:mi>
                                             </m:mrow>
                                          </m:msup>
                                       </m:mrow>
                                    </m:msubsup>
                                 </m:mrow>
                                 <m:mrow>
                                    <m:msubsup>
                                       <m:mi>&#946;</m:mi>
                                       <m:mrow>
                                          <m:mn>1</m:mn>
                                          <m:mi>k</m:mi>
                                       </m:mrow>
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>&#945;</m:mi>
                                             <m:mrow>
                                                <m:mn>1</m:mn>
                                                <m:mi>k</m:mi>
                                             </m:mrow>
                                          </m:msub>
                                       </m:mrow>
                                    </m:msubsup>
                                 </m:mrow>
                              </m:mfrac>
                              <m:mo>&#8901;</m:mo>
                              <m:msup>
                                 <m:mrow>
                                    <m:mrow>
                                       <m:mo>(</m:mo>
                                       <m:mrow>
                                          <m:mfrac>
                                             <m:mrow>
                                                <m:msub>
                                                   <m:mi>&#955;</m:mi>
                                                   <m:mrow>
                                                      <m:mn>0</m:mn>
                                                      <m:mi>k</m:mi>
                                                   </m:mrow>
                                                </m:msub>
                                             </m:mrow>
                                             <m:mrow>
                                                <m:msub>
                                                   <m:mi>&#955;</m:mi>
                                                   <m:mrow>
                                                      <m:mn>1</m:mn>
                                                      <m:mi>k</m:mi>
                                                   </m:mrow>
                                                </m:msub>
                                             </m:mrow>
                                          </m:mfrac>
                                       </m:mrow>
                                       <m:mo>)</m:mo>
                                    </m:mrow>
                                 </m:mrow>
                                 <m:mrow>
                                    <m:mn>1</m:mn>
                                    <m:mo>/</m:mo>
                                    <m:mn>2</m:mn>
                                 </m:mrow>
                              </m:msup>
                              <m:mo>.</m:mo>
                           </m:mrow>
                           <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemiCaaNaeiikaGIaemiraqKaeiiFaWNaemOta4KaeiykaKIaeyypa0ZaaebCaeaajuaGdaWcaaqaaiabigdaXaqaaiabcIcaOiabikdaYiabec8aWjabcMcaPmaaCaaabeqaaiabd2eannaaBaaabaGaem4AaSgabeaacqGGVaWlcqaIYaGmaaaaaaWcbaGaem4AaSMaeyypa0JaeGymaedabaGaemyCaehaniabg+GivdGccqGHflY1juaGdaWcaaqaaiabfo5ahjabcIcaOiabeg7aHnaaBaaabaGaeGymaeJaem4AaSgabeaacqGGPaqkaeaacqqHtoWrcqGGOaakcqaHXoqydaWgaaqaaiabicdaWiabdUgaRbqabaGaeiykaKcaaOGaeyyXICDcfa4aaSaaaeaacqaHYoGydaqhaaqaaiabicdaWiabdUgaRbqaaiabeg7aHnaaCaaabeqaaiabicdaWiabdUgaRbaaaaaabaGaeqOSdi2aa0baaeaacqaIXaqmcqWGRbWAaeaacqaHXoqydaWgaaqaaiabigdaXiabdUgaRbqabaaaaaaakiabgwSixpaabmaajuaGbaWaaSaaaeaacqaH7oaBdaWgaaqaaiabicdaWiabdUgaRbqabaaabaGaeq4UdW2aaSbaaeaacqaIXaqmcqWGRbWAaeqaaaaaaOGaayjkaiaawMcaamaaCaaaleqajuaGbaGaeGymaeJaei4la8IaeGOmaidaaOGaeiOla4caaa@7B26@</m:annotation>
                        </m:semantics>
                     </m:math>
                  </display-formula>
               </p>
               <p>In our analysis, we set <it>&#956;</it><sub>0<it>k </it></sub>= 0, <it>&#955;</it><sub>0<it>k </it></sub>= 10, <it>&#945;</it><sub>0<it>k </it></sub>= 9/2 and <it>&#946;</it><sub>0<it>k </it></sub>= 10/2 for all <it>k</it>.</p>
            </sec>
            <sec>
               <st>
                  <p>The prior probability</p>
               </st>
               <p>To avoid overfitting to the training data, the prior probability of the network <it>p</it>(<it>N</it>) was specified so as to penalize complex networks:</p>
               <p>
                  <display-formula>
                     <m:math name="1471-2105-9-404-i16" xmlns:m="http://www.w3.org/1998/Math/MathML">
                        <m:semantics>
                           <m:mrow>
                              <m:mi>p</m:mi>
                              <m:mo stretchy="false">(</m:mo>
                              <m:mi>N</m:mi>
                              <m:mo stretchy="false">)</m:mo>
                              <m:mo>=</m:mo>
                              <m:mi>c</m:mi>
                              <m:msup>
                                 <m:mi>K</m:mi>
                                 <m:mrow>
                                    <m:mo>&#8722;</m:mo>
                                    <m:msub>
                                       <m:mi>n</m:mi>
                                       <m:mi>p</m:mi>
                                    </m:msub>
                                 </m:mrow>
                              </m:msup>
                              <m:mo>,</m:mo>
                           </m:mrow>
                           <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemiCaaNaeiikaGIaemOta4KaeiykaKIaeyypa0Jaem4yamMaem4saS0aaWbaaSqabeaacqGHsislcqWGUbGBdaWgaaadbaGaemiCaahabeaaaaGccqGGSaalaaa@38D6@</m:annotation>
                        </m:semantics>
                     </m:math>
                  </display-formula>
               </p>
               <p>where <it>c </it>is a constant that makes &#8721;<it>p</it>(<it>N</it>) = 1, <it>K </it>is a parameter that specifies how strongly complexity is penalized, and <it>n</it><sub><it>p </it></sub>is the number of parent nodes in the network. As <it>K </it>decreases, the networks grow larger, and the number of parent nodes increases. Initially this increase in complexity reflects actual combinational regulation. However, after exceeding a point, false positive increase gradually owing to overfitting to the training data. To optimize the value of <it>K</it>, we performed preliminary runs with <it>K </it>= 10, 15, 20, 25, 30. We checked P-values for the training data, and chose <it>K </it>= 20 because it allows sufficient sensitivity and a minimum of false positives.</p>
            </sec>
            <sec>
               <st>
                  <p>Search algorithm</p>
               </st>
               <p>To search for the most probable parent nodes based on the scoring function <it>p</it>(<it>N</it>)<it>p</it>(<it>D</it>|<it>N</it>), we took greedy search strategy. We started from structure without any edge between the child node and the parent node candidates and iteratively added an edge from a parent node candidate. For each iterative cycle, we calculated the score of <it>p</it>(<it>N</it>)<it>p</it>(<it>D</it>|<it>N</it>) for every case where the edge from the each parent node candidate was added, and the maximizer of them was added to the structure. The cycle repeated until no more edge increases the score. To speed up the search, we utilized clustering of parent node candidates (see Additional file <supplr sid="S1">1</supplr>).</p>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Transcriptional programs correlating with histological grades</p>
            </st>
            <p>Focusing on transcriptional regulatory programs that control histological diversity, we searched for <it>cis</it>-regulatory motifs associated with histological grades. Histological grading in breast cancer seeks to integrate measurements of cellular differentiation and replicative potential into a composite score that quantifies the aggressive behavior of a tumor. The most studied and widely used method is the Elston-Ellis modified Scarff, Bloom, Richardson grading system, also known as the Nottingham Grading System <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. The Nottingham Grading System is based on a microscopic evaluation of morphologic and cytologic features of tumor cells, including degree of tubule formation, nuclear pleomorphism, and mitotic count. The sum of these scores stratifies breast tumors into grade 1 (G1; well-differentiated), grade 2 (G2; moderately differentiated), and grade 3 (G3; poorly diferentiated, highly proliferative) malignancies. It has been well known that the grade of breast cancer is a powerful indicator of disease recurrence and patient death. Untreated patients with G1 disease have a ~95% 5-year survival rate whereas those with G2 and G3 malignancy have survival rates at 5 years of ~75% and ~50%, respectively. Comparison between global expression profiles of tumor cells of different grades also revealed distinct expression patterns, especially between G1 and G3 groups <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>.</p>
            <p>For each gene in the global expression profile data, we calculated the degree of differential expression between two sample groups (67 G1 and 54 G3 samples). We then applied our method to the differential expression value to search for correlating motifs. The results were evaluated in two ways. First, reproducibility of the result was assessed by bootstrap analysis. Structure learning of a Bayesian network was repeated 30 times using bootstrap samples from the training dataset. We found that V$ELK1_02, V$E2F1_Q4_01, V$NRF1_Q6 and JSP$NF_Y were reproducibly selected by the bootstrap analysis (Figure <figr fid="F2">2</figr>). Here, IDs starting from "V$", "JSP$" and "DME$" motifs denote motifs from the TRANSFAC database, the JASPAR database and our DME analysis, respectively. For V$ELK1_02, highly similar motifs sampled by DME also reproducibly appeared. Although we present here results based on one training-test set partition, for checking robustness of biological findings, we applied our method to different training-test set partitions. We confirmed that almost the same results were obtained with different training-test set partitions. Secondly, statistical significance was evaluated for each of the sequence features reproducibly selected by the bootstrap analysis. We assessed difference of expression values between two gene groups with and without each sequence feature, using Wilcoxon rank sum test for the training and test data. It should be noted that, because the P-values calculated using the training data is not subject to multiple testing corrections, it can potentially achieve low values by overfitting to the training data. Hence, we must use the P-values calculated using the test data to accurately evaluate statistical significance. The results from the Wilcoxon rank sum tests suggest that sequence features that are most significantly associated with the histological grades are V$ELK1_02(20) V$E2F1_Q4_01(10), V$NRF1_Q6(10) and JSP$NF_Y(10) (The IDs are followed by values of the threshold parameter for motif searches in parentheses). P-values were also calculated for these four sequence features as a combination. We split genes into 16 groups based on combinations of the presence and absence of the 4 sequence feature, and evaluated difference of expression value distributions among the gene groups using Kruskal-Wallis test. Our calculation shows that the combination of these four sequence features scores highly significant a P-value of 1.33 &#215; 10<sup>-15 </sup>for the test data. Analyses using independent data sets and prediction based on the MAP-value also confirmed these results (see Additional file <supplr sid="S1">1</supplr>).</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Sequence features associated with differential expression between G1 and G3 breast tumors</p>
               </caption>
               <text>
                  <p>
                     <b>Sequence features associated with differential expression between G1 and G3 breast tumors.</b>
                  </p>
               </text>
               <graphic file="1471-2105-9-404-2"/>
            </fig>
            <p>We next investigated how differential expression between G1 and G3 tumors depends on these four sequence features. We divided genes into 16 groups based on patterns of these four sequence features, and differences in distribution of their expression values were examined (see Supplementary Table <tblr tid="T1">1</tblr> in Additional file <supplr sid="S1">1</supplr>). The box plots in Figure <figr fid="F3">3</figr> summarize the results. For clarity, gene groups of similar distributions were gathered to form one group. These results indicate that these sequence features are additively associated with upregulation of gene expression in G3 populations.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Dependency of differential expression between G1 and G3 breast tumors on sequence features</p>
               </caption>
               <text>
                  <p><b>Dependency of differential expression between G1 and G3 breast tumors on sequence features</b>. Genes are divided into five groups based on patterns of four sequence features, V$ELK1_02(20), V$E2F1_Q4_01(10), V$NRF1_Q6(10) and JSP$NF_Y(10) (the left red boxes indicates the presence of sequence features). The distributions of their differential expression values between G1 and G3 are displayed using box plots.</p>
               </text>
               <graphic file="1471-2105-9-404-3"/>
            </fig>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Motif associated with histological grades or prognosis identified based on independent datasets</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p><sup><it>a</it></sup>Motif ID</p>
                     </c>
                     <c ca="center">
                        <p><sup><it>b</it></sup>Reproducibility</p>
                     </c>
                     <c ca="center">
                        <p><sup><it>c</it></sup>P value for training data</p>
                     </c>
                     <c ca="center">
                        <p><sup><it>d</it></sup>P value for test data</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>JSP$NF_Y(20)</p>
                     </c>
                     <c ca="center">
                        <p>20</p>
                     </c>
                     <c ca="center">
                        <p>3.1 &#215; 10<sup>-10</sup></p>
                     </c>
                     <c ca="center">
                        <p>0.000158</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>V$NRF1_Q6(10)</p>
                     </c>
                     <c ca="center">
                        <p>15</p>
                     </c>
                     <c ca="center">
                        <p>3.09 &#215; 10<sup>-14</sup></p>
                     </c>
                     <c ca="center">
                        <p>6.02 &#215; 10<sup>-7</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>motifs associated with histological grades based on the data by Sotiriou <it>et al</it>.</p>
                     </c>
                     <c ca="center">
                        <p>V$ELK1_02(20)</p>
                     </c>
                     <c ca="center">
                        <p>12</p>
                     </c>
                     <c ca="center">
                        <p>9.25 &#215; 10<sup>-26</sup></p>
                     </c>
                     <c ca="center">
                        <p>1.41 &#215; 10<sup>-6</sup></p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>DME$CTTCCGSYN(5)</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>5.71 &#215; 10<sup>-14</sup></p>
                     </c>
                     <c ca="center">
                        <p>6.82 &#215; 10<sup>-5</sup></p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>V$E2F1_Q4_01(5)</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>5.71 &#215; 10<sup>-15</sup></p>
                     </c>
                     <c ca="center">
                        <p>0.002372</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>JSP$NF_Y(10)</p>
                     </c>
                     <c ca="center">
                        <p>15</p>
                     </c>
                     <c ca="center">
                        <p>2.46 &#215; 10<sup>-14</sup></p>
                     </c>
                     <c ca="center">
                        <p>0.011049</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>DME$RMSYSSARGCGC(5)</p>
                     </c>
                     <c ca="center">
                        <p>11</p>
                     </c>
                     <c ca="center">
                        <p>4.02 &#215; 10<sup>-5</sup></p>
                     </c>
                     <c ca="center">
                        <p>0.063412</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>V$ELK1_02(10)</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>2.03 &#215; 10<sup>-16</sup></p>
                     </c>
                     <c ca="center">
                        <p>2.08 &#215; 10<sup>-7</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>motifs associated with prognosis based on the data by Sotiriou <it>et al</it>.</p>
                     </c>
                     <c ca="center">
                        <p>DME$YYYGSGCMYGCG(5)</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>1.65 &#215; 10<sup>-9</sup></p>
                     </c>
                     <c ca="center">
                        <p>0.008054</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>V$E2F1_Q4_01(10)</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>1.05 &#215; 10<sup>-17</sup></p>
                     </c>
                     <c ca="center">
                        <p>2.37 &#215; 10<sup>-5</sup></p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>V$IRF_Q6_01(10)</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>2.06 &#215; 10<sup>-8</sup></p>
                     </c>
                     <c ca="center">
                        <p>0.000152</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>DME$NMSTTCYKSYR(5)</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>0.000669</p>
                     </c>
                     <c ca="center">
                        <p>0.084446</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>V$NRF1_Q6(20)</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>9.02 &#215; 10<sup>-22</sup></p>
                     </c>
                     <c ca="center">
                        <p>1.31 &#215; 10<sup>-6</sup></p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>JSP$NF_Y(20)</p>
                     </c>
                     <c ca="center">
                        <p>22</p>
                     </c>
                     <c ca="center">
                        <p>5.93 &#215; 10<sup>-8</sup></p>
                     </c>
                     <c ca="center">
                        <p>0.01116</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>motifs associated with histological grades based on the data by Pawitan <it>et al</it>.</p>
                     </c>
                     <c ca="center">
                        <p>V$E2F1_Q4_01(5)</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>6.56 &#215; 10<sup>-7</sup></p>
                     </c>
                     <c ca="center">
                        <p>0.049423</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>DME$RCRKGCGCAVN(5)</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>5.71 &#215; 10<sup>-8</sup></p>
                     </c>
                     <c ca="center">
                        <p>0.060899</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>V$E2F1_Q4_01(15)</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>9.59 &#215; 10<sup>-6</sup></p>
                     </c>
                     <c ca="center">
                        <p>0.017285</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>V$ELK1_02(20)</p>
                     </c>
                     <c ca="center">
                        <p>16</p>
                     </c>
                     <c ca="center">
                        <p>1.26 &#215; 10<sup>-27</sup></p>
                     </c>
                     <c ca="center">
                        <p>6.13 &#215; 10<sup>-12</sup></p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>V$NRF1_Q6(15)</p>
                     </c>
                     <c ca="center">
                        <p>11</p>
                     </c>
                     <c ca="center">
                        <p>9.2 &#215; 10<sup>-23</sup></p>
                     </c>
                     <c ca="center">
                        <p>3.89 &#215; 10<sup>-7</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>motifs associated with prognosis based on the data by Pawitan <it>et al</it>.</p>
                     </c>
                     <c ca="center">
                        <p>V$NRF1_Q6(20)</p>
                     </c>
                     <c ca="center">
                        <p>11</p>
                     </c>
                     <c ca="center">
                        <p>4.31 &#215; 10<sup>-22</sup></p>
                     </c>
                     <c ca="center">
                        <p>2.49 &#215; 10<sup>-7</sup></p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>V$ELK1_02(15)</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>1.63 &#215; 10<sup>-25</sup></p>
                     </c>
                     <c ca="center">
                        <p>6.41 &#215; 10<sup>-11</sup></p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>DME$RCGCHKGCGY(5)</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>3.23 &#215; 10<sup>-20</sup></p>
                     </c>
                     <c ca="center">
                        <p>4.8 &#215; 10<sup>-6</sup></p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup><it>a</it></sup>IDs starting from "V$", "JSP$", and "DME$" Motifs denote motifs from the TRANSFAC database, the JASPAR database, and our DME analysis, respectively, followed by values of the threshold parameter for motif searches in parentheses.</p>
                  <p><sup><it>b</it></sup>The number of appearances of sequence feature in 30 searches with bootstrap resampling.</p>
                  <p><sup><it>cd</it></sup>P values calculated by Wilcoxon rank sum tests for training and test data, respectively.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Transcriptional programs correlating with prognosis</p>
            </st>
            <p>We also examined regulatory programs associated with prognosis, a more direct measure of tumor malignancy. For each gene, correlation values with survival time were calculated using Cox regression models <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. Then, we searched for <it>cis</it>-regulatory motifs associated with the correlation values using our method. Our analysis selected V$ELK1_02(10), V$E2F1_Q4_01(5), V$NRF1_Q6(15) and JSP$NF_Y(10) as sequence features positively associated with prognosis, similarly to the analysis for histological grade (Figure <figr fid="F4">4</figr>, Supplementary Table 2 in Additional file <supplr sid="S1">1</supplr>, and Figure <figr fid="F5">5</figr>). A P-value for a combination of these four motifs was calculated as 7.17 &#215; 10<sup>-12 </sup>for the test data.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Sequence features associated with the correlation value calculated for breast cancer prognosis</p>
               </caption>
               <text>
                  <p>
                     <b>Sequence features associated with the correlation value calculated for breast cancer prognosis.</b>
                  </p>
               </text>
               <graphic file="1471-2105-9-404-4"/>
            </fig>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Dependency of the correlation value with breast cancer prognosis on sequence features</p>
               </caption>
               <text>
                  <p><b>Dependency of the correlation value with breast cancer prognosis on sequence features</b>. Genes are divided into five groups based on patterns of four sequence features, V$ELK1_02(5), V$E2F1_Q4_01(10), V$NRF1_Q6(15) and JSP$NF_Y(10) (the left red boxes indicates the presence of sequence features). The distributions of their correlation value with breast cancer prognosis are displayed using box plots.</p>
               </text>
               <graphic file="1471-2105-9-404-5"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Robustness of our biological findings</p>
            </st>
            <p>To confirm robustness of our biological findings, we analyze independent data published by Sotiriou <it>et al</it>. <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>(189 samples &#215; 12466 genes) and Pawitan <it>et al</it>. <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>(159 samples &#215; 16425 genes). Similar to the results obtained in the above analyses, we found that the binding motifs of E2F, ELK1, NRF1 and NFY show significant correlation of histological grades and prognosis (Table <tblr tid="T1">1</tblr>), indicating the robustness of our findings. Taken together, we conclude that <it>cis</it>-regulatory motifs bound by these 4 TFs are principal motifs associated with breast cancer malignancy.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>To decode transcriptional program in breast cancer, we developed a novel approach employing a new Bayesian scoring function and meta-expression value. Combining promoter sequence and expression data, we searched for <it>cis</it>-regulatory motifs correlated with histological grade and prognosis.</p>
         <p>As motif sets to be searched, we prepared known motifs from databases, and <it>de novo </it>motifs identified by a motif discovery program, DME. As motifs correlated with malignancy, we identified the ELK1 binding motif as well as a highly similar <it>de novo </it>one, demonstrating success in our approach. Judging from statistical evaluations, the known motif shows better performance than the <it>de novo </it>one. Further improvement of the motif finder program will enable us to identify <it>de novo </it>motifs of higher quality. Our method introduced a new Bayesian approach, which can deal with multiple sequence features and a continuous meta-expression value. Compared to previous methods, our method more efficiently analyzes motif combination without thresholding meta-expression values (see Additional file <supplr sid="S1">1</supplr>). It should be noted that found motif combinations are no guarantee of a true synergistic, cooperative interaction of the related TFs; further studies remain to be done for analysis of motif interactions. Utilization of meta-expression values is also a novel feature of our method. Although we focused on histological grade and prognosis of breast cancer in this study, our approach can easily be extended to analyze other pathologies and other clinical variables. In addition to these features, we found that our method is robust on the data complexity; we found that our method leads to essentially the same result for grade-associated motifs even if we use only half of the patient data (see Supplementary Table 6 in Additional file <supplr sid="S1">1</supplr>).</p>
         <p>Our analysis identified <it>cis</it>-regulatory motifs bound by ELK1, E2F1, NRF1 and NFY as principal motifs associated with breast cancer malignancy. ELK1 is a member of the ETS transcription factor family. Because the ETS family of transcription factors binds to similar motifs with a central core sequence GGA(A/T), ELK1 binding motifs are potentially bounded by other ETS family members. It has been reported that many of them are downstream nuclear targets of Ras-MAP kinase signaling, and the deregulation of the ETS genes results in malignant transformation and tumor progression. Several ETS genes are rearranged in human leukemia and Ewing tumor to generate chimeric oncoproteins. Furthermore, the aberrant expression of several ETS genes is often observed in various types of human malignant tumors <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. Many of the ETS family transcription factors are upregulated in the G3 population: ETV7(<it>Q </it>= 7.79 &#215; 10<sup>-5</sup>), ELF4(<it>Q </it>= 0.00182), ELF5(<it>Q </it>= 0.0270), GABPA(<it>Q </it>= 0.0301), SPIB(<it>Q </it>= 0.0344), ELF3(<it>Q </it>= 0.0383), ETV4(<it>Q </it>= 0.0386) and ETS1(<it>Q </it>= 0.0468). A recent study based on integrative bioinformatics also suggests that a ETS-directed transcriptional program is involved in malignant progression of prostate cancer <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. Further integrative studies are required to examine whether ETS-directed transcriptional programs contributes to malignancy in various types of tumors.</p>
         <p>The E2F family includes transcription factors which form heterodimer complexes with DP proteins and recognize a common motif <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. The E2F family of proteins is known to be a master regulator of the cell cycle. The association of the E2F motif with G3 is therefore consistent with the fact that the histological grading criteria include the mitotic index and that G3 tumors are defined as highly proliferative. We also observed that most of the E2F family members and two DP genes are significantly upregulated in G3 tumors: E2F8(<it>Q </it>&lt; 10<sup>-6</sup>), E2F3(<it>Q </it>&lt; 10<sup>-6</sup>), E2F1(<it>Q </it>&lt; 10<sup>-6</sup>), E2F6(<it>Q </it>= 3.14 &#215; 10<sup>-5</sup>), E2F5(<it>Q </it>= 0.0219), DP2(<it>Q </it>= 0.00167) and DP1(<it>Q </it>= 0.0111).</p>
         <p>NFR1 has been reported to induce nuclear-encoded mitochondrial genes and increase mitochondrial respiratory capacity <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. Though no clear function of NRF1 in cancer cells has been reported, our finding that the NRF1-binding motif correlates with tumor malignancy may reflect hypermetabolism in aggressive tumors. It has also been reported that NRF1 collaborates with E2F family members to regulate genes involved in cellular proliferations <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>.</p>
         <p>The NFY-binding motif, the CCAAT box, is one of the first identified and most common elements in eukaryotic promoters. On the other hand, elucidation of regulatory networks involving NFY motifs has been hampered by their generality. Our result raises the possibility that NFY-binding motif functions malignant breast cancers cooperatively with other factors. In fact, a previous study reported that NFY and E2F functionally interact to regulate cell cycle genes <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>.</p>
         <p>Although we successfully identified above regulatory motifs, we failed to identify the motifs bounded by transcription factors that are thought to be more critically associated with breast cancer malignancy, including the estrogen receptor and p53. One reason for this failure is that, since the number of target genes varies between transcriptional regulators, our method "skims off" only strong signals from motifs bound by regulators having a sufficient number of target genes. However, a more likely reason is that our method focuses on only proximal regulatory sequences. Each TFs has a positional preference: some TFs bind mainly proximal promoters around the TSSs while others can act on distal enhancer sequences. Recent comprehensive ChIP analyses have clearly shown that the estrogen receptor and p53 have a broad range of positional preference <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp>. Computational predictions <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> and genome-wide experiments <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr></abbrgrp> have just started to produce distal regulatory sequence data; incorporation of such information will solve this problem.</p>
         <p>In cancer cells, genetic and epigenetic alterations also have great impact on gene expression at the mRNA level. Currently, comprehensive data of genomic copy number <abbrgrp><abbr bid="B39">39</abbr></abbrgrp> and epigenetic status <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> are also accumulating. One of the next important challenges will be to incorporate them and decompose gene expression signals from different molecular mechanisms.</p>
         <p>Considering the exploding availability of genome-wide experimental data, we can be optimistic that the integrative bioinformatics approach will circumvent these limitations in the near feature. Future work will focus on further refinement of our approach toward a deeper understanding of transcriptional programs in cancer cells.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>In this study, we introduced a new approach to analyze cancer microarray data. While many studies have focused on correlation between gene expression and a clinical phenotype, our method associates <it>cis</it>-regulatory motifs with clinical phenotypes. This approach offers a more concise description of transcriptome diversity among samples with different clinical phenotypes. Using this method, we demonstrated that <it>cis</it>-regulatory motifs bound by ELK1, E2F, NRF1 and NFY are most significantly associated with breast cancer malignancy. Our data suggest that they are principal regulatory motifs driving breast cancer malignant progression.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>AN, TA, ST, and HA designed research; AN performed research; AN, ADS and MQZ contributed new analytic tools; AN, TA, and SI wrote the paper.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>Computation time was provided by the Super Computer System, Human Genome Center, Institute of Medical Science, University of Tokyo. This work was supported by Grants-in-Aid for Scientific Research on Priority Areas. A. N. was supported by Research Fellowships from Japan Society for the Promotion of Science for Young Scientist.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Transcription factors as targets for cancer therapy</p>
            </title>
            <aug>
               <au>
                  <snm>Darnell</snm>
                  <fnm>JE</fnm>
                  <suf>Jr</suf>
               </au>
            </aug>
            <source>Nat Rev Cancer</source>
            <pubdate>2002</pubdate>
            <volume>2</volume>
            <fpage>740</fpage>
            <lpage>749</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrc906</pubid>
                  <pubid idtype="pmpid" link="fulltext">12360277</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Molecular portraits of human breast tumours</p>
            </title>
            <aug>
               <au>
                  <snm>Perou</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Sorlie</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Rijn</snm>
                  <mnm>van de</mnm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jeffrey</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Rees</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Pollack</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Ross</snm>
                  <fnm>DT</fnm>
               </au>
               <au>
                  <snm>Johnsen</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Akslen</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Fluge</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Pergamenschikov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Williams</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>SX</fnm>
               </au>
               <au>
                  <snm>Lonning</snm>
                  <fnm>PE</fnm>
               </au>
               <au>
                  <snm>Borresen-Dale</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>PO</fnm>
               </au>
               <au>
                  <snm>Botstein</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>406</volume>
            <fpage>747</fpage>
            <lpage>752</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35021093</pubid>
                  <pubid idtype="pmpid" link="fulltext">10963602</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications</p>
            </title>
            <aug>
               <au>
                  <snm>Sorlie</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Perou</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Tibshirani</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Aas</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Geisler</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Johnsen</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Hastie</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Rijn</snm>
                  <mnm>van de</mnm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jeffrey</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Thorsen</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Quist</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Matese</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>PO</fnm>
               </au>
               <au>
                  <snm>Botstein</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Eystein Lonning</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Borresen-Dale</snm>
                  <fnm>AL</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2001</pubdate>
            <volume>98</volume>
            <fpage>10869</fpage>
            <lpage>10874</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">58566</pubid>
                  <pubid idtype="pmpid" link="fulltext">11553815</pubid>
                  <pubid idtype="doi">10.1073/pnas.191367098</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Gene expression profiling predicts clinical outcome of breast cancer</p>
            </title>
            <aug>
               <au>
                  <snm>van't Veer</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Dai</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Vijver</snm>
                  <mnm>van de</mnm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>He</snm>
                  <fnm>YD</fnm>
               </au>
               <au>
                  <snm>Hart</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Mao</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Peterse</snm>
                  <fnm>HL</fnm>
               </au>
               <au>
                  <snm>Kooy</snm>
                  <mnm>van der</mnm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Marton</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Witteveen</snm>
                  <fnm>AT</fnm>
               </au>
               <au>
                  <snm>Schreiber</snm>
                  <fnm>GJ</fnm>
               </au>
               <au>
                  <snm>Kerkhoven</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Roberts</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Linsley</snm>
                  <fnm>PS</fnm>
               </au>
               <au>
                  <snm>Bernards</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Friend</snm>
                  <fnm>SH</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>415</volume>
            <fpage>530</fpage>
            <lpage>536</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/415530a</pubid>
                  <pubid idtype="pmpid" link="fulltext">11823860</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>A gene-expression signature as a predictor of survival in breast cancer</p>
            </title>
            <aug>
               <au>
                  <snm>Vijver</snm>
                  <mnm>van de</mnm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>He</snm>
                  <fnm>YD</fnm>
               </au>
               <au>
                  <snm>van't Veer</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Dai</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Hart</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Voskuil</snm>
                  <fnm>DW</fnm>
               </au>
               <au>
                  <snm>Schreiber</snm>
                  <fnm>GJ</fnm>
               </au>
               <au>
                  <snm>Peterse</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Roberts</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Marton</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Parrish</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Atsma</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Witteveen</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Glas</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Delahaye</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Velde</snm>
                  <mnm>van der</mnm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Bartelink</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Rodenhuis</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Rutgers</snm>
                  <fnm>ET</fnm>
               </au>
               <au>
                  <snm>Friend</snm>
                  <fnm>SH</fnm>
               </au>
               <au>
                  <snm>Bernards</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>N Engl J Med</source>
            <pubdate>2002</pubdate>
            <volume>347</volume>
            <fpage>1999</fpage>
            <lpage>2009</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1056/NEJMoa021967</pubid>
                  <pubid idtype="pmpid" link="fulltext">12490681</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Gene-expression profiles to predict distant metastasis of lymph-node-negative primary breast cancer</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Klijn</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Sieuwerts</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Look</snm>
                  <fnm>MP</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Talantov</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Timmermans</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Meijer-van Gelder</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Jatkoe</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Berns</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Atkins</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Foekens</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>Lancet</source>
            <pubdate>2005</pubdate>
            <volume>365</volume>
            <fpage>671</fpage>
            <lpage>679</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15721472</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>An expression signature for p53 status in human breast cancer predicts mutation status, transcriptional effects, and patient survival</p>
            </title>
            <aug>
               <au>
                  <snm>Miller</snm>
                  <fnm>LD</fnm>
               </au>
               <au>
                  <snm>Smeds</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>George</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Vega</snm>
                  <fnm>VB</fnm>
               </au>
               <au>
                  <snm>Vergara</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Ploner</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Pawitan</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Hall</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Klaar</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>ET</fnm>
               </au>
               <au>
                  <snm>Bergh</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>13550</fpage>
            <lpage>13555</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1197273</pubid>
                  <pubid idtype="pmpid" link="fulltext">16141321</pubid>
                  <pubid idtype="doi">10.1073/pnas.0506230102</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Robustness, scalability, and integration of a wound-response gene expression signature in predicting breast cancer survival</p>
            </title>
            <aug>
               <au>
                  <snm>Chang</snm>
                  <fnm>HY</fnm>
               </au>
               <au>
                  <snm>Nuyten</snm>
                  <fnm>DS</fnm>
               </au>
               <au>
                  <snm>Sneddon</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Hastie</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Tibshirani</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Sorlie</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Dai</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>He</snm>
                  <fnm>YD</fnm>
               </au>
               <au>
                  <snm>van't Veer</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Bartelink</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Rijn</snm>
                  <mnm>van de</mnm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>PO</fnm>
               </au>
               <au>
                  <snm>Vijver</snm>
                  <mnm>van de</mnm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>3738</fpage>
            <lpage>3743</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">548329</pubid>
                  <pubid idtype="pmpid" link="fulltext">15701700</pubid>
                  <pubid idtype="doi">10.1073/pnas.0409462102</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Regulatory element detection using correlation with expression</p>
            </title>
            <aug>
               <au>
                  <snm>Bussemaker</snm>
                  <fnm>HJ</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Siggia</snm>
                  <fnm>ED</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2001</pubdate>
            <volume>27</volume>
            <fpage>167</fpage>
            <lpage>171</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/84792</pubid>
                  <pubid idtype="pmpid" link="fulltext">11175784</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Integrating regulatory motif discovery and genome-wide expression analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Conlon</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>XS</fnm>
               </au>
               <au>
                  <snm>Lieb</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>JS</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <fpage>3339</fpage>
            <lpage>3344</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">152294</pubid>
                  <pubid idtype="pmpid" link="fulltext">12626739</pubid>
                  <pubid idtype="doi">10.1073/pnas.0630591100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Interacting models of cooperative gene regulation</p>
            </title>
            <aug>
               <au>
                  <snm>Das</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Banerjee</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>MQ</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <fpage>16234</fpage>
            <lpage>16239</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">528978</pubid>
                  <pubid idtype="pmpid" link="fulltext">15534222</pubid>
                  <pubid idtype="doi">10.1073/pnas.0407365101</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Predicting gene expression from sequence</p>
            </title>
            <aug>
               <au>
                  <snm>Beer</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Tavazoie</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2004</pubdate>
            <volume>117</volume>
            <fpage>185</fpage>
            <lpage>198</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0092-8674(04)00304-6</pubid>
                  <pubid idtype="pmpid" link="fulltext">15084257</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Adaptively inferring human transcriptional subnetworks</p>
            </title>
            <aug>
               <au>
                  <snm>Das</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Nahle</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>MQ</fnm>
               </au>
            </aug>
            <source>Mol Syst Biol</source>
            <pubdate>2006</pubdate>
            <volume>2</volume>
            <fpage>0029</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1681499</pubid>
                  <pubid idtype="pmpid" link="fulltext">16760900</pubid>
                  <pubid idtype="doi">10.1038/msb4100067</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>DNA motifs in human and mouse proximal promoters predict tissue-specific expression</p>
            </title>
            <aug>
               <au>
                  <snm>Smith</snm>
                  <fnm>AD</fnm>
               </au>
               <au>
                  <snm>Sumazin</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Xuan</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>MQ</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2006</pubdate>
            <volume>103</volume>
            <fpage>6275</fpage>
            <lpage>6280</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1458868</pubid>
                  <pubid idtype="pmpid" link="fulltext">16606849</pubid>
                  <pubid idtype="doi">10.1073/pnas.0508169103</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Elucidating the altered transcriptional programs in breast cancer using independent component analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Teschendorff</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Journee</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Absil</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Sepulchre</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Caldas</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>PLoS Comput Biol</source>
            <pubdate>2007</pubdate>
            <volume>3</volume>
            <fpage>e161</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1950343</pubid>
                  <pubid idtype="pmpid" link="fulltext">17708679</pubid>
                  <pubid idtype="doi">10.1371/journal.pcbi.0030161</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Transcriptional networks inferred from molecular signatures of breast cancer</p>
            </title>
            <aug>
               <au>
                  <snm>Tongbai</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Idelman</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Nordgard</snm>
                  <fnm>SH</fnm>
               </au>
               <au>
                  <snm>Cui</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Jacobs</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Haggerty</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Chanock</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Borresen-Dale</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Livingston</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Shaunessy</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Chiang</snm>
                  <fnm>CH</fnm>
               </au>
               <au>
                  <snm>Kristensen</snm>
                  <fnm>VN</fnm>
               </au>
               <au>
                  <snm>Bilke</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gardner</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Am J Pathol</source>
            <pubdate>2008</pubdate>
            <volume>172</volume>
            <fpage>495</fpage>
            <lpage>509</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2312359</pubid>
                  <pubid idtype="pmpid" link="fulltext">18187569</pubid>
                  <pubid idtype="doi">10.2353/ajpath.2008.061079</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>TRANSFAC and its module TRANSCompel: transcriptional gene regulation in eukaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Matys</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Kel-Margoulis</snm>
                  <fnm>OV</fnm>
               </au>
               <au>
                  <snm>Fricke</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Liebich</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Land</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Barre-Dirrie</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Reuter</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Chekmenev</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Krull</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hornischer</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Voss</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Stegmaier</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Lewicki-Potapov</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Saxel</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kel</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Wingender</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <volume>34</volume>
            <fpage>D108</fpage>
            <lpage>D110</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347505</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381825</pubid>
                  <pubid idtype="doi">10.1093/nar/gkj143</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>A new generation of JASPAR, the open-access repository for transcription factor binding site profiles</p>
            </title>
            <aug>
               <au>
                  <snm>Vlieghe</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Sandelin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>De Bleser</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Vleminckx</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Wasserman</snm>
                  <fnm>WW</fnm>
               </au>
               <au>
                  <snm>van Roy</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Lenhard</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <volume>34</volume>
            <fpage>D95</fpage>
            <lpage>D97</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347477</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381983</pubid>
                  <pubid idtype="doi">10.1093/nar/gkj115</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Using Bayesian networks to analyze expression data</p>
            </title>
            <aug>
               <au>
                  <snm>Friedman</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Linial</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nachman</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Pe'er</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>J Comput Biol</source>
            <pubdate>2000</pubdate>
            <volume>7</volume>
            <fpage>601</fpage>
            <lpage>620</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1089/106652700750050961</pubid>
                  <pubid idtype="pmpid" link="fulltext">11108481</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Effects of threshold choice on biological conclusions reached during analysis of gene expression by DNA microarrays</p>
            </title>
            <aug>
               <au>
                  <snm>Pan</snm>
                  <fnm>KH</fnm>
               </au>
               <au>
                  <snm>Lih</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Cohen</snm>
                  <fnm>SN</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>8961</fpage>
            <lpage>8965</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1149502</pubid>
                  <pubid idtype="pmpid" link="fulltext">15951424</pubid>
                  <pubid idtype="doi">10.1073/pnas.0502674102</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>The Oct4 and Nanog transcription network regulates pluripotency in mouse embryonic stem cells</p>
            </title>
            <aug>
               <au>
                  <snm>Loh</snm>
                  <fnm>YH</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Chew</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Vega</snm>
                  <fnm>VB</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Bourque</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>George</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Leong</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wong</snm>
                  <fnm>KY</fnm>
               </au>
               <au>
                  <snm>Sung</snm>
                  <fnm>KW</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>CW</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>XD</fnm>
               </au>
               <au>
                  <snm>Chiu</snm>
                  <fnm>KP</fnm>
               </au>
               <au>
                  <snm>Lipovich</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Kuznetsov</snm>
                  <fnm>VA</fnm>
               </au>
               <au>
                  <snm>Robson</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Stanton</snm>
                  <fnm>LW</fnm>
               </au>
               <au>
                  <snm>Wei</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Ruan</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Lim</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Ng</snm>
                  <fnm>HH</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2006</pubdate>
            <volume>38</volume>
            <fpage>431</fpage>
            <lpage>440</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1760</pubid>
                  <pubid idtype="pmpid" link="fulltext">16518401</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Genome-wide prediction of mammalian enhancers based on analysis of transcription-factor binding affinity</p>
            </title>
            <aug>
               <au>
                  <snm>Hallikas</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Palin</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Sinjushina</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Rautiainen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Partanen</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ukkonen</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Taipale</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2006</pubdate>
            <volume>124</volume>
            <fpage>47</fpage>
            <lpage>59</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2005.10.042</pubid>
                  <pubid idtype="pmpid" link="fulltext">16413481</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Statistical significance for genomewide studies</p>
            </title>
            <aug>
               <au>
                  <snm>Storey</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Tibshirani</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <fpage>9440</fpage>
            <lpage>9445</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">170937</pubid>
                  <pubid idtype="pmpid" link="fulltext">12883005</pubid>
                  <pubid idtype="doi">10.1073/pnas.1530509100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Regression Models and Life-tables</p>
            </title>
            <aug>
               <au>
                  <snm>Cox</snm>
                  <fnm>DR</fnm>
               </au>
            </aug>
            <source>J R Stat Soc Ser B</source>
            <pubdate>1972</pubdate>
            <volume>34</volume>
            <fpage>187</fpage>
            <lpage>220</lpage>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Pathological prognostic factors in breast cancer. I. The value of histological grade in breast cancer: experience from a large study with long-term follow-up</p>
            </title>
            <aug>
               <au>
                  <snm>Elston</snm>
                  <fnm>CW</fnm>
               </au>
               <au>
                  <snm>Ellis</snm>
                  <fnm>IO</fnm>
               </au>
            </aug>
            <source>Histopathology</source>
            <pubdate>1991</pubdate>
            <volume>19</volume>
            <fpage>403</fpage>
            <lpage>410</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-2559.1991.tb00229.x</pubid>
                  <pubid idtype="pmpid">1757079</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Gene expression profiles of human breast cancer progression</p>
            </title>
            <aug>
               <au>
                  <snm>Ma</snm>
                  <fnm>XJ</fnm>
               </au>
               <au>
                  <snm>Salunga</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Tuggle</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Gaudet</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Enright</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>McQuary</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Payette</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Pistone</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Stecker</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>BM</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>YX</fnm>
               </au>
               <au>
                  <snm>Varnholt</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Gadd</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chatfield</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Kessler</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Baer</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Erlander</snm>
                  <fnm>MG</fnm>
               </au>
               <au>
                  <snm>Sgroi</snm>
                  <fnm>DC</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <fpage>5974</fpage>
            <lpage>5979</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">156311</pubid>
                  <pubid idtype="pmpid" link="fulltext">12714683</pubid>
                  <pubid idtype="doi">10.1073/pnas.0931261100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Gene expression profiling in breast cancer: understanding the molecular basis of histologic grade to improve prognosis</p>
            </title>
            <aug>
               <au>
                  <snm>Sotiriou</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Wirapati</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Loi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Fox</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Smeds</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Nordgren</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Farmer</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Praz</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Haibe-Kains</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Desmedt</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Larsimont</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Cardoso</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Peterse</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Nuyten</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Buyse</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Vijver</snm>
                  <mnm>Van de</mnm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Bergh</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Piccart</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Delorenzi</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Natl Cancer Inst</source>
            <pubdate>2006</pubdate>
            <volume>98</volume>
            <fpage>262</fpage>
            <lpage>272</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16478745</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Gene expression profiling spares early breast cancer patients from adjuvant therapy: derived and validated in two population-based cohorts</p>
            </title>
            <aug>
               <au>
                  <snm>Pawitan</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Bjohle</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Amler</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Borg</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Egyhazi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hall</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Han</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Holmberg</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Klaar</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>ET</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>LD</fnm>
               </au>
               <au>
                  <snm>Nordgren</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Ploner</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sandelin</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Shaw</snm>
                  <fnm>PM</fnm>
               </au>
               <au>
                  <snm>Smeds</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Skoog</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Wedren</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bergh</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Breast Cancer Res</source>
            <pubdate>2005</pubdate>
            <volume>7</volume>
            <fpage>R953</fpage>
            <lpage>R964</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1410752</pubid>
                  <pubid idtype="pmpid" link="fulltext">16280042</pubid>
                  <pubid idtype="doi">10.1186/bcr1325</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>The ETS-domain transcription factor family</p>
            </title>
            <aug>
               <au>
                  <snm>Sharrocks</snm>
                  <fnm>AD</fnm>
               </au>
            </aug>
            <source>Nat Rev Mol Cell Biol</source>
            <pubdate>2001</pubdate>
            <volume>2</volume>
            <fpage>827</fpage>
            <lpage>837</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35099076</pubid>
                  <pubid idtype="pmpid" link="fulltext">11715049</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Integrative molecular concept modeling of prostate cancer progression</p>
            </title>
            <aug>
               <au>
                  <snm>Tomlins</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Mehra</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Rhodes</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Cao</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Dhanasekaran</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Kalyana-Sundaram</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wei</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Pienta</snm>
                  <fnm>KJ</fnm>
               </au>
               <au>
                  <snm>Shah</snm>
                  <fnm>RB</fnm>
               </au>
               <au>
                  <snm>Chinnaiyan</snm>
                  <fnm>AM</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2007</pubdate>
            <volume>39</volume>
            <fpage>41</fpage>
            <lpage>51</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1935</pubid>
                  <pubid idtype="pmpid" link="fulltext">17173048</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Sibling rivalry in the E2F family</p>
            </title>
            <aug>
               <au>
                  <snm>Trimarchi</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Lees</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>Nat Rev Mol Cell Biol</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <fpage>11</fpage>
            <lpage>20</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrm714</pubid>
                  <pubid idtype="pmpid" link="fulltext">11823794</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Nuclear control of respiratory gene expression in mammalian cells</p>
            </title>
            <aug>
               <au>
                  <snm>Scarpulla</snm>
                  <fnm>RC</fnm>
               </au>
            </aug>
            <source>J Cell Biochem</source>
            <pubdate>2006</pubdate>
            <volume>97</volume>
            <fpage>673</fpage>
            <lpage>683</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/jcb.20743</pubid>
                  <pubid idtype="pmpid" link="fulltext">16329141</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>A common set of gene regulatory networks links metabolism and growth inhibition</p>
            </title>
            <aug>
               <au>
                  <snm>Cam</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Balciunaite</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Blais</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Spektor</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Scarpulla</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kluger</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Dynlacht</snm>
                  <fnm>BD</fnm>
               </au>
            </aug>
            <source>Mol Cell</source>
            <pubdate>2004</pubdate>
            <volume>16</volume>
            <fpage>399</fpage>
            <lpage>411</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.molcel.2004.09.037</pubid>
                  <pubid idtype="pmpid" link="fulltext">15525513</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>E2Fs link the control of G1/S and G2/M transcription</p>
            </title>
            <aug>
               <au>
                  <snm>Zhu</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Giangrande</snm>
                  <fnm>PH</fnm>
               </au>
               <au>
                  <snm>Nevins</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>EMBO J</source>
            <pubdate>2004</pubdate>
            <volume>23</volume>
            <fpage>4615</fpage>
            <lpage>4626</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">533046</pubid>
                  <pubid idtype="pmpid" link="fulltext">15510213</pubid>
                  <pubid idtype="doi">10.1038/sj.emboj.7600459</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Genome-wide analysis of estrogen receptor binding sites</p>
            </title>
            <aug>
               <au>
                  <snm>Carroll</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Meyer</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Song</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Geistlinger</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>Eeckhoute</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Brodsky</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Keeton</snm>
                  <fnm>EK</fnm>
               </au>
               <au>
                  <snm>Fertuck</snm>
                  <fnm>KC</fnm>
               </au>
               <au>
                  <snm>Hall</snm>
                  <fnm>GF</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Bekiranov</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sementchenko</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Fox</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Silver</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Gingeras</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>XS</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Carroll</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>XS</fnm>
               </au>
               <au>
                  <snm>Brodsky</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Meyer</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Szary</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Eeckhoute</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Shao</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Hestermann</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Geistlinger</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>Fox</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Silver</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2006</pubdate>
            <volume>38</volume>
            <fpage>1289</fpage>
            <lpage>1297</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1901</pubid>
                  <pubid idtype="pmpid" link="fulltext">17013392</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>A global map of p53 transcription-factor binding sites in the human genome</p>
            </title>
            <aug>
               <au>
                  <snm>Wei</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Vega</snm>
                  <fnm>VB</fnm>
               </au>
               <au>
                  <snm>Chiu</snm>
                  <fnm>KP</fnm>
               </au>
               <au>
                  <snm>Ng</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Shahab</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Yong</snm>
                  <fnm>HC</fnm>
               </au>
               <au>
                  <snm>Fu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Weng</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>XD</fnm>
               </au>
               <au>
                  <snm>Chew</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>YL</fnm>
               </au>
               <au>
                  <snm>Kuznetsov</snm>
                  <fnm>VA</fnm>
               </au>
               <au>
                  <snm>Sung</snm>
                  <fnm>WK</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>LD</fnm>
               </au>
               <au>
                  <snm>Lim</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>ET</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Ng</snm>
                  <fnm>HH</fnm>
               </au>
               <au>
                  <snm>Ruan</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2006</pubdate>
            <volume>124</volume>
            <fpage>207</fpage>
            <lpage>219</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2005.10.043</pubid>
                  <pubid idtype="pmpid" link="fulltext">16413492</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Genome-wide mapping of DNase hypersensitive sites using massively parallel signature sequencing (MPSS)</p>
            </title>
            <aug>
               <au>
                  <snm>Crawford</snm>
                  <fnm>GE</fnm>
               </au>
               <au>
                  <snm>Holt</snm>
                  <fnm>IE</fnm>
               </au>
               <au>
                  <snm>Whittle</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Webb</snm>
                  <fnm>BD</fnm>
               </au>
               <au>
                  <snm>Tai</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Margulies</snm>
                  <fnm>EH</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Bernat</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Ginsburg</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Luo</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Vasicek</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Daly</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Wolfsberg</snm>
                  <fnm>TG</fnm>
               </au>
               <au>
                  <snm>Collins</snm>
                  <fnm>FS</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2006</pubdate>
            <volume>16</volume>
            <fpage>123</fpage>
            <lpage>131</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1356136</pubid>
                  <pubid idtype="pmpid" link="fulltext">16344561</pubid>
                  <pubid idtype="doi">10.1101/gr.4074106</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>In vivo enhancer analysis of human conserved non-coding sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Pennacchio</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Ahituv</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Moses</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Prabhakar</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nobrega</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Shoukry</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Minovitsky</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Dubchak</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Holt</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Plajzer-Frick</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Akiyama</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>De Val</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Afzal</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Black</snm>
                  <fnm>BL</fnm>
               </au>
               <au>
                  <snm>Couronne</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Visel</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>EM</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2006</pubdate>
            <volume>444</volume>
            <fpage>499</fpage>
            <lpage>502</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature05295</pubid>
                  <pubid idtype="pmpid" link="fulltext">17086198</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Array comparative genomic hybridization and its applications in cancer</p>
            </title>
            <aug>
               <au>
                  <snm>Pinkel</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Albertson</snm>
                  <fnm>DG</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2005</pubdate>
            <volume>37</volume>
            <issue>Suppl</issue>
            <fpage>S11</fpage>
            <lpage>S17</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1569</pubid>
                  <pubid idtype="pmpid" link="fulltext">15920524</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Mapping of genetic and epigenetic regulatory networks using microarrays</p>
            </title>
            <aug>
               <au>
                  <snm>van Steensel</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2005</pubdate>
            <volume>37</volume>
            <issue>Suppl</issue>
            <fpage>S18</fpage>
            <lpage>S24</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1559</pubid>
                  <pubid idtype="pmpid" link="fulltext">15920525</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
