<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2172-8-25</ui>
   <ji>1471-2172</ji>
   <fm>
      <dochead>Methodology article</dochead>
      <bibl>
         <title>
            <p>Gene expression trees in lymphoid development</p>
         </title>
         <aug>
            <au id="A1" ca="yes" ce="yes">
               <snm>Costa</snm>
               <mi>G</mi>
               <fnm>Ivan</fnm>
               <insr iid="I1"/>
               <email>ivan.filho@molgen.mpg.de</email>
            </au>
            <au id="A2" ce="yes">
               <snm>Roepcke</snm>
               <fnm>Stefan</fnm>
               <insr iid="I1"/>
               <email>stefan.roepcke@molgen.mpg.de</email>
            </au>
            <au id="A3">
               <snm>Schliep</snm>
               <fnm>Alexander</fnm>
               <insr iid="I1"/>
               <email>alexander.schliep@molgen.mpg.de</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Berlin, Germany</p>
            </ins>
         </insg>
         <source>BMC Immunology</source>
         <issn>1471-2172</issn>
         <pubdate>2007</pubdate>
         <volume>8</volume>
         <issue>1</issue>
         <fpage>25</fpage>
         <url>http://www.biomedcentral.com/1471-2172/8/25</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17925013</pubid>
               <pubid idtype="doi">10.1186/1471-2172-8-25</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>18</day>
               <month>5</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>09</day>
               <month>10</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>09</day>
               <month>10</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Costa et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The regulatory processes that govern cell proliferation and differentiation are central to developmental biology. Particularly well studied in this respect is the lymphoid system due to its importance for basic biology and for clinical applications. Gene expression measured in lymphoid cells in several distinguishable developmental stages helps in the elucidation of underlying molecular processes, which change gradually over time and lock cells in either the B cell, T cell or Natural Killer cell lineages. Large-scale analysis of these <it>gene expression trees </it>requires computational support for tasks ranging from visualization, querying, and finding clusters of similar genes, to answering detailed questions about the functional roles of individual genes.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We present the first statistical framework designed to analyze gene expression data as it is collected in the course of lymphoid development through clusters of co-expressed genes and additional heterogeneous data. We introduce dependence trees for continuous variates, which model the inherent dependencies during the differentiation process naturally as gene expression trees. Several trees are combined in a mixture model to allow inference of potentially overlapping clusters of co-expressed genes. Additionally, we predict microRNA targets.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>Computational results for several data sets from the lymphoid system demonstrate the relevance of our framework. We recover well-known biological facts and identify promising novel regulatory elements of genes and their functional assignments. The implementation of our method (licensed under the GPL) is available at <url>http://algorithmics.molgen.mpg.de/Supplements/ExpLym/</url>.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The study of gene regulatory mechanisms controlling cell proliferation and differentiation is central in developmental biology. Because all hematopoietic cells are easily obtained as individual cells, and due to high clinical interest, the development of lymphocytes is particularly well-studied <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. In mammals, all blood cells develop from pluri-potent, self-renewing hematopoietic stem cells (pHSC) of the bone marrow. In the classical model, these pHSC differentiate into common myelo-erythroid progenitors and common lymphoid progenitors <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. The latter give rise to all cells of the adaptive immune system, including T, B and natural killer cells, which are the focus of our work.</p>
         <p>Lymphocytes are well characterized; they can be purified by fluorescence activated cell sorting (FACS) exploiting the large variety of cell surface antigens, which appear in specific order during differentiation as the result of a linear sequence of genomic rearrangements at the T and B cell receptor loci <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. Based on this, lineage-specific expression and roles of transcription factors have been studied extensively <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B6">6</abbr></abbrgrp>. It has been shown, for example, that Gata3 is required for CD4 T cell maturation and that Runx3 silences the CD4 gene in CD8 T cells. Very recently, a new class of regulatory RNAs, microRNAs, have been identified as being involved in lymphocyte cell development <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>.</p>
         <p>Several groups <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp> have combined FACS mediated cell sorting and mRNA expression profiling to derive a more comprehensive picture of the lymphocytes in distinguishable developmental stages. Our interest focuses on these patterns of gene expression in the distinct stages of the developmental tree, the <it>developmental profiles </it>of genes; see Fig. <figr fid="F1">1</figr> for a developmental tree. Observing such patterns, the first natural question to ask is whether further genes exhibit the same developmental profile; for example, are there other genes co-expressed with Gata3. It is reasonable to assume that genes with a prescribed pattern of expression, such as "up-regulated in proliferating cells", might be relevant for specific functions of cells in a particular stage of differentiation. Clearly, not all relevant developmental profiles are known beforehand, so clustering is the next logical step. Clustering allows us to divide genes into groups of similar developmental profiles, some of which will be irrelevant&#8211;genes expressed in all stages&#8211;others will differ in distinct branches of the developmental tree and thus indicate relevance for differentiation. Once the gamut of developmental profiles is determined, further questions can be addressed with statistical methods: which regulatory effects might cause differentiation, which subgroups of developmental stages share regulatory patterns or at which developmental stage is the difference in expression between two groups the largest. Prior work in this context relies on classical clustering methods, such as self-organizing maps <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>, hierarchical clustering <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>, or on performing tests of differential expression between cell types of interest <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. Further studies concentrated on small-scale data, where selected genes are used to infer regulatory networks. One such study applied a state-space model to infer networks of T cell activation <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. Troncale and colleagues adopted Petri Nets to model and infer regulatory networks of early pHSC development <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, while Basso and colleagues proposed a novel algorithm for a similar task <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Schematic view of lymphocyte cell development</p>
            </caption>
            <text>
               <p><b>Schematic view of lymphocyte cell development</b>. Developmental stages are depicted as nodes and arrows indicate transition from one stage to another, i.e. specialization. Self-renewing hematopoietic stem cells give rise to T cells in the thymus (green), B cells in the bone marrow (blue) and natural killer cells (NK) via intermediate stages. DN stands for CD4-/CD8-double negative cells, DPL for CD4+/CD8+ double positive large cells, and DPS for CD4+/CD8+ double positive small cells. Cell surface antigens and rearrangement events are partially annotated. The expression data sets investigated in this paper are marked as follows: green ovals for TCell, blue ovals for BCell, and pink boxes for LymphoidTree. We do not investigate developmental stages and transitions depicted in grey.</p>
            </text>
            <graphic file="1471-2172-8-25-1"/>
         </fig>
         <p>Classical clustering relies on distance functions between developmental profiles such as correlation or Euclidean distance, which neglect the dependence structure of the developmental tree (Fig. <figr fid="F1">1</figr>). As a matter of fact, the clustering result does not change if one permutes all the variables. Biology suggests however, that the very sequence of changes does matter as this exact sequence of events is what takes a cell from pluri-potent to, say, mature B-cell. Thus we propose dependence tree models&#8211;see <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> for the discrete variate version&#8211;to model expression during the course of development. Our model assumes that the dependence of gene expression between subsequent stages is the most relevant one for identification of co-expressed genes. We assume that gene expression has been measured for a sufficient number of stages, in particular those relevant for differentiation processes, and that the cell population in a particular stage is sufficiently pure. The disagreement between reality and our assumptions is subsumed as noise, which our method can successfully deal with on simulated data. If we consider all pairwise dependencies between developmental stages our model would be equivalent to a multivariate Gaussian distribution with full covariance matrix. Due to the complexity the estimation of such models is prone to over-fitting <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp>. The dependence tree model represents a tradeoff between methods assuming independence between variables, such as <it>k</it>-means and hierarchical clustering, and complex models, such as multivariate Gaussians, which makes estimation more robust.</p>
         <p>With one such tree we can find genes with a specified developmental profile, for example similar to the developmental profile of <it>Gata</it>3, by ranking genes in order of decreasing likelihood under the tree. To cluster developmental profiles we combine several trees with the same topology but with distinct parameters in a classical mixture model <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>; tree topologies are taken from the biological literature. Thus we obtain a robust and flexible statistical model for clustering genome-wide mRNA expression data sets, which takes the inherent dependencies between developmental stages explicitly into account. The resulting clusters of genes sharing similar developmental expression profiles are well-suited for a subsequent search for common regulators such as transcription factors or microRNAs.</p>
         <p>Our choice of model class is motivated by the successful application of mixtures of complex statistical models to the analysis of mRNA expression time-courses. There, models that take temporal dependencies into account, such as Splines <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>, Autoregressive models <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> or Hidden Markov models <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>, outperform simpler models, which assume independence of the variables, for example <it>k</it>-means, self-organizing maps or hierarchical clustering.</p>
         <p>For discrete variates, dependence trees were first proposed by Chow and Liu <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>, who showed that efficient computation is possible. Mixtures of trees were first proposed and applied in image recognition problems <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>, where more efficient versions of the structure learning algorithm for sparse data sets became necessary. In bioinformatics, mixtures of trees were applied to infer mutation events in HIV strains <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. We present an extension of the dependence trees to continuous variates, requiring modifications to the densities and provide a framework for robust clustering based on mixtures. To the best of our knowledge, there is no prior work on genome-scale mRNA expression analysis in which the developmental tree structure is taken into account. Both the biological application and our approach of combining tree models with mixture estimation for this purpose is novel. However, the main methodological ingredients are well-established. Our advanced statistical framework allows us to identify clusters of genes with similar developmental profiles. We detect interesting groups of genes not found using standard techniques, such as self-organizing maps <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>, in developing lymphoid cells. Results on simulated data show the conditions under which our method has a technical advantage. From our clustering results we can identify plausible regulatory roles of microRNAs known to be involved in hematopoiesis. We provide a graphical user interface and a web database of clustering results; see <abbrgrp><abbr bid="B26">26</abbr></abbrgrp> for implementations, a tutorial on how to use the tools, and a web database with the results presented below. Our findings suggest that our framework is well-suited for analysis of genome-wide expression data from detailed cell development studies.</p>
      </sec>
      <sec>
         <st>
            <p>Results/Discussion</p>
         </st>
         <p>In the next two sections, we describe the dependence trees and how they are combined in a mixture to find groups of developmental profiles. Subsequently, we present the results of the application of our method to three lymphoid cell datasets. In the last subsection, we analyze the groups of genes, given by our mixture of dependence trees (MixDTrees) results, for common microRNA binding sites patterns, in order to gain insights into regulatory function of microRNAs.</p>
         <sec>
            <st>
               <p>Dependence trees</p>
            </st>
            <p>The main assumption behind the dependence trees (DTree) is that expression levels of a particular developmental stage depend primarily on expression levels of the immediately preceding stage. For example, cf. Fig. <figr fid="F2">2</figr>, we can approximate the joint probability density function (pdf) of four random variables (<it>X</it><sub><it>A</it></sub>, <it>X</it><sub><it>B</it></sub>, <it>X</it><sub><it>C</it></sub>, <it>X</it><sub><it>D</it></sub>) by</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Example of a simple developmental tree and a cluster of developmental profiles</p>
               </caption>
               <text>
                  <p><b>Example of a simple developmental tree and a cluster of developmental profiles</b>. On the left, we depict a simple development tree, where arrows represent dependencies between variables. Above each tree variable, we depict a distribution related to it. On the right, we display the gene expression values (<it>y</it>-axis) in the distinct development stages (<it>x</it>-axis). Each line corresponds to the developmental profile of a given gene of a particular path of the tree in the left, as in a time-course plot. Distinct paths have distinct colors, in correspondence with the tree on the left. In this particular example, we have the path A, B and C in green and B and D in red. By superimposing the lines corresponding to paths B to C and B to D, we can contrast the differences in expression values of genes in these two alternative differentiation pathways.</p>
               </text>
               <graphic file="1471-2172-8-25-2"/>
            </fig>
            <p>
               <display-formula id="M1"><it>p </it>[<it>X</it><sub><it>A</it></sub>, <it>X</it><sub><it>B</it></sub>, <it>X</it><sub><it>C</it></sub>, <it>X</it><sub><it>D</it></sub>] &#8776; <it>p </it>[<it>X</it><sub><it>A</it></sub>]<it>p </it>[<it>X</it><sub><it>B</it></sub>|<it>X</it><sub><it>A</it></sub>]<it>p </it>[<it>X</it><sub><it>C</it></sub>|<it>X</it><sub><it>B</it></sub>]<it>p </it>[<it>X</it><sub><it>D</it></sub>|<it>X</it><sub><it>B</it></sub>].</display-formula>
            </p>
            <p>In other words, we condition the probability of a given variable on its immediate predecessor, in accordance with the tree structure shown in Fig. <figr fid="F2">2</figr>. There, also a cluster of hypothetical genes with similar developmental profiles is depicted (Fig. <figr fid="F2">2</figr>, right). The genes display average expression in stage A, up-regulation in stage B, down-regulation in stage C and up-regulation in stage D. Furthermore, the genes have clearly distinct expression intensities, but similar relative expression changes. Genes strongly over-expressed in B are also strongly under-expressed in C and strongly expressed in D. These dependencies are reflected in the correlation between these stages. For example, A and B (or B and D) are positively correlated, and stages B and C are negatively correlated. A statistical model for such developmental profiles has to include these dependencies between subsequent stages, as it is provided by dependence trees. Let <it>X </it>= (<it>X</it><sub>1</sub>, ..., <it>X</it><sub><it>u</it></sub>, ..., <it>X</it><sub><it>L</it></sub>) be a <it>L</it>-dimensional continuous random vector where the variable <it>X</it><sub><it>u </it></sub>denotes the expression values of the developmental stage <it>u </it>and <it>x </it>= (<it>x</it><sub>1</sub>, ..., <it>x</it><sub><it>L</it></sub>) denotes a realization of <it>X </it>representing a developmental profile of a gene. We represent a tree by its predecessor or parent map, pa {1, ..., <it>L</it>} &#8614; {1, ..., <it>L</it>} for which we assume without loss of generality that 1 <it>&lt; pa</it>(<it>u</it>) <it>&lt; u </it>and pa(1) = 1. Then we can write for the probability density function (pdf) of a conditional</p>
            <p>
               <display-formula id="M2">
                  <m:math name="1471-2172-8-25-i1" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>p</m:mi>
                           <m:mo stretchy="false">[</m:mo>
                           <m:mi>x</m:mi>
                           <m:mo>|</m:mo>
                           <m:mi>&#952;</m:mi>
                           <m:mo stretchy="false">]</m:mo>
                           <m:mo>=</m:mo>
                           <m:mi>p</m:mi>
                           <m:mo stretchy="false">[</m:mo>
                           <m:msub>
                              <m:mi>x</m:mi>
                              <m:mn>1</m:mn>
                           </m:msub>
                           <m:mo>|</m:mo>
                           <m:msub>
                              <m:mi>&#964;</m:mi>
                              <m:mn>1</m:mn>
                           </m:msub>
                           <m:mo stretchy="false">]</m:mo>
                           <m:mstyle displaystyle="true">
                              <m:munderover>
                                 <m:mo>&#8719;</m:mo>
                                 <m:mrow>
                                    <m:mi>u</m:mi>
                                    <m:mo>=</m:mo>
                                    <m:mn>2</m:mn>
                                 </m:mrow>
                                 <m:mi>L</m:mi>
                              </m:munderover>
                              <m:mrow>
                                 <m:mi>p</m:mi>
                                 <m:mo stretchy="false">[</m:mo>
                                 <m:msub>
                                    <m:mi>x</m:mi>
                                    <m:mi>u</m:mi>
                                 </m:msub>
                                 <m:mo>|</m:mo>
                                 <m:msub>
                                    <m:mi>x</m:mi>
                                    <m:mrow>
                                       <m:mtext>pa</m:mtext>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>u</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo>,</m:mo>
                                 <m:msub>
                                    <m:mi>&#964;</m:mi>
                                    <m:mi>u</m:mi>
                                 </m:msub>
                                 <m:mo stretchy="false">]</m:mo>
                                 <m:mo>.</m:mo>
                              </m:mrow>
                           </m:mstyle>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGWbaCcqGGBbWwcqWG4baEcqGG8baFiiGacqWF4oqCcqGGDbqxcqGH9aqpcqWGWbaCcqGGBbWwcqWG4baEdaWgaaWcbaGaeGymaedabeaakiabcYha8jab=r8a0naaBaaaleaacqaIXaqmaeqaaOGaeiyxa01aaebCaeaacqWGWbaCcqGGBbWwcqWG4baEdaWgaaWcbaGaemyDauhabeaakiabcYha8jabdIha4naaBaaaleaacqqGWbaCcqqGHbqycqGGOaakcqWG1bqDcqGGPaqkaeqaaOGaeiilaWIae8hXdq3aaSbaaSqaaiabdwha1bqabaGccqGGDbqxcqGGUaGlaSqaaiabdwha1jabg2da9iabikdaYaqaaiabdYeambqdcqGHpis1aaaa@5D38@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>We denote the model parameters by <it>&#952; </it>= (<it>&#964;</it><sub>1</sub>, ..., <it>&#964;</it><sub><it>u</it></sub>, ... <it>&#964;</it><sub><it>L</it></sub>) and the DTree by the tuple (<it>X</it>, pa, <it>&#952;</it>). Note, that a DTree can also be viewed as an approximation of the joint distribution of a <it>L</it>-dimensional continuous random vector by a product of <it>L </it>- 1 second order distributions <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>.</p>
            <p>We use conditional Gaussian density functions <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> as conditional densities, denoted by <it>p </it>[<it>x</it><sub><it>u</it></sub>|<it>x</it><sub>pa(<it>u</it>)</sub>, <it>&#964;</it><sub><it>u</it></sub>] in Eq. 2. Hence, for a given developmental profile <it>x </it>and a non-root developmental stage <it>u </it>with pa(<it>u</it>) = <it>v</it>, the pdf takes the form</p>
            <p>
               <display-formula id="M3">
                  <m:math name="1471-2172-8-25-i2" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>p</m:mi>
                           <m:mo stretchy="false">[</m:mo>
                           <m:msub>
                              <m:mi>x</m:mi>
                              <m:mi>u</m:mi>
                           </m:msub>
                           <m:mo>|</m:mo>
                           <m:msub>
                              <m:mi>x</m:mi>
                              <m:mi>v</m:mi>
                           </m:msub>
                           <m:mo>,</m:mo>
                           <m:msub>
                              <m:mi>&#964;</m:mi>
                              <m:mi>u</m:mi>
                           </m:msub>
                           <m:mo stretchy="false">]</m:mo>
                           <m:mo>=</m:mo>
                           <m:msup>
                              <m:mrow>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:msqrt>
                                    <m:mrow>
                                       <m:mn>2</m:mn>
                                       <m:mi>&#960;</m:mi>
                                    </m:mrow>
                                 </m:msqrt>
                                 <m:msub>
                                    <m:mi>&#963;</m:mi>
                                    <m:mrow>
                                       <m:mi>u</m:mi>
                                       <m:mo>|</m:mo>
                                       <m:mi>v</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                              <m:mrow>
                                 <m:mo>&#8722;</m:mo>
                                 <m:mn>1</m:mn>
                              </m:mrow>
                           </m:msup>
                           <m:mi>exp</m:mi>
                           <m:mo>&#8289;</m:mo>
                           <m:mrow>
                              <m:mo>(</m:mo>
                              <m:mrow>
                                 <m:mfrac>
                                    <m:mrow>
                                       <m:mo>&#8722;</m:mo>
                                       <m:msup>
                                          <m:mrow>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:msub>
                                                <m:mi>x</m:mi>
                                                <m:mi>u</m:mi>
                                             </m:msub>
                                             <m:mo>&#8722;</m:mo>
                                             <m:msub>
                                                <m:mi>&#956;</m:mi>
                                                <m:mi>u</m:mi>
                                             </m:msub>
                                             <m:mo>&#8722;</m:mo>
                                             <m:msub>
                                                <m:mi>w</m:mi>
                                                <m:mrow>
                                                   <m:mi>u</m:mi>
                                                   <m:mo>|</m:mo>
                                                   <m:mi>v</m:mi>
                                                </m:mrow>
                                             </m:msub>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:msub>
                                                <m:mi>x</m:mi>
                                                <m:mi>v</m:mi>
                                             </m:msub>
                                             <m:mo>&#8722;</m:mo>
                                             <m:msub>
                                                <m:mi>&#956;</m:mi>
                                                <m:mi>v</m:mi>
                                             </m:msub>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo stretchy="false">)</m:mo>
                                          </m:mrow>
                                          <m:mn>2</m:mn>
                                       </m:msup>
                                    </m:mrow>
                                    <m:mrow>
                                       <m:mn>2</m:mn>
                                       <m:msubsup>
                                          <m:mi>&#963;</m:mi>
                                          <m:mrow>
                                             <m:mi>u</m:mi>
                                             <m:mo>|</m:mo>
                                             <m:mi>v</m:mi>
                                          </m:mrow>
                                          <m:mn>2</m:mn>
                                       </m:msubsup>
                                    </m:mrow>
                                 </m:mfrac>
                              </m:mrow>
                              <m:mo>)</m:mo>
                           </m:mrow>
                           <m:mo>,</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGWbaCcqGGBbWwcqWG4baEdaWgaaWcbaGaemyDauhabeaakiabcYha8jabdIha4naaBaaaleaacqWG2bGDaeqaaOGaeiilaWccciGae8hXdq3aaSbaaSqaaiabdwha1bqabaGccqGGDbqxcqGH9aqpcqGGOaakdaGcaaqaaiabikdaYiab=b8aWbWcbeaakiab=n8aZnaaBaaaleaacqWG1bqDcqGG8baFcqWG2bGDaeqaaOGaeiykaKYaaWbaaSqabeaacqGHsislcqaIXaqmaaGccyGGLbqzcqGG4baEcqGGWbaCdaqadaqaamaalaaabaGaeyOeI0IaeiikaGIaemiEaG3aaSbaaSqaaiabdwha1bqabaGccqGHsislcqWF8oqBdaWgaaWcbaGaemyDauhabeaakiabgkHiTiabdEha3naaBaaaleaacqWG1bqDcqGG8baFcqWG2bGDaeqaaOGaeiikaGIaemiEaG3aaSbaaSqaaiabdAha2bqabaGccqGHsislcqWF8oqBdaWgaaWcbaGaemODayhabeaakiabcMcaPiabcMcaPmaaCaaaleqabaGaeGOmaidaaaGcbaGaeGOmaiJae83Wdm3aa0baaSqaaiabdwha1jabcYha8jabdAha2bqaaiabikdaYaaaaaaakiaawIcacaGLPaaacqGGSaalaaa@74E7@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where <it>&#964;</it><sub><it>u </it></sub>= (<it>&#956;</it><sub><it>u</it></sub>, <it>w</it><sub><it>u</it>|<it>v</it></sub>, <inline-formula><m:math name="1471-2172-8-25-i3" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mrow><m:mi>u</m:mi><m:mo>|</m:mo><m:mi>v</m:mi></m:mrow><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaqhaaWcbaGaemyDauNaeiiFaWNaemODayhabaGaeGOmaidaaaaa@33FD@</m:annotation></m:semantics></m:math></inline-formula>) are the parameters for one conditional density in the model.</p>
            <p>For a given expression data set consisting of measurements for <it>N </it>genes at <it>L </it>developmental stages, let <it>x</it><sub><it>i </it></sub>= (<it>x</it><sub><it>i</it>1</sub>, ..., <it>x</it><sub><it>iu</it></sub>, ..., <it>x</it><sub><it>iL</it></sub>) be the developmental profile of gene <it>i</it>, and <it>x</it><sub><it>iu </it></sub>be the expression value of the gene <it>i </it>in development stage <it>u </it>for 1 &#8804; <it>i </it>&#8804; <it>N </it>and 1 &#8804; <it>u </it>&#8804; <it>L</it>. As derived in the Protocol in the Additional data file <supplr sid="S1">1</supplr>, the maximum likelihood estimates (MLE) for the parameters of the conditional Gaussian are</p>
            <suppl id="S1">
               <title>
                  <p>Additional data file 1</p>
               </title>
               <text>
                  <p><b>Protocol</b>. This file contains information on software implementations, derivations of estimation formulas and additional experiments with simulated data.</p>
               </text>
               <file name="1471-2172-8-25-S1.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>
               <display-formula id="M4">
                  <m:math name="1471-2172-8-25-i4" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mover accent="true">
                                 <m:mi>&#956;</m:mi>
                                 <m:mo>^</m:mo>
                              </m:mover>
                              <m:mi>u</m:mi>
                           </m:msub>
                           <m:mo>=</m:mo>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mstyle displaystyle="true">
                              <m:munderover>
                                 <m:mo>&#8721;</m:mo>
                                 <m:mrow>
                                    <m:mi>i</m:mi>
                                    <m:mo>=</m:mo>
                                    <m:mn>1</m:mn>
                                 </m:mrow>
                                 <m:mi>N</m:mi>
                              </m:munderover>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>x</m:mi>
                                    <m:mrow>
                                       <m:mi>i</m:mi>
                                       <m:mi>u</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo>/</m:mo>
                                 <m:mi>N</m:mi>
                              </m:mrow>
                           </m:mstyle>
                           <m:mo>,</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacuWF8oqBgaqcamaaBaaaleaacqWG1bqDaeqaaOGaeyypa0JaeiikaGYaaabCaeaacqWG4baEdaWgaaWcbaGaemyAaKMaemyDauhabeaakiabcMcaPiabc+caViabd6eaobWcbaGaemyAaKMaeyypa0JaeGymaedabaGaemOta4eaniabggHiLdGccqGGSaalaaa@4104@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>
               <display-formula id="M5">
                  <m:math name="1471-2172-8-25-i5" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mover accent="true">
                                 <m:mi>w</m:mi>
                                 <m:mo>^</m:mo>
                              </m:mover>
                              <m:mrow>
                                 <m:mi>u</m:mi>
                                 <m:mo>|</m:mo>
                                 <m:mi>v</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:msub>
                                    <m:mover accent="true">
                                       <m:mi>&#963;</m:mi>
                                       <m:mo>^</m:mo>
                                    </m:mover>
                                    <m:mrow>
                                       <m:mi>u</m:mi>
                                       <m:mi>v</m:mi>
                                    </m:mrow>
                                 </m:msub>
                              </m:mrow>
                              <m:mrow>
                                 <m:msubsup>
                                    <m:mover accent="true">
                                       <m:mi>&#963;</m:mi>
                                       <m:mo>^</m:mo>
                                    </m:mover>
                                    <m:mi>v</m:mi>
                                    <m:mn>2</m:mn>
                                 </m:msubsup>
                              </m:mrow>
                           </m:mfrac>
                           <m:mo>,</m:mo>
                           <m:mtext>&#160;and</m:mtext>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWG3bWDgaqcamaaBaaaleaacqWG1bqDcqGG8baFcqWG2bGDaeqaaOGaeyypa0ZaaSaaaeaaiiGacuWFdpWCgaqcamaaBaaaleaacqWG1bqDcqWG2bGDaeqaaaGcbaGaf83WdmNbaKaadaqhaaWcbaGaemODayhabaGaeGOmaidaaaaakiabcYcaSiabbccaGiabbggaHjabb6gaUjabbsgaKbaa@42ED@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>
               <display-formula id="M6">
                  <m:math name="1471-2172-8-25-i6" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msubsup>
                              <m:mover accent="true">
                                 <m:mi>&#963;</m:mi>
                                 <m:mo>^</m:mo>
                              </m:mover>
                              <m:mrow>
                                 <m:mi>u</m:mi>
                                 <m:mo>|</m:mo>
                                 <m:mi>v</m:mi>
                              </m:mrow>
                              <m:mn>2</m:mn>
                           </m:msubsup>
                           <m:mo>=</m:mo>
                           <m:msubsup>
                              <m:mover accent="true">
                                 <m:mi>&#963;</m:mi>
                                 <m:mo>^</m:mo>
                              </m:mover>
                              <m:mi>u</m:mi>
                              <m:mn>2</m:mn>
                           </m:msubsup>
                           <m:mo>&#8722;</m:mo>
                           <m:msubsup>
                              <m:mover accent="true">
                                 <m:mi>w</m:mi>
                                 <m:mo>^</m:mo>
                              </m:mover>
                              <m:mrow>
                                 <m:mi>u</m:mi>
                                 <m:mo>|</m:mo>
                                 <m:mi>v</m:mi>
                              </m:mrow>
                              <m:mn>2</m:mn>
                           </m:msubsup>
                           <m:msubsup>
                              <m:mover accent="true">
                                 <m:mi>&#963;</m:mi>
                                 <m:mo>^</m:mo>
                              </m:mover>
                              <m:mi>v</m:mi>
                              <m:mn>2</m:mn>
                           </m:msubsup>
                           <m:mo>.</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacuWFdpWCgaqcamaaDaaaleaacqWG1bqDcqGG8baFcqWG2bGDaeaacqaIYaGmaaGccqGH9aqpcuWFdpWCgaqcamaaDaaaleaacqWG1bqDaeaacqaIYaGmaaGccqGHsislcuWG3bWDgaqcamaaDaaaleaacqWG1bqDcqGG8baFcqWG2bGDaeaacqaIYaGmaaGccuWFdpWCgaqcamaaDaaaleaacqWG2bGDaeaacqaIYaGmaaGccqGGUaGlaaa@46DC@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>These terms can be computed from the sufficient statistics</p>
            <p>
               <display-formula id="M7">
                  <m:math name="1471-2172-8-25-i7" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msubsup>
                              <m:mover accent="true">
                                 <m:mi>&#963;</m:mi>
                                 <m:mo>^</m:mo>
                              </m:mover>
                              <m:mi>u</m:mi>
                              <m:mn>2</m:mn>
                           </m:msubsup>
                           <m:mo>=</m:mo>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mstyle displaystyle="true">
                              <m:munderover>
                                 <m:mo>&#8721;</m:mo>
                                 <m:mrow>
                                    <m:mi>i</m:mi>
                                    <m:mo>=</m:mo>
                                    <m:mn>1</m:mn>
                                 </m:mrow>
                                 <m:mi>N</m:mi>
                              </m:munderover>
                              <m:mrow>
                                 <m:msup>
                                    <m:mrow>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:msub>
                                          <m:mi>x</m:mi>
                                          <m:mrow>
                                             <m:mi>i</m:mi>
                                             <m:mi>u</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                       <m:mo>&#8722;</m:mo>
                                       <m:msub>
                                          <m:mover accent="true">
                                             <m:mi>&#956;</m:mi>
                                             <m:mo>^</m:mo>
                                          </m:mover>
                                          <m:mi>u</m:mi>
                                       </m:msub>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                    <m:mn>2</m:mn>
                                 </m:msup>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo>/</m:mo>
                                 <m:mi>N</m:mi>
                              </m:mrow>
                           </m:mstyle>
                           <m:mo>,</m:mo>
                           <m:mtext>&#160;and</m:mtext>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacuWFdpWCgaqcamaaDaaaleaacqWG1bqDaeaacqaIYaGmaaGccqGH9aqpcqGGOaakdaaeWbqaaiabcIcaOiabdIha4naaBaaaleaacqWGPbqAcqWG1bqDaeqaaOGaeyOeI0Iaf8hVd0MbaKaadaWgaaWcbaGaemyDauhabeaakiabcMcaPmaaCaaaleqabaGaeGOmaidaaOGaeiykaKIaei4la8IaemOta4ealeaacqWGPbqAcqGH9aqpcqaIXaqmaeaacqWGobGta0GaeyyeIuoakiabcYcaSiabbccaGiabbggaHjabb6gaUjabbsgaKbaa@4DF8@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>
               <display-formula id="M8">
                  <m:math name="1471-2172-8-25-i8" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mover accent="true">
                                 <m:mi>&#963;</m:mi>
                                 <m:mo>^</m:mo>
                              </m:mover>
                              <m:mrow>
                                 <m:mi>u</m:mi>
                                 <m:mi>v</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo>=</m:mo>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mstyle displaystyle="true">
                              <m:munderover>
                                 <m:mo>&#8721;</m:mo>
                                 <m:mrow>
                                    <m:mi>i</m:mi>
                                    <m:mo>=</m:mo>
                                    <m:mn>1</m:mn>
                                 </m:mrow>
                                 <m:mi>N</m:mi>
                              </m:munderover>
                              <m:mrow>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:msub>
                                    <m:mi>x</m:mi>
                                    <m:mrow>
                                       <m:mi>i</m:mi>
                                       <m:mi>u</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo>&#8722;</m:mo>
                                 <m:msub>
                                    <m:mover accent="true">
                                       <m:mi>&#956;</m:mi>
                                       <m:mo>^</m:mo>
                                    </m:mover>
                                    <m:mi>u</m:mi>
                                 </m:msub>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:msub>
                                    <m:mi>x</m:mi>
                                    <m:mrow>
                                       <m:mi>i</m:mi>
                                       <m:mi>v</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo>&#8722;</m:mo>
                                 <m:msub>
                                    <m:mover accent="true">
                                       <m:mi>&#956;</m:mi>
                                       <m:mo>^</m:mo>
                                    </m:mover>
                                    <m:mi>v</m:mi>
                                 </m:msub>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo>/</m:mo>
                                 <m:mi>N</m:mi>
                              </m:mrow>
                           </m:mstyle>
                           <m:mo>.</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacuWFdpWCgaqcamaaBaaaleaacqWG1bqDcqWG2bGDaeqaaOGaeyypa0JaeiikaGYaaabCaeaacqGGOaakcqWG4baEdaWgaaWcbaGaemyAaKMaemyDauhabeaakiabgkHiTiqb=X7aTzaajaWaaSbaaSqaaiabdwha1bqabaGccqGGPaqkcqGGOaakcqWG4baEdaWgaaWcbaGaemyAaKMaemODayhabeaakiabgkHiTiqb=X7aTzaajaWaaSbaaSqaaiabdAha2bqabaGccqGGPaqkcqGGPaqkcqGGVaWlcqWGobGtaSqaaiabdMgaPjabg2da9iabigdaXaqaaiabd6eaobqdcqGHris5aOGaeiOla4caaa@531D@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>The conditional normal distribution can be seen as estimating a linear fit between <it>X</it><sub><it>u </it></sub>and <it>X</it><sub><it>v</it></sub>, where <it>w</it><sub><it>u</it>|<it>v</it></sub><it>> </it>0 indicates a positive linear correlation and <it>w</it><sub><it>u</it>|<it>v</it></sub><it>&lt;</it>0 a negative linear correlation between variables; <it>w</it><sub><it>u</it>|<it>v </it></sub>= 0 if the variables are independent. Furthermore, <it>w</it><sub><it>u</it>|<it>v </it></sub>and <inline-formula><m:math name="1471-2172-8-25-i3" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mrow><m:mi>u</m:mi><m:mo>|</m:mo><m:mi>v</m:mi></m:mrow><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaqhaaWcbaGaemyDauNaeiiFaWNaemODayhabaGaeGOmaidaaaaa@33FD@</m:annotation></m:semantics></m:math></inline-formula> are related because the better the linear fit the smaller the variance. For the special case of the root (recall pa(1) = 1), <it>w</it><sub>1|1 </sub>is set to zero, and the conditional density is effectively a univariate normal. In total, the model has 3<it>L </it>- 1 free parameters.</p>
            <p>A very simple, but useful application, is to query the developmental profiles from a data set with a tree model. By defining the model parameters in an interactive manner, we can compute the likelihood (Eq. 2) of all expression profiles <it>x</it><sub><it>i</it></sub>. rank them accordingly, and list the <it>m </it>most likely profiles (see <abbrgrp><abbr bid="B26">26</abbr></abbrgrp> for the tool description and tutorial). This interactive tool allows biological experts to find genes following a developmental profile of interest.</p>
            <p>Returning to the example in Fig. <figr fid="F2">2</figr>, the model estimates given the tree and developmental profiles are</p>
            <p>
               <display-formula>
                  <m:math name="1471-2172-8-25-i9" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mtable>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>&#964;</m:mi>
                                          <m:mi>A</m:mi>
                                       </m:msub>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mo>=</m:mo>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:msub>
                                          <m:mover accent="true">
                                             <m:mi>&#956;</m:mi>
                                             <m:mo>^</m:mo>
                                          </m:mover>
                                          <m:mi>A</m:mi>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:msub>
                                          <m:mover accent="true">
                                             <m:mi>w</m:mi>
                                             <m:mo>^</m:mo>
                                          </m:mover>
                                          <m:mi>A</m:mi>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:msubsup>
                                          <m:mover accent="true">
                                             <m:mi>&#963;</m:mi>
                                             <m:mo>^</m:mo>
                                          </m:mover>
                                          <m:mi>A</m:mi>
                                          <m:mn>2</m:mn>
                                       </m:msubsup>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mo>=</m:mo>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mn>0.01</m:mn>
                                       <m:mo>,</m:mo>
                                       <m:mn>0</m:mn>
                                       <m:mo>,</m:mo>
                                       <m:mn>0.02</m:mn>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>&#964;</m:mi>
                                          <m:mi>B</m:mi>
                                       </m:msub>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mo>=</m:mo>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:msub>
                                          <m:mover accent="true">
                                             <m:mi>&#956;</m:mi>
                                             <m:mo>^</m:mo>
                                          </m:mover>
                                          <m:mi>B</m:mi>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:msub>
                                          <m:mover accent="true">
                                             <m:mi>w</m:mi>
                                             <m:mo>^</m:mo>
                                          </m:mover>
                                          <m:mrow>
                                             <m:mi>B</m:mi>
                                             <m:mo>|</m:mo>
                                             <m:mi>A</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:msubsup>
                                          <m:mover accent="true">
                                             <m:mi>&#963;</m:mi>
                                             <m:mo>^</m:mo>
                                          </m:mover>
                                          <m:mrow>
                                             <m:mi>B</m:mi>
                                             <m:mo>|</m:mo>
                                             <m:mi>A</m:mi>
                                          </m:mrow>
                                          <m:mn>2</m:mn>
                                       </m:msubsup>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mo>=</m:mo>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mn>0.97</m:mn>
                                       <m:mo>,</m:mo>
                                       <m:mn>2.2</m:mn>
                                       <m:mo>,</m:mo>
                                       <m:mn>0.02</m:mn>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>&#964;</m:mi>
                                          <m:mi>C</m:mi>
                                       </m:msub>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mo>=</m:mo>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:msub>
                                          <m:mover accent="true">
                                             <m:mi>&#956;</m:mi>
                                             <m:mo>^</m:mo>
                                          </m:mover>
                                          <m:mi>C</m:mi>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:msub>
                                          <m:mover accent="true">
                                             <m:mi>w</m:mi>
                                             <m:mo>^</m:mo>
                                          </m:mover>
                                          <m:mrow>
                                             <m:mi>C</m:mi>
                                             <m:mo>|</m:mo>
                                             <m:mi>B</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:msubsup>
                                          <m:mover accent="true">
                                             <m:mi>&#963;</m:mi>
                                             <m:mo>^</m:mo>
                                          </m:mover>
                                          <m:mrow>
                                             <m:mi>C</m:mi>
                                             <m:mo>|</m:mo>
                                             <m:mi>B</m:mi>
                                          </m:mrow>
                                          <m:mn>2</m:mn>
                                       </m:msubsup>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mo>=</m:mo>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mn>0.99</m:mn>
                                       <m:mo>,</m:mo>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mn>0.3</m:mn>
                                       <m:mo>,</m:mo>
                                       <m:mn>0.01</m:mn>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>,</m:mo>
                                       <m:mtext>&#160;and</m:mtext>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>&#964;</m:mi>
                                          <m:mi>D</m:mi>
                                       </m:msub>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mo>=</m:mo>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:msub>
                                          <m:mover accent="true">
                                             <m:mi>&#956;</m:mi>
                                             <m:mo>^</m:mo>
                                          </m:mover>
                                          <m:mi>D</m:mi>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:msub>
                                          <m:mover accent="true">
                                             <m:mi>w</m:mi>
                                             <m:mo>^</m:mo>
                                          </m:mover>
                                          <m:mrow>
                                             <m:mi>D</m:mi>
                                             <m:mo>|</m:mo>
                                             <m:mi>B</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:msubsup>
                                          <m:mover accent="true">
                                             <m:mi>&#963;</m:mi>
                                             <m:mo>^</m:mo>
                                          </m:mover>
                                          <m:mrow>
                                             <m:mi>D</m:mi>
                                             <m:mo>|</m:mo>
                                             <m:mi>B</m:mi>
                                          </m:mrow>
                                          <m:mn>2</m:mn>
                                       </m:msubsup>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mo>=</m:mo>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mn>0.45</m:mn>
                                       <m:mo>,</m:mo>
                                       <m:mn>0.53</m:mn>
                                       <m:mo>,</m:mo>
                                       <m:mn>0.01</m:mn>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>.</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                           </m:mtable>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaafaqadeabfaaaaaqaaGGaciab=r8a0naaBaaaleaacqWGbbqqaeqaaaGcbaGaeyypa0dabaGaeiikaGIaf8hVd0MbaKaadaWgaaWcbaGaemyqaeeabeaakiabcYcaSiqbdEha3zaajaWaaSbaaSqaaiabdgeabbqabaGccqGGSaalcuWFdpWCgaqcamaaDaaaleaacqWGbbqqaeaacqaIYaGmaaGccqGGPaqkaeaacqGH9aqpaeaacqGGOaakcqGHsislcqaIWaamcqGGUaGlcqaIWaamcqaIXaqmcqGGSaalcqaIWaamcqGGSaalcqaIWaamcqGGUaGlcqaIWaamcqaIYaGmcqGGPaqkcqGGSaalaeaacqWFepaDdaWgaaWcbaGaemOqaieabeaaaOqaaiabg2da9aqaaiabcIcaOiqb=X7aTzaajaWaaSbaaSqaaiabdkeacbqabaGccqGGSaalcuWG3bWDgaqcamaaBaaaleaacqWGcbGqcqGG8baFcqWGbbqqaeqaaOGaeiilaWIaf83WdmNbaKaadaqhaaWcbaGaemOqaiKaeiiFaWNaemyqaeeabaGaeGOmaidaaOGaeiykaKcabaGaeyypa0dabaGaeiikaGIaeGimaaJaeiOla4IaeGyoaKJaeG4naCJaeiilaWIaeGOmaiJaeiOla4IaeGOmaiJaeiilaWIaeGimaaJaeiOla4IaeGimaaJaeGOmaiJaeiykaKIaeiilaWcabaGae8hXdq3aaSbaaSqaaiabdoeadbqabaaakeaacqGH9aqpaeaacqGGOaakcuWF8oqBgaqcamaaBaaaleaacqWGdbWqaeqaaOGaeiilaWIafm4DaCNbaKaadaWgaaWcbaGaem4qamKaeiiFaWNaemOqaieabeaakiabcYcaSiqb=n8aZzaajaWaa0baaSqaaiabdoeadjabcYha8jabdkeacbqaaiabikdaYaaakiabcMcaPaqaaiabg2da9aqaaiabcIcaOiabgkHiTiabicdaWiabc6caUiabiMda5iabiMda5iabcYcaSiabgkHiTiabicdaWiabc6caUiabiodaZiabcYcaSiabicdaWiabc6caUiabicdaWiabigdaXiabcMcaPiabcYcaSiabbccaGiabbggaHjabb6gaUjabbsgaKbqaaiab=r8a0naaBaaaleaacqWGebaraeqaaaGcbaGaeyypa0dabaGaeiikaGIaf8hVd0MbaKaadaWgaaWcbaGaemiraqeabeaakiabcYcaSiqbdEha3zaajaWaaSbaaSqaaiabdseaejabcYha8jabdkeacbqabaGccqGGSaalcuWFdpWCgaqcamaaDaaaleaacqWGebarcqGG8baFcqWGcbGqaeaacqaIYaGmaaGccqGGPaqkaeaacqGH9aqpaeaacqGGOaakcqaIWaamcqGGUaGlcqaI0aancqaI1aqncqGGSaalcqaIWaamcqGGUaGlcqaI1aqncqaIZaWmcqGGSaalcqaIWaamcqGGUaGlcqaIWaamcqaIXaqmcqGGPaqkcqGGUaGlaaaaaa@C668@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>As expected, <it>w</it><sub><it>B</it>|<it>A </it></sub>and <it>w</it><sub><it>D</it>|<it>B </it></sub>are positive, indicating a linear dependence between these variables. On the other hand <it>w</it><sub><it>C</it>|<it>B </it></sub>is negative.</p>
         </sec>
         <sec>
            <st>
               <p>Mixtures of dependence trees</p>
            </st>
            <p>In order to find clusters of co-expressed genes, we combine several dependence trees (DTree) in a mixture. Each DTree is a representation of a cluster or group of genes with similar developmental profiles; that is, each DTree models distinct patterns of gene expression in the course of development (see Fig. <figr fid="F3">3</figr> for an example). The differentiation of cells is conveniently represented as a developmental tree and the structure or topology of this tree is well-known for most data sets under investigation. Consequently, all trees in a mixture share the same topology. A mixture of dependence trees accommodates overlapping clusters while reflecting the inherent dependencies between stages. Throughout this paper we refer to the presented method as well as to the resulting model as MixDTrees.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Example of a mixture of four dependence trees with the topology defined in Fig. 2</p>
               </caption>
               <text>
                  <p><b>Example of a mixture of four dependence trees with the topology defined in Fig. 2</b>. Each of the trees models distinct developmental profiles found in an example data set. Furthermore, clusters may have distinct sizes proportional to their <it>&#945;</it><sub><it>i</it></sub>'s. Note also that it is not necessary that clusters have distinct expression values in branching stages. For example, stages <it>C </it>and <it>D </it>have similar expression values for cluster 3 and 4. This can be interpreted as the genes being equally expressed in the two alternative lineages.</p>
               </text>
               <graphic file="1471-2172-8-25-3"/>
            </fig>
            <p>More formally, we combine a set of <it>K </it>DTrees in a mixture model <inline-formula><m:math name="1471-2172-8-25-i10" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>f</m:mi><m:mo stretchy="false">(</m:mo><m:mi>x</m:mi><m:mo>|</m:mo><m:mi>&#920;</m:mi><m:mo stretchy="false">)</m:mo><m:mo>=</m:mo><m:mstyle displaystyle="true"><m:msubsup><m:mo>&#8721;</m:mo><m:mrow><m:mi>k</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mi>K</m:mi></m:msubsup><m:mrow><m:msub><m:mi>&#945;</m:mi><m:mi>k</m:mi></m:msub><m:mi>p</m:mi><m:mo stretchy="false">[</m:mo><m:mi>x</m:mi><m:mo>|</m:mo><m:msub><m:mi>&#952;</m:mi><m:mi>k</m:mi></m:msub><m:mo stretchy="false">]</m:mo></m:mrow></m:mstyle></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGMbGzcqGGOaakcqWG4baEcqGG8baFcqqHyoqucqGGPaqkcqGH9aqpdaaeWaqaaGGaciab=f7aHnaaBaaaleaacqWGRbWAaeqaaOGaemiCaaNaei4waSLaemiEaGNaeiiFaWNae8hUde3aaSbaaSqaaiabdUgaRbqabaGccqGGDbqxaSqaaiabdUgaRjabg2da9iabigdaXaqaaiabdUealbqdcqGHris5aaaa@4902@</m:annotation></m:semantics></m:math></inline-formula>, where &#920; = (<it>&#952;</it><sub>1</sub>, ..., <it>&#952;</it><sub><it>K</it></sub>, <it>&#945;</it><sub>1</sub>, ..., <it>&#945;</it><sub><it>K</it></sub>), <it>&#952;</it><sub><it>k </it></sub>denotes the parameters of the <it>k</it>-th DTree and <it>&#945;</it><sub><it>k </it></sub>is proportional to the number of developmental profiles assigned to the <it>k</it>-th Dtree; as usual <it>&#945;</it><sub><it>k </it></sub>&#8805; 0 and <inline-formula><m:math name="1471-2172-8-25-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mstyle displaystyle="true"><m:msubsup><m:mo>&#8721;</m:mo><m:mrow><m:mi>k</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mi>K</m:mi></m:msubsup><m:mrow><m:msub><m:mi mathvariant="script">&#945;</m:mi><m:mi>k</m:mi></m:msub><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow></m:mstyle></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaaeWaqaaGGaciab=f7aHnaaBaaaleaacqWGRbWAaeqaaOGaeyypa0JaeGymaedaleaacqWGRbWAcqGH9aqpcqaIXaqmaeaacqWGlbWsa0GaeyyeIuoaaaa@3853@</m:annotation></m:semantics></m:math></inline-formula>. To avoid over-fitting of the tree models, in particular for components with low component priors <it>&#945;</it><sub><it>k</it></sub>&#8211;that is, a small number of assigned genes&#8211;we propose a maximum-a-posteriori (MAP) approach, which regularizes the estimates from Eq. 5 and Eq. 6. Given this preferable characteristic, MAP estimates are used in all MixDTrees experiments, unless otherwise stated. Note also, the parameters of the mixture are estimated with the Expectation-Maximization (EM) algorithm <abbrgrp><abbr bid="B28">28</abbr></abbrgrp> (see Methods section for EM and MAP details).</p>
            <p>As stated in the introduction, the problem approached here is closely related to gene expression time-course analysis. There is a vast amount of literature on models and clustering methods suitable for time-courses <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp>. Lately, attention has been given to the fact that these time-courses have usually few time points <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr></abbrgrp>, a characteristic previously ignored. This aspect is also essential to our application, since the number of distinguishable developmental stages is usually small, for example at most seven in our data sets. Note that a single chain of subsequent development stages, such as the stages of B-cell differentiation in Fig. <figr fid="F1">1</figr>, is by definition a tree. While dependence trees are indeed also suitable for time-courses, the complex dependency structures necessary due to branching of the developmental tree into distinct lineages prevents the use of time-course models, as there is no effective way of incorporating the necessary extensions into these models <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B22">22</abbr></abbrgrp>. In the context of mixtures, our method represents an alternative to the parameterization of the covariance matrix of a mixture of multivariate Gaussians <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. With MLE, the dependence tree model essentially imputes zeros in the covariance matrix reducing the number of parameters to the order of <it>L</it>. If we would consider all the covariances between observations for <it>L </it>developmental stages; it would be straightforward to represent the data distribution by a <it>L</it>-variate Gaussian model with full covariance matrix. However, the estimates for the <it>L</it><sup>2 </sup>parameters are often unreliable even for small values of <it>L </it>and the parameter estimation is prone to over-fit to outliers often found in noisy and scarce data. In fact, mixtures of Gaussians with full covariance matrix were outperformed by simpler parameterizations of the covariance matrices in the context of gene expression time courses <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Application in lymphocyte cell development</p>
            </st>
            <p>We apply our method to obtain MixDTrees for the data sets TCell, BCell, LymphoidTree, and SIM (see Methods section for details) and compare our clustering results to previous work. Our data is complemented with information from OMIM <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>, the Gene Ontology database <abbrgrp><abbr bid="B34">34</abbr></abbrgrp> and from literature. For TCell and BCell, we use the same number of clusters as Hoffmann and colleagues (20) <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B35">35</abbr></abbrgrp> and for LymphoidTree we apply the BIC criterion <abbrgrp><abbr bid="B36">36</abbr></abbrgrp> (see Fig. S4 in Additional data file <supplr sid="S2">2</supplr>), which also resulted in an optimal choice of 20 clusters. As discussed in Dependence trees section, a simple way to check for similarities in the expression between developmental stages is to compute the correlation matrix of the data set at hand (see Mixtures of dependence trees estimation section).</p>
            <suppl id="S2">
               <title>
                  <p>Additional data file 2</p>
               </title>
               <text>
                  <p><b>Supplementary Figures</b>. Figures 1, 2 and 3 contains all clusters results from MixDTrees on BCell, TCell and LymphoidTree, and Figure 4 contains BIC results from LymphoidTree. Figures 5 and 6 contain comparisons between microRNA enrichment with MixDTrees-MAP and SOM in TCell and BCell, Figures 7 and 8 depict the empirical cumulative distribution function (cdf) of microRNA enrichment <it>p</it>-values from TCell and BCell, and Figures 9 and 10 contain comparisons between microRNA enrichment with MixDTrees-MAP and MixDTrees-MLE in TCell and BCell. Figure 11 describes the cluster size distribution of clustering results in TCell and BCell.</p>
               </text>
               <file name="1471-2172-8-25-S2.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>T cell development (TCell)</p>
            </st>
            <p>TCell is a gene expression data set from seven differentiation stages of the T cell development (see Methods section and Fig. <figr fid="F1">1</figr> for details). The only branch in this tree is the final differentiation of DPS precursors into CD4 single positive SP4 cells and CD8 single positive SP8 cells. Most clusters show a distinctive pattern of differential expression along the developmental path but do not differ between SP4 and SP8 cells (clusters 4, 7, 11, 13, 14, 15, 16, 19, and 20). The most drastic changes occur at the DPL stage in which the cells are proliferating and subsequently start to rearrange the TCR<it>&#945;</it>-locus. This is also reflected in the overall correlation matrix (Table S1 in Additional data file <supplr sid="S3">3</supplr>). Although the expression values of all neighboring stages are positively correlated, the correlation between the DPL stage and the DPS stage is much smaller in comparison to the double negative stages, all of which are relatively highly correlated. The correlation matrix suggests that SP4 and SP8 cells are more similar to each other than to their precursor DPS cells, which is expected since the two types of mature T cells share many cellular functions <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. The largest differences with respect to SP4 and SP8 are found in clusters 5 and 18 (Fig. <figr fid="F4">4</figr>). In cluster 5, cell-cycle genes are clearly enriched. In contrast, cluster 18 mainly contains regulatory proteins involved in transcription and signaling (see Fig. <figr fid="F4">4</figr>).</p>
            <suppl id="S3">
               <title>
                  <p>Additional data file 3</p>
               </title>
               <text>
                  <p><b>Supplementary Tables</b>. Tables 1, 2 and 3 contains correlation matrices from BCell, TCell and LymphoidTree datasets; Tables 4, 5 and 6 contains enriched microRNA and gene targets from SOM results on TCell and BCell and from MixDTrees-MAP results on LymphoidTree; Tables 7, 8, 9 contains microRNA enrichment <it>p</it>-values for BCell, TCell and LymphoidTree on MixDTrees-MAP results; Tables 10 and 11 contains microRNA enrichment <it>p</it>-values for BCell and TCell on SOM results; Tables 12 and 13 contain the contingency tables comparing clusters from MixDTrees-MAP and SOM with BCell and TCell datasets; and Tables 14 and 15 contain the contingency tables comparing clusters from MixDTrees-MAP and MixDTrees-MLE with BCell and TCell datasets.</p>
               </text>
               <file name="1471-2172-8-25-S3.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Selected clusters from MixDTrees on Tcell</p>
               </caption>
               <text>
                  <p><b>Selected clusters from MixDTrees on Tcell</b>. We depict the clusters 5, 8 and 18 found in TCell, expression values on the y-axis, and cell types on the x-axis. Lines corresponding to developmental profile values between stages DN2, DN3, DN4, DPL, DPS and SP4 are in green and between DPS and SP8 in red.</p>
               </text>
               <graphic file="1471-2172-8-25-4"/>
            </fig>
            <p>Hoffmann and colleagues used self-organizing maps (SOM) to cluster the expression profiles <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B35">35</abbr></abbrgrp>. From now on, we refer to Hoffman and colleagues' results simply as SOM. In our analysis we observe clusters with similar developmental profiles, which we define as the average over the gene expression profiles of a cluster. As expected, there is not a one-to-one relationship between the two clusterings. While the single gene profiles are similar since we used analogous normalization and filtering procedures (see Methods section), the actual gene clusterings differ (see Table S12 in Additional data file <supplr sid="S3">3</supplr>). An objective assessment of clustering quality on developmental data is impossible due to lack of benchmarking data. Furthermore, there is no agreement in the literature on a methodology to validate clustering results <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. In order to demonstrate that our method is able to extract additional biological information, we concentrate our discussion on clusters of distinct developmental profiles that could not be detected by SOM <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. For such a cluster we assign functions to genes using the GO term annotation and complementary literature. Ideally, the functions of all genes of the cluster would match the cellular processes of the particular developmental stage at which these genes are over-expressed. Additionally, if some of these genes are of unknown function then the developmental profile can help to generate hypotheses about their functional role. In our analysis we find that genes of cluster 8 are over-expressed in DN3 and DN4 cells, a developmental profile that has not been previously discovered (Fig. <figr fid="F4">4</figr>). With SOM, the genes of this cluster are dispersed over the two clusters (see Table S12 in Additional data file <supplr sid="S3">3</supplr>). Out of the 30 genes of cluster 8 seven are related to vesicle transport, or to the Golgi/ER system. Additionally, we find five cell-cycle related genes, three involved in mitochondrial function, and seven genes of other functions, which are mainly involved in signaling. These findings agree with the functions of DN3 and DN4 cells, which is the transport of precursor receptor molecules to the cell surface membrane and the initiation of proliferation. This demonstrates that our method is able to identify functionally relevant gene sets even if the expression changes are not as large as for the DPL stage, for example. The complete results, including gene expression plots, analysis of GO-term and microRNA enrichment, can be found in our web database <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>B cell development (BCell)</p>
            </st>
            <p>In a similar approach to the TCell study, we investigated gene expression for five consecutive stages during B cell development (see Methods section and <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp> for details). The correlation matrix of BCell suggests dependencies between gene expression values of successive stages, with the largest correlation between pre-BI and large pre-BII cells and between immature and mature B cells (see Table S2 in Additional data file <supplr sid="S2">2</supplr>). When we compare, as in the TCell set, our clustering results to those of Hoffmann and colleagues <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>, we observe similar average developmental profiles although the contingency table indicates differences in the cluster compositions (Table S13 in Additional data file <supplr sid="S3">3</supplr>). Clusters 3, 5 and 6, for example, contain genes that are up-regulated in pre-BI and large pre-BII cells and down-regulated in later developmental stages (Fig. <figr fid="F5">5</figr>). Consistent with the phenotype of these cells, the function assigned to the genes of this cluster are mainly related to proliferation. GO categories that are associated with mitosis, cell-cycle and chromatin remodeling are clearly overrepresented in these clusters (see our web database <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>).</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Selected clusters from from MixDTrees on Bcell</p>
               </caption>
               <text>
                  <p><b>Selected clusters from from MixDTrees on Bcell</b>. We depict clusters 3, 5, 6 and 20 found in BCell, expression values on the y-axis, and cell types on the x-axis. Lines corresponding to developmental profile values between between all stages are in red.</p>
               </text>
               <graphic file="1471-2172-8-25-5"/>
            </fig>
            <p>Cluster 20 shows an average developmental profile that was not detected with SOM <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. The genes of this cluster are down-regulated in pre-BI cells, in which the first rearrangement of the <it>D</it><sup><it>H </it></sup>and <it>J</it><sup><it>H</it></sup>segments on the <it>H </it>chain loci has taken place, and up-regulated in all the following developmental stages (Fig. <figr fid="F5">5</figr>). With SOM <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>, these 23 genes are found distributed over the four clusters 11, 13, 14 and 17 (Table S12 in Additional data file <supplr sid="S3">3</supplr>). The most palpable common function of many cluster 20 genes is the regulation of survival and apoptosis during B cell development. The gene products <it>Nfkbia</it>, <it>Traf5 </it>and the Src-family protein tyrosine kinases <it>Lyn </it>and <it>Syk </it>are known regulators of NF-kappa B activity, which in turn has been found to be involved in B cell fate decision and survival <abbrgrp><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr></abbrgrp>. Similarly, Krupel-like factor 2 (<it>Klf2</it>) protects cells against TNF-alpha induced apoptosis <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>. Furthermore, <it>Icam-2 </it>and <it>Rhoh</it>, whose encoding genes are two other members of cluster 20, regulate the adhesiveness of primary B cells depending on their activation state and protect them from apoptosis <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B42">42</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Lymphoid tree (LymphoidTree)</p>
            </st>
            <p>LymphoidTree combines data sets of several studies <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>, and the resulting tree contains expression measurements from lymphoid cells of six developmental stages, namely hematopoietic stem cells, pro-B, pre-B, and immature B cells, mature SP4 T cells, and natural killer (NK) cells. This integration of data is possible because the studies were carried out on the same array platform. Although the developmental tree is far less detailed compared to TCell and BCell, we still gain insights on differences between the cell lineages. As expected, the correlation matrix shows that the expression patterns of the three B cell stages are more highly correlated among each other then expression patterns of different lineages. Moreover, the overall expression of SP4 cells and NK cells is positively correlated. The resulting clusters provide a basis to hypothesize about early developmental decisions and suggest target genes for further investigations. For example cluster 11 contains genes that are strongly up-regulated in NK cells, weakly induced in the SP4 cells and not expressed in the precursor B cells (Fig. <figr fid="F6">6</figr>). Many of the cluster 11 genes are well known to be expressed in NK cells, as for example the cell surface receptor genes <it>Cd244</it>, <it>Klra1</it>, and <it>Crtam </it><abbrgrp><abbr bid="B33">33</abbr><abbr bid="B43">43</abbr></abbrgrp>. Among the lesser known genes is the one that codes for the Pu.1 related transcription factor SpiC, which has already been found to be temporarily expressed during B cell development <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. In contrast, cluster 19 contains genes that are up-regulated in SP4 cells and in all B cell precursors but not in NK cells (Fig. <figr fid="F6">6</figr>). Important functions during B and T cell maturation are reflected by genes in this cluster, like the bruton tyrosine kinase <it>Btk</it>, the transcription factor <it>Pou2af1</it>, which is involved in immunoglobulin gene regulation, and the DNA repair genes <it>Trp53bp1 </it>and <it>Pnkp </it><abbrgrp><abbr bid="B33">33</abbr></abbrgrp>.</p>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>Selected clusters from from MixDTrees on LymphoidTree</p>
               </caption>
               <text>
                  <p><b>Selected clusters from from MixDTrees on LymphoidTree</b>. We depict clusters 11 and 19 found in LymphoidTree, expression values on the y-axis, and cell types on the x-axis. Lines corresponding to developmental profile values between stages HSC, pro-B, pre-B and immature B cell are in read, between HSC and NK cells in blue, and between HSC and SP4 cells in green.</p>
               </text>
               <graphic file="1471-2172-8-25-6"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Simulated data (SIM)</p>
            </st>
            <p>We demonstrate with simulated data that our novel method outperforms established methods, such as SOM, <it>k</it>-means and mixture of Gaussians, when inferring tree components in complex mixtures for varying levels of dependence between the individual variates. The dependence is reflected in the magnitude of <it>w</it><sub><it>u</it>|<it>v</it>, <it>k </it></sub>(Eq. 5) of a tree. By sampling these parameters from different intervals, [-<it>&#949;</it>, <it>&#949; </it>], [-0.5, 0.5], [-1, 1], [-1.0, -0.5] &#8746; [0.5, 1] and [-1, -1 + <it>&#949;</it>] &#8746; [1 &#8211; <it>&#949;</it>, 1], we can create mixtures with components ranging from independent models to highly dependent ones. We generate a data set for each sampled mixture. We used MixDTrees, mixture of Gaussians, <it>k</it>-means and SOM to compute clusters, which we can compare to the classes used in data generation to compute specificity and sensitivity of the clustering solutions. Method performance is evaluated with a paired <it>t</it>-test. Details are given in Methods section.</p>
            <p>We observe that the MixDTrees with MAP estimates (MixDTrees-MAP) have a higher specificity and sensitivity than <it>k</it>-means and SOM in all experimental settings (Fig. <figr fid="F7">7</figr> top) (<it>p</it>-value below 0.005). In the (almost) independent case (<it>w</it><sub><it>u</it>|<it>v</it>, <it>k </it></sub>&#8712; [-<it>&#949;</it>, <it>&#949;</it>]), this is not expected, since the data agrees well with the assumptions of <it>k</it>-means and SOM. This also explains the large standard deviations of MixDTrees-MAP in that case. As expected, the MixDTrees-MAP clearly improves the cluster recovery in settings with pronounced dependence structure, while the performance of <it>k</it>-means and SOM deteriorates slightly. In comparison to others mixture model methods (Fig. <figr fid="F7">7</figr> bottom), MixDTrees-MAP also obtains a significantly higher specificity and sensitivity in almost all experimental settings. The mixture of Gaussians with diagonal covariance matrices performs well in the independent case (1), which meets the model assumptions, but it has poor results in experiments with higher dependence (<it>p</it>-values below 0.05 for settings 3, 4 and 5). The mixture of Gaussians with full covariance matrix (MG-Full) has a reasonable sensitivity in all settings, but very poor specificity (<it>p</it>-value below 0.05 in settings 3, 4 and 5 for specificity and in all settings for specificity). The reason for these results is that MG-Full tends to populate some clusters with few data points, a problem known as spurious local maxima <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. Note that we use a MAP estimate of MG-Full to mitigate this problem. Even though there are other methods for detection of spurious local maxima in MG-Full, which could lead to better specificity, this would require extensions of the EM method, and consequently slower convergence <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. On the other hand, MixDTrees, which has a lower computational running time than MG-Full, achieves good results without the need of any extension. MixDTrees with MLE estimates (MixDTrees-MLE) has good overall performance, but is outperformed by MixDTrees-MAP in all cases, except experimental settings 1 and 5 (<it>p</it>-value below 0.05 for settings 2, 3 and 4). In experimental setting 5, where data is highly dependent, by definition, both methods work similarly well. Nevertheless, such high dependency would never be found in real data sets, since noise in the data obfuscates dependencies between variables. Additionally, we performed further experiments with simulated data to evaluate the robustness of the method with respect to noise (see Additional data file <supplr sid="S1">1</supplr>). There, MT-MAP maintains good sensitivity and specificity of cluster recovery even for high noise levels.</p>
            <fig id="F7">
               <title>
                  <p>Figure 7</p>
               </title>
               <caption>
                  <p>Results of SIM</p>
               </caption>
               <text>
                  <p><b>Results of SIM</b>. We display the mean sensitivity (left plots) and mean specificity (right plots) against five experimental settings: (1) <it>w</it><sub><it>u</it>|<it>v</it>, <it>k </it></sub>&#8712; [-<it>&#949;</it>, <it>&#949; </it>] (independent data), (2) <it>w</it><sub><it>u</it>|<it>v</it>, <it>k </it></sub>&#8712; [-0.5, 0.5], (3) <it>w</it><sub><it>u</it>|<it>v</it>, <it>k </it></sub>&#8712; [-1, 1], (4) <it>w</it><sub><it>u</it>|<it>v</it>, <it>k </it></sub>&#8712; [-1.0, -0.5] &#8746; [0.5, 1] and (5) <it>w</it><sub><it>u</it>|<it>v</it>, <it>k </it></sub>&#8712; [-1, -1 + <it>&#949;</it>] &#8746; [1 &#8211; <it>&#949;</it>, 1]. The dependence increases with experiment number. On the top plots, <it>k</it>-means results are displayed in blue, SOM in green and mixture of dependence trees with MAP estimation (MixDTrees) in red. On the bottom plots, mixture of Gaussians with full covariance matrices are displayed in yellow, mixture of Gaussians with diagonal covariance matrices in purple, Mixture of dependence trees with MLE estimation in light blue (MixDTrees-MLE) and mixture of dependence trees with MAP estimation (MixDTrees-MAP) in red.</p>
               </text>
               <graphic file="1471-2172-8-25-7"/>
            </fig>
            <p>This demonstrates that the MixDTrees is a superior alternative to SOM and <it>k</it>-means in all cases. In relation to other mixture models, MixDTrees represents a good tradeoff between a complex model class such as multivariate Gaussian with full covariance matrices and the simple Gaussian with diagonal covariance matrices. Furthermore, MAP estimates of the MixDTrees represent a more robust alternative to the MLE counterpart.</p>
         </sec>
         <sec>
            <st>
               <p>MicroRNA target discovery</p>
            </st>
            <p>LympMIR contains a set of 17 microRNAs that are potentially involved in lymphocyte cell development (for details see Methods section). It has been proposed that microRNAs bind target mRNAs specifically via base pairing, which subsequently leads to interference with the translational machinery or mRNA degradation, and thus can control whole groups of genes simultaneously <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. Recent microarray studies have demonstrated that the microRNA expression negatively correlates with mRNA target expression in a tissue specific manner <abbrgrp><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr></abbrgrp>.</p>
            <p>Having identified a cluster of co-expressed genes during lymphoid development we ask whether a certain microRNA could be a potential regulator of this cluster (see Fig. <figr fid="F8">8</figr>). For this task we first obtain lists of potential target genes for each microRNA from the miRBase Targets database <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>, which contains predictions made by sequence based methods. Given our clustering results, we use the statistic of the Chi-Square Test <abbrgrp><abbr bid="B50">50</abbr></abbrgrp> to obtain a list of microRNAs, whose potential targets are overrepresented in a cluster. This is an analogous approach to finding Gene Ontology <abbrgrp><abbr bid="B51">51</abbr></abbrgrp> terms over-represented in a cluster of genes. Given a set of <it>n </it>genes, we count the number <it>c </it>of genes in a given cluster, the number <it>t </it>of genes identified as targets for a given microRNA and the number <it>h </it>of genes that are both in the cluster and are targets of the microRNA. The resulting <it>p</it>-value reflects the statistical significance of observing a count <it>h</it>, given <it>n</it>, <it>c </it>and <it>t</it>. A lower <it>p</it>-value indicates a higher "microRNA enrichment", and, consequently, a better result. By choosing a <it>p</it>-value cutoff, we can construct a list of enriched microRNAs for each cluster as well as a list of target genes related to the enriched microRNAs. Note, that the statistics for microRNA-binding are not well developed; intricate dependencies introduced by sequence similarities among the microRNAs and the target genes exist and complicate the analysis. As we also consider a manually selected set of microRNAs, we choose a somewhat relaxed <it>p</it>-value cutoff, foregoing multiple testing corrections <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>, followed by a careful biological evaluation. For the following discussions we restrict our result set to clusters that contain at least four target genes in total.</p>
            <fig id="F8">
               <title>
                  <p>Figure 8</p>
               </title>
               <caption>
                  <p>Strategy to identify enriched microRNAs</p>
               </caption>
               <text>
                  <p><b>Strategy to identify enriched microRNAs</b>. Strategy to identify microRNAs and their target genes overrepresented in groups of co-expressed genes (indicated left) as part of a post-transcriptional regulatory mechanism. In the middle mRNAs clustered according to our mixture results are depicted and potential microRNA binding sites in their 3'UTRs are symbolized.</p>
               </text>
               <graphic file="1471-2172-8-25-8"/>
            </fig>
            <p>In summary, in TCell our target prediction scheme detects significant enrichment for eleven out of the 17 initial microRNAs in four out of the 20 clusters (Table <tblr tid="T1">1</tblr>). In these four clusters we detect in total 35 candidate target genes, which is a considerable reduction of the set of 229 targets that have been predicted by sequence based methods alone <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>. For BCell these numbers are respectively, eleven out of the 17 microRNAs, four out of the 20 clusters, and 29 out of the 273 predicted targets (Table <tblr tid="T1">1</tblr>). In particular, we find the five microRNA families miR-15, miR-181, miR-221, miR-26, and miR-142-3p to be enriched in both TCell and BCell by our criterion. See Table S6 in Additional data file <supplr sid="S3">3</supplr> for microRNA enrichment in LymphoidTree and Table S7, Table S8, Table S9, for <it>p</it>-values of microRNA enrichment of all data sets. As mentioned earlier, the BCell clusters 3, 5, and 6 show a similar expression profile. We find that cluster 5 of the results of the TCell set overlaps substantially with clusters 3 and 5 of BCell (Table <tblr tid="T1">1</tblr>). In TCell cluster 5 we find miR-15a, miR-181a, miR-26a, miR-24, and miR-221 as potential regulators and 20 potential target genes, seven of which are also present among the 18 BCell candidate genes of clusters 3 and 5. The developmental profiles of the clusters of both lineages show strikingly analogous phenotypical features, namely up-regulation in the proliferating large cell populations (DN4, DPL, large pre-BII) and from then on strict down-regulation. In TCell cluster 5 there are eight genes and in the BCell clusters 3 and 5 there are nine target genes that are known to be involved in DNA metabolism, cell-cycle and mitosis (Table <tblr tid="T1">1</tblr>). This suggests a regulatory role for the identified microRNAs in reducing the transcript levels of genes that are important for cell proliferation. This is supported by the fact that a similar role for microRNA was found in Drosophila germline stem cells <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>List of LympMIR enriched in the clusters from MixDTrees on data sets TCell and BCell</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c ca="left">
                        <p>Cluster ID</p>
                     </c>
                     <c ca="left">
                        <p>MicroRNA</p>
                     </c>
                     <c ca="left">
                        <p>Target Genes</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>TCell 3</p>
                     </c>
                     <c ca="left">
                        <p>miR-222</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Elovl6, Nme1, Rcn1, Rps3</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>TCell 5</p>
                     </c>
                     <c ca="left">
                        <p>miR-15a<sup>1</sup>, miR-181a<sup>2</sup>, miR-221<sup>3</sup>,</p>
                     </c>
                     <c ca="left">
                        <p><it>2410015N17Rik</it><sup>4</sup>, <it>Alad</it><sup>1,4</sup>, <it>Atpif1</it><sup>1,5</sup>, <b><it>Aurkb</it></b><sup>2</sup>, <b><it>Cdc25a</it></b><sup>1</sup>, <b><it>Chek1</it></b><sup>1</sup></p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>miR-24<sup>4</sup>, miR-26a<sup>5</sup></p>
                     </c>
                     <c ca="left">
                        <p><b><it>Cks1b</it></b><sup>2,4</sup>, <b><it>Cks2</it></b><sup>5</sup>, <it>Eed</it><sup>2</sup>, <b><it>H2afx</it></b><sup>4</sup>, <it>Kpnb1</it><sup>3</sup>, <b><it>Mcm5</it></b><sup>3</sup>, <it>Nasp</it><sup>3,5</sup>, <it>Pex7</it><sup>2</sup>, <it>Psmd12</it><sup>2</sup>, <it>Ranbp5</it><sup>2</sup>, <it>Rars</it><sup>1</sup>, <b><it>Tk1</it></b><sup>3</sup>, <it>Trip13</it><sup>1</sup>, <it>Uchl5</it><sup>5</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>TCell 10</p>
                     </c>
                     <c ca="left">
                        <p>miR-142-3p<sup>6</sup>, miR-150<sup>7</sup></p>
                     </c>
                     <c ca="left">
                        <p><it>Gfi1</it><sup>6</sup>, <it>Marcks</it><sup>6</sup>, <it>Msh6</it><sup>6</sup>, <it>Pp11r</it><sup>7</sup>, <it>Psmc1</it><sup>6,7</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>TCell 11</p>
                     </c>
                     <c ca="left">
                        <p>miR-146<sup>8</sup>, miR-16<sup>9</sup>, miR-181b<sup>10</sup></p>
                     </c>
                     <c ca="left">
                        <p><it>Atp1b3</it><sup>10</sup>, <it>Ipo4</it><sup>9</sup>, <it>Klhdc2</it><sup>10</sup>, <it>Mrpl30</it><sup>8</sup>, <it>Orc5l</it><sup>8</sup>, <it>Tuba4</it><sup>9</sup></p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>BCell 3</p>
                     </c>
                     <c ca="left">
                        <p>miR-181b<sup>1</sup>, miR-181c<sup>2</sup>, miR-26a<sup>3</sup></p>
                     </c>
                     <c ca="left">
                        <p><it>Atpif1</it><sup>3</sup>, <b><it>Aurkb</it></b><sup>1,2</sup>, <it>Cbx1</it><sup>3</sup>, <b><it>Cdc45l</it></b><sup>2</sup>, <b><it>Cks1b</it></b><sup>1,2</sup>, <b><it>Cks2</it></b><sup>3</sup>, <it>Cox5a</it><sup>3</sup>, <it>Hmgb2</it><sup>1,2</sup>, <it>Melk</it><sup>1,2</sup>, <b><it>Ttk</it></b><sup>1,2</sup>, <it>Uchl5</it><sup>3</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>BCell 5</p>
                     </c>
                     <c ca="left">
                        <p>miR-15a<sup>4</sup>, miR-15b<sup>5</sup>, miR-221<sup>6</sup>,</p>
                     </c>
                     <c ca="left">
                        <p><b><it>Cdca4</it></b><sup>4,5</sup>, <b><it>Chek1</it></b><sup>4,5</sup>, <b><it>Mcm4</it></b><sup>7</sup>, <it>Nasp</it><sup>6</sup>, <it>Nfyb</it><sup>6</sup>, <b><it>Smc4l1</it></b><sup>7</sup></p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>miR-223<sup>7</sup></p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Tuba2</it>
                           <sup>4,5,7</sup>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>BCell 6</p>
                     </c>
                     <c ca="left">
                        <p>miR-155<sup>8</sup>, miR-191<sup>9</sup></p>
                     </c>
                     <c ca="left">
                        <p><b><it>Ctps</it></b><sup>9</sup>, <it>Ddx1</it><sup>8</sup>, <it>Hint1</it><sup>9</sup>, <b><it>Mcm2</it></b><sup>8</sup>, <it>Phf17</it><sup>8</sup>, <it>Prdx4</it><sup>9</sup>, <it>SNrpd1</it><sup>9</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>BCell 19</p>
                     </c>
                     <c ca="left">
                        <p>miR-142-3p<sup>14</sup>, miR-342<sup>15</sup></p>
                     </c>
                     <c ca="left">
                        <p><it>2410002F23Rik</it><sup>14</sup>, <it>H2-Eb1</it><sup>14</sup>, <it>Ltb</it><sup>15</sup>, <it>Tap2</it><sup>14,15</sup></p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>We display the cluster and data set id, the list of microRNA and list of target genes, with <it>p</it>-values <it>&lt;</it>0.05 and at least four target genes per cluster. Genes involved in cell proliferation or DNA repair are depicted in bold. The indices indicates to which microRNA a gene is related, when there is more than one enriched microRNA in a cluster.</p>
               </tblfn>
            </tbl>
            <p>At the individual gene level we identify some candidate microRNA targets for further detailed analysis: the three known genes (<it>H2-Eb1</it>, <it>Ltb</it>, <it>Tap2</it>) of BCell cluster 19 are all involved in the antigen presentation by MHC class II molecules <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B54">54</abbr></abbrgrp>. In the context of the cell cycle, <it>Chek1 </it>(clusters TCell 5 and BCell 5) and <it>Cdc25a </it>(cluster TCell 5) are important for the transition between G1/S and G2/M phases <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>.</p>
            <p>Furthermore, both genes are candidate targets of the same microRNA, miR-15a, which is related to apoptosis in chronic lymphoid leukemia cells <abbrgrp><abbr bid="B56">56</abbr></abbrgrp>. Another interesting gene codes for the nuclear factor Y (<it>Nfyb</it>; cluster BCell 5), which regulates <it>Hoxb4 </it><abbrgrp><abbr bid="B57">57</abbr></abbrgrp>, <it>Cdc34 </it><abbrgrp><abbr bid="B58">58</abbr></abbrgrp> and the major histocompatibility complex in mice <abbrgrp><abbr bid="B59">59</abbr></abbrgrp>. These are all important genes for lymphoid development. The mRNA of the growth factor independence-1 transcription factor (<it>Gfi1</it>; cluster TCell 10) is a potential target of miR-142-3p with a function in the restriction of cell proliferation and maintenance of the functional integrity of lymphocyte cells <abbrgrp><abbr bid="B60">60</abbr></abbrgrp>. Moreover, <it>Gfi1 </it>is implicated in the transition from CD4/CD8 double negative to double positive T cells <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>.</p>
            <p>In order to relate our approach with <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>, we also perform a microRNA enrichment analysis with the results of SOM (see Table S4 and S5 in the Additional data file <supplr sid="S3">3</supplr>). In TCell there is little overlap between the microRNA targets, with the exception of SOM cluster 6, which is a subset of targets genes from cluster 5 from MixDTrees. We also compare the <it>p</it>-values obtained by both methods in a procedure similar to the one performed in <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. For TCell, MixDTrees results in lower <it>p</it>-values in nine out of 14 microRNAs (see Fig. S5 in Additional data file <supplr sid="S2">2</supplr>). In BCell, gene targets found with SOM are partially a subset of the ones encountered with MixDTrees; 14 out of 24 targets genes in BCell SOM are also detected by MixDTrees (Table S5 in the Additional data file <supplr sid="S3">3</supplr>). For BCell, (Fig. S6 in Additional data file <supplr sid="S2">2</supplr>), MixDTrees obtains lower <it>p</it>-values in 8 out of 14 microRNAs. Even though SOM obtains lower p-values for microRNAs found to be enriched with both methods, MixDTrees detects seven enriched microRNA not significantly enriched in SOM. An inspection of the cumulative distribution function of these <it>p</it>-values also reinforces the view that MixDTrees is more sensitive in detecting enriched microRNAs than SOM in BCell (Fig. S8 in Additional data file <supplr sid="S2">2</supplr>). Overall, the results suggests a higher sensitivity of MixDTrees-MAP in finding groups of microRNA targets sharing similar expression patterns compared to SOM. Additionally, we performed microRNA enrichment <it>p</it>-value comparison between MixDTrees-MAP and MixDTrees-MLE for both data sets (see Additional data file <supplr sid="S2">2</supplr> Fig. S9 and S10). For TCell, MixDTrees-MAP achieves a higher enrichment for nine out of 14 microRNAs; while for BCell, six out of 13 microRNAs. In summary, clusters computed according to MAP have an increased enrichment for TCell and a slightly lowered enrichment for BCell. A manual inspection of the contingency table comparing the clusters from MAP and MLE (Additional data file <supplr sid="S3">3</supplr> Table S15) and in the cluster size distributions (Additional data file <supplr sid="S2">2</supplr> Fig. S11) shows that MixDTrees-MLE has a tendency to produce spurious, small clusters as a result of over-fitting, a known disadvantage of MLE estimates <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. Note that the resulting <it>p</it>-values decrease drastically as a function of the cluster size, making a clustering which joins clusters appear preferable. Enrichment analysis should be used cautiously to compare clusterings, if the cluster size distributions are not similar, as it is the case for the MLE results. This and the results on simulated data supports our preference of MixDTrees-MAP over MixDTrees-MLE.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>The regulatory processes behind cell proliferation and differentiation are of central interest to developmental biologists and clinicians alike and are frequently the focus of large-scale studies to investigate gene expression along paths of differentiation. To make full use of this data in a principled manner we present a novel statistical framework which models gene expression in the course of development. By combining the dependence trees in a classical mixture model, we facilitate interactive querying and visualization of data and, more importantly, the detection of possibly overlapping clusters of co-expressed genes, which provide a basis for the identification of key players in the regulatory mechanism and their mode of action.</p>
         <p>In particular, we detect interesting groups of genes not found by classical clustering methods such as SOM. By incorporating microRNA binding data, we show how to identify complex regulatory relationships. Compared to an analysis based only on sequence, we predict a manageable number of plausible microRNA targets. Moreover, our method offers some insights into the biological role of predicted microRNAs, by the inspection of the developmental profiles of gene targets associated with one microRNA. A comparison with SOM indicates that our approach is more sensitive for finding co-expressed genes on which the same microRNA can have a regulatory effect.</p>
         <p>Extensions to accommodate further types of data are straightforward. Binding sites of transcription factors can be analyzed completely analogous to the microRNA analysis. If expression levels of microRNAs in developmental stages investigated in TCell or BCell were available, we could incorporate a target prediction framework <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>. Furthermore, we can simply apply established techniques <abbrgrp><abbr bid="B63">63</abbr><abbr bid="B64">64</abbr><abbr bid="B65">65</abbr><abbr bid="B66">66</abbr></abbrgrp> to extend our mixture model to integrate heterogeneous data&#8211;sequence information, protein interaction, genotype, phenotype data&#8211;and semi-supervised extensions to mixture estimation can be applied to make use of biological knowledge about functional similarities and regulatory relationships <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B67">67</abbr><abbr bid="B68">68</abbr></abbrgrp>. This is of highest relevance, because the identification of regulatory modules is actually feasible compared to the automated inference of regulatory networks <abbrgrp><abbr bid="B69">69</abbr></abbrgrp>. Once a statistical model is obtained, further detailed questions about the significance of differences, or the most likely stage, at which differentiation occurs can be easily answered.</p>
         <p>Fascinating extensions are possible, even when one only considers gene expression data and the basic method. None of the currently publicly available data sets offers both a tree with a large number of branches and a detailed view of all, in particular early, development stages (<abbrgrp><abbr bid="B70">70</abbr></abbrgrp> concentrates on mature and immature cells in final development stages); combining data from several microarray platforms suffers from the usual problems. Hence, we concentrate on two smaller but detailed studies covering several stages of T cell and B cell development <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>, and a tree containing three lineages of lymphoid cells. Note that in the latter several cell types of intermediary development stages are not measured. Nevertheless, our analysis indicates that our method takes advantage of the tree structure information in detecting relevant differences of gene expression in these lineages. This also reinforces the importance of the creation of expression compendia, such as the one in <abbrgrp><abbr bid="B70">70</abbr></abbrgrp>, where many intermediary stages of differentiation of the developmental tree are also present. Such data will be of great value as computational methods <it>can </it>exploit characteristics intrinsic to cell development.</p>
         <p>Lastly, developmental biologists are still redrawing developmental trees with the discovery of new intermediary stages and "alternative" paths of development <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>; a particular developmental stage might also be formed by a mixture of distinct cell types not well characterized yet. As an example of an alternative path, there has been evidence that DN1 T cells can be originated not only from the lymphoid progenitor as depicted in Fig. <figr fid="F1">1</figr>, but also from the earlier multipotent progenitor cells <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. It is an exciting prospect to infer branches and stages of a developmental tree from gene expression data, ideally per functional module. This structure learning (see <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> for discrete data) can be incorporated in the EM-based parameter estimation. In conclusion, our results suggest that the mixture of dependence trees provides a natural and powerful representation of developmental gene expression data. Furthermore, our results reinforce the importance of the creation of detailed and heterogeneous data sets for helping elucidate the regulatory mechanisms of development.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Data</p>
            </st>
            <p>Our work concentrates on two detailed studies covering several stages of the B and T cell development <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp> and a tree containing three lineages of lymphoid cells <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>. All gene expression data sets analyzed are deposited at the Gene Expression Omnibus <abbrgrp><abbr bid="B71">71</abbr></abbrgrp>. Their accession entries are: GDS44 and GDS52 for BCell, GDS237 and GDS257 for TCell, and GDS1077 (HSC), GSE2227 (Bcells) and GDS828 (NK and SP4) for the LymphoidTree data. Final normalized and filtered data sets are found in <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. Furthermore, we also use simulated data sets in order to evaluate the method. Finally, we describe a set of microRNAs that are used in our study.</p>
            <sec>
               <st>
                  <p>T cell development (TCell)</p>
               </st>
               <p>This data set contains measurements of gene expression during the development of T cells in mouse <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. Based on cell surface markers seven stages have been distinguished: CD4 and CD8 double negatives (DN2, DN3, DN4), large double positives (DPL), small double positives (DPS), single positive CD4 (SP4) and single positive CD8 (SP8) (see Fig. <figr fid="F1">1</figr> for the corresponding tree, and the original publication for details <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>). Affymetrix MU11k chips with four or five replicates were used to measure the expression levels of 13,104 mouse genes. We performed variance stabilization <abbrgrp><abbr bid="B72">72</abbr></abbrgrp> on all chips, and computed the median values of replicates. To facilitate comparisons, we restrict the set to the same list of 1318 differentially expressed genes that was used by Hoffmann and colleagues <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. Furthermore, we normalize the expression levels separately for each gene to mean zero and standard deviation one, as is routine in gene expression analysis. Finally, we map each probe set to a gene symbol if it exists in the respective chip platform annotation provided by the GEO database <abbrgrp><abbr bid="B73">73</abbr></abbrgrp>. The final dataset is found at Additional data file <supplr sid="S4">4</supplr>.</p>
               <suppl id="S4">
                  <title>
                     <p>Additional data file 4</p>
                  </title>
                  <text>
                     <p><b>TCell Dataset</b>. Data set after filtering and normalization procedures. The second column indicates the cluster assignment found by the MixDTrees.</p>
                  </text>
                  <file name="1471-2172-8-25-S4.txt">
                     <p>Click here for file</p>
                  </file>
               </suppl>
            </sec>
            <sec>
               <st>
                  <p>B cell development (BCell)</p>
               </st>
               <p>This data set contains expression levels of five consecutive stages of the B cell lineage, Pre-BI, large Pre-BII, small Pre-BII, immature, and mature B cells <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. This study was conducted on Affymetrix MU11k chips also, and we pre-process the data exactly as it is described for TCell. The final dataset is found at Additional data file <supplr sid="S5">5</supplr>.</p>
               <suppl id="S5">
                  <title>
                     <p>Additional data file 5</p>
                  </title>
                  <text>
                     <p><b>BCell Dataset</b>. Data set after filtering and normalization procedures. The second column indicates the cluster assignment found by the MixDTrees.</p>
                  </text>
                  <file name="1471-2172-8-25-S5.txt">
                     <p>Click here for file</p>
                  </file>
               </suppl>
            </sec>
            <sec>
               <st>
                  <p>Lymphoid tree (LymphoidTree)</p>
               </st>
               <p>We combine the data of the wild-type control measurements of three studies <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp> based on the Affymetrix U74 platform to obtain a development tree with distinct lymphoid lineages. This results in expression values of a hematopoietic stem cell (pHSC) from <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, of Natural Killer cells (NK) and of SP4 cells from <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>, and of three B cell stages from <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>, which are pro-B, pre-B and immature B cells. We pre-process the data exactly as it is described for TCell. Additionally, we remove genes which do not display a 2-fold change in expression at least once. The final dataset is found at Additional data file <supplr sid="S6">6</supplr>.</p>
               <suppl id="S6">
                  <title>
                     <p>Additional data file 6</p>
                  </title>
                  <text>
                     <p><b>LymphoidTree Dataset</b>. Data set after filtering and normalization procedures. The second column indicates the cluster assignment found by the MixDTrees.</p>
                  </text>
                  <file name="1471-2172-8-25-S6.txt">
                     <p>Click here for file</p>
                  </file>
               </suppl>
            </sec>
            <sec>
               <st>
                  <p>Simulated data (SIM)</p>
               </st>
               <p>We use MixDTrees with random parameterizations to generate simulated data. For the tree structure given in Fig. <figr fid="F2">2</figr>, we randomly chose the <it>&#956;</it><sub><it>u</it>|<it>v</it>, <it>k </it></sub>from the range [-1.5, 1.5] and <inline-formula><m:math name="1471-2172-8-25-i12" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mrow><m:mi>u</m:mi><m:mo>|</m:mo><m:mi>v</m:mi><m:mo>,</m:mo><m:mi>k</m:mi></m:mrow><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaqhaaWcbaGaemyDauNaeiiFaWNaemODayNaeiilaWIaem4AaSgabaGaeGOmaidaaaaa@363C@</m:annotation></m:semantics></m:math></inline-formula> from [0, 1]. We create five experimental settings to inspect the performance of the method in the presence of distinct levels of dependence. For these five settings, we sample <it>w</it><sub><it>u</it>|<it>v</it>, <it>k </it></sub>uniformly from [-<it>&#949;</it>, <it>&#949; </it>] (independent data), [-0.5, 0.5], [-1, 1], [-1.0, -0.5] &#8746;[0.5, 1] and [-1, -1 + <it>&#949; </it>] &#8746; [1 &#8211; <it>&#949;</it>, 1] (tree dependent data) respectively, where <it>&#949; </it>= 0.01. We chose <it>K </it>= 5 and mixture coefficients equal to <it>&#945; </it>= (0.1, 0.15, 0.2, 0.2, 0.35). For each experimental setting, we generate ten such mixtures, and sample 500 development profiles from each (see Additional data file <supplr sid="S1">1</supplr> for more results on simulated data and Additional data file <supplr sid="S7">7</supplr> for datasets). To evaluate the results we compare the class information from the data generation to compute sensitivity, <inline-formula><m:math name="1471-2172-8-25-i13" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mfrac><m:mrow><m:mo>#</m:mo><m:mi>T</m:mi><m:mi>P</m:mi></m:mrow><m:mrow><m:mo>#</m:mo><m:mi>T</m:mi><m:mi>P</m:mi><m:mo>+</m:mo><m:mo>#</m:mo><m:mi>F</m:mi><m:mi>N</m:mi></m:mrow></m:mfrac></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaWcaaqaaiabcocaJiabdsfaujabdcfaqbqaaiabcocaJiabdsfaujabdcfaqjabgUcaRiabcocaJiabdAeagjabd6eaobaaaaa@36F6@</m:annotation></m:semantics></m:math></inline-formula>, and specificity, <inline-formula><m:math name="1471-2172-8-25-i14" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mfrac><m:mrow><m:mo>#</m:mo><m:mi>T</m:mi><m:mi>P</m:mi></m:mrow><m:mrow><m:mo>#</m:mo><m:mi>T</m:mi><m:mi>P</m:mi><m:mo>+</m:mo><m:mo>#</m:mo><m:mi>F</m:mi><m:mi>P</m:mi></m:mrow></m:mfrac></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaWcaaqaaiabcocaJiabdsfaujabdcfaqbqaaiabcocaJiabdsfaujabdcfaqjabgUcaRiabcocaJiabdAeagjabdcfaqbaaaaa@36FA@</m:annotation></m:semantics></m:math></inline-formula>, where, for a given clustering result and the class information, <it>TP </it>denotes the number of pairs of objects in the same cluster and same class. The remaining three types of pairs are counted as <it>FP </it>(same cluster, distinct class), <it>TN </it>(distinct cluster and class) and <it>FN </it>(distinct cluster, same class). For each method, we compute the sensitivity and specificity on all 10 data sets of an experimental setting and take the mean (see Fig. <figr fid="F7">7</figr>). To compare MixDTrees-MAP with other methods, we apply a one tailed paired <it>t</it>-test to evaluate the hypothesis that two methods have the same mean specificity (or sensitivity) in a given experimental setting. Low <it>p</it>-values indicate that the equal means hypothesis was rejected and that mean specificity (or sensitivity) was significantly higher in MixDTrees-MAP. For brevity, in the Simulated data section, we simply state&#8211;MixDTrees-MAP had a higher sensitivity than method X (<it>p</it>-value below 0.05)&#8211;when the null hypothesis is rejected.</p>
               <suppl id="S7">
                  <title>
                     <p>Additional data file 7</p>
                  </title>
                  <text>
                     <p><b>SIM Datasets</b>. Data sets from simulated MixDTrees. See readme.txt for file descriptions. The first column indicates the true label of the sample.</p>
                  </text>
                  <file name="1471-2172-8-25-S7.zip">
                     <p>Click here for file</p>
                  </file>
               </suppl>
            </sec>
            <sec>
               <st>
                  <p>Lymphoid development related microRNAs (LympMIR)</p>
               </st>
               <p>We collect 17 microRNAs that have been found to be involved in Lymphoid development or at least differentially expressed between distinguishable lymphocyte cell types <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B56">56</abbr><abbr bid="B74">74</abbr></abbrgrp>: mmu-miR-24, mmu-miR-26a, mmu-miR-142-3p, mmu-miR-146, mmu-miR-150, mmu-miR-155, mmu-miR-181a, mmu-miR-181b, mmu-miR-181c, mmu-miR-191, mmu-miR-221, mmu-miR-222, mmu-miR-223 and mmu-miR-342. Additionally, we include mmu-miR-15a, mmu-miR-15b, and mmu-miR-16 because, according to recent papers, they participate in the regulation of cell proliferation and apoptosis <abbrgrp><abbr bid="B75">75</abbr><abbr bid="B76">76</abbr></abbrgrp>. Since we refer exclusively to microRNAs of the mouse in this work, the species prefix mmu is omitted throughout the text. The lists of candidate targets of these were obtained in the miRBase Targets database <abbrgrp><abbr bid="B49">49</abbr></abbrgrp> (Version 2.0), which uses the Miranda algorithm <abbrgrp><abbr bid="B77">77</abbr></abbrgrp> to search for possible microRNA binding sites in the gene sequences.</p>
            </sec>
            <sec>
               <st>
                  <p>Mixtures of dependence trees estimation</p>
               </st>
               <p>We combine <it>K </it>DTrees in a mixture <inline-formula><m:math name="1471-2172-8-25-i10" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>f</m:mi><m:mo stretchy="false">(</m:mo><m:mi>x</m:mi><m:mo>|</m:mo><m:mi>&#920;</m:mi><m:mo stretchy="false">)</m:mo><m:mo>=</m:mo><m:mstyle displaystyle="true"><m:msubsup><m:mo>&#8721;</m:mo><m:mrow><m:mi>k</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mi>K</m:mi></m:msubsup><m:mrow><m:msub><m:mi mathvariant="script">&#945;</m:mi><m:mi>k</m:mi></m:msub><m:mi>p</m:mi><m:mo stretchy="false">[</m:mo><m:mi>x</m:mi><m:mo>|</m:mo><m:msub><m:mi>&#952;</m:mi><m:mi>k</m:mi></m:msub><m:mo stretchy="false">]</m:mo></m:mrow></m:mstyle></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGMbGzcqGGOaakcqWG4baEcqGG8baFcqqHyoqucqGGPaqkcqGH9aqpdaaeWaqaaGGaciab=f7aHnaaBaaaleaacqWGRbWAaeqaaOGaemiCaaNaei4waSLaemiEaGNaeiiFaWNae8hUde3aaSbaaSqaaiabdUgaRbqabaGccqGGDbqxaSqaaiabdUgaRjabg2da9iabigdaXaqaaiabdUealbqdcqGHris5aaaa@4902@</m:annotation></m:semantics></m:math></inline-formula>, where &#920; = (<it>&#952;</it><sub>1</sub>, ..., <it>&#952;</it><sub><it>K</it></sub>, <it>&#945;</it><sub>1</sub>, ..., <it>&#945;</it><sub><it>K</it></sub>), <it>&#952;</it><sub><it>k </it></sub>denotes the parameter set of the <it>k-</it>th Dtree and <it>&#945;k </it>&#8805; 0, <inline-formula><m:math name="1471-2172-8-25-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mstyle displaystyle="true"><m:msubsup><m:mo>&#8721;</m:mo><m:mrow><m:mi>k</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mi>K</m:mi></m:msubsup><m:mrow><m:msub><m:mi>&#945;</m:mi><m:mi>k</m:mi></m:msub><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow></m:mstyle></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaaeWaqaaGGaciab=f7aHnaaBaaaleaacqWGRbWAaeqaaOGaeyypa0JaeGymaedaleaacqWGRbWAcqGH9aqpcqaIXaqmaeaacqWGlbWsa0GaeyyeIuoaaaa@3853@</m:annotation></m:semantics></m:math></inline-formula>, are the mixture weights or component priors. By introducing a discrete hidden variable <it>Y </it>= {<it>y</it><sub><it>i</it></sub>} for 1 &#8804; <it>i </it>&#8804; <it>N</it>, which indicates which DTree generated which developmental profile <it>x</it><sub><it>i</it></sub>, we can formulate a complete log-likelihood function and estimate the parameters with the Expectation-Maximization (EM) algorithm <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. Given an initial parameterization &#920;<sup>0</sup>, EM iterates two steps: first estimating the posterior probabilities <inline-formula><m:math name="1471-2172-8-25-i15" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>P</m:mi><m:mo stretchy="false">[</m:mo><m:msub><m:mi>y</m:mi><m:mi>i</m:mi></m:msub><m:mo>=</m:mo><m:mi>k</m:mi><m:mo>|</m:mo><m:msub><m:mi>x</m:mi><m:mi>i</m:mi></m:msub><m:mo>,</m:mo><m:msubsup><m:mi>&#952;</m:mi><m:mi>k</m:mi><m:mi>m</m:mi></m:msubsup><m:mo stretchy="false">]</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGqbaucqGGBbWwcqWG5bqEdaWgaaWcbaGaemyAaKgabeaakiabg2da9iabdUgaRjabcYha8jabdIha4naaBaaaleaacqWGPbqAaeqaaOGaeiilaWccciGae8hUde3aa0baaSqaaiabdUgaRbqaaiabd2gaTbaakiabc2faDbaa@3FE6@</m:annotation></m:semantics></m:math></inline-formula> (E Step), and second the computation of the maximum-likelihood parameters &#920;<sup><it>m </it>+ 1 </sup>(M-step), as defined in Eq. 4, Eq. 5 and Eq. 6. We refer the reader to <abbrgrp><abbr bid="B36">36</abbr></abbrgrp> for details of the EM-algorithm.</p>
               <p>To avoid over-fitting the models, in particular for components with low component priors <it>&#945;</it><sub><it>k</it></sub>&#8211;that is, a small number of assigned genes&#8211;we propose maximum-a-posteriori (MAP) approach. We assume that <it>w</it><sub><it>u</it>|<it>v, k </it></sub>~ <it>N</it>(0, <it>&#945;</it><sub><it>k</it></sub><it>&#946;</it><sub><it>u</it>|<it>v, k</it></sub>, <inline-formula><m:math name="1471-2172-8-25-i16" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mrow><m:mi>u</m:mi><m:mo>|</m:mo><m:mi>k</m:mi></m:mrow><m:mrow><m:mo>&#8722;</m:mo><m:mn>2</m:mn></m:mrow></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaqhaaWcbaGaemyDauNaeiiFaWNaem4AaSgabaGaeyOeI0IaeGOmaidaaaaa@34D4@</m:annotation></m:semantics></m:math></inline-formula>) <abbrgrp><abbr bid="B78">78</abbr></abbrgrp>. Consequently, the estimates take the form.</p>
               <p>
                  <display-formula id="M9">
                     <m:math name="1471-2172-8-25-i17" xmlns:m="http://www.w3.org/1998/Math/MathML">
                        <m:semantics>
                           <m:mrow>
                              <m:msub>
                                 <m:mover accent="true">
                                    <m:mi>w</m:mi>
                                    <m:mo>^</m:mo>
                                 </m:mover>
                                 <m:mrow>
                                    <m:mi>u</m:mi>
                                    <m:mo>|</m:mo>
                                    <m:mi>v</m:mi>
                                    <m:mo>,</m:mo>
                                    <m:mi>k</m:mi>
                                 </m:mrow>
                              </m:msub>
                              <m:mo>=</m:mo>
                              <m:mfrac>
                                 <m:mrow>
                                    <m:msub>
                                       <m:mover accent="true">
                                          <m:mi>&#963;</m:mi>
                                          <m:mo>^</m:mo>
                                       </m:mover>
                                       <m:mrow>
                                          <m:mi>u</m:mi>
                                          <m:mi>v</m:mi>
                                          <m:mo>|</m:mo>
                                          <m:mi>k</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                 </m:mrow>
                                 <m:mrow>
                                    <m:msubsup>
                                       <m:mover accent="true">
                                          <m:mi>&#963;</m:mi>
                                          <m:mo>^</m:mo>
                                       </m:mover>
                                       <m:mrow>
                                          <m:mi>u</m:mi>
                                          <m:mo>|</m:mo>
                                          <m:mi>k</m:mi>
                                       </m:mrow>
                                       <m:mn>2</m:mn>
                                    </m:msubsup>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:mn>1</m:mn>
                                    <m:mo>+</m:mo>
                                    <m:msubsup>
                                       <m:mi>&#946;</m:mi>
                                       <m:mrow>
                                          <m:mi>u</m:mi>
                                          <m:mo>|</m:mo>
                                          <m:mi>v</m:mi>
                                          <m:mo>,</m:mo>
                                          <m:mi>k</m:mi>
                                       </m:mrow>
                                       <m:mrow>
                                          <m:mo>&#8722;</m:mo>
                                          <m:mn>1</m:mn>
                                       </m:mrow>
                                    </m:msubsup>
                                    <m:mo stretchy="false">)</m:mo>
                                 </m:mrow>
                              </m:mfrac>
                              <m:mo>,</m:mo>
                           </m:mrow>
                           <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWG3bWDgaqcamaaBaaaleaacqWG1bqDcqGG8baFcqWG2bGDcqGGSaalcqWGRbWAaeqaaOGaeyypa0ZaaSaaaeaaiiGacuWFdpWCgaqcamaaBaaaleaacqWG1bqDcqWG2bGDcqGG8baFcqWGRbWAaeqaaaGcbaGaf83WdmNbaKaadaqhaaWcbaGaemyDauNaeiiFaWNaem4AaSgabaGaeGOmaidaaOGaeiikaGIaeGymaeJaey4kaSIae8NSdi2aa0baaSqaaiabdwha1jabcYha8jabdAha2jabcYcaSiabdUgaRbqaaiabgkHiTiabigdaXaaakiabcMcaPaaacqGGSaalaaa@5401@</m:annotation>
                        </m:semantics>
                     </m:math>
                  </display-formula>
               </p>
               <p>
                  <display-formula id="M10">
                     <m:math name="1471-2172-8-25-i18" xmlns:m="http://www.w3.org/1998/Math/MathML">
                        <m:semantics>
                           <m:mrow>
                              <m:msubsup>
                                 <m:mover accent="true">
                                    <m:mi>&#963;</m:mi>
                                    <m:mo>^</m:mo>
                                 </m:mover>
                                 <m:mrow>
                                    <m:mi>u</m:mi>
                                    <m:mo>|</m:mo>
                                    <m:mi>v</m:mi>
                                 </m:mrow>
                                 <m:mn>2</m:mn>
                              </m:msubsup>
                              <m:mo>=</m:mo>
                              <m:msubsup>
                                 <m:mover accent="true">
                                    <m:mi>&#963;</m:mi>
                                    <m:mo>^</m:mo>
                                 </m:mover>
                                 <m:mi>u</m:mi>
                                 <m:mn>2</m:mn>
                              </m:msubsup>
                              <m:mo>&#8722;</m:mo>
                              <m:msubsup>
                                 <m:mover accent="true">
                                    <m:mi>w</m:mi>
                                    <m:mo>^</m:mo>
                                 </m:mover>
                                 <m:mrow>
                                    <m:mi>u</m:mi>
                                    <m:mo>|</m:mo>
                                    <m:mi>v</m:mi>
                                 </m:mrow>
                                 <m:mn>2</m:mn>
                              </m:msubsup>
                              <m:msubsup>
                                 <m:mover accent="true">
                                    <m:mi>&#963;</m:mi>
                                    <m:mo>^</m:mo>
                                 </m:mover>
                                 <m:mi>v</m:mi>
                                 <m:mn>2</m:mn>
                              </m:msubsup>
                              <m:mo stretchy="false">(</m:mo>
                              <m:mn>1</m:mn>
                              <m:mo>&#8722;</m:mo>
                              <m:msubsup>
                                 <m:mi>&#946;</m:mi>
                                 <m:mrow>
                                    <m:mi>u</m:mi>
                                    <m:mo>|</m:mo>
                                    <m:mi>v</m:mi>
                                    <m:mo>,</m:mo>
                                    <m:mi>k</m:mi>
                                 </m:mrow>
                                 <m:mrow>
                                    <m:mo>&#8722;</m:mo>
                                    <m:mn>1</m:mn>
                                 </m:mrow>
                              </m:msubsup>
                              <m:mo stretchy="false">)</m:mo>
                              <m:mo>.</m:mo>
                           </m:mrow>
                           <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacuWFdpWCgaqcamaaDaaaleaacqWG1bqDcqGG8baFcqWG2bGDaeaacqaIYaGmaaGccqGH9aqpcuWFdpWCgaqcamaaDaaaleaacqWG1bqDaeaacqaIYaGmaaGccqGHsislcuWG3bWDgaqcamaaDaaaleaacqWG1bqDcqGG8baFcqWG2bGDaeaacqaIYaGmaaGccuWFdpWCgaqcamaaDaaaleaacqWG2bGDaeaacqaIYaGmaaGccqGGOaakcqaIXaqmcqGHsislcqWFYoGydaqhaaWcbaGaemyDauNaeiiFaWNaemODayNaeiilaWIaem4AaSgabaGaeyOeI0IaeGymaedaaOGaeiykaKIaeiOla4caaa@54C2@</m:annotation>
                        </m:semantics>
                     </m:math>
                  </display-formula>
               </p>
               <p>For the sake of simplicity we omit the coefficients <it>k </it>which indicates a tree in a given mixture from formulas in the Dependence tree section. See Protocol for exact MLE and MAP formulas in the mixture context. When <it>&#946; </it>&#8594; &#8734;, we obtain a non-informative prior, for which the MAP and MLE estimates are equal. As <it>&#946; </it>&#8594; 0, <it>w </it>&#8594; 0 and we have a univariate Gaussian. As in <abbrgrp><abbr bid="B78">78</abbr></abbrgrp>, we use a empirical Bayes approach to estimate the value of the hyper-parameter <it>&#946;</it><sub><it>u</it>|<it>v</it>, <it>k </it></sub>as</p>
               <p>
                  <display-formula id="M11">
                     <m:math name="1471-2172-8-25-i19" xmlns:m="http://www.w3.org/1998/Math/MathML">
                        <m:semantics>
                           <m:mrow>
                              <m:msub>
                                 <m:mover accent="true">
                                    <m:mi>&#946;</m:mi>
                                    <m:mo>^</m:mo>
                                 </m:mover>
                                 <m:mrow>
                                    <m:mi>u</m:mi>
                                    <m:mo>|</m:mo>
                                    <m:mi>v</m:mi>
                                    <m:mo>,</m:mo>
                                    <m:mi>k</m:mi>
                                 </m:mrow>
                              </m:msub>
                              <m:mo>=</m:mo>
                              <m:mfrac>
                                 <m:mrow>
                                    <m:mstyle displaystyle="true">
                                       <m:msubsup>
                                          <m:mo>&#8721;</m:mo>
                                          <m:mrow>
                                             <m:mi>i</m:mi>
                                             <m:mo>=</m:mo>
                                             <m:mn>1</m:mn>
                                          </m:mrow>
                                          <m:mi>N</m:mi>
                                       </m:msubsup>
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>r</m:mi>
                                             <m:mrow>
                                                <m:mi>i</m:mi>
                                                <m:mi>k</m:mi>
                                             </m:mrow>
                                          </m:msub>
                                       </m:mrow>
                                    </m:mstyle>
                                 </m:mrow>
                                 <m:mrow>
                                    <m:mfrac>
                                       <m:mrow>
                                          <m:msubsup>
                                             <m:mover accent="true">
                                                <m:mi>&#963;</m:mi>
                                                <m:mo>^</m:mo>
                                             </m:mover>
                                             <m:mrow>
                                                <m:mi>u</m:mi>
                                                <m:mo>|</m:mo>
                                                <m:mi>k</m:mi>
                                             </m:mrow>
                                             <m:mn>2</m:mn>
                                          </m:msubsup>
                                          <m:msubsup>
                                             <m:mover accent="true">
                                                <m:mi>&#963;</m:mi>
                                                <m:mo>^</m:mo>
                                             </m:mover>
                                             <m:mrow>
                                                <m:mi>v</m:mi>
                                                <m:mo>|</m:mo>
                                                <m:mi>k</m:mi>
                                             </m:mrow>
                                             <m:mn>2</m:mn>
                                          </m:msubsup>
                                       </m:mrow>
                                       <m:mrow>
                                          <m:msubsup>
                                             <m:mover accent="true">
                                                <m:mi>&#963;</m:mi>
                                                <m:mo>^</m:mo>
                                             </m:mover>
                                             <m:mrow>
                                                <m:mi>u</m:mi>
                                                <m:mi>v</m:mi>
                                                <m:mo>|</m:mo>
                                                <m:mi>k</m:mi>
                                             </m:mrow>
                                             <m:mn>2</m:mn>
                                          </m:msubsup>
                                       </m:mrow>
                                    </m:mfrac>
                                    <m:mo>&#8722;</m:mo>
                                    <m:mn>1</m:mn>
                                 </m:mrow>
                              </m:mfrac>
                              <m:mo>,</m:mo>
                           </m:mrow>
                           <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacuWFYoGygaqcamaaBaaaleaacqWG1bqDcqGG8baFcqWG2bGDcqGGSaalcqWGRbWAaeqaaOGaeyypa0ZaaSaaaeaadaaeWaqaaiabdkhaYnaaBaaaleaacqWGPbqAcqWGRbWAaeqaaaqaaiabdMgaPjabg2da9iabigdaXaqaaiabd6eaobqdcqGHris5aaGcbaWaaSaaaeaacuWFdpWCgaqcamaaDaaaleaacqWG1bqDcqGG8baFcqWGRbWAaeaacqaIYaGmaaGccuWFdpWCgaqcamaaDaaaleaacqWG2bGDcqGG8baFcqWGRbWAaeaacqaIYaGmaaaakeaacuWFdpWCgaqcamaaDaaaleaacqWG1bqDcqWG2bGDcqGG8baFcqWGRbWAaeaacqaIYaGmaaaaaOGaeyOeI0IaeGymaedaaiabcYcaSaaa@5B40@</m:annotation>
                        </m:semantics>
                     </m:math>
                  </display-formula>
               </p>
               <p>where <it>r</it><sub><it>ik </it></sub>is equal to the posterior probability <it>P </it>[<it>y</it><sub><it>i </it></sub>= <it>k</it>|<it>x</it><sub><it>i</it></sub>, <it>&#952;</it><sub><it>k</it></sub>] calculated in the E step. This term can be interpreted as the inverse of the linearity evidence. It penalizes components with low responsibilities and larger variances, enforcing lower <it>w</it><sub><it>u</it>|<it>v</it>, <it>k </it></sub>values (see Protocol in Additional data file <supplr sid="S1">1</supplr> for derivations of all formulas).</p>
               <p>The last step after the mixture estimation is the assignment of genes to groups. This is done by assigning genes to the component that maximizes the posterior of the <it>i</it>-th gene, which is <it>y</it><sub><it>i </it></sub>= <it>argmax</it><sub>1 &#8804; <it>k </it>&#8804; <it>K</it></sub>(<it>r</it><sub><it>ik</it></sub>). Note, that more refined assignment schemes <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> (i.e., decoding a mixture) which increase the robustness of the clustering method can also be used.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Application in lymphoid development</p>
            </st>
            <p>We perform the following steps on each of the sets TCell, BCell, LymphoidTree, and SIM. The mixture estimation method is initialized with <it>K </it>random DTrees, which are obtained by choosing random values uniformly and in [0, 1] independently for each <it>r</it><sub><it>ik </it></sub>and estimating the individual models. Subsequently, we train the mixture model using the EM-algorithm and MAP estimates. To avoid the effect of the initialization, all estimations are repeated 15 times, and the one with highest likelihoods is selected (a similar procedure is applied for <it>k</it>-means and SOM). The implementation of our method (licensed under the GPL) and MS Windows binaries are available at <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. There you can also find a web database&#8211;generated with our MixDTrees Report tool&#8211;with results of all analyses described in this article.</p>
            <p>On TCell and BCell, we used the SOM results as given by <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. For SOM experiments on SIM data, we used the default parameters of the implementation <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>, which uses a set of heuristics to select the values. Furthermore, we performed a clustering of SOM nodes with <it>k</it>-means as it is a common practice <abbrgrp><abbr bid="B79">79</abbr></abbrgrp>. In order to facilitate the comparison between our clustering results and the clusters of the original work we reorder our clusters accordingly. Dependence between developmental stages is measured as the correlation between variables. Given two stages, <it>X</it><sub><it>u </it></sub>and <it>X</it><sub><it>v </it></sub>the correlation is defined as</p>
            <p>
               <display-formula id="M12">
                  <m:math name="1471-2172-8-25-i20" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mi>&#961;</m:mi>
                              <m:mrow>
                                 <m:mi>u</m:mi>
                                 <m:mo>,</m:mo>
                                 <m:mi>v</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:msub>
                                    <m:mover accent="true">
                                       <m:mi>&#963;</m:mi>
                                       <m:mo>^</m:mo>
                                    </m:mover>
                                    <m:mrow>
                                       <m:mi>u</m:mi>
                                       <m:mi>v</m:mi>
                                    </m:mrow>
                                 </m:msub>
                              </m:mrow>
                              <m:mrow>
                                 <m:msub>
                                    <m:mover accent="true">
                                       <m:mi>&#963;</m:mi>
                                       <m:mo>^</m:mo>
                                    </m:mover>
                                    <m:mi>u</m:mi>
                                 </m:msub>
                                 <m:msub>
                                    <m:mover accent="true">
                                       <m:mi>&#963;</m:mi>
                                       <m:mo>^</m:mo>
                                    </m:mover>
                                    <m:mi>v</m:mi>
                                 </m:msub>
                              </m:mrow>
                           </m:mfrac>
                           <m:mo>,</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFbpGCdaWgaaWcbaGaemyDauNaeiilaWIaemODayhabeaakiabg2da9maalaaabaGaf83WdmNbaKaadaWgaaWcbaGaemyDauNaemODayhabeaaaOqaaiqb=n8aZzaajaWaaSbaaSqaaiabdwha1bqabaGccuWFdpWCgaqcamaaBaaaleaacqWG2bGDaeqaaaaakiabcYcaSaaa@4043@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where -1 &#8804; <it>&#961;</it><sub><it>u</it>, <it>v </it></sub>&#8804; 1 and <it>&#961;</it><sub><it>u</it>, <it>v </it></sub>= 0 indicates independence of variables.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>BCell &#8211; B cell development data</p>
         <p>DTree &#8211; dependence tree</p>
         <p>DN &#8211; CD4-/CD8- double negative cells</p>
         <p>DPL &#8211; CD4+/CD8+ double positive large cells</p>
         <p>DPS &#8211; CD4+/CD8+ double positive small cells</p>
         <p>FACS &#8211; fluorescence activated cell sorting</p>
         <p>LympMIR &#8211; hematopoiesis related microRNAs data</p>
         <p>LymphoidTree &#8211; lymphoid tree data</p>
         <p>MAP &#8211; maximum-a-posteriori</p>
         <p>MLE &#8211; maximum likelihood estimates (MLE)</p>
         <p>MixDTrees &#8211; mixtures of dependence trees</p>
         <p>MixDTrees-MAP &#8211; mixtures of dependence trees with MAP estimates</p>
         <p>MixDTrees-MLE &#8211; mixtures of dependence trees with MLE estimates</p>
         <p>NK &#8211; natural killer cells</p>
         <p>pHSC &#8211; pluri-potent, self-renewing hematopoietic stem cells</p>
         <p>SIM &#8211; simulated data</p>
         <p>SOM &#8211; self-organizing maps</p>
         <p>SP4 &#8211; single positive CD4</p>
         <p>SP8 &#8211; single positive CD8</p>
         <p>TCell &#8211; T cell development data</p>
      </sec>
      <sec>
         <st>
            <p>Competing interests</p>
         </st>
         <p>The author(s) declares that there are no competing interests.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>IC implemented the approach and performed the experiments. IC and SR evaluated the results. IC, SR and AS designed this study and wrote the manuscript. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We would like to express our gratitude to Fritz Melchers and Roland Krause (MPI for Infection Biology, Berlin) for helpful discussions, encouragement, and valuable comments about the manuscript. We also thank Christoph Hafemeister for his work on the software, and Benjamin Georgi and Ruben Schilling for revising the manuscript. The first author would like to acknowledge funding from the CNPq(Brazil)/DAAD.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Transcriptional networks in developing and mature B cells</p>
            </title>
            <aug>
               <au>
                  <snm>Matthias</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Rolink</snm>
                  <fnm>AG</fnm>
               </au>
            </aug>
            <source>Nat Rev Immunol</source>
            <pubdate>2005</pubdate>
            <volume>5</volume>
            <issue>6</issue>
            <fpage>497</fpage>
            <lpage>508</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15928681</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Molecular genetics of T cell development</p>
            </title>
            <aug>
               <au>
                  <snm>Rothenberg</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Taghon</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Annu Rev Immunol</source>
            <pubdate>2005</pubdate>
            <volume>23</volume>
            <fpage>601</fpage>
            <lpage>649</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15771582</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>From stem cell to T cell: one route or many?</p>
            </title>
            <aug>
               <au>
                  <snm>Bhandoola</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sambandam</snm>
                  <fnm/>
               </au>
            </aug>
            <source>Nature Reviews Immunology</source>
            <pubdate>2006</pubdate>
            <volume>6</volume>
            <fpage>117</fpage>
            <lpage>126</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16491136</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Rules for gene usage inferred from a comparison of large-scale gene expression profiles of T and B lymphocyte development</p>
            </title>
            <aug>
               <au>
                  <snm>Hoffmann</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Bruno</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Seidl</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Rolink</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Melchers</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>J Immunol</source>
            <pubdate>2003</pubdate>
            <volume>170</volume>
            <issue>3</issue>
            <fpage>1339</fpage>
            <lpage>1353</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12538694</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Changes in gene expression profiles in developing B cells of murine bone marrow</p>
            </title>
            <aug>
               <au>
                  <snm>Hoffmann</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Seidl</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Neeb</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rolink</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Melchers</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2002</pubdate>
            <volume>12</volume>
            <fpage>98</fpage>
            <lpage>111</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">155249</pubid>
                  <pubid idtype="pmpid" link="fulltext">11779835</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Regulatory coding of lymphoid lineage choice by hematopoietic transcription factors</p>
            </title>
            <aug>
               <au>
                  <snm>Warren</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Rothenberg</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Curr Opin Immunol</source>
            <pubdate>2003</pubdate>
            <volume>15</volume>
            <issue>2</issue>
            <fpage>166</fpage>
            <lpage>175</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12633666</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>MicroRNAs modulate hematopoietic lineage differentiation</p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>CZ</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Lodish</snm>
                  <fnm>HF</fnm>
               </au>
               <au>
                  <snm>Bartel</snm>
                  <fnm>DP</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2004</pubdate>
            <volume>303</volume>
            <issue>5654</issue>
            <fpage>83</fpage>
            <lpage>86</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14657504</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>MicroRNA profiling of the murine hematopoietic system</p>
            </title>
            <aug>
               <au>
                  <snm>Monticelli</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ansel</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>Xiao</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Socci</snm>
                  <fnm>ND</fnm>
               </au>
               <au>
                  <snm>Krichevsky</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Thai</snm>
                  <fnm>TH</fnm>
               </au>
               <au>
                  <snm>Rajewsky</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Marks</snm>
                  <fnm>DS</fnm>
               </au>
               <au>
                  <snm>Sander</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Rajewsky</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Rao</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kosik</snm>
                  <fnm>KS</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <issue>8</issue>
            <fpage>R71</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1273638</pubid>
                  <pubid idtype="pmpid" link="fulltext">16086853</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Hematopoietic-specific microRNA expression in human cells</p>
            </title>
            <aug>
               <au>
                  <snm>Ramkissoon</snm>
                  <fnm>SH</fnm>
               </au>
               <au>
                  <snm>Mainwaring</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Ogasawara</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Keyvanfar</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>McCoy</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Sloand</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Kajigaya</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>NS</fnm>
               </au>
            </aug>
            <source>Leuk Res</source>
            <pubdate>2005</pubdate>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16226311</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Uncovering regulatory pathways that affect hematopoietic stem cell function using 'genetical genomics'</p>
            </title>
            <aug>
               <au>
                  <snm>Bystrykh</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Weersing</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Dontje</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Sutton</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Pletcher</snm>
                  <fnm>MT</fnm>
               </au>
               <au>
                  <snm>Wiltshire</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Su</snm>
                  <fnm>AI</fnm>
               </au>
               <au>
                  <snm>Vellenga</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Manly</snm>
                  <fnm>KF</fnm>
               </au>
               <au>
                  <snm>Lu</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Chesler</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Alberts</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Jansen</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Williams</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Cooke</snm>
                  <fnm>MP</fnm>
               </au>
               <au>
                  <snm>de Haan</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2005</pubdate>
            <volume>37</volume>
            <issue>3</issue>
            <fpage>225</fpage>
            <lpage>232</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15711547</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Natural killer cells distinguish innocuous and destructive forms of pancreatic islet autoimmunity</p>
            </title>
            <aug>
               <au>
                  <snm>Poirot</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Benoist</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Mathis</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <issue>21</issue>
            <fpage>8102</fpage>
            <lpage>8107</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">419564</pubid>
                  <pubid idtype="pmpid" link="fulltext">15141080</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Basal immunoglobulin signaling actively maintains developmental stage in immature B cells</p>
            </title>
            <aug>
               <au>
                  <snm>Tze</snm>
                  <fnm>LE</fnm>
               </au>
               <au>
                  <snm>Schram</snm>
                  <fnm>BR</fnm>
               </au>
               <au>
                  <snm>Lam</snm>
                  <fnm>KP</fnm>
               </au>
               <au>
                  <snm>Hogquist</snm>
                  <fnm>KA</fnm>
               </au>
               <au>
                  <snm>Hippen</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Shinton</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Otipoby</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>Rodine</snm>
                  <fnm>PR</fnm>
               </au>
               <au>
                  <snm>Vegoe</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Kraus</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hardy</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Schlissel</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Rajewsky</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Behrens</snm>
                  <fnm>TW</fnm>
               </au>
            </aug>
            <source>PLoS Biology</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <issue>3</issue>
            <fpage>e82</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1059451</pubid>
                  <pubid idtype="pmpid" link="fulltext">15752064</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Modeling T-cell activation using gene expression profiling and state-space models</p>
            </title>
            <aug>
               <au>
                  <snm>Rangel</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Angus</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ghahramani</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Lioumi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sotheran</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Gaiba</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wild</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Falciani</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <issue>9</issue>
            <fpage>1361</fpage>
            <lpage>1372</lpage>
            <url>http://bioinformatics.oxfordjournals.org/cgi/content/abstract/20/9/1361</url>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14962938</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Modeling and simulation with Hybrid Functional Petri Nets of the role of interleukin-6 in human early haematopoiesis</p>
            </title>
            <aug>
               <au>
                  <snm>Troncale</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Tahi</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Campard</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Vannier</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Guespin</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Pac Symp Biocomput</source>
            <pubdate>2006</pubdate>
            <volume/>
            <fpage>427</fpage>
            <lpage>438</lpage>
            <xrefbib>
               <pubid idtype="pmpid">17094258</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Reverse engineering of regulatory networks in human B cells</p>
            </title>
            <aug>
               <au>
                  <snm>Basso</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Margolin</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Stolovitzky</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Klein</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Dalla-Favera</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Califano</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2005</pubdate>
            <volume>37</volume>
            <issue>4</issue>
            <fpage>382</fpage>
            <lpage>390</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15778709</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Approximating discrete probability distributions with dependence trees</p>
            </title>
            <aug>
               <au>
                  <snm>Chow</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>IEEE Trans Info Theory</source>
            <pubdate>1968</pubdate>
            <volume>14</volume>
            <issue>3</issue>
            <fpage>462</fpage>
            <lpage>467</lpage>
         </bibl>
         <bibl id="B17">
            <title>
               <p>A mixture model-based approach to the clustering of microarray expression data</p>
            </title>
            <aug>
               <au>
                  <snm>McLachlan</snm>
                  <fnm>GJ</fnm>
               </au>
               <au>
                  <snm>Bean</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Peel</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2002</pubdate>
            <volume>18</volume>
            <issue>3</issue>
            <fpage>413</fpage>
            <lpage>422</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11934740</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Model-based clustering and data transformations for gene expression data</p>
            </title>
            <aug>
               <au>
                  <snm>Yeung</snm>
                  <fnm>KY</fnm>
               </au>
               <au>
                  <snm>Fraley</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Murua</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Raftery</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Ruzzo</snm>
                  <fnm>WL</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2001</pubdate>
            <volume>17</volume>
            <issue>10</issue>
            <fpage>977</fpage>
            <lpage>987</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11673243</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Continuous representations of time-series gene expression data</p>
            </title>
            <aug>
               <au>
                  <snm>Bar-Joseph</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Gerber</snm>
                  <fnm>GK</fnm>
               </au>
               <au>
                  <snm>Gifford</snm>
                  <fnm>DK</fnm>
               </au>
               <au>
                  <snm>Jaakkola</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Simon</snm>
                  <fnm>I</fnm>
               </au>
            </aug>
            <source>J Comput Biol</source>
            <pubdate>2003</pubdate>
            <volume>10</volume>
            <issue>3&#8211;4</issue>
            <fpage>341</fpage>
            <lpage>356</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12935332</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Clustering of time-course gene expression data using a mixed-effects model with B-splines</p>
            </title>
            <aug>
               <au>
                  <snm>Luan</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <issue>4</issue>
            <fpage>474</fpage>
            <lpage>482</lpage>
            <url>http://bioinformatics.oxfordjournals.org/cgi/content/abstract/19/4/474</url>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12611802</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Cluster analysis of gene expression dynamics</p>
            </title>
            <aug>
               <au>
                  <snm>Ramoni</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Sebastiani</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Kohane</snm>
                  <fnm>IS</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2002</pubdate>
            <volume>99</volume>
            <issue>14</issue>
            <fpage>9121</fpage>
            <lpage>9126</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">123104</pubid>
                  <pubid idtype="pmpid" link="fulltext">12082179</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Analyzing Gene Expression Time-Courses</p>
            </title>
            <aug>
               <au>
                  <snm>Schliep</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Costa</snm>
                  <fnm>IG</fnm>
               </au>
               <au>
                  <snm>Steinhoff</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Schonhuth</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>IEEE/ACM Trans Comput Biol Bioinform</source>
            <pubdate>2005</pubdate>
            <volume>2</volume>
            <issue>3</issue>
            <fpage>179</fpage>
            <lpage>193</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17044182</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Learning with mixtures of trees</p>
            </title>
            <aug>
               <au>
                  <snm>Meila</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jordan</snm>
                  <fnm>MI</fnm>
               </au>
            </aug>
            <source>J Mach Learn Res</source>
            <pubdate>2001</pubdate>
            <volume>1</volume>
            <fpage>1</fpage>
            <lpage>48</lpage>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Learning multiple evolutionary pathways from cross-sectional data</p>
            </title>
            <aug>
               <au>
                  <snm>Beerenwinkel</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Rahnenfuhrer</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Daumer</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hoffmann</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kaiser</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Selbig</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lengauer</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>RECOMB '04: Proceedings of the eighth annual international conference on Research in computational molecular biology</source>
            <publisher>New York, NY, USA: ACM Press</publisher>
            <pubdate>2004</pubdate>
            <fpage>36</fpage>
            <lpage>44</lpage>
         </bibl>
         <bibl id="B25">
            <title>
               <p>SOM Toolbox for Matlab</p>
            </title>
            <aug>
               <au>
                  <snm>Vesanto</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Himberg</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Alhoniemi</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Parhankangas</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Tech rep</source>
            <pubdate>2000</pubdate>
            <url>http://citeseer.ist.psu.edu/vesanto00som.html</url>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Supplementary Material</p>
            </title>
            <url>http://algorithmics.molgen.mpg.de/Supplements/ExpLym/</url>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Local computations with probabilities on graphical structures and their application to expert systems</p>
            </title>
            <aug>
               <au>
                  <snm>Lauritzen</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Spiegelhalter</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>J Royal Statis Soc B</source>
            <pubdate>1988</pubdate>
            <volume>50</volume>
            <fpage>157</fpage>
            <lpage>224</lpage>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Maximum likelihood from incomplete data via the EM algorithm</p>
            </title>
            <aug>
               <au>
                  <snm>Dempster</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Laird</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>JRSSB</source>
            <pubdate>1977</pubdate>
            <volume>39</volume>
            <fpage>1</fpage>
            <lpage>38</lpage>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Analyzing time series gene expression data</p>
            </title>
            <aug>
               <au>
                  <snm>Bar-Joseph</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <issue>16</issue>
            <fpage>2493</fpage>
            <lpage>503</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15130923</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Clustering gene-expression data with repeated measurements</p>
            </title>
            <aug>
               <au>
                  <snm>Yeung</snm>
                  <fnm>KY</fnm>
               </au>
               <au>
                  <snm>Medvedovic</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bumgarner</snm>
                  <fnm>RE</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <issue>5</issue>
            <fpage>R34</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">156590</pubid>
                  <pubid idtype="pmpid" link="fulltext">12734014</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Clustering short time series gene expression data</p>
            </title>
            <aug>
               <au>
                  <snm>Ernst</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Nau</snm>
                  <fnm>GJ</fnm>
               </au>
               <au>
                  <snm>Bar-Joseph</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>suppl 1</issue>
            <fpage>i159</fpage>
            <lpage>168</lpage>
            <url>http://bioinformatics.oxfordjournals.org/cgi/content/abstract/21/suppl_1/i159</url>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15961453</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Fuzzy clustering of short time-series and unevenly distributed sampling points</p>
            </title>
            <aug>
               <au>
                  <snm>Moller-Levet</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Klawonn</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Cho</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Wolkenhauer</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Advances in Intelligent Data Analysis V, Lecture Notes in Computer Science</source>
            <publisher>Springer Verlag</publisher>
            <pubdate>2003</pubdate>
            <volume>2810</volume>
            <fpage>330</fpage>
            <lpage>340</lpage>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Online Mendelian Inheritance in Man, OMIM (TM)</p>
            </title>
            <aug>
               <au>
                  <cnm>Johns Hopkins University M Baltimore</cnm>
               </au>
            </aug>
            <source>World Wide Web</source>
            <pubdate>2006</pubdate>
            <url>http://www.ncbi.nlm.nih.gov/omim/</url>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Gene Ontology: tool for the unification of biology</p>
            </title>
            <aug>
               <au>
                  <snm>Ashburner</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2000</pubdate>
            <volume>25</volume>
            <fpage>25</fpage>
            <lpage>29</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10802651</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>A genomic view of lymphocyte development</p>
            </title>
            <aug>
               <au>
                  <snm>Hoffmann</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Melchers</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Curr Opin Immunol</source>
            <pubdate>2003</pubdate>
            <volume>15</volume>
            <issue>3</issue>
            <fpage>239</fpage>
            <lpage>245</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12787746</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <aug>
               <au>
                  <snm>McLachlan</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Peel</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Finite Mixture Models</source>
            <publisher>Wiley Series in Probability and Statistics, Wiley, New York</publisher>
            <pubdate>2000</pubdate>
         </bibl>
         <bibl id="B37">
            <title>
               <p>How does gene expression clustering work?</p>
            </title>
            <aug>
               <au>
                  <snm>D'haeseleer</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>2005</pubdate>
            <volume>23</volume>
            <issue>12</issue>
            <fpage>1499</fpage>
            <lpage>1501</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16333293</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Tumor necrosis factor receptor-associated factor (TRAF) 5 and TRAF2 are involved in CD30-mediated NFkappaB activation</p>
            </title>
            <aug>
               <au>
                  <snm>Aizawa</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nakano</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Ishida</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Horie</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Nagai</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ito</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Yagita</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Okumura</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Inoue</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Watanabe</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>1997</pubdate>
            <volume>272</volume>
            <issue>4</issue>
            <fpage>2042</fpage>
            <lpage>2045</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8999898</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Targeted disruption of Traf5 gene causes defects in CD40- and CD27-mediated lymphocyte activation</p>
            </title>
            <aug>
               <au>
                  <snm>Nakano</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Sakon</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Koseki</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Takemori</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Tada</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Matsumoto</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Munechika</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Sakai</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Shirasawa</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Akiba</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kobata</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Santee</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Ware</snm>
                  <fnm>CF</fnm>
               </au>
               <au>
                  <snm>Rennert</snm>
                  <fnm>PD</fnm>
               </au>
               <au>
                  <snm>Taniguchi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Yagita</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Okumura</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>PNAS</source>
            <pubdate>1999</pubdate>
            <volume>96</volume>
            <issue>17</issue>
            <fpage>9803</fpage>
            <lpage>9808</lpage>
            <url>http://www.pnas.org/cgi/content/abstract/96/17/9803</url>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">22291</pubid>
                  <pubid idtype="pmpid" link="fulltext">10449775</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>The IkappaB-NF-kappaB signaling module: temporal control and selective gene activation</p>
            </title>
            <aug>
               <au>
                  <snm>Hoffmann</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Levchenko</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Scott</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Baltimore</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>298</volume>
            <issue>5596</issue>
            <fpage>1241</fpage>
            <lpage>1245</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12424381</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>B-lymphocyte quiescence, tolerance and activation as viewed by global gene expression profiling on microarrays</p>
            </title>
            <aug>
               <au>
                  <snm>Glynne</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ghandour</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Rayner</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Mack</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Goodnow</snm>
                  <fnm>CC</fnm>
               </au>
            </aug>
            <source>Immunol Rev</source>
            <pubdate>2000</pubdate>
            <volume>176</volume>
            <fpage>216</fpage>
            <lpage>246</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11043780</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Activation of the PKB/AKT pathway by ICAM-2</p>
            </title>
            <aug>
               <au>
                  <snm>Perez</snm>
                  <fnm>OD</fnm>
               </au>
               <au>
                  <snm>Kinoshita</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hitoshi</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Payan</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Kitamura</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Nolan</snm>
                  <fnm>GP</fnm>
               </au>
               <au>
                  <snm>Lorens</snm>
                  <fnm>JB</fnm>
               </au>
            </aug>
            <source>Immunity</source>
            <pubdate>2002</pubdate>
            <volume>16</volume>
            <fpage>51</fpage>
            <lpage>65</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11825565</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>The tumor suppressor TSLC1/NECL-2 triggers NK-cell and CD8+ T-cell responses through the cell-surface receptor CRTAM</p>
            </title>
            <aug>
               <au>
                  <snm>Boles</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Barchet</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Diacovo</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Cella</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Colonna</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Blood</source>
            <pubdate>2005</pubdate>
            <volume>106</volume>
            <issue>3</issue>
            <fpage>779</fpage>
            <lpage>786</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15811952</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Genomic structure of mouse SPI-C and genomic structure and expression pattern of human SPI-C</p>
            </title>
            <aug>
               <au>
                  <snm>Carlsson</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Hjalmarsson</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Liberg</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Persson</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Leanderson</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>2002</pubdate>
            <volume>299</volume>
            <issue>1&#8211;2</issue>
            <fpage>271</fpage>
            <lpage>278</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12459275</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>MicroRNAs: genomics, biogenesis, mechanism, and function</p>
            </title>
            <aug>
               <au>
                  <snm>Bartel</snm>
                  <fnm>DP</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2004</pubdate>
            <volume>116</volume>
            <issue>2</issue>
            <fpage>281</fpage>
            <lpage>297</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14744438</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Differential Repression of Alternative Transcripts: A Screen for miRNA Targets</p>
            </title>
            <aug>
               <au>
                  <snm>Legendre</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ritchie</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Lopez</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Gautheret</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>PLoS Computational Biology</source>
            <pubdate>2006</pubdate>
            <volume>2</volume>
            <issue>5</issue>
            <fpage>e43</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1458965</pubid>
                  <pubid idtype="pmpid" link="fulltext">16699595</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Microarray analysis shows that some microRNAs downregulate large numbers of target mRNAs</p>
            </title>
            <aug>
               <au>
                  <snm>Lim</snm>
                  <fnm>LP</fnm>
               </au>
               <au>
                  <snm>Lau</snm>
                  <fnm>NC</fnm>
               </au>
               <au>
                  <snm>Garrett-Engele</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Grimson</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Schelter</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Castle</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bartel</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Linsley</snm>
                  <fnm>PS</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>433</volume>
            <issue>7027</issue>
            <fpage>769</fpage>
            <lpage>773</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15685193</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Cell-type-specific signatures of microRNAs on target mRNA expression</p>
            </title>
            <aug>
               <au>
                  <snm>Sood</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Krek</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Zavolan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Macino</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Rajewsky</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>PNAS</source>
            <pubdate>2006</pubdate>
            <volume>103</volume>
            <issue>8</issue>
            <fpage>2746</fpage>
            <lpage>2751</lpage>
            <url>http://www.pnas.org/cgi/content/abstract/103/8/2746</url>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1413820</pubid>
                  <pubid idtype="pmpid" link="fulltext">16477010</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>miRBase: microRNA sequences, targets and gene nomenclature</p>
            </title>
            <aug>
               <au>
                  <snm>Griffiths-Jones</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Grocock</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>van Dongen</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bateman</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Enright</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <issue>34 Database</issue>
            <fpage>D140</fpage>
            <lpage>D144</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347474</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381832</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <aug>
               <au>
                  <snm>Sokal</snm>
                  <fnm>FR</fnm>
               </au>
               <au>
                  <snm>Rohlf</snm>
                  <fnm/>
               </au>
            </aug>
            <source>Biometry</source>
            <publisher>New York: W. H. Freeman and Company</publisher>
            <pubdate>1995</pubdate>
         </bibl>
         <bibl id="B51">
            <title>
               <p>GOstat: find statistically overrepresented Gene Ontologies within a group of genes</p>
            </title>
            <aug>
               <au>
                  <snm>Beissbarth</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Speed</snm>
                  <fnm>TP</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <issue>9</issue>
            <fpage>1464</fpage>
            <lpage>1465</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14962934</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <aug>
               <au>
                  <snm>Westfall</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Resampling-Based Multiple Testing: Examples and Methods for p-Value Adjustment</source>
            <publisher>Wiley-Interscience</publisher>
            <pubdate>1993</pubdate>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Stem cell division is regulated by the microRNA pathway</p>
            </title>
            <aug>
               <au>
                  <snm>Hatfield</snm>
                  <fnm>SD</fnm>
               </au>
               <au>
                  <snm>Shcherbata</snm>
                  <fnm>HR</fnm>
               </au>
               <au>
                  <snm>Fischer</snm>
                  <fnm>KA</fnm>
               </au>
               <au>
                  <snm>Nakahara</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Carthew</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Ruohola-Baker</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>435</volume>
            <issue>7044</issue>
            <fpage>974</fpage>
            <lpage>978</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15944714</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Polymorphism in a second ABC transporter gene located within the class II region of the human major histocompatibility complex</p>
            </title>
            <aug>
               <au>
                  <snm>Powis</snm>
                  <fnm>SH</fnm>
               </au>
               <au>
                  <snm>Mockridge</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Kelly</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kerr</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Glynne</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gileadi</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Beck</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Trowsdale</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1992</pubdate>
            <volume>89</volume>
            <issue>4</issue>
            <fpage>1463</fpage>
            <lpage>1467</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">48471</pubid>
                  <pubid idtype="pmpid" link="fulltext">1741401</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>Degradation of Cdc25A by beta-TrCP during S phase and in response to DNA damage</p>
            </title>
            <aug>
               <au>
                  <snm>Busino</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Donzelli</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chiesa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Guardavaccaro</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Ganoth</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Dorrello</snm>
                  <fnm>NV</fnm>
               </au>
               <au>
                  <snm>Hershko</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Pagano</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Draetta</snm>
                  <fnm>GF</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2003</pubdate>
            <volume>426</volume>
            <issue>6962</issue>
            <fpage>87</fpage>
            <lpage>91</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14603323</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>miR-15 and miR-16 induce apoptosis by targeting BCL2</p>
            </title>
            <aug>
               <au>
                  <snm>Cimmino</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Calin</snm>
                  <fnm>GA</fnm>
               </au>
               <au>
                  <snm>Fabbri</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Iorio</snm>
                  <fnm>MV</fnm>
               </au>
               <au>
                  <snm>Ferracin</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Shimizu</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Wojcik</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Aqeilan</snm>
                  <fnm>RI</fnm>
               </au>
               <au>
                  <snm>Zupo</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Dono</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rassenti</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Alder</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Volinia</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>CG</fnm>
               </au>
               <au>
                  <snm>Kipps</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Negrini</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Croce</snm>
                  <fnm>CM</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <issue>39</issue>
            <fpage>13944</fpage>
            <lpage>13949</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1236577</pubid>
                  <pubid idtype="pmpid" link="fulltext">16166262</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>Spatially specific expression of Hoxb4 is dependent on the ubiquitous transcription factor NFY</p>
            </title>
            <aug>
               <au>
                  <snm>Gilthorpe</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Vandromme</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Brend</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Gutman</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Summerbell</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Totty</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Rigby</snm>
                  <fnm>PWJ</fnm>
               </au>
            </aug>
            <source>Development</source>
            <pubdate>2002</pubdate>
            <volume>129</volume>
            <issue>16</issue>
            <fpage>3887</fpage>
            <lpage>3899</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12135926</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>A nuclear factor Y (NFY) site positively regulates the human CD34 stem cell gene</p>
            </title>
            <aug>
               <au>
                  <snm>Radomska</snm>
                  <fnm>HS</fnm>
               </au>
               <au>
                  <snm>Satterthwaite</snm>
                  <fnm>AB</fnm>
               </au>
               <au>
                  <snm>Taranenko</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Narravula</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Krause</snm>
                  <fnm>DS</fnm>
               </au>
               <au>
                  <snm>Tenen</snm>
                  <fnm>DG</fnm>
               </au>
            </aug>
            <source>Blood</source>
            <pubdate>1999</pubdate>
            <volume>94</volume>
            <issue>11</issue>
            <fpage>3772</fpage>
            <lpage>3780</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10572091</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>Transcriptional scaffold: CIITA interacts with NF-Y, RFX, and CREB to cause stereospecific regulation of the class II major histocompatibility complex promoter</p>
            </title>
            <aug>
               <au>
                  <snm>Zhu</snm>
                  <fnm>XS</fnm>
               </au>
               <au>
                  <snm>Linhoff</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Chin</snm>
                  <fnm>KC</fnm>
               </au>
               <au>
                  <snm>Maity</snm>
                  <fnm>SN</fnm>
               </au>
               <au>
                  <snm>Ting</snm>
                  <fnm>JP</fnm>
               </au>
            </aug>
            <source>Mol Cell Biol</source>
            <pubdate>2000</pubdate>
            <volume>20</volume>
            <issue>16</issue>
            <fpage>6051</fpage>
            <lpage>6061</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">86081</pubid>
                  <pubid idtype="pmpid" link="fulltext">10913187</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>The growth factor independence-1 transcription factor: New functions and new insights</p>
            </title>
            <aug>
               <au>
                  <snm>Kazanjian</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Gross</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Grimes</snm>
                  <fnm>HL</fnm>
               </au>
            </aug>
            <source>Crit Rev Oncol Hematol</source>
            <pubdate>2006</pubdate>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16716599</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>Evidence implicating Gfi-1 and Pim-1 in pre-T-cell differentiation steps associated with beta-selection</p>
            </title>
            <aug>
               <au>
                  <snm>Schmidt</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Karsunky</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Rodel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Zevnik</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Elsasser</snm>
                  <fnm>HP</fnm>
               </au>
               <au>
                  <snm>Moroy</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>EMBO J</source>
            <pubdate>1998</pubdate>
            <volume>17</volume>
            <issue>18</issue>
            <fpage>5349</fpage>
            <lpage>5359</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1170861</pubid>
                  <pubid idtype="pmpid" link="fulltext">9736613</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>Detecting MicroRNA Targets by Linking Sequence, MicroRNA and Gene   Expression Data. Lect. Notes </p>
            </title>
            <aug>
               <au>
                  <snm>Huang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Morris</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Frey</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Comput. Sci</source>
            <pubdate>2006</pubdate>
            <volume>3909</volume>
            <fpage>114</fpage>
            <lpage>29</lpage>
            <url>http://www.springerlink.com/openurl.asp?genre=article&amp;id=doi:10.1007/11732990_11</url>
         </bibl>
         <bibl id="B63">
            <title>
               <p>Genome-wide discovery of transcriptional modules from DNA sequence and gene expression</p>
            </title>
            <aug>
               <au>
                  <snm>Segal</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Yelensky</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Koller</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <issue>Suppl 1</issue>
            <fpage>i273</fpage>
            <lpage>i282</lpage>
            <url>http://bioinformatics.oxfordjournals.org/cgi/content/abstract/19/suppl_1/i273</url>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12855470</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B64">
            <title>
               <p>Discovering molecular pathways from protein interaction and gene expression data</p>
            </title>
            <aug>
               <au>
                  <snm>Segal</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Koller</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <issue>Suppl 1</issue>
            <fpage>i264</fpage>
            <lpage>i271</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12855469</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B65">
            <title>
               <p>Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data</p>
            </title>
            <aug>
               <au>
                  <snm>Segal</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Shapira</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Regev</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Pe'er</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Botstein</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Koller</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Friedman</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2003</pubdate>
            <volume>34</volume>
            <issue>2</issue>
            <fpage>166</fpage>
            <lpage>176</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12740579</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B66">
            <title>
               <p>Time Series Analysis of Gene Expression and Location Data</p>
            </title>
            <aug>
               <au>
                  <snm>Yeang</snm>
                  <fnm>CH</fnm>
               </au>
               <au>
                  <snm>Jaakkola</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Third IEEE Symposium on BioInformatics and BioEngineering (BIBE'03)</source>
            <pubdate>2003</pubdate>
            <fpage>305</fpage>
         </bibl>
         <bibl id="B67">
            <title>
               <p>Semi-supervised methods to predict patient survival from gene expression data</p>
            </title>
            <aug>
               <au>
                  <snm>Bair</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Tibshirani</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2004</pubdate>
            <volume>2</volume>
            <issue>4</issue>
            <fpage>E108</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">387275</pubid>
                  <pubid idtype="pmpid" link="fulltext">15094809</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B68">
            <title>
               <p>Semi-supervised learning via penalized mixture model with application to microarray sample classification</p>
            </title>
            <aug>
               <au>
                  <snm>Pan</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Shen</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Jiang</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hebbel</snm>
                  <fnm>RP</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <issue>19</issue>
            <fpage>2388</fpage>
            <lpage>2395</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16870935</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B69">
            <title>
               <p>Inferring cellular networks using probabilistic graphical models</p>
            </title>
            <aug>
               <au>
                  <snm>Friedman</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2004</pubdate>
            <volume>303</volume>
            <issue>5659</issue>
            <fpage>799</fpage>
            <lpage>805</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14764868</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B70">
            <title>
               <p>Gene expression microarrays: glimpses of the immunological genome</p>
            </title>
            <aug>
               <au>
                  <snm>Hyatt</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Melamed</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Park</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Seguritan</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Laplace</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Poirot</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Zucchelli</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Obst</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Matos</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Venanzi</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Goldrath</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Nguyen</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Luckey</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Yamagata</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Herman</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Jacobs</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Mathis</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Benoist</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Nat Immunol</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <issue>7</issue>
            <fpage>686</fpage>
            <lpage>691</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16785882</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B71">
            <title>
               <p>Gene Expression Omnibus</p>
            </title>
            <url>http://www.ncbi.nlm.nih.gov/projects/geo/</url>
         </bibl>
         <bibl id="B72">
            <title>
               <p>Variance stabilization applied to microarray data calibration and to the quantification of differential expression</p>
            </title>
            <aug>
               <au>
                  <snm>Huber</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Heydebreck</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Sultmann</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Poustka</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Vingron</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2002</pubdate>
            <volume>18</volume>
            <issue>Suppl 1</issue>
            <fpage>S96</fpage>
            <lpage>S104</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12169536</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B73">
            <title>
               <p>Gene Expression Omnibus: NCBI gene expression and hybridization array data repository</p>
            </title>
            <aug>
               <au>
                  <snm>Edgar</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Domrachev</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lash</snm>
                  <fnm>AE</fnm>
               </au>
            </aug>
            <source>Nucl Acids Res</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <fpage>207</fpage>
            <lpage>210</lpage>
            <url>http://nar.oxfordjournals.org/cgi/content/abstract/30/1/207</url>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">99122</pubid>
                  <pubid idtype="pmpid" link="fulltext">11752295</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B74">
            <title>
               <p>MicroRNAs 221 and 222 inhibit normal erythropoiesis and erythroleukemic cell growth via kit receptor down-modulation</p>
            </title>
            <aug>
               <au>
                  <snm>Felli</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Fontana</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Pelosi</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Botta</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Bonci</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Facchiano</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Liuzzi</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Lulli</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Morsilli</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Santoro</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Valtieri</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Calin</snm>
                  <fnm>GA</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>CG</fnm>
               </au>
               <au>
                  <snm>Sorrentino</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Croce</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Peschle</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <issue>50</issue>
            <fpage>18081</fpage>
            <lpage>18086</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1312381</pubid>
                  <pubid idtype="pmpid" link="fulltext">16330772</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B75">
            <title>
               <p>MicroRNA profiling reveals distinct signatures in B cell chronic lymphocytic leukemias</p>
            </title>
            <aug>
               <au>
                  <snm>Calin</snm>
                  <fnm>GA</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>CG</fnm>
               </au>
               <au>
                  <snm>Sevignani</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ferracin</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Felli</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Dumitru</snm>
                  <fnm>CD</fnm>
               </au>
               <au>
                  <snm>Shimizu</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Cimmino</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Zupo</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Dono</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Dell'Aquila</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Alder</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Rassenti</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Kipps</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Bullrich</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Negrini</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Croce</snm>
                  <fnm>CM</fnm>
               </au>
            </aug>
            <source>PNAS</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <issue>32</issue>
            <fpage>11755</fpage>
            <lpage>11760</lpage>
            <url>http://www.pnas.org/cgi/content/abstract/101/32/11755</url>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">511048</pubid>
                  <pubid idtype="pmpid" link="fulltext">15284443</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B76">
            <title>
               <p>miRNAs, cancer, and stem cell division</p>
            </title>
            <aug>
               <au>
                  <snm>Croce</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Calin</snm>
                  <fnm>GA</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2005</pubdate>
            <volume>122</volume>
            <fpage>6</fpage>
            <lpage>7</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16009126</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B77">
            <title>
               <p>MicroRNA targets in Drosophila</p>
            </title>
            <aug>
               <au>
                  <snm>Enright</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>John</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Gaul</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Tuschl</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Sander</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Marks</snm>
                  <fnm>DS</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2003</pubdate>
            <volume>5</volume>
            <fpage>R1</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">395733</pubid>
                  <pubid idtype="pmpid" link="fulltext">14709173</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B78">
            <title>
               <p>Bayesian Linear Regression</p>
            </title>
            <aug>
               <au>
                  <snm>Minka</snm>
                  <fnm>TP</fnm>
               </au>
            </aug>
            <source>Tech rep</source>
            <publisher>MIT</publisher>
            <pubdate>2001</pubdate>
         </bibl>
         <bibl id="B79">
            <title>
               <p>Clustering of the Self-Organizing Map</p>
            </title>
            <aug>
               <au>
                  <snm>Vesanto</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Alhoniemi</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>IEEE Transactions on Neural Networks</source>
            <pubdate>2000</pubdate>
            <volume>11</volume>
            <issue>3</issue>
            <fpage>586</fpage>
            <url>http://citeseer.ist.psu.edu/article/vesanto00clustering.html</url>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">18249787</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
