<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-9-75</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Rank-based edge reconstruction for scale-free genetic regulatory networks</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Chen</snm>
               <fnm>Guanrao</fnm>
               <insr iid="I1"/>
               <email>gchen4@uic.edu</email>
            </au>
            <au id="A2">
               <snm>Larsen</snm>
               <fnm>Peter</fnm>
               <insr iid="I2"/>
               <email>plarsen@uic.edu</email>
            </au>
            <au id="A3">
               <snm>Almasri</snm>
               <fnm>Eyad</fnm>
               <insr iid="I3"/>
               <email>ealmas1@uic.edu</email>
            </au>
            <au id="A4" ca="yes">
               <snm>Dai</snm>
               <fnm>Yang</fnm>
               <insr iid="I3"/>
               <email>yangdai@uic.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Computer Science (MC152), University of Illinois at Chicago, 851 South Morgan Street, Chicago, IL 60607, USA</p>
            </ins>
            <ins id="I2">
               <p>Core Genomics Laboratory, Research Resource Center (MC937), University of Illinois at Chicago, 835 South Wolcott Avenue, Chicago, IL 60612, USA</p>
            </ins>
            <ins id="I3">
               <p>Department of Bioengineering (MC063), University of Illinois at Chicago, 851 South Morgan Street, Chicago, IL 60607, USA</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2008</pubdate>
         <volume>9</volume>
         <issue>1</issue>
         <fpage>75</fpage>
         <url>http://www.biomedcentral.com/1471-2105/9/75</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18237422</pubid>
               <pubid idtype="doi">10.1186/1471-2105-9-75</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>19</day>
               <month>3</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>31</day>
               <month>1</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>31</day>
               <month>1</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Chen et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The reconstruction of genetic regulatory networks from microarray gene expression data has been a challenging task in bioinformatics. Various approaches to this problem have been proposed, however, they do not take into account the topological characteristics of the targeted networks while reconstructing them.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>In this study, an algorithm that explores the scale-free topology of networks was proposed based on the modification of a rank-based algorithm for network reconstruction. The new algorithm was evaluated with the use of both simulated and microarray gene expression data. The results demonstrated that the proposed algorithm outperforms the original rank-based algorithm. In addition, in comparison with the Bayesian Network approach, the results show that the proposed algorithm gives much better recovery of the underlying network when sample size is much smaller relative to the number of genes.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The proposed algorithm is expected to be useful in the reconstruction of biological networks whose degree distributions follow the scale-free topology.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The reconstruction of genetic regulatory networks based on microarray gene expression data is one of the most challenging tasks in bioinformatics. The genetic regulatory relationship considered here will be restricted to what might be observed in a microarray experiment: a change in the expression of a regulator gene modulates the expression of a target gene mainly via protein-DNA interactions besides other types of interactions, such as protein-protein interaction. Various approaches have been proposed to this problem, such as Boolean Network and Bayesian Network approaches <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>, differential equations and steady-state models <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp>, and other statistical and probabilistic methods <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>. Each method has its own strengths and weakness <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>, however, very few has been considered superior to the others mainly because of the intrinsically noisy property of the data, 'the curse of dimensionality', and the unknown 'true' underlying networks. Various scoring metrics and searching heuristics were proposed in <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> within the Bayesian Network (BN) framework. It was shown that a large amount of data is required in order to have a good recovery of the underlying network. This requirement is easily satisfied in a simulated environment; however, it is unlikely to be met for biological applications. Efforts such as incorporating heterogeneous biological data in network reconstruction have been witnessed to improve the accuracy of the networks <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr></abbrgrp>.</p>
         <p>As pointed out in <abbrgrp><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr></abbrgrp>, large scale networks, such as the Internet and the scientific collaboration network, show the scale-free property, i.e., the connections or edges in the networks follow the power law distribution. Many biological networks, including transcription regulatory networks, fall into this category. So far there is little research that has explicitly explored this important property to facilitate the learning of genetic networks from gene expression data. One recent study imposed the scale-free constraint on structure in network inference based on the S-system model <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. They investigated the performance with a simulated small scale time-course data. On the other hand, different mechanisms have been employed to explain the formation of the scale-free property in large scale networks other than biological networks. Most of the suggested models relate to Preferential Attachment <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. In contrast to modeling network growing, a model with fixed number of nodes and links was proposed recently <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. By applying local rewiring moves, the network can reach equilibrium states which have the power law degree distribution. Different mechanisms also were proposed to explain specific properties of different types of networks, such as genetic regulatory networks and the World Wide Web <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>.</p>
         <p>In this study, we proposed a network reconstruction algorithm that takes into account the scale-free network topology based on a modification of the <it>Symmetric-N </it>algorithm originally developed in <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>. The <it>Symmetric-N </it>algorithm was used to construct co-expressed gene networks which showed scale-free topology. It was also recently incorporated as a major component in their Nearest Neighbor Network algorithm for clustering expression data for generating functionally coherent clusters <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>. Both our modified and the original algorithms were evaluated on simulated data sets and a 102-gene set of microarray gene expression data from a study of the <it>Saccharomyces cerevisiae </it>yeast cell cycle <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. Compared with the original algorithm, the proposed algorithm demonstrated promising capability in recovering the underlying network structure. The results of our algorithm were further compared with a previous study based on the BN approaches <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. Our algorithm performed much better on the simulated data when the sample size is small compared with the number of variables, as is most of the currently available microarray expression data.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Proposed algorithm</p>
            </st>
            <p>Our algorithm is a modification of the algorithm for network construction proposed in <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>. The algorithm in <abbrgrp><abbr bid="B42">42</abbr></abbrgrp> is based on the concept of <it>N</it>-nearest-neighbor and consists of two steps. This algorithm (we name it <it>Symmetric-N</it>) is presented in the 'Methods' section. In the first step, for each node in the network, all other nodes are sorted according to the magnitude of correlations of gene expression in descending order. These nodes are considered as potential neighbors. In the second step, each pair of nodes is investigated. If they are both in each other's <it>N </it>nearest neighbors, a connection between them is made. Otherwise, they are not connected. Here <it>N </it>is a prescribed number for the size of neighbors.</p>
            <p>By using the <it>Symmetric-N </it>algorithm, Agrawal <abbrgrp><abbr bid="B42">42</abbr></abbrgrp> constructed co-expressed gene networks from several published gene expression data sets and found that the gene networks had small-world characteristics and became scale-free when <it>N </it>was above certain threshold. It was shown that this algorithm was able to uncover the scale-free topology, however, no analysis was provided on biological relevance of the co-expressed networks in the study. The major characteristic of a scale-free network is that a few nodes with much higher degrees of connections act as the core of the network and other nodes with much fewer connections act as the periphery of the network. In biological networks such as genetic regulatory networks, the transcription factors (TFs) are more likely to regulate multiple target genes and therefore have more connections compared to those non-TFs. On the other hand, the non-TF genes are only regulated by a few TFs. These observations suggest that the sizes of neighbors for the core and periphery nodes should generally not be equal. This phenomenon motivated a modification of the algorithm <it>Symmetric-N </it>so that the unequal neighbor sizes of the core and periphery nodes can benefit the network construction. In step 2 of the <it>Symmetric-N </it>algorithm, instead of using the same <it>N </it>neighbors for all the nodes, a larger number <it>N</it><sub><it>C </it></sub>is assigned to a core node and a smaller number <it>N</it><sub><it>P </it></sub>is assigned to a periphery node. If a periphery node is within the <it>N</it><sub><it>C </it></sub>nearest neighbors of a core node and the core node is within the <it>N</it><sub><it>P </it></sub>nearest neighbors of the periphery node, then a connection is made between them. Since the ranges of potential neighbors are different for these two types of nodes, the proposed algorithm is named <it>Asymmetric-N</it>. Details of the algorithm are presented in the 'Methods' section.</p>
         </sec>
         <sec>
            <st>
               <p>Computation study</p>
            </st>
            <p>The original algorithm <it>Symmetric-N </it>and our modified algorithm <it>Asymmetric-N </it>were evaluated with both simulated gene expression data and microarray gene expression data related to yeast cell cycle. Details on the microarray data, the construction of the simulated networks, and their node degree distributions are presented in the 'Methods' section. Two simulated datasets were derived from a 100-node network and a 20-node network, respectively. The underlying scale-free network for the 100-node network has 10 core nodes and 90 periphery nodes. The directions of edges are more likely to be from core nodes to periphery nodes. The 20-node network was constructed in a similar way. The criteria used to evaluate the performance of the algorithms on the simulated data include <it>recall</it>, <it>precision </it>and <it>F-Score</it>. <it>Recall </it>is defined as the ratio of the number of true edges found in the reconstructed network to the number of total edges in the underlying network. <it>Precision </it>is defined as the ratio of the number of true edges found in the reconstructed network to the number of total edges found in the reconstructed network. <it>F-Score </it>is defined as 2*<it>recall</it>*<it>precision</it>/(<it>recall </it>+ <it>precision</it>).</p>
            <p>The microarray data include 102 gene expression temporal profiles observed over 18 time points derived from the yeast cell cycle gene expression data <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. For this study, the 'true' interactions were derived from the database of Pathway Studio <abbrgrp><abbr bid="B45">45</abbr></abbrgrp> by submitting the list of genes and querying for instances of published interactions between these genes limited to interaction types 'expression' and 'regulation'. One hundred seventy one published interactions were found for this 102-gene set. It should be noted that this is not a so-called 'golden standard' set for a true evaluation of the learning outcome. We report the percentage of the published edges out of the total edges in the reconstructed network, as the criteria used for the simulated datasets would be inappropriate for this microarray dataset because of the unknown or incomplete 'true' network. For the examination of biological relevance of the predicted edges, we report the percentage of edges whose nodes (genes) share a common Gene Ontology (GO) Biological Process (BP) annotation from the Saccharomyces Genome Database (SGD) GO Slim mapper <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>. Generally, two genes or gene products with a common GO BP annotation are considered likely to interact with each other.</p>
            <p>In addition, <it>&#947; </it>in <it>P</it>(<it>k</it>) ~ <it>k</it><sup><it>&#947; </it></sup>of the node degree distribution in the constructed network and the fitness of the distribution, measured by the Coefficient of Determination (<it>R</it><sup>2</sup>), were used for the evaluation of the network structure. The parameters <it>&#947; </it>and <it>R</it><sup>2 </sup>were computed with the <it>fit</it>() function in Matlab (see the 'Methods' section for more details of the <it>fit</it>() function). Both <it>F-score </it>and <it>R</it><sup>2 </sup>range between 0 and 1. For a good recovery of the network, <it>F-Score </it>is expected to be high; <it>&#947; </it>is expected to be close to the <it>&#947; </it>of the underlying network; and <it>R</it><sup>2 </sup>is expected to be high. For the 100-node network, <it>&#947; </it>= -1.22 and <it>R</it><sup>2 </sup>= 0.96 for mixed-degree distribution; for the 102-gene network formed with the published interactions from the Pathway Studio, <it>&#947; </it>= -1.22 and <it>R</it><sup>2 </sup>= 0.93 for mixed-degree distribution.</p>
         </sec>
         <sec>
            <st>
               <p>Experiment with simulated data</p>
            </st>
            <p>For each underlying network, 10 different sets of gene expression profiles were generated for a fixed number of samples (time points) and results obtained from the algorithms were averaged.</p>
            <sec>
               <st>
                  <p>The 100-node network</p>
               </st>
               <p>Figure <figr fid="F1">1</figr> shows the results obtained from the <it>Symmetric-N </it>algorithm on the 100-node network. In panel (a), when the number of samples <it>S </it>is fixed at 25, as the number of possible neighbors <it>N </it>increases, <it>recall </it>increases, <it>precision </it>decreases and the <it>F-Score </it>first increases (up to <it>N </it>= 9) and then decreases (from <it>N </it>= 9 to <it>N </it>= 10). This is because that as <it>N </it>increases, pairs of nodes become more likely to be in each other's neighborhood and thus become more likely to be included. This leads to more true edges in the reconstructed network at the cost of including more edges and decreasing <it>precision</it>. Similarly, as <it>N </it>becomes larger, <it>&#947; </it>tends to drastically deviate from -1.22, the <it>&#947; </it>of the underlying network, in panel (b); the reconstructed network becomes less scale-free. Here we chose <it>&#947; </it>from the mixed-degree distribution since the reconstructed network is directionless.</p>
               <fig id="F1">
                  <title>
                     <p>Figure 1</p>
                  </title>
                  <caption>
                     <p>Results for the <it>Symmetric-N </it>algorithm with the 100-node simulated network</p>
                  </caption>
                  <text>
                     <p>Results for the <it>Symmetric-N </it>algorithm with the 100-node simulated network. Panels (a) and (b) show the results when sample size <it>S </it>is fixed (<it>S </it>= 25) while the number of neighbors <it>N </it>is varying. Panels (c) and (d) show the results when <it>N </it>is fixed (<it>N </it>= 5) while <it>S </it>is varying. The upper panels (a) and (c) show the results for <it>Recall</it>, <it>Precision </it>and <it>F-Score</it>. The lower panels (b) and (d) show the results for <it>&#947; </it>and <it>R</it><sup>2</sup>. The parameter pair &lt;<it>&#947;</it>, <it>R</it><sup>2</sup>> for the underlying network structure are &lt;-1.27, 0.96> for in-degree distribution, &lt;-1.61, 0.97> for out-degree distribution, and &lt;-1.22, 0.92> for mixed-degree distribution, respectively.</p>
                  </text>
                  <graphic file="1471-2105-9-75-1"/>
               </fig>
               <p>In panel (c), while the number of neighbors is fixed (<it>N </it>= 5 in this example), increasing the number of samples will generally improve both <it>recall </it>and <it>precision</it>, therefore also <it>F-Score</it>. This result is expected since more observations usually lessen the 'curse of dimensionality', and agrees with the previously published results <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. The parameter <it>&#947; </it>in panel (d) is still far from the true value (<it>&#947; </it>= -1.22) as the number of samples increases.</p>
               <p>Figure <figr fid="F2">2</figr> presents the results when applying the <it>Asymmetric-N </it>algorithm to the 100-node network. Different from the <it>Symmetric-N</it>, the number of neighbors of the core nodes and the periphery nodes were set unequal. Note that there could be different combinations of <it>N</it><sub><it>C </it></sub>and <it>N</it><sub><it>P</it></sub>. The values reported in Figure <figr fid="F2">2</figr> are the ones that achieved the best results according to <it>F-score </it>and <it>&#947;</it>. However, the behavior of the algorithms is similar regardless of the choice of values for <it>N</it><sub><it>C </it></sub>and <it>N</it><sub><it>P</it></sub>.</p>
               <fig id="F2">
                  <title>
                     <p>Figure 2</p>
                  </title>
                  <caption>
                     <p>Results for the <it>Asymmetric-N </it>algorithm with the 100-node simulated network</p>
                  </caption>
                  <text>
                     <p>Results for the <it>Asymmetric-N </it>algorithm with the 100-node simulated network. Panels (a) and (b) show the results when <it>S </it>and <it>N</it><sub><it>P </it></sub>are fixed (<it>S </it>= 25, <it>N</it><sub><it>P </it></sub>= 2) while <it>N</it><sub><it>C </it></sub>is varying. Panels (c) and (d) show the results when <it>S </it>and <it>N</it><sub><it>C </it></sub>are fixed (<it>S </it>= 25, <it>N</it><sub><it>C </it></sub>= 91) while <it>N</it><sub><it>P </it></sub>is varying. Panels (e) and (f) shows the results when <it>N</it><sub><it>C </it></sub>and <it>N</it><sub><it>P </it></sub>are fixed (<it>N</it><sub><it>C </it></sub>= 91, <it>N</it><sub><it>P </it></sub>= 2) while <it>S </it>is varying.</p>
                  </text>
                  <graphic file="1471-2105-9-75-2"/>
               </fig>
               <p>In panel (a), we see the trends of <it>recall</it>, <it>precision </it>and <it>F-Score </it>when fixing the number of samples (<it>S </it>= 25) and the number of neighbors for the periphery nodes (<it>N</it><sub><it>P </it></sub>= 2) while varying the number of neighbors for the core nodes (<it>N</it><sub><it>C</it></sub>). All the three measurements increase as <it>N</it><sub><it>C </it></sub>increases, which implies that the inclusion of more neighbors for the core nodes generally improves the performance of the algorithm. Similarly, increasing the number of neighbors for the core nodes makes <it>&#947; </it>move toward -1.22 as observed in panel (b). Better results with larger <it>N</it><sub><it>C </it></sub>for core nodes are consistent with the fact that TFs usually regulate a large number of genes.</p>
               <p>In panel (c), <it>S </it>and <it>N</it><sub><it>C </it></sub>are fixed at 25 and 91 respectively, and <it>N</it><sub><it>P </it></sub>varies. The trends of the three curves show some different patterns compared with those in panel (a): <it>recall </it>increases while <it>precision </it>decreases and <it>F-Score </it>decreases very gently, which means that more false edges are included than true edges when increasing the number of neighbors <it>N</it><sub><it>P</it></sub>. Similarly, the structure of the network becomes drastically different from the underlying structure as the number of neighbors for the periphery nodes increases (<it>&#947; </it>deviated from -1.22 when <it>Nc </it>> 2 in panel (d)). Therefore, this implies periphery nodes should have very few neighbors. This phenomenon is consistent with the fact that non-TF nodes are usually regulated by a few TFs. Similarly, as observed for the <it>Symmetric-N </it>algorithm, when fixing <it>N</it><sub><it>C </it></sub>and <it>N</it><sub><it>P</it></sub>, the increase of <it>S </it>improves the performance of the algorithm for all the three criteria (panel (e)) and the structure of the reconstructed network becomes closer to that of the underlying network (panel (f)).</p>
               <p>The performance of <it>Symmetric-N </it>and <it>Asymmetric-N </it>can be compared by examining Figures <figr fid="F1">1</figr> and <figr fid="F2">2</figr>. When the number of samples is fixed (<it>S </it>= 25) while numbers of neighbors vary, comparing results in panel (a) of Figure <figr fid="F1">1</figr> with those in panels (a) and (c) of Figure <figr fid="F2">2</figr>, the <it>Asymmetric-N </it>algorithm performs much better than the <it>Symmetric-N </it>algorithm in terms of <it>F-Score</it>, when the number of neighbors for the core nodes is large and the number of neighbors for periphery nodes is small. It is also true for <it>&#947; </it>by comparing panel (b) of Figure <figr fid="F1">1</figr> with panels (b) and (d) of Figure <figr fid="F2">2</figr>. The same phenomenon is observed when numbers of neighbors are fixed while number of samples changes (comparing panel (c) in Figure <figr fid="F1">1</figr> with panel (e) in Figure <figr fid="F2">2</figr>, panel (d) in Figure <figr fid="F1">1</figr> with panel (f) in Figure <figr fid="F2">2</figr>, respectively). The reason is that in the <it>Symmetric-N </it>algorithm all the nodes are treated equally while in <it>Asymmetric-N </it>algorithm different types of nodes (core and periphery) are distinguished, which reflects biological expectations more closely. Thus the improved performance is expected. In summary, <it>Asymmetric-N </it>algorithm outperforms significantly the <it>Symmetric-N </it>algorithm proposed in <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>.</p>
            </sec>
            <sec>
               <st>
                  <p>The 20-node network</p>
               </st>
               <p>We compared the proposed algorithm with some other methods currently used for the reconstruction of transcription regulatory networks. The experiments of Yu <it>et al</it>. <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> was selected because our simulated profiles were generated following their procedure, though our networks possess the scale-free property while no structure was assumed in theirs. They applied the BN method to 10 simulated small networks each with 20 nodes, with the number of samples ranging from 25 to 5,000. A <it>recall-imprecision </it>curve was used to show the performance when the number of samples increases (<it>imprecision </it>= 1 - <it>precision</it>). Here, a <it>recall-imprecision </it>curve for the <it>Asymmetric-N </it>algorithm is drawn for a 20-node network (Figure <figr fid="F3">3</figr>). The largest number of samples is 1,000 in our study. To better appreciate the performance, the <it>precision </it>curve (1 - <it>imprecision</it>) is shown as well.</p>
               <fig id="F3">
                  <title>
                     <p>Figure 3</p>
                  </title>
                  <caption>
                     <p><it>Recall, precision and imprecision </it>curves obtained with the <it>Asymmetric-N </it>algorithm for the 20-node simulated network when <it>N</it><sub><it>C </it></sub>and <it>N</it><sub><it>P </it></sub>are fixed (<it>N</it><sub><it>C </it></sub>= 17, <it>N</it><sub><it>P </it></sub>= 1) while <it>S </it>is varying</p>
                  </caption>
                  <text>
                     <p><it>Recall, precision and imprecision </it>curves obtained with the <it>Asymmetric-N </it>algorithm for the 20-node simulated network when <it>N</it><sub><it>C </it></sub>and <it>N</it><sub><it>P </it></sub>are fixed (<it>N</it><sub><it>C </it></sub>= 17, <it>N</it><sub><it>P </it></sub>= 1) while <it>S </it>is varying. The <it>imprecision is defined as </it>1 - <it>precision</it>.</p>
                  </text>
                  <graphic file="1471-2105-9-75-3"/>
               </fig>
               <p>It is not surprising that <it>recall </it>increases with the number of samples. <it>Imprecision</it>, however, increases first and then decreases. It is not clear why this happens and needs further investigation. Fixing at the sample size of <it>S </it>= 25, <it>F-Score </it>is 0.23, which is better than 0.16 (this number is inferred from Figure 4 in <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>) obtained from the BN method <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. At larger sample sizes such as 500 and 1,000, the BN approach performs much better. This is reasonable because that the BN method is statistically rigorous and it benefits when more samples are available. However, when only limited samples are available, as is the case in most of the currently available microarray data, our approach may perform better.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Experiment with the yeast cell cycle microarray data</p>
            </st>
            <p>The results obtained from our <it>Asymmetric-N </it>algorithm with different choices of correlation matrices are summarized in Table <tblr tid="T1">1</tblr>. Results of several combinations of <it>N</it><sub><it>C </it></sub>and <it>N</it><sub><it>P </it></sub>are illustrated. The <it>&#947; </it>and <it>R</it><sup>2 </sup>values, e.g., <it>&#947; </it>= -0.96 and <it>R</it><sup>2 </sup>= 0.79, deviate far from their counterparts for the underlying network with <it>&#947; </it>= -1.22 and <it>R</it><sup>2 </sup>= 0.93. This is in contrast to the results that the structural parameters of the reconstructed network are close to their counterparts of the underlying network with the simulation data. The main reason for the inconsistency is that the underlying network in this real dataset is incomplete. The <it>&#947; </it>and <it>R</it><sup>2 </sup>values for these two networks, namely, the network formed with published interactions and the reconstructed network, might both deviate from those of the real gene interaction network at work in the yeast cell cycle, for which our understanding is still incomplete.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Results of <it>Asymmetric-N </it>on the 102-gene dataset</p>
               </caption>
               <tblbdy cols="9">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>N<sub>C</sub></b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>N<sub>P</sub></b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>#Edges</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>#Published</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>%Published</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>%GO BP</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <it>&#947;</it>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <it>R</it>
                              <sup>2</sup>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="9">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>P-P PCC: no time lag</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>101</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>142</p>
                     </c>
                     <c ca="center">
                        <p>20</p>
                     </c>
                     <c ca="center">
                        <p>14.08</p>
                     </c>
                     <c ca="center">
                        <p>38.03</p>
                     </c>
                     <c ca="center">
                        <p>-0.69</p>
                     </c>
                     <c ca="center">
                        <p>0.45</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>P-P PCC: one time lag</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>101</p>
                     </c>
                     <c ca="center">
                        <p>16</p>
                     </c>
                     <c ca="center">
                        <p>185</p>
                     </c>
                     <c ca="center">
                        <p>31</p>
                     </c>
                     <c ca="center">
                        <p>16.76</p>
                     </c>
                     <c ca="center">
                        <p>30.81</p>
                     </c>
                     <c ca="center">
                        <p>-0.96</p>
                     </c>
                     <c ca="center">
                        <p>0.79</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>S-S PCC: no time lag</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>91</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>60</p>
                     </c>
                     <c ca="center">
                        <p>17</p>
                     </c>
                     <c ca="center">
                        <p>28.33</p>
                     </c>
                     <c ca="center">
                        <p>45.83</p>
                     </c>
                     <c ca="center">
                        <p>-1.27</p>
                     </c>
                     <c ca="center">
                        <p>0.65</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>S-S PCC: one time lag</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>91</p>
                     </c>
                     <c ca="center">
                        <p>11</p>
                     </c>
                     <c ca="center">
                        <p>155</p>
                     </c>
                     <c ca="center">
                        <p>29</p>
                     </c>
                     <c ca="center">
                        <p>18.71</p>
                     </c>
                     <c ca="center">
                        <p>27.74</p>
                     </c>
                     <c ca="center">
                        <p>-0.89</p>
                     </c>
                     <c ca="center">
                        <p>0.70</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>P-P PCC means point to point (total 18 points) Pearson correlation coefficient between two time series profiles.</p>
                  <p>S-S PCC means segment to segment (total 17 segments) Pearson correlation coefficient and the segment (say <it>i</it>) value is +1 (-1) if the value at point <it>i </it>is less (greater) than that at point <it>i </it>+ 1 [53].</p>
                  <p>Time lag means when aligning the two gene profiles, one of them needs to be shifted relative to the other.</p>
                  <p>#Edges means the total interactions reconstructed;</p>
                  <p>#Published means the reconstructed interactions that were previously published;</p>
                  <p>%Published is the percentage of the published interactions among all the reconstructed interactions;</p>
                  <p>%GO BP means the percentage of the reconstructed interactions whose genes or gene products pair share a common Gene Ontology (GO) Biological Process (BP) annotation from the SGD GO Slim mapper [46];</p>
                  <p><it>&#947; </it>and <it>R</it><sup>2 </sup>are the power in <it>P</it>(<it>k</it>) ~ <it>k</it><sup><it>&#947; </it></sup>and coefficient of determination returned by the <it>fit</it>() function, respectively (see 'Results' &#8211; 'Computation Study' section for more details).</p>
               </tblfn>
            </tbl>
            <p>As the gold standard or 'true' network is unknown or largely incomplete for this real microarray expression dataset, using criteria such as <it>recall </it>and <it>precision </it>to evaluate the performance of the reconstruction algorithms is inappropriate and likely to be misleading. There is an emerging tendency recently to take biological context into consideration when dealing with functional genomic data <abbrgrp><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr></abbrgrp>. By incorporating biological context information into the data integration process and the network recovery procedure, Myers <it>et al</it>. <abbrgrp><abbr bid="B49">49</abbr></abbrgrp> demonstrated that the utilization of such an important source yielded dramatic benefit comparing with their earlier work which only used prior knowledge of gene function but did not particularly exploit biological context. In general, most experiments are designed with the goal of investigating a particular biological process in mind <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>. Consequently, it is both necessary and important to inspect the related biological process information when checking the validity of the predicted interactions in a network, especially for situations where gold standard is not available or incomplete. In this study, for the biological relevance of the predicted edges, we report the percentage of edges whose nodes or genes share a common GO BP annotation from the SGD GO Slim mapper <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. In general, the probability for two genes or gene products to interact with each other is high if they belong to the same biological process.</p>
            <p>For the interactions found by our algorithm, when using the point-to-point Pearson correlation coefficient (P-P PCC) between two time series of gene profiles of 18 time points as the value in the correlation matrix with <it>N</it><sub><it>C </it></sub>= 101 and <it>N</it><sub><it>P </it></sub>= 4, 38% of which are found to share the same GO BP annotation when no time lag is used; while 31% are found to share the same GO BP annotation when one time lag is used with <it>N</it><sub><it>C </it></sub>= 101 and <it>N</it><sub><it>P </it></sub>= 16. When using the segment-to-segment Pearson correlation coefficient (S-S PCC) over 17 segments, the results are 28% with <it>N</it><sub><it>C </it></sub>= 91 and <it>N</it><sub><it>P </it></sub>= 11 and 46% with <it>N</it><sub><it>C </it></sub>= 91 and <it>N</it><sub><it>P </it></sub>= 1 for with and without time lag, respectively (see Table <tblr tid="T1">1</tblr>). This percentage (46%) is comparable to the result (45%) on the same dataset in <abbrgrp><abbr bid="B51">51</abbr></abbrgrp> by using the PCC. Of all the interactions in the network constructed in <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>, 3.5% are published interactions in comparison with those (14.08% &#8211; 28.33%) in the current study.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>In our proposed algorithm, it is required to specify whether a node is a core node or a periphery node. In case of reconstruction of genetic regulatory network, it is not hard to identify transcription regulators from biological knowledge, therefore the core nodes. Consequently, the core and the periphery nodes can be always specified for a set of genes whose networks are to be reconstructed by the proposed algorithm.</p>
         <p>We have also mentioned that in our current study the edges in the recovered networks are directionless, i.e., interaction between a pair of nodes is indicated without specifying which node is the source of influence. When more accurate information is needed, the directions of the edges have to be considered. The direction between core and periphery nodes can be always assigned as from the core node to the periphery node since transcription factors always regulate target genes. Several other possible ways to assign the directions for the connections between core and core nodes or periphery and periphery nodes can be considered:</p>
         <p>a) compare the rank of node <it>i </it>with respect to node <it>j </it>and the rank of node <it>j </it>with respect to node <it>i</it>. Assign the direction of the connection as from the higher ranked node to the lower ranked node. Generally, regulators tend to have more connections and targets tend to have fewer connections. Thus the rank of a regulator with respect to a target tends to be high while the rank of a target with respect to a regulator tends to be low. When there is a tie, a random direction is assigned.</p>
         <p>b) for the time-lagged computation, always assign the direction of the connection as from the node without time lag to the node with time lag. This is in accordance with the fact that the expression level of a regulator changes before it can influence its target.</p>
         <p>At the same time, we are seeking even better and more efficient ways to improve this method such as specifying neighbor size for each node according to biological knowledge. It is also noted that although the proposed algorithm demonstrated improved performance over the previous one for simulated networks with underlying scale-free property, our algorithm does not directly use any information on the node degree distribution. Therefore, we expect that this algorithm can be applied to the construction of biological networks that are not random.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>A modification of the current algorithm for the scale-free network construction has been proposed and evaluated with two different simulated gene expression datasets and one microarray gene expression dataset. The proposed algorithm performs much better than the original one in recovering the underlying true networks. Compared with previously published experiments using Bayesian Network approaches, our algorithm shows its advantages when the number of samples is small relative to the number of genes, as is the case for most actual biological microarray experiments. The proposed algorithm is expected to be used in reconstruction of biological networks that have underlying scale-free topologies. Besides, as the original algorithm was recently successfully used in gene expression data clustering analysis <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>, our improved algorithm hopefully can be incorporated into such clustering algorithm frameworks to derive better clustering results.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Datasets</p>
            </st>
            <sec>
               <st>
                  <p>Simulated gene expression data</p>
               </st>
               <p>The underlying scale-free network is a 100-node network constructed by selecting initially 10 core nodes in the network. The connections are made between pairs of these 10 nodes with a pre-specified probability. Either direction for the connection is equally likely. Thus an initial small random network is formed. Then the remaining 90 periphery nodes are added into the network. The nodes to be connected in the existing network with the new coming node are selected preferentially, that is, nodes with higher degree of connectivity will be more likely to be chosen to link to the newly added node. The directions of new connections are more likely (by setting a pre-specified probability) to be from core nodes to periphery nodes. Due to the randomness of the procedure a node might not be connected to any other node in the final network. In the 100-node network, 79 nodes form a large connected component and the others are isolated from this main subnetwork, the number of edges is 182, and the <it>&#947; </it>in the node distribution function (<it>P</it>(<it>k</it>) ~ <it>k</it><sup><it>&#947;</it></sup>) is approximately -1.27 for in-degree, -1.61 for out-degree, and -1.22 for mixed-degree with the Coefficient of Determination <it>R</it><sup>2 </sup>about 0.96, 0.97 and 0.92, respectively. The network thus can be considered as scale-free. Here, <it>&#947; </it>and <it>R</it><sup>2 </sup>are computed with the <it>fit</it>() function in Matlab. The 100-node simulated network and its degree distributions are illustrated in Figure <figr fid="F4">4</figr>. The 20-node simulated network was constructed in a similar fashion.</p>
               <fig id="F4">
                  <title>
                     <p>Figure 4</p>
                  </title>
                  <caption>
                     <p>The 100-node simulated network and its node degree distributions</p>
                  </caption>
                  <text>
                     <p>The 100-node simulated network and its node degree distributions. Core nodes are the 10 nodes that form the initial network. Periphery nodes are the remaining nodes that are (preferentially) attached (see 'Methods' &#8211; 'Dataset' section for more details).</p>
                  </text>
                  <graphic file="1471-2105-9-75-4"/>
               </fig>
               <p>With this fixed network topology, the simulated gene profiles are generated following a two-step procedure described in <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. First, values at each time step are updated by a simple stochastic process:</p>
               <p>
                  <display-formula><it>Y</it><sub><it>t</it>+1 </sub>= <it>Y</it><sub><it>t </it></sub>+ <it>A</it>(<it>Y</it><sub><it>t </it></sub>- <it>T</it>) + <it>E</it></display-formula>
               </p>
               <p>where <it>Y</it><sub><it>t </it></sub>is a vector representing the expression levels of all genes at time <it>t</it>, the matrix <it>A </it>represents the regulatory interactions in the simulated network, the vector <it>T </it>represents constitutive expression values for each gene, and the vector <it>E </it>models the intrinsic biological noise. Second, expression levels are restricted by a floor and ceiling function to range from 0 to 100 (arbitrary units). Expression levels are initialized randomly with values uniformly sampled from this range <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. By calculating the Pearson correlation coefficients between pairs of these profiles, the correlation matrix is derived. Since the correlation coefficients will be considered in the proposed method, the actual magnitude of the gene expression chosen in the simulated profiles is not essential.</p>
            </sec>
            <sec>
               <st>
                  <p>Microarray gene expression data</p>
               </st>
               <p>The time course profiles for a set of 102 genes are selected from the widely used yeast, <it>Saccharomyces cerevisiae</it>, cell cycle microarray data <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. These microarray experiments were designed to create a comprehensive list of yeast genes whose transcription levels were expressed periodically within the cell cycle. The gene expressions of cell cycle synchronized yeast cultures were collected over 18 time points taken in 7-minute intervals. This time series covers more than two complete cycles of cell division. The 102-gene set includes 9 known transcription regulators and their possible regulation targets <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. It is highly enriched for known interacting genes involved in the <it>Saccharomyces </it>cell cycle. The true edges of the underlying network were provided by the database of Pathway Studio <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>, which is based on information derived from PubMed abstracts using natural language search algorithms. If there is confirmative report that gene A and gene B interact with each other, a true edge is then assigned between the pair of genes. For this 102-gene regulatory network, <it>&#947; </it>for in-degree is -0.979 with <it>R</it><sup>2 </sup>= 0.9, <it>&#947; </it>for out-degree is -0.948 with <it>R</it><sup>2 </sup>= 0.44, and <it>&#947; </it>for mixed-degree is -1.22 with <it>R</it><sup>2 </sup>= 0.93. It appears that the distribution for the mixed-degree fits better with the power law distribution. The network and its degree distributions are shown in Figure <figr fid="F5">5</figr>.</p>
               <fig id="F5">
                  <title>
                     <p>Figure 5</p>
                  </title>
                  <caption>
                     <p>The 102-gene network and its node degree distributions</p>
                  </caption>
                  <text>
                     <p>The 102-gene network and its node degree distributions. Core nodes are the 9 transcription factors. Periphery nodes are the remaining non-transcription factors. The edges are obtained from Pathway Studio [45] (see 'Methods' &#8211; 'Dataset' section for more details).</p>
                  </text>
                  <graphic file="1471-2105-9-75-5"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>Algorithm <it>Symmetric-N</it></p>
               </st>
               <p>This algorithm was proposed in <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>. It is presented here for the sake of completeness.</p>
               <p><it>ConstructedNet </it>= <it>Symmetric-N</it>(<it>NumNodes</it>, <it>N</it>, <it>CorrelationMatrix</it>)</p>
               <p>Step 1: for <it>i </it>= 1 to <it>NumNodes</it></p>
               <p>&#160;&#160;&#160;<it>SortedNeighbor </it>[<it>i</it>, 1:<it>NumNodes </it>- 1] = <it>mySort</it>(<it>i, CorrelationMatrix</it>);</p>
               <p>Step 2: for <it>i </it>= 2 to <it>NumNodes</it></p>
               <p>&#160;&#160;&#160;for <it>j </it>= 1 to <it>i </it>- 1</p>
               <p>&#160;&#160;&#160;&#160;&#160;&#160;if (<it>j </it>is in <it>SortedNeighbor </it>[<it>i</it>, 1:<it>N</it>] and <it>i </it>is in <it>SortedNeighbor </it>[<it>j</it>, 1:<it>N</it>])</p>
               <p>&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;<it>ConstructedNet </it>[<it>i, j</it>] = <it>ConstructedNet </it>[<it>j, i</it>] = 1;</p>
               <p>&#160;&#160;&#160;&#160;&#160;&#160;otherwise</p>
               <p>&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;<it>ConstructedNet </it>[<it>i, j</it>] = <it>ConstructedNet </it>[<it>j, i</it>] = 0;</p>
               <p>Here <it>NumNodes </it>represents the total number of nodes in the network; <it>N </it>the pre-specified number of neighbors; and <it>CorrelationMatrix </it>the pre-computed absolute values of the correlation coefficients for all pairs of nodes. The function <it>mySort</it>() returns the other nodes in the sorted order in terms of their 'closeness' or correlation with the selected node.</p>
            </sec>
            <sec>
               <st>
                  <p>Algorithm <it>Asymmetric-N</it></p>
               </st>
               <p><it>ConstructedNet </it>= <it>Asymmetric-N</it>(<it>NumNodes, N</it><sub><it>C</it></sub>, <it>N</it><sub><it>P</it></sub>, <it>CorrelationMatrix</it>)</p>
               <p>Step 1: for <it>i </it>= 1 to <it>NumNodes</it></p>
               <p>&#160;&#160;&#160;<it>SortedNeighbor </it>[<it>i</it>, 1:<it>NumNodes </it>- 1] = <it>mySort</it>(<it>i, CorrelationMatrix</it>);</p>
               <p>&#160;&#160;&#160;if (<it>i </it>is a core node) <it>N</it><sub><it>i </it></sub>= <it>N</it><sub><it>C</it></sub>; otherwise <it>N</it><sub><it>i </it></sub>= <it>N</it><sub><it>P</it></sub>;</p>
               <p>Step 2: for <it>i </it>= 2 to <it>NumNodes</it></p>
               <p>&#160;&#160;&#160;for <it>j </it>= 1 to <it>i </it>- 1</p>
               <p>&#160;&#160;&#160;if (<it>j </it>is in <it>SortedNeighbor </it>[<it>i</it>, 1:<it>N</it><sub><it>i</it></sub>] and <it>i </it>is in <it>SortedNeighbor </it>[<it>j</it>, 1:<it>N</it><sub><it>j</it></sub>])</p>
               <p>&#160;&#160;&#160;&#160;&#160;&#160;<it>ConstructedNet </it>[<it>i, j</it>] = <it>ConstructedNet </it>[<it>j, i</it>] = 1;</p>
               <p>&#160;&#160;&#160;Otherwise</p>
               <p>&#160;&#160;&#160;&#160;&#160;&#160;<it>ConstructedNet </it>[<it>i, j</it>] = <it>ConstructedNet </it>[<it>j, i</it>] = 0;</p>
            </sec>
            <sec>
               <st>
                  <p><it>fit</it>() function in Matlab</p>
               </st>
               <p><it>fit</it>() function <abbrgrp><abbr bid="B52">52</abbr></abbrgrp> fits data to model, especially for (non-linear) curve fitting. It was used to fit the data points (dots in Figures <figr fid="F4">4</figr> and <figr fid="F5">5</figr>) to some power law distributed model (<it>P</it>(<it>k</it>) ~ <it>k</it><sup><it>&#947;</it></sup>). The returns of the function include <it>&#947; </it>and <it>R</it><sup>2 </sup>for the best fit it finds. We used <it>fit</it>(xdata, ydata, 'power1') in which 'power1' is defined as <it>y </it>= <it>a</it>*<it>x</it><sup><it>b</it></sup>. More details on the function can be found in Additional files <supplr sid="S1">1</supplr>.</p>
               <suppl id="S1">
                  <title>
                     <p>Additional file 1</p>
                  </title>
                  <text>
                     <p>Matlab <it>fit</it>() function. The file provides detail information on the usage and the algorithms used for this function.</p>
                  </text>
                  <file name="1471-2105-9-75-S1.doc">
                     <p>Click here for file</p>
                  </file>
               </suppl>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>The main framework was formed by GC and YD. GC implemented the algorithm. PL and EA participated in the computation. YD supervised overall project. All authors have read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank Dr. Min Zou and Dr. Jing Yu for kindly sharing their source code and data <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B33">33</abbr></abbrgrp>.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Reveal, a general reverse engineering algorithm for inference of genetic network architectures</p>
            </title>
            <aug>
               <au>
                  <snm>Liang</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Fuhrman</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Somogyi</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Pac Symp Biocomput</source>
            <pubdate>1998</pubdate>
            <fpage>18</fpage>
            <lpage>29</lpage>
            <xrefbib>
               <pubid idtype="pmpid">9697168</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Modeling gene expression data using dynamic Bayesian networks</p>
            </title>
            <aug>
               <au>
                  <snm>Murphy</snm>
                  <fnm>KP</fnm>
               </au>
               <au>
                  <snm>Mian</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Technical report</source>
            <publisher>University of California at Berkeley. Berkeley, CA </publisher>
            <pubdate>1999</pubdate>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Identification of genetic networks from a small number of gene expression patterns under the Boolean network model</p>
            </title>
            <aug>
               <au>
                  <snm>Akutsu</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Miyano</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kuhara</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Pac Symp Biocomput</source>
            <pubdate>1999</pubdate>
            <fpage>17</fpage>
            <lpage>28</lpage>
            <xrefbib>
               <pubid idtype="pmpid">10380182</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Genetic network inference: from co-expression clustering to reverse engineering</p>
            </title>
            <aug>
               <au>
                  <snm>D'Haeseleer</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Liang</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Somogyi</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2000</pubdate>
            <volume>16</volume>
            <issue>8</issue>
            <fpage>707</fpage>
            <lpage>726</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11099257</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Using Bayesian networks to analyze expression data</p>
            </title>
            <aug>
               <au>
                  <snm>Friedman</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Linial</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nachman</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Pe'er</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>J Comput Biol</source>
            <pubdate>2000</pubdate>
            <volume>7</volume>
            <fpage>601</fpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11108481</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Discovery of regulatory interactions through perturbation: inference and experimental design</p>
            </title>
            <aug>
               <au>
                  <snm>Ideker</snm>
                  <fnm>TE</fnm>
               </au>
               <au>
                  <snm>Thorsson</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Karp</snm>
                  <fnm>RM</fnm>
               </au>
            </aug>
            <source>Pac Symp Biocomput</source>
            <pubdate>2000</pubdate>
            <fpage>305</fpage>
            <lpage>316</lpage>
            <xrefbib>
               <pubid idtype="pmpid">10902179</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Inferring subnetworks from perturbed expression profiles</p>
            </title>
            <aug>
               <au>
                  <snm>Pe'er</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Regev</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Elidan</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Friedman</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2001</pubdate>
            <volume>17 Suppl 1</volume>
            <fpage>S215</fpage>
            <lpage>24</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11473012</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Estimation of genetic networks and functional structures between genes by using Bayesian networks and nonparametric regression</p>
            </title>
            <aug>
               <au>
                  <snm>Imoto</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Goto</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Miyano</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Pac Symp Biocomput</source>
            <pubdate>2002</pubdate>
            <fpage>175</fpage>
            <lpage>186</lpage>
            <xrefbib>
               <pubid idtype="pmpid">11928473</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Advances to Bayesian network inference for generating causal networks from observational biological data</p>
            </title>
            <aug>
               <au>
                  <snm>Yu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>VA</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>PP</fnm>
               </au>
               <au>
                  <snm>Hartemink</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Jarvis</snm>
                  <fnm>ED</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <issue>18</issue>
            <fpage>3594</fpage>
            <lpage>3603</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15284094</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Informative structure priors: joint learning of dynamic regulatory networks from multiple types of data</p>
            </title>
            <aug>
               <au>
                  <snm>Bernard</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hartemink</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>Pac Symp Biocomput</source>
            <pubdate>2005</pubdate>
            <fpage>459</fpage>
            <lpage>470</lpage>
            <xrefbib>
               <pubid idtype="pmpid">15759651</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Inferring gene regulatory networks from time-ordered gene expression data of Bacillus subtilis using differential equations</p>
            </title>
            <aug>
               <au>
                  <snm>de Hoon</snm>
                  <fnm>MJL</fnm>
               </au>
               <au>
                  <snm>Imoto</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kobayashi</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ogasawara</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Miyano</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Pac Symp Biocomput</source>
            <pubdate>2003</pubdate>
            <volume>8</volume>
            <fpage>17&#8211;28</fpage>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Modeling gene expression with differential equations</p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>He</snm>
                  <fnm>HL</fnm>
               </au>
               <au>
                  <snm>Church</snm>
                  <fnm>GM</fnm>
               </au>
            </aug>
            <source>Pacific Symposium on Biocomputing</source>
            <pubdate>1999</pubdate>
            <volume>4</volume>
            <fpage>29</fpage>
            <lpage>40</lpage>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Inference of S-system models of genetic networks using a cooperative coevolutionary algorithm</p>
            </title>
            <aug>
               <au>
                  <snm>Kimura</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ide</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kashihara</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kano</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hatakeyama</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Masui</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Nakagawa</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Yokoyama</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kuramitsu</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Konagaya</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>7</issue>
            <fpage>1154</fpage>
            <lpage>1163</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15514004</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Chemogenomic profiling on a genome-wide scale using reverse-engineered gene networks</p>
            </title>
            <aug>
               <au>
                  <snm>di Bernardo</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Thompson</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Gardner</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Chobot</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Eastwood</snm>
                  <fnm>EL</fnm>
               </au>
               <au>
                  <snm>Wojtovich</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Elliott</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Schaus</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Collins</snm>
                  <fnm>JJ</fnm>
               </au>
            </aug>
            <source>Nat Biotech</source>
            <pubdate>2005</pubdate>
            <volume>23</volume>
            <issue>3</issue>
            <fpage>377</fpage>
         </bibl>
         <bibl id="B15">
            <title>
               <p>A stochastic differential equation model for quantifying transcriptional regulatory network in Saccharomyces cerevisiae</p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>KC</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>TY</fnm>
               </au>
               <au>
                  <snm>Tseng</snm>
                  <fnm>HH</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>CYF</fnm>
               </au>
               <au>
                  <snm>Kao</snm>
                  <fnm>CY</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>12</issue>
            <fpage>2883</fpage>
            <lpage>2890</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15802287</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Reverse engineering gene networks using singular value decomposition and robust regression</p>
            </title>
            <aug>
               <au>
                  <snm>Yeung</snm>
                  <fnm>MK</fnm>
               </au>
               <au>
                  <snm>Tegner</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Collins</snm>
                  <fnm>JJ</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2002</pubdate>
            <volume>99</volume>
            <fpage>6163</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">122920</pubid>
                  <pubid idtype="pmpid" link="fulltext">11983907</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>A systematic approach to reconstructing transcription networks in Saccharomycescerevisiae</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Cherry</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Botstein</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>PNAS</source>
            <pubdate>2002</pubdate>
            <volume>99</volume>
            <issue>26</issue>
            <fpage>16893</fpage>
            <lpage>16898</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">139240</pubid>
                  <pubid idtype="pmpid" link="fulltext">12482955</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>A Gene-Coexpression Network for Global Discovery of Conserved Genetic Modules</p>
            </title>
            <aug>
               <au>
                  <snm>Stuart</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Segal</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Koller</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>SK</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2003</pubdate>
            <volume>302</volume>
            <issue>5643</issue>
            <fpage>249</fpage>
            <lpage>255</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12934013</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data</p>
            </title>
            <aug>
               <au>
                  <snm>Segal</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Shapira</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Regev</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Pe'er</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Botstein</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Koller</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Friedman</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2003</pubdate>
            <volume>34</volume>
            <issue>2</issue>
            <fpage>166</fpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12740579</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Network component analysis: Reconstruction of regulatory signals in biological systems</p>
            </title>
            <aug>
               <au>
                  <snm>Liao</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Boscolo</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>YL</fnm>
               </au>
               <au>
                  <snm>Tran</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Sabatti</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Roychowdhury</snm>
                  <fnm>VP</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <issue>26</issue>
            <fpage>15522</fpage>
            <lpage>15527</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid>14673099</pubid>
                  <pubid idtype="pmpid" link="fulltext">14673099</pubid>
                  <pubid idtype="pmcid">307600</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Inferring genetic networks and identifying compound mode of action via expression profiling</p>
            </title>
            <aug>
               <au>
                  <snm>Gardner</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>di Bernardo</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lorenz</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Collins</snm>
                  <fnm>JJ</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2003</pubdate>
            <volume>301</volume>
            <fpage>102</fpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12843395</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>A causal inference approach for constructing transcriptional regulatory networks</p>
            </title>
            <aug>
               <au>
                  <snm>Xing</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>van der Laan</snm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>21</issue>
            <fpage>4007</fpage>
            <lpage>4013</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16131521</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>A Statistical Method for Constructing Transcriptional Regulatory Networks Using Gene Expression and Sequence Data</p>
            </title>
            <aug>
               <au>
                  <snm>Xing</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>van der Laan</snm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <source>Journal of Computational Biology</source>
            <pubdate>2005</pubdate>
            <volume>12</volume>
            <issue>2</issue>
            <fpage>229</fpage>
            <lpage>246</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15767778</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Inference of transcriptional regulatory network by two-stage constrained space factor analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Yu</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>KC</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>21</issue>
            <fpage>4033</fpage>
            <lpage>4038</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16144806</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Reconstructing gene regulatory networks from time-series microarray data</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>SP</fnm>
               </au>
               <au>
                  <snm>Tseng</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>SC</fnm>
               </au>
            </aug>
            <source>Physica A: Statistical and Theoretical Physics</source>
            <pubdate>2005</pubdate>
            <volume>350</volume>
            <issue>1</issue>
            <fpage>63</fpage>
         </bibl>
         <bibl id="B26">
            <title>
               <p>A probabilistic dynamical model for quantitative inference of the regulatory mechanism of transcription</p>
            </title>
            <aug>
               <au>
                  <snm>Sanguinetti</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Rattray</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>ND</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <issue>14</issue>
            <fpage>1753</fpage>
            <lpage>1759</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16632490</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles</p>
            </title>
            <aug>
               <au>
                  <snm>Faith</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Hayete</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Thaden</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Mogno</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Wierzbowski</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Cottarel</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Kasif</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Collins</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Gardner</snm>
                  <fnm>TS</fnm>
               </au>
            </aug>
            <source>PLoS Biology</source>
            <pubdate>2007</pubdate>
            <volume>5</volume>
            <issue>1</issue>
            <fpage>e8</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1764438</pubid>
                  <pubid idtype="pmpid" link="fulltext">17214507</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Modeling and simulation of genetic regulatory systems: a literature review</p>
            </title>
            <aug>
               <au>
                  <snm>de Jong</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>J Comput Biol</source>
            <pubdate>2002</pubdate>
            <volume>9</volume>
            <issue>1</issue>
            <fpage>67</fpage>
            <lpage>103</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11911796</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Combining location and expression data for principled discovery of genetic regulatory network models</p>
            </title>
            <aug>
               <au>
                  <snm>Hartemink</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Gifford</snm>
                  <fnm>DK</fnm>
               </au>
               <au>
                  <snm>Jaakkola</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>RA</fnm>
               </au>
            </aug>
            <source>Pac Symp Biocomput</source>
            <pubdate>2002</pubdate>
            <fpage>437</fpage>
            <lpage>449</lpage>
            <xrefbib>
               <pubid idtype="pmpid">11928497</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Estimating gene networks from gene expression data by combining Bayesian network model with promoter element detection</p>
            </title>
            <aug>
               <au>
                  <snm>Tamada</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bannai</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Imoto</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Tashiro</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kuhara</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Miyano</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>19 Suppl 2</volume>
            <fpage>II227</fpage>
            <lpage>II236</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14534194</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Physical Network Models</p>
            </title>
            <aug>
               <au>
                  <snm>Yeang</snm>
                  <fnm>CH</fnm>
               </au>
               <au>
                  <snm>Ideker</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Jaakkola</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Journal of Computational Biology</source>
            <pubdate>2004</pubdate>
            <volume>11</volume>
            <issue>2-3</issue>
            <fpage>243</fpage>
            <lpage>262</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15285891</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Using prior knowledge to improve genetic network reconstruction from microarray data</p>
            </title>
            <aug>
               <au>
                  <snm>Le Phillip</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Bahl</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Unga</snm>
                  <fnm>LH</fnm>
               </au>
            </aug>
            <source>In Silico Biology</source>
            <pubdate>2004</pubdate>
            <volume>4</volume>
            <fpage>335</fpage>
            <lpage>353</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid>15724284</pubid>
                  <pubid idtype="pmpid" link="fulltext">15724284</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>A new dynamic Bayesian network (DBN) approach for identifying gene regulatory networks from time course microarray data</p>
            </title>
            <aug>
               <au>
                  <snm>Zou</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Conzen</snm>
                  <fnm>SD</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>1</issue>
            <fpage>71</fpage>
            <lpage>79</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15308537</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Modularized learning of genetic interaction networks from biological annotations and mRNA expression data</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>PH</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <fpage>2739</fpage>
            <lpage>2747</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid>15797909</pubid>
                  <pubid idtype="pmpid" link="fulltext">15797909</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Reconstructing gene-regulatory networks from time series, knock-out data, and prior knowledge</p>
            </title>
            <aug>
               <au>
                  <snm>Geier</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Timmer</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Fleck</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>BMC Systems Biology</source>
            <pubdate>2007</pubdate>
            <volume>1</volume>
            <issue>1</issue>
            <fpage>11</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1839889</pubid>
                  <pubid idtype="pmpid" link="fulltext">17408501</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Statistical mechanics of complex networks</p>
            </title>
            <aug>
               <au>
                  <snm>Albert</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Barab&#225;si</snm>
                  <fnm>AL</fnm>
               </au>
            </aug>
            <source>Reviews of Modern Physics</source>
            <publisher> American Physical Society</publisher>
            <pubdate>2002</pubdate>
            <volume>74</volume>
            <issue>1</issue>
            <fpage>47</fpage>
         </bibl>
         <bibl id="B37">
            <title>
               <p>The topology of the transcription regulatory network in the yeast, Saccharomyces cerevisiae</p>
            </title>
            <aug>
               <au>
                  <snm>Farkas</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Jeong</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Vicsek</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Barabasi</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Oltvai</snm>
                  <fnm>ZN</fnm>
               </au>
            </aug>
            <source>Physica A</source>
            <pubdate>2003</pubdate>
            <volume>318</volume>
            <issue>3-4</issue>
            <fpage>601</fpage>
            <lpage>612</lpage>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Network biology: understanding the cell's functional organization</p>
            </title>
            <aug>
               <au>
                  <snm>Barabasi</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Oltvai</snm>
                  <fnm>ZN</fnm>
               </au>
            </aug>
            <source>Nature Reviews Genetics</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <issue>2</issue>
            <fpage>101</fpage>
            <lpage>113</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14735121</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Inference of scale-free networks from gene expression time series</p>
            </title>
            <aug>
               <au>
                  <snm>Daisuke</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Horton</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>J Bioinform Comput Biol.</source>
            <pubdate>2006</pubdate>
            <volume>4</volume>
            <fpage>503</fpage>
            <lpage>514</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid>16819798</pubid>
                  <pubid idtype="pmpid" link="fulltext">16819798</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Scale-free networks from a Hamiltonian dynamics</p>
            </title>
            <aug>
               <au>
                  <snm>Baiesi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Manna</snm>
                  <fnm>SS</fnm>
               </au>
            </aug>
            <source>Physical Review E</source>
            <publisher> American Physical Society</publisher>
            <pubdate>2003</pubdate>
            <volume>68</volume>
            <issue>4</issue>
            <fpage>47103</fpage>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Copying nodes versus editing links: the source of the difference between genetic regulatory networks and the WWW</p>
            </title>
            <aug>
               <au>
                  <snm>Louzoun</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Muchnik</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Solomon</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <issue>5</issue>
            <fpage>581</fpage>
            <lpage>588</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16403796</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Extreme Self-Organization in Networks Constructed from Gene Expression Data</p>
            </title>
            <aug>
               <au>
                  <snm>Agrawal</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Physical Review Letters</source>
            <publisher> American Physical Society</publisher>
            <pubdate>2002</pubdate>
            <volume>89</volume>
            <issue>26</issue>
            <fpage>268702</fpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12484863</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Nearest Neighbor Networks: clustering expression data based on gene neighborhoods</p>
            </title>
            <aug>
               <au>
                  <snm>Huttenhower</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Flamholz</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Landis</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Sahi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Myers</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Olszewski</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Hibbs</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Siemers</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Troyanskaya</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Coller</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <issue>1</issue>
            <fpage>250</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1941745</pubid>
                  <pubid idtype="pmpid" link="fulltext">17626636</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization</p>
            </title>
            <aug>
               <au>
                  <snm>Spellman</snm>
                  <fnm>PT</fnm>
               </au>
               <au>
                  <snm>Sherlock</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>MQ</fnm>
               </au>
               <au>
                  <snm>Iyer</snm>
                  <fnm>VR</fnm>
               </au>
               <au>
                  <snm>Anders</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>PO</fnm>
               </au>
               <au>
                  <snm>Botstein</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Futcher</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Mol Biol Cell</source>
            <pubdate>1998</pubdate>
            <volume>9</volume>
            <issue>12</issue>
            <fpage>3273</fpage>
            <lpage>3297</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">25624</pubid>
                  <pubid idtype="pmpid" link="fulltext">9843569</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Pathway studio--the analysis and navigation of molecular networks</p>
            </title>
            <aug>
               <au>
                  <snm>Nikitin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Egorov</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Daraselia</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Mazo</snm>
                  <fnm>I</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <issue>16</issue>
            <fpage>2155</fpage>
            <lpage>2157</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14594725</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>GO Slim Mapper</p>
            </title>
            <url>http://db.yeastgenome.org/cgi-bin/GO/goTermMapper</url>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Discovery of biological networks from diverse functional genomic data</p>
            </title>
            <aug>
               <au>
                  <snm>Myers</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Robson</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Wible</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hibbs</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chiriac</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Theesfeld</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Dolinski</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Troyanskaya</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Genome Biology</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <issue>13</issue>
            <fpage>R114</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1414113</pubid>
                  <pubid idtype="pmpid" link="fulltext">16420673</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Finding function: evaluation methods for functional genomic data</p>
            </title>
            <aug>
               <au>
                  <snm>Myers</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Barrett</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Hibbs</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Huttenhower</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Troyanskaya</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <issue>1</issue>
            <fpage>187</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1560386</pubid>
                  <pubid idtype="pmpid" link="fulltext">16869964</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Context-sensitive data integration and prediction of biological networks</p>
            </title>
            <aug>
               <au>
                  <snm>Myers</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Troyanskaya</snm>
                  <fnm>OG</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2007</pubdate>
            <volume>23</volume>
            <issue>17</issue>
            <fpage>2322</fpage>
            <lpage>2330</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17599939</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Probabilistic Discovery of Overlapping Cellular Processes and Their Regulation</p>
            </title>
            <aug>
               <au>
                  <snm>Battle</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Segal</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Koller</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Journal of Computational Biology</source>
            <pubdate>2005</pubdate>
            <volume>12</volume>
            <issue>7</issue>
            <fpage>909</fpage>
            <lpage>927</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16201912</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>A statistical method to incorporate biological knowledge for generating testable novel gene regulatory interactions from microarray experiments </p>
            </title>
            <aug>
               <au>
                  <snm>Larsen</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Almasri</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Dai</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>317</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2082045</pubid>
                  <pubid idtype="pmpid" link="fulltext">17727721</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>Matlab fit() function  </p>
            </title>
            <url>http://www.mathworks.com/access/helpdesk/help/toolbox/curvefit/fit.html</url>
         </bibl>
         <bibl id="B53">
            <title>
               <p>In search of functional association from time-series microarray data based on the change trend and level of gene expression</p>
            </title>
            <aug>
               <au>
                  <snm>He</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Zeng</snm>
                  <fnm>AP</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <issue>1</issue>
            <fpage>69</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1435774</pubid>
                  <pubid idtype="pmpid" link="fulltext">16478547</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
