<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1752-0509-3-67</ui>
   <ji>1752-0509</ji>
   <fm>
      <dochead>Methodology article</dochead>
      <bibl>
         <title>
            <p>BowTieBuilder: modeling signal transduction pathways</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Supper</snm>
               <fnm>Jochen</fnm>
               <insr iid="I1"/>
               <email>jochen@supper.de</email>
            </au>
            <au id="A2">
               <snm>Spangenberg</snm>
               <fnm>Luc&#237;a</fnm>
               <insr iid="I1"/>
               <email>spangenb@informatik.uni-tuebingen.de</email>
            </au>
            <au id="A3">
               <snm>Planatscher</snm>
               <fnm>Hannes</fnm>
               <insr iid="I1"/>
               <email>hannes.planatscher@uni-tuebingen.de</email>
            </au>
            <au id="A4">
               <snm>Dr&#228;ger</snm>
               <fnm>Andreas</fnm>
               <insr iid="I1"/>
               <email>andreas.draeger@uni-tuebingen.de</email>
            </au>
            <au id="A5">
               <snm>Schr&#246;der</snm>
               <fnm>Adrian</fnm>
               <insr iid="I1"/>
               <email>adrian.schroeder@uni-tuebingen.de</email>
            </au>
            <au id="A6">
               <snm>Zell</snm>
               <fnm>Andreas</fnm>
               <insr iid="I1"/>
               <email>andreas.zell@uni-tuebingen.de</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Center for Bioinformatics T&#252;bingen (ZBIT), University of T&#252;bingen, Sand 1, 72076 T&#252;bingen, Germany</p>
            </ins>
         </insg>
         <source>BMC Systems Biology</source>
         <issn>1752-0509</issn>
         <pubdate>2009</pubdate>
         <volume>3</volume>
         <issue>1</issue>
         <fpage>67</fpage>
         <url>http://www.biomedcentral.com/1752-0509/3/67</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">19566957</pubid>
               <pubid idtype="doi">10.1186/1752-0509-3-67</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>28</day>
               <month>11</month>
               <year>2008</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>30</day>
               <month>6</month>
               <year>2009</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>30</day>
               <month>6</month>
               <year>2009</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2009</year>
         <collab>Supper et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Sensory proteins react to changing environmental conditions by transducing signals into the cell. These signals are integrated into core proteins that activate downstream target proteins such as transcription factors (TFs). This structure is referred to as a bow tie, and allows cells to respond appropriately to complex environmental conditions. Understanding this cellular processing of information, from sensory proteins (e.g., cell-surface proteins) to target proteins (e.g., TFs) is important, yet for many processes the signaling pathways remain unknown.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Here, we present BowTieBuilder for inferring signal transduction pathways from multiple source and target proteins. Given protein-protein interaction (PPI) data signaling pathways are assembled without knowledge of the intermediate signaling proteins while maximizing the overall probability of the pathway. To assess the inference quality, BowTieBuilder and three alternative heuristics are applied to several pathways, and the resulting pathways are compared to reference pathways taken from KEGG. In addition, BowTieBuilder is used to infer a signaling pathway of the innate immune response in humans and a signaling pathway that potentially regulates an underlying gene regulatory network.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>We show that BowTieBuilder, given multiple source and/or target proteins, infers pathways with satisfactory recall and precision rates and detects the core proteins of each pathway.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Most signal transduction events are initialized by cell-surface proteins that respond to specific environmental stimuli. When activated these proteins emanate a signaling cascade which involves a series of (de)-phosphorylation events. In many cases such signaling events transduce the signal to transcription factors (TFs), which in turn regulate the expression level of downstream genes. Understanding this cellular processing of information, from the source proteins (e.g., cell-surface proteins) to the target proteins (e.g., TFs), is important when generating comprehensive models of regulatory networks. For several biological processes the signaling pathway has been derived experimentally <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. However, a large number of complex signaling pathways are yet to be discovered. To unravel these, computational inference methods are a valuable tool.</p>
         <p>The basis for the computational inference of novel signaling pathways are protein-protein interaction (PPI) datasets. These datasets are derived from biological studies on individual PPIs, but recently also by large-scale genomic, proteomic, and bioinformatic analyses. The yeast-two hybrid method, for instance, was a major driving force in this development <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. These technological advances in measuring and predicting PPIs have fueled numerous databases <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>.</p>
         <p>Based on such PPI datasets several methods have been developed for inferring signal transduction pathways <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>. Some of these methods combine PPI with gene expression datasets <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B13">13</abbr></abbrgrp>, improving the overall performance. Here, the dataset provided by the STRING database is utilized <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. STRING already integrates PPI information from various sources (e.g., coexpression, the literature, and genomic context) and provides confidence scores for each reported PPI.</p>
         <p>When inferring signaling pathways some assumptions regarding their structure have to be made. Many previous approaches have inferred pathways by connecting pairs of proteins (e.g., one membrane protein and one TF) <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>. In recent works on the structural organization of cellular regulation, however, it has been reported that many biological networks are structured like bow ties <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp>. Such bow tie structures contain multiple source and target proteins and, in most cases, internal proteins that process the transduced signals (Figure <figr fid="F1">1</figr>).</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Assumed structure of signaling pathways</p>
            </caption>
            <text>
               <p><b>Assumed structure of signaling pathways</b>. This figure depicts a signal transduction pathway. The source proteins are cell-surface proteins that transduce the signal to intermediate (cytosolic) proteins. These in turn transduce the signal to the target proteins (TFs), that regulate the transcription of downstream genes. A dotted line between two proteins indicates a PPI.</p>
            </text>
            <graphic file="1752-0509-3-67-1"/>
         </fig>
         <p>In this work we present BowTieBuilder, which aims at integrating multiple source proteins (e.g., membrane proteins) and target proteins (e.g., TFs) into one signaling pathway. As input, BowTieBuilder requires a set of source and/or target proteins. Given this input, BowTieBuilder searches for the most probable pathway that connects the input and output proteins. Thereby, core proteins are favored implicitly through the objective function. For every inferred signaling pathway, the core proteins are determined and their bow tie score is calculated, this value indicates whether the pathway is bow tie structured. These core proteins constitute gateways that integrate all information and are, therefore, often the key regulators in these signaling pathways. In contrast to metabolic networks where the core often forms a large cluster with interconnected nodes, bow ties in signaling networks are reported to have fewer nodes with sparse interconnections <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> &#8211; if they exist at all. Accordingly, The BowTieBuilder does not require a bow tie structure, so that pathways without a bow tie structure can also be inferred and analyzed.</p>
         <p>To validate this method various sets of source and target proteins from yeast (<it>Saccharomyces cerevisiae</it>) and human (<it>Homo sapiens</it>) are inferred and compared against signaling pathways from KEGG (Kyoto Encyclopedia of Genes and Genomes) <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. To compare BowTieBuilder to other heuristics, three additional inference methods are described and applied to the same pathways. After validating the results against KEGG signaling pathways, two signaling pathways with no related KEGG pathway are inferred. One pathway involved in the innate immune system of human and another pathway that connected signal transduction and gene regulatory networks, which was inferred in a separate study <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Protein-protein interaction (PPI) data</p>
            </st>
            <p>The PPI dataset is represented as a weighted directed graph <it>G </it>= (<it>V</it>, <it>E</it>, <b>w</b>), where nodes (<it>V</it>) represent proteins, edges (<it>E</it>) PPIs, and the scores (<b>w</b>) the confidence in each interaction. The scores (edge weight <b>w</b>) range from 0, indicating no interaction, to 1, indicating an interaction with high confidence.</p>
            <p>The PPI dataset used in this work is obtained from the STRING database (version 7.1) <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp>. This dataset contains computationally and experimentally derived PPIs, including interactions from other databases (e.g., MINT <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>, BioGRID <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, DIP <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>, and Reactome <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>), microarray experiments, high-throughput experiments, and a mined literature corpus. Furthermore, PPIs are transferred between orthologous pairs of proteins over different organisms. All of these datasets are combined and for each PPI a confidence score is calculated. This way the information from multiple sources is combined into a single score that expresses the overall confidence in each PPI. This score is derived by calculating the joint membership of proteins with PPI in KEGG pathways <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Problem complexity and formalization</p>
            </st>
            <p>The problem posed here is similar to the problem of finding Steiner trees in graphs <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>, or more specifically, vertex-weighted Steiner trees. In this problem formalization, a weighted graph <it>G </it>= (<it>V</it>, <it>E</it>, <b>w</b>) and a non-emtpy set of terminals <it>T </it>&#8838; <it>V </it>is given, with <b>w </b>&#8712; &#8477;<sup>+</sup>. The optimal Steiner tree is defined as the connected subgraph <it>G' </it>= (<it>V'</it>, <it>E'</it>, <b>w'</b>) with <it>G' </it>&#8838; <it>G</it>, for which the summed weight <b>w</b><sub>sum</sub>(<it>E'</it>) = &#8721;<sub><b>e</b>&#8712;<it>E' </it></sub><b>w</b><sub><it>e </it></sub>is minimal, and <it>T </it>&#8838; <it>V' </it>holds. The Steiner tree problem on graphs was shown to be <inline-formula><graphic file="1752-0509-3-67-i1.gif"/></inline-formula>-complete <abbrgrp><abbr bid="B31">31</abbr></abbrgrp> and, thus, is in most cases solved with heuristics. One of these heuristics is Prim's algorithm <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>, which iteratively extends the subgraph <it>G</it>' by adding the vertex with the smallest distance until all nodes in <it>T </it>are connected in <it>G</it>'. A more recent heuristic presented by Melhorn <it>et al</it>. <abbrgrp><abbr bid="B33">33</abbr></abbrgrp> proceeds by first calculating the minimal distance between all nodes in <it>T</it>, and then assembling the minimal Steiner tree by iteratively connecting the nodes with the smallest distance to each other.</p>
            <p>Here, the aim is to select a subgraph of <it>G' </it>&#8838; <it>G </it>that connects a set of source proteins <it>S </it>to a set of target proteins <it>T</it>. Given a graph <it>G </it>= (<it>V</it>, <it>E</it>, <b>w</b>) with <it>w </it>&#8712; [0, 1] and a disjoint source <it>S </it>&#8838; <it>V </it>and target set <it>T </it>&#8838; <it>V </it>(<it>S </it>&#8745; <it>T </it>= &#8709;), the aim is to find the optimal subgraph <it>G' </it>&#8838; <it>G </it>such that for every <it>s </it>&#8712; <it>S </it>and for every <it>t </it>&#8712; <it>T </it>at least one path <it>P</it>(<it>s</it>, <it>t</it>) exists in <it>G'</it>, whenever such a path exists in <it>G</it>. If either the source or the target set is empty, the problem formalization of Steiner trees is applied. Then the aim is to connect all nodes that are given either in <it>S </it>or in <it>T </it>(<it>S </it>&#8745; <it>T</it>) to each other.</p>
         </sec>
         <sec>
            <st>
               <p>Objective function for the pathways</p>
            </st>
            <p>For any given pathway, the overall confidence is calculated by multiplying the individual confidence values of the utilized edges:</p>
            <p>
               <display-formula id="M1">
                  <graphic file="1752-0509-3-67-i2.gif"/>
               </display-formula>
            </p>
            <p>This objective is based on the assumption that the edge scores reflect independent confidence values, and implies that the resulting score gives the overall confidence in the pathway &#8211; that all contained edges are true biological interactions.</p>
         </sec>
         <sec>
            <st>
               <p>Inferring signal transduction pathways</p>
            </st>
            <sec>
               <st>
                  <p>Finding optimal paths between two proteins</p>
               </st>
               <p>Although the problem of finding the optimal pathway is <inline-formula><graphic file="1752-0509-3-67-i1.gif"/></inline-formula>-complete, some special instances exist that are solvable in polynomial time. If, for instance, the source set <it>S </it>and target set <it>T </it>both contain one node, the problem reduces to finding the highest scoring path between them. This problem can be solved by applying Dijkstra's algorithm <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. Given two nodes, this algorithm finds the highest scoring path with a runtime complexity of <inline-formula><graphic file="1752-0509-3-67-i3.gif"/></inline-formula>((|<it>E</it>| + |<it>V</it>|) log |<it>V</it>|), where |<it>V</it>| gives the number of proteins and |<it>E</it>| the number of PPIs. For PPI networks, it can be assumed that most proteins are not connected to each other |<it>E</it>| &#8810; |<it>V</it>|<sup>2</sup>; therefore, Dijkstra's algorithm is implemented using adjacency lists, and thus the runtime is reduced to <inline-formula><graphic file="1752-0509-3-67-i3.gif"/></inline-formula>(|<it>V</it>| log |<it>V</it>| + <it>E</it>). The scores between all nodes, obtained by Dijkstra's algorithm, will be stored in a distance matrix <it>D</it><sub>|<it>S</it>|&#215;|<it>T</it>| </sub>with |<it>S</it>| rows and |<it>T</it>| columns and the respective paths will be referred to by <it>P</it><sup><it>D</it></sup>(<it>s</it>, <it>t</it>).</p>
            </sec>
            <sec>
               <st>
                  <p>BowTieBuilder</p>
               </st>
               <p>When multiple source and target proteins are provided, we employ a greedy approach, referred to as BowTieBuilder, to construct the signaling pathway <it>P</it>. In the first step, BowTieBuilder initializes the signaling pathway <it>P </it>= (<it>V </it>= <it>S </it>&#8745; <it>T</it>, <it>E </it>= &#8709;, <b>w </b>= &#8709;) by including the source <it>S </it>and target <it>T </it>nodes, and flagging these nodes as 'not visited'. In the second step, the distance matrix <it>D</it><sub>|<it>S</it>|&#215;|<it>T</it>| </sub>is constructed by determining the maximal scoring (Equation 1) paths between the nodes in <it>S </it>and the nodes in <it>T </it>with Dijkstra's algorithm, where the distance is set to &#8734; if no path exists. This preprocessing is similar to the heuristic presented by Melhorn <it>et al</it>. <abbrgrp><abbr bid="B33">33</abbr></abbrgrp> for finding Steiner trees. In the next stage of the inference, the highest scoring path <it>P</it><sup><it>D</it></sup>(<it>s</it>, <it>t</it>) in <it>D </it>that connects a 'not visited' node to a 'visited' node is added. If no such path exists the two 'not visited' nodes with the highest scoring path <it>P</it><sup><it>D</it></sup>(<it>s</it>, <it>t</it>) in <it>D </it>are connected to each other and, likewise, the path <it>P</it><sup><it>D</it></sup>(<it>s</it>, <it>t</it>) is added to <it>P</it>. Subsequently, the nodes in that path are flagged as 'visited' and <it>D </it>is updated to include all distances to the nodes in <it>P</it><sup><it>D</it></sup>(<it>s</it>, <it>t</it>). This step is reiterated, in each stage integrating 'not visited' source and target nodes. The method terminates when all nodes in <it>S </it>&#8745;<it>T </it>are flagged as 'visited', or, if for the remaining nodes, no path to any other node in <it>S </it>&#8745; <it>T </it>exists. Then the final signaling pathway <it>P </it>is returned. If either <it>S </it>or <it>T </it>is an empty set, <it>D </it>is initialized such that it contains all distances between any node in the input set (<it>D</it><sub>|<it>S</it>&#8745;<it>T</it>|&#215;|<it>S</it>&#8745;<it>T</it>|</sub>). Despite this change in the initialization of <it>D</it>, the algorithm proceeds in the same manner and finally returns the signaling pathway <it>P </it>which connects all nodes to each other. The structure of the BowTieBuilder algorithm is given in the following:</p>
               <p indent="1">1. Initialize the pathway <it>P </it>with all nodes <it>S </it>&#8745; <it>T</it>, and flag all nodes in <it>S </it>&#8745; <it>T </it>as 'not visited'.</p>
               <p indent="1">2. Calculate the distance matrix <it>D</it><sub>|<it>S</it>|&#215;|<it>T</it>| </sub>between the nodes in <it>S </it>and <it>T </it>with Dijkstra's algorithm.</p>
               <p indent="1">3. Select the shortest path in <it>D </it>that connects a 'not visited' and a 'visited' node in <it>P</it>, or, if no such path exists, a 'not visited' node in <it>S </it>to a 'not visited' node in <it>T</it>.</p>
               <p indent="1">4. Add the nodes and edges of the selected path to <it>P </it>and flag all nodes in the pathway as 'visited'.</p>
               <p indent="1">5. Update <it>D </it>to include all distances to the nodes in <it>P</it><sup><it>D</it></sup>(<it>s</it>, <it>t</it>).</p>
               <p indent="1">6. Repeat the steps 2&#8211;5 until every node in <it>S </it>is connected to some node in <it>T</it>, and vice versa if such a path exists in <it>G</it>.</p>
               <p indent="1">7. Export final pathway <it>P</it>.</p>
               <p>As an optional parameter, the maximum path length <it>l </it>is introduced, since very long paths can increase the introduction of false positive PPIs. This is accomplished by setting the length of a path with more than <it>l </it>edges to &#8734;.</p>
            </sec>
            <sec>
               <st>
                  <p>Additional inference methods</p>
               </st>
               <p>When applying heuristics, it is advisable to compare different approaches to each other to analyze their properties. For this purpose, we implemented three alternative inference methods: <it>all interactions</it>, <it>shortest paths</it>, and <it>all shortest paths</it>.</p>
               <p><b><it>all interactions</it></b>: In this modification, the standard BowTieBuilder is applied and the resulting pathway <it>P </it>is obtained. Then, all PPIs (edges) between any two nodes in <it>P </it>are added whenever they are contained in <it>G</it>.</p>
               <p><b><it>shortest paths</it></b>: In this inference method, every node in the source set <it>S </it>is connected to the target set <it>T </it>through the maximal scoring path, and vice versa. In this case the pathway <it>P </it>can be directly derived from paths corresponding to the maximal scores in matrix <it>D</it>. More specifically, for each row and column, the path corresponding to the maximal entry in <it>D </it>is added to <it>P</it>.</p>
               <p><b><it>all shortest paths</it></b>: In this inference method, for every pair of source (<it>S</it>) and target (<it>T</it>) proteins the highest scoring path <it>P</it><sup><it>D</it></sup>(<it>s</it>, <it>t</it>) is added to <it>P</it>. Thus, every source and target node is directly connected if a corresponding path exists in <it>G</it>.</p>
            </sec>
            <sec>
               <st>
                  <p>Output</p>
               </st>
               <p>Inferred signal transduction pathways are exported in the formats GML (Graph Markup Language), XGML, and GraphViz, and visualized with the graph viewer yED <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> or by Cytoscape <abbrgrp><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr></abbrgrp>.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Validation</p>
            </st>
            <p>To validate the correctness of the inferred pathways we compute the recall and precision rates with respect to a specified reference pathway. These rates can be calculated with respect to PPIs or proteins. The recall rate is defined as the fraction of PPIs/proteins in the reference pathway that are inferred (Equation 2) and the precision rate is defined as the fraction of inferred PPIs/proteins that are contained in the reference pathway (Equation 3).</p>
            <p>
               <display-formula id="M2">
                  <graphic file="1752-0509-3-67-i4.gif"/>
               </display-formula>
            </p>
            <p>
               <display-formula id="M3">
                  <graphic file="1752-0509-3-67-i5.gif"/>
               </display-formula>
            </p>
            <p>The topological validation is only performed for pathways that are provided by KEGG. Another possibility for testing the plausibility of inferred pathways &#8211; without the need for validation pathways &#8211; is to test if the inferred pathway can be associated with a certain biological process. To perform such an analysis, we map the proteins contained in each pathway to their 'biological process', defined by the Gene Ontology (GO) <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. The tool Term Finder <abbrgrp><abbr bid="B39">39</abbr></abbrgrp> is used for this purpose, which calculates a <it>p</it>-value for each biological process using the hypergeometric distribution.</p>
            <p>A direct validation against other methods for automatically inferring signal transduction pathways is omitted, because most of these algorithms are validated through pathways with one source and one target protein <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B13">13</abbr></abbrgrp>. The recall and precision rates obtained by the different methods can, however, give a rough estimate of the relative performance.</p>
         </sec>
         <sec>
            <st>
               <p>Source and target proteins</p>
            </st>
            <p>BowTieBuilder is applied to several sets of source and target proteins. In principle, any type of source or target protein can be processed by BowTieBuilder; in this work, however, if not stated otherwise, the source proteins are membrane-bound proteins and the target proteins are TFs.</p>
            <p>To infer signaling pathways for different biological processes, we collect several sets of membrane-bound proteins and TFs. To infer signaling pathways that control the yeast cell cycle, we collect membrane-TF sets for the yeast cell cycle phases G1 and S from the respective KEGG pathway (KEGG identifier: sce04111). For the analysis of the yeast MAPK pathway, the membrane and TF sets are obtained from the KEGG MAPK pathway (KEGG identifier: hsa04010). In addition, the human membrane and TF sets of the Erb pathway are collected from KEGG (KEGG identifier: hsa04012), and the human membrane and TF sets related to the TLR-mediated innate immune pathway are collected from a publication of Kitano <it>et al</it>. <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>.</p>
            <p>To combine signal transduction pathways with gene regulatory networks, all TFs that were inferred as regulators in a previous study <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> are used here as the target list. In this study, TFs were inferred to have a regulatory effect from two gene expression datasets <abbrgrp><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr></abbrgrp> and known <it>cis</it>-regulatory elements. In addition to these TFs a list of membrane proteins was collected from the Yeast Membrane Protein Library (YMPL). Based on these TFs and membrane proteins, a signaling pathway is inferred that potentially explains the higher-level regulation of these TFs in the respective gene regulatory network. All source and target proteins are provided in Additional File <supplr sid="S1">1</supplr>.</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p><b>All target, source, and inferred proteins of the signaling pathways</b>. All source, target, and inferred proteins are provided in an Excel file, along with their significant GO-terms.</p>
               </text>
               <file name="1752-0509-3-67-S1.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Bow tie score</p>
            </st>
            <p>As mentioned earlier, BowTieBuilder favors signaling pathways that are structured like a bow tie, but it does not demand such a structure. Thus, it is of interest to quantify to what extent signaling pathways follow the bow tie structure and, in addition, to determine the core proteins. For this purpose, we provide a bow tie score (<it>b</it>(<it>p</it>) &#8712; [0, 1]) that determines how 'central' a protein <it>p </it>is. This score is also used to determine the bow tie score of the complete pathway. This score is related to the 'betweenness' measure, in which the number of shortest paths that include the core protein determines the centrality.</p>
            <p>To calculate this score, the possible number of connecting paths between the source <it>S </it>and target <it>T </it>proteins is first determined, which is simply the number of source proteins multiplied with the number of target proteins |<it>S</it>|&#183;|<it>T</it>|. Then the number of source and target proteins that can be connected by a path containing <it>p </it>is calculated. This is given by the number of target proteins from which <it>p </it>can be reached (|<it>T</it><sub><it>p</it></sub>|) multiplied by the number of source proteins that can be reached from <it>p </it>(|S<sub><it>p</it></sub>|). Thereby, every edge can only be traversed in one direction, since the signaling pathway is a directed graph that is traversed from the source to the target proteins. The corresponding bow tie score for any protein <it>p </it>reads:</p>
            <p>
               <display-formula id="M4">
                  <graphic file="1752-0509-3-67-i6.gif"/>
               </display-formula>
            </p>
            <p>To determine the core elements of any signaling pathway, <it>b</it>(<it>p</it>) is calculated for every intermediate protein <it>p</it>. Given these scores, the core component is defined by the set of proteins with the maximal <it>b</it>(<it>p</it>) score. This also gives the overall score of the signaling pathway. In some cases it is helpful to distill the subnetwork that constitutes the bow tie structure by removing all paths that do not pass through the core component. We refer to such signaling pathways as 'core bow tie'.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Validation with KEGG signal transduction pathways</p>
            </st>
            <p>To evaluate all heuristics, they are applied to the G1-phase cell cycle, S-phase cell cycle, MAPK pathways of yeast, and the human Erb pathway. The resulting recall and precision rates are provided in Table <tblr tid="T1">1</tblr>. In comparison to other heuristics, BowTieBuilder has the highest average precision with respect to proteins and PPIs. This could be expected since BowTieBuilder aims at finding the minimal pathway <it>P</it>, whereas the other methods add additional PPIs or proteins to the pathway. The <it>shortest paths </it>heuristic has the highest protein recall rate, whereas the <it>all interactions </it>heuristic has the highest PPI recall rate.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Comparison of different heuristics. </p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>BowTieBuilder</p>
                     </c>
                     <c ca="center">
                        <p>shortest paths</p>
                     </c>
                     <c ca="center">
                        <p>all shortest paths</p>
                     </c>
                     <c ca="center">
                        <p>all interactions</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>G1-Phase</p>
                     </c>
                     <c ca="left">
                        <p>precision (PPI)</p>
                     </c>
                     <c ca="center">
                        <p>77%</p>
                     </c>
                     <c ca="center">
                        <p>77%</p>
                     </c>
                     <c ca="center">
                        <p>77%</p>
                     </c>
                     <c ca="center">
                        <p>48%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>recall (PPI)</p>
                     </c>
                     <c ca="center">
                        <p>77%</p>
                     </c>
                     <c ca="center">
                        <p>77%</p>
                     </c>
                     <c ca="center">
                        <p>77%</p>
                     </c>
                     <c ca="center">
                        <p>77%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>precision (protein)</p>
                     </c>
                     <c ca="center">
                        <p>67%</p>
                     </c>
                     <c ca="center">
                        <p>67%</p>
                     </c>
                     <c ca="center">
                        <p>67%</p>
                     </c>
                     <c ca="center">
                        <p>67%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>recall (protein)</p>
                     </c>
                     <c ca="center">
                        <p>67%</p>
                     </c>
                     <c ca="center">
                        <p>67%</p>
                     </c>
                     <c ca="center">
                        <p>67%</p>
                     </c>
                     <c ca="center">
                        <p>67%</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>S-Phase</p>
                     </c>
                     <c ca="left">
                        <p>precision (PPI)</p>
                     </c>
                     <c ca="center">
                        <p>73%</p>
                     </c>
                     <c ca="center">
                        <p>70%</p>
                     </c>
                     <c ca="center">
                        <p>59%</p>
                     </c>
                     <c ca="center">
                        <p>60%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>recall (PPI)</p>
                     </c>
                     <c ca="center">
                        <p>55%</p>
                     </c>
                     <c ca="center">
                        <p>60%</p>
                     </c>
                     <c ca="center">
                        <p>50%</p>
                     </c>
                     <c ca="center">
                        <p>56%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>precision (protein)</p>
                     </c>
                     <c ca="center">
                        <p>86%</p>
                     </c>
                     <c ca="center">
                        <p>86%</p>
                     </c>
                     <c ca="center">
                        <p>86%</p>
                     </c>
                     <c ca="center">
                        <p>86%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>recall (protein)</p>
                     </c>
                     <c ca="center">
                        <p>86%</p>
                     </c>
                     <c ca="center">
                        <p>86%</p>
                     </c>
                     <c ca="center">
                        <p>86%</p>
                     </c>
                     <c ca="center">
                        <p>86%</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MAPK</p>
                     </c>
                     <c ca="left">
                        <p>precision (PPI)</p>
                     </c>
                     <c ca="center">
                        <p>46%</p>
                     </c>
                     <c ca="center">
                        <p>46%</p>
                     </c>
                     <c ca="center">
                        <p>37%</p>
                     </c>
                     <c ca="center">
                        <p>31%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>recall (PPI)</p>
                     </c>
                     <c ca="center">
                        <p>37%</p>
                     </c>
                     <c ca="center">
                        <p>43%</p>
                     </c>
                     <c ca="center">
                        <p>45%</p>
                     </c>
                     <c ca="center">
                        <p>41%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>precision (protein)</p>
                     </c>
                     <c ca="center">
                        <p>79%</p>
                     </c>
                     <c ca="center">
                        <p>80%</p>
                     </c>
                     <c ca="center">
                        <p>69%</p>
                     </c>
                     <c ca="center">
                        <p>79%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>recall (protein)</p>
                     </c>
                     <c ca="center">
                        <p>74%</p>
                     </c>
                     <c ca="center">
                        <p>78%</p>
                     </c>
                     <c ca="center">
                        <p>80%</p>
                     </c>
                     <c ca="center">
                        <p>74%</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Erb</p>
                     </c>
                     <c ca="left">
                        <p>precision (PPI)</p>
                     </c>
                     <c ca="center">
                        <p>55%</p>
                     </c>
                     <c ca="center">
                        <p>40%</p>
                     </c>
                     <c ca="center">
                        <p>9%</p>
                     </c>
                     <c ca="center">
                        <p>NA</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>recall (PPI)</p>
                     </c>
                     <c ca="center">
                        <p>28%</p>
                     </c>
                     <c ca="center">
                        <p>25%</p>
                     </c>
                     <c ca="center">
                        <p>15%</p>
                     </c>
                     <c ca="center">
                        <p>NA</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>precision (protein)</p>
                     </c>
                     <c ca="center">
                        <p>79%</p>
                     </c>
                     <c ca="center">
                        <p>72%</p>
                     </c>
                     <c ca="center">
                        <p>44%</p>
                     </c>
                     <c ca="center">
                        <p>79%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>recall (protein)</p>
                     </c>
                     <c ca="center">
                        <p>56%</p>
                     </c>
                     <c ca="center">
                        <p>76%</p>
                     </c>
                     <c ca="center">
                        <p>73%</p>
                     </c>
                     <c ca="center">
                        <p>56%</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>average</p>
                     </c>
                     <c ca="left">
                        <p>precision (PPI)</p>
                     </c>
                     <c ca="center">
                        <p>63%</p>
                     </c>
                     <c ca="center">
                        <p>58%</p>
                     </c>
                     <c ca="center">
                        <p>46%</p>
                     </c>
                     <c ca="center">
                        <p>46%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>recall (PPI)</p>
                     </c>
                     <c ca="center">
                        <p>49%</p>
                     </c>
                     <c ca="center">
                        <p>51%</p>
                     </c>
                     <c ca="center">
                        <p>47%</p>
                     </c>
                     <c ca="center">
                        <p>58%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>precision (protein)</p>
                     </c>
                     <c ca="center">
                        <p>78%</p>
                     </c>
                     <c ca="center">
                        <p>76%</p>
                     </c>
                     <c ca="center">
                        <p>67%</p>
                     </c>
                     <c ca="center">
                        <p>78%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>recall (protein)</p>
                     </c>
                     <c ca="center">
                        <p>71%</p>
                     </c>
                     <c ca="center">
                        <p>77%</p>
                     </c>
                     <c ca="center">
                        <p>77%</p>
                     </c>
                     <c ca="center">
                        <p>71%</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Statistical evaluation of signal transduction pathways inferred with BowTieBuilder and alternative heuristics. The inferred signal transduction pathways are mapped against the reference signal transduction pathways from KEGG. The precision and recall rates are calculated with respect to the PPIs and proteins.</p>
               </tblfn>
            </tbl>
            <p>Depending on the type of validation (protein or PPI), the performance of some heuristics varies strongly. The <it>all interactions </it>heuristic, for instance, has high precision when inferring proteins, although the precision for inferring edges is significantly lower in comparison to BowTieBuilder.</p>
            <p>In summary, BowTieBuilder has the highest average precision but the lowest average recall. The <it>all interactions </it>heuristic, on the other hand, has the lowest precision and highest recall rate. Thus, the average precision decreases and the average recall increases in the following order: BowTieBuilder, <it>shortest paths</it>, <it>all shortest paths</it>, and <it>all interactions</it>. Several of the inferred pathways are provided in Additional File <supplr sid="S2">2</supplr>.</p>
            <suppl id="S2">
               <title>
                  <p>Additional file 2</p>
               </title>
               <text>
                  <p><b>GraphML-formatted files of several signaling pathways</b>. Several signaling pathways in GraphML format. These files can be viewed and edited with yEd (from yWorks).</p>
               </text>
               <file name="1752-0509-3-67-S2.zip">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Yeast cell cycle pathways</p>
            </st>
            <p>For the inferred G1-phase cell cycle pathway, the PPI precision rates range from 48% to 77%, whereas the PPI recall is 77% in all cases (Table <tblr tid="T1">1</tblr>). The most significant biological process for the inferred proteins is 'cell cycle' (<it>p</it>-value: 4.00&#183;10<sup>-5</sup>). Four of six proteins from the KEGG pathway are contained in all inferred pathways (Figure <figr fid="F2">2</figr>), thus the protein recall and precision rates are 67%. The two proteins (Sic1 and Clb5) that constitute alternative paths through the signaling pathways in KEGG are not considered by any inference method. In all cases the core protein is Cdc28, however, its bow tie score ranges from 0.20 (KEGG pathway) to 1.00 ('<it>all interactions</it>'). Cdc28 is reported to be the central coordinator of the major events of the yeast cell division cycle <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>G1-phase signaling pathways inferred with different heuristics</p>
               </caption>
               <text>
                  <p><b>G1-phase signaling pathways inferred with different heuristics</b>. The membrane-bound proteins are depicted at the top, the TFs are depicted at the bottom, and the inferred proteins in between (except SWI5 which is an inferred TF). Proteins that occur in the KEGG pathway but are not inferred by any heuristic are depicted in gray. The core protein is CDC28 in all cases &#8211; its bow tie score is provided in the respective box. All recall and precision values are provided in Table 1.</p>
               </text>
               <graphic file="1752-0509-3-67-2"/>
            </fig>
            <p>In the case of the S-phase pathways, Cdc28 is also contained in all core components, except in the BowTieBuilder pathway which is not bow tie structured. The KEGG core component containing Cdc28 and Cdc6 with a bow tie score of 0.75 is also found in the <it>'all shortest paths' </it>pathway (see Figure <figr fid="F3">3</figr>). The inferred S-phase pathways lack only the protein Clb5 in all cases. This protein binds to Cdc28 and Sic1, and thereby introduces a cycle into the signaling pathway. Hence, this protein is not considered by approaches searching for minimal graphs. Accordingly, the protein recall and precision rates are 86% in all cases, whereas the PPI recall and precision rates range from 50% to 73%. The proteins inferred in the case of the S-phase cell cycle show a significant enrichment for 'G1/S transition of mitotic cell cycle' (<it>p</it>-value: 9.78&#183;10<sup>-13</sup>).</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>S-phase signaling pathways inferred with different heuristics</p>
               </caption>
               <text>
                  <p><b>S-phase signaling pathways inferred with different heuristics</b>. The membrane-bound proteins are depicted at the top, the TFs are at the bottom and the inferred proteins in the middle. Proteins that occur in the KEGG pathway but not inferred by any heuristic are depicted in gray. The varying core proteins are indicated through gray boxes, which provide their bow tie score. All recall and precision values are provided in Table 1.</p>
               </text>
               <graphic file="1752-0509-3-67-3"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Yeast MAPK pathway</p>
            </st>
            <p>The MAPK pathway inferred with BowTieBuilder contains several 'shortcuts' with respect to the original pathway in KEGG (Figure <figr fid="F4">4</figr>). For instance, the inferred pathway connects FAR1 directly to Cdc24 (STRING score: 0.99) and FUS3 directly to STE11 (STRING score: 0.99), whereas in KEGG they are connected through intermediate proteins. These 'shortcuts' are high-confidence PPIs in STRING and experimentally verified, thus allowing inference of shorter pathways than those given in KEGG. At this point it is unclear which connections are actually utilized in the cell, or even if this utilization depends on the specific environmental conditions. Overall, the PPI recall and precision rates are rather low, whereas the protein recall and precision rates are up to 79% (in the case of BowTieBuilder and the <it>all interactions </it>heuristic). The MAPK pathway responds to different external stimuli, such as pheromones and osmolarity. In accordance with these known MAPK stimuli, the GO processes 'osmotic stress' (<it>p</it>-value: 2.57&#183;10<sup>-12</sup>) and 'response to pheromone' (<it>p</it>-value: 1.36&#183;10<sup>-11</sup>) are the most significant. The core proteins that are contained in the KEGG pathway are: Ste20, Ste11, Ste7 and Fus3. Together with Ste5 these proteins form a scaffolding complex. In the inferred networks this complex is not present because of the '|shortcut' from Ste11 to Fus3. Nonetheless, Fus3, the endpoint of this scaffolding complex, is the core protein in both inferred pathways with a bow tie score similar to the KEGG pathway.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>MAKP signaling pathway of yeast</p>
               </caption>
               <text>
                  <p><b>MAKP signaling pathway of yeast</b>. Depicted is the KEGG pathway and the pathway inferred by BowTieBuilder. The membrane-bound proteins are drawn at the top, and the TFs are drawn at the bottom. The inferred proteins are depicted in between. Proteins that do not overlap between the KEGG and BowTieBuilder pathways are depicted in gray. The core proteins are embedded in a gray box, where FUS3 is contained in all core structures. The recall and precision values are given in Table 1.</p>
               </text>
               <graphic file="1752-0509-3-67-4"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Human Erb pathway</p>
            </st>
            <p>The inferred human Erb pathway is mapped to the GO term 'erb signaling pathway' (<it>p</it>-value: 2.51&#183;10<sup>-30</sup>) as the most significant biological process. Thereby, several structures also found in the KEGG pathway could be observed (Figure <figr fid="F5">5</figr>), however, with rather low recall and precision rates (Table <tblr tid="T1">1</tblr>). Several proteins are skipped by 'shortcut' PPI as already observed for the MAPK pathway. For the inferred Erb pathway no bow tie structure could be found.</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Erb-associated signaling pathways</p>
               </caption>
               <text>
                  <p><b>Erb-associated signaling pathways</b>. Erb signal transduction pathway inferred with BowTieBuilder. Membrane-bound proteins are depicted at the top, TFs are depicted at the bottom, and the inferred proteins are depicted in between. The core proteins are embedded in a gray box, in which their bow tie score is provided. For the inferred Erb pathway, however, no core element could be determined. The recall and precision values are given in Table 1.</p>
               </text>
               <graphic file="1752-0509-3-67-5"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>TLR-mediated innate immune pathway</p>
            </st>
            <p>The TLR-mediated innate immune system of humans is known to have a bow tie architecture in which eleven TLRs respond to a wide variety of pathogens, capturing so-called pathogen-associated molecular patterns. MyD88 is responsible for the activation of TLR-mediated responses. For this pathway no applicable validation pathway is available in KEGG. Nonetheless, the inference of this pathway revealed the general structure of the TLR-mediated innate immune pathway (Figure <figr fid="F6">6</figr>). Furthermore, a clear bow tie structure can be observed with MyD88 and Fadd as core elements, which is also reported in the publication of Oda <it>et al</it>. <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>.</p>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>TLR-mediated innate immune signaling pathway</p>
               </caption>
               <text>
                  <p><b>TLR-mediated innate immune signaling pathway</b>. TLR-mediated innate immune-associated signaling pathway inferred by BowTieBuilder. Membrane-bound TLR-proteins are depicted at the top, TFs are depicted at the bottom and the inferred proteins are depicted in-between. The proteins MyD88 and FADD are both in the core module and considered to be essential to this signaling pathway.</p>
               </text>
               <graphic file="1752-0509-3-67-6"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Integrating signal transduction pathways and gene regulatory networks</p>
            </st>
            <p>For a set of TFs obtained by inferring a gene regulatory network <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>, BowTieBuilder was applied to infer the corresponding signaling pathway. This inference leds to several distinct signaling pathways, and the one with the highest bow tie score is depicted in Figure <figr fid="F7">7</figr>. The core component of this pathway contains several proteins, including the exocytic complex, that are related to excocytosis. These core proteins connect various membrane-bound proteins and TFs, and can be divided into different subpathways that are related to different biological processes (Figure <figr fid="F7">7</figr>).</p>
            <fig id="F7">
               <title>
                  <p>Figure 7</p>
               </title>
               <caption>
                  <p>Signaling pathway inferred from a gene regulatory network</p>
               </caption>
               <text>
                  <p><b>Signaling pathway inferred from a gene regulatory network</b>. This figure depicts a bow tie inferred from membrane-bound proteins and TFs that were predicted to be active in a gene regulatory network <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. The proteins in the bottom are the TFs, the proteins at the top are the membrane-bound proteins and the remaining proteins constitute the inferred signaling pathway. Different modules of this pathway are associated with different biological processes, where the core module is associated with exocytosis &#8211; a process through which cells direct secretory vesicles out of the cell.</p>
               </text>
               <graphic file="1752-0509-3-67-7"/>
            </fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>In this work, we have presented several heuristics that allow inferring signal transduction pathways when several source and/or target proteins are given. The resulting pathways provide the researcher with a interconnected signaling pathway, which unravels the core proteins that integrate and transduce signals from multiple source to multiple target proteins.</p>
         <p>Most current methods for the automated reconstruction of signal transduction pathways infer linear pathways and incorporate gene expression data into their scoring function <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>. BowTieBuilder allows inferring signal transduction pathways from an arbitrary number of source and target proteins.</p>
         <p>Furthermore, the scoring function of BowTieBuilder is based solely on the PPI dataset from STRING and the associated confidence values. Hence, we build upon the integration of PPI information by STRING. The inferred signaling pathways had satisfactory recall and precision rates for most signaling pathways; for some, however, there is room for improvement. Two main sources of error could be observed. The first source of error was that some PPIs allow a 'shortcut' from the source to the target protein, in comparison to the reference pathway. This was, for instance, the case for several MAPK pathways, where these 'shortcuts' were even interactions with a high confidence level. Another source of error arises from the cyclic patterns that are neglected by inference methods when maximizing the pathway score.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>In conclusion, when keeping the potential pitfalls of such inference methods in mind, the signaling pathways obtained can be of great help in understanding and constructing regulatory networks. BowTieBuider is capable of uncovering core proteins that integrate multiple source proteins and transduce these signals to TFs. This could be observed for the TLR-mediated innate immune pathway, where MyD88 and Fadd constitute the core proteins that function as a hub for all possible signaling pathways. Furthermore, Cdc28 was inferred as a core protein in both cell cycle related pathways, which is confirmed in the literature <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>.</p>
         <p>In other cases, such as the Erb pathway, no clear bow tie structure could be uncovered. Furthermore, proteins that were core proteins in certain pathways (e.g., Cdc28 in both cell cycle pathways) had very low bow tie scores in other pathways. Thus, which proteins constitute the core of a signal transduction pathway seems to be dynamic and depends on the context of target and source proteins.</p>
      </sec>
      <sec>
         <st>
            <p>Competing interests</p>
         </st>
         <p>The authors declare that they have no competing interests.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>JS wrote the manuscript and conceived this work. LS and JS developed the methods and algorithms. LS implemented the algorithms in Java&#8482;. AD, AS, HP, and AZ were involved in the study design and coordination. All authors read and approved the final manuscript. None of the authors have any competing financial or other interests in relation to this work.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>This work was funded by the German Federal Ministry of Education and Research (BMBF) in the project 'National Genome Research Network' (NGFNplus) under grant number 01GS08134 and 'HepatoSys' under grant number 0313080 L.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Mammalian MAP kinase signalling cascades</p>
            </title>
            <aug>
               <au>
                  <snm>Chang</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Karin</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2001</pubdate>
            <volume>410</volume>
            <issue>6824</issue>
            <fpage>37</fpage>
            <lpage>40</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35065000</pubid>
                  <pubid idtype="pmpid" link="fulltext">11242034</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Missing pieces in the NF-kappaB puzzle</p>
            </title>
            <aug>
               <au>
                  <snm>Ghosh</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Karin</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2002</pubdate>
            <volume>109</volume>
            <issue>Suppl</issue>
            <fpage>S81</fpage>
            <lpage>S96</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0092-8674(02)00703-1</pubid>
                  <pubid idtype="pmpid" link="fulltext">11983155</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae</p>
            </title>
            <aug>
               <au>
                  <snm>Uetz</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Giot</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Cagney</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Mansfield</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Judson</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Knight</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Lockshon</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Narayan</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Srinivasan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pochart</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Qureshi-Emili</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Godwin</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Conover</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kalbfleisch</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Vijayadamodar</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Johnston</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Fields</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Rothberg</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>403</volume>
            <issue>6770</issue>
            <fpage>623</fpage>
            <lpage>627</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35001009</pubid>
                  <pubid idtype="pmpid" link="fulltext">10688190</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>A network of protein-protein interactions in yeast</p>
            </title>
            <aug>
               <au>
                  <snm>Schwikowski</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Uetz</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Fields</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>2000</pubdate>
            <volume>18</volume>
            <issue>12</issue>
            <fpage>1257</fpage>
            <lpage>1261</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/82360</pubid>
                  <pubid idtype="pmpid" link="fulltext">11101803</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>A comprehensive two-hybrid analysis to explore the yeast protein interactome</p>
            </title>
            <aug>
               <au>
                  <snm>Ito</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Chiba</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ozawa</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Yoshida</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hattori</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sakaki</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2001</pubdate>
            <volume>98</volume>
            <issue>8</issue>
            <fpage>4569</fpage>
            <lpage>4574</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">31875</pubid>
                  <pubid idtype="pmpid" link="fulltext">11283351</pubid>
                  <pubid idtype="doi">10.1073/pnas.061034498</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>IntAct: an open source molecular interaction database</p>
            </title>
            <aug>
               <au>
                  <snm>Hermjakob</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Montecchi-Palazzi</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Lewington</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Mudali</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kerrien</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Orchard</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Vingron</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Roechert</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Roepstorff</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Valencia</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Margalit</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Armstrong</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bairoch</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Cesareni</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Sherman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Apweiler</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <issue>32 Database</issue>
            <fpage>D452</fpage>
            <lpage>D455</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308786</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681455</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh052</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>The Database of Interacting Proteins: 2004 update</p>
            </title>
            <aug>
               <au>
                  <snm>Salwinski</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>CS</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Pettit</snm>
                  <fnm>FK</fnm>
               </au>
               <au>
                  <snm>Bowie</snm>
                  <fnm>JU</fnm>
               </au>
               <au>
                  <snm>Eisenberg</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <issue>32 Database</issue>
            <fpage>D449</fpage>
            <lpage>D451</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308820</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681454</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh086</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>The MIPS mammalian protein-protein interaction database</p>
            </title>
            <aug>
               <au>
                  <snm>Pagel</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Kovac</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Oesterheld</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Brauner</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Dunger-Kaltenbach</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Frishman</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Montrone</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Mark</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>St&#252;mpflen</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Mewes</snm>
                  <fnm>HW</fnm>
               </au>
               <au>
                  <snm>Ruepp</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Frishman</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>6</issue>
            <fpage>832</fpage>
            <lpage>834</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti115</pubid>
                  <pubid idtype="pmpid" link="fulltext">15531608</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>STRING 7-recent developments in the integration and prediction of protein interactions</p>
            </title>
            <aug>
               <au>
                  <snm>von Mering</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Jensen</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Kuhn</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chaffron</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Doerks</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kr&#252;ger</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <issue>35 Database</issue>
            <fpage>D358</fpage>
            <lpage>D362</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1669762</pubid>
                  <pubid idtype="pmpid" link="fulltext">17098935</pubid>
                  <pubid idtype="doi">10.1093/nar/gkl825</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Automated modelling of signal transduction networks</p>
            </title>
            <aug>
               <au>
                  <snm>Steffen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Petti</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Aach</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>D'haeseleer</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Church</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <fpage>34</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">137599</pubid>
                  <pubid idtype="pmpid" link="fulltext">12413400</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-3-34</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Efficient algorithms for detecting signaling pathways in protein interaction networks</p>
            </title>
            <aug>
               <au>
                  <snm>Scott</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ideker</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Karp</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Sharan</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>J Comput Biol</source>
            <pubdate>2006</pubdate>
            <volume>13</volume>
            <issue>2</issue>
            <fpage>133</fpage>
            <lpage>144</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1089/cmb.2006.13.133</pubid>
                  <pubid idtype="pmpid" link="fulltext">16597231</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>QPath: a method for querying pathways in a protein-protein interaction network</p>
            </title>
            <aug>
               <au>
                  <snm>Shlomi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Segal</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Ruppin</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Sharan</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>199</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1458361</pubid>
                  <pubid idtype="pmpid" link="fulltext">16606460</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-7-199</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>PathFinder: mining signal transduction pathway segments from protein-protein interaction networks</p>
            </title>
            <aug>
               <au>
                  <snm>Bebek</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>335</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2100073</pubid>
                  <pubid idtype="pmpid" link="fulltext">17854489</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-8-335</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>A computational approach for ordering signal transduction pathway components from genomics and proteomics Data</p>
            </title>
            <aug>
               <au>
                  <snm>Liu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>158</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">526379</pubid>
                  <pubid idtype="pmpid" link="fulltext">15504238</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-5-158</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Bow ties, metabolism and disease</p>
            </title>
            <aug>
               <au>
                  <snm>Csete</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Doyle</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Trends Biotechnol</source>
            <pubdate>2004</pubdate>
            <volume>22</volume>
            <issue>9</issue>
            <fpage>446</fpage>
            <lpage>450</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tibtech.2004.07.007</pubid>
                  <pubid idtype="pmpid" link="fulltext">15331224</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Hierarchical modularity of nested bow-ties in metabolic networks</p>
            </title>
            <aug>
               <au>
                  <snm>Zhao</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Luo</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Cao</snm>
                  <fnm>ZW</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>YX</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>386</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1560398</pubid>
                  <pubid idtype="pmpid" link="fulltext">16916470</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-7-386</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>A comprehensive map of the toll-like receptor signaling network</p>
            </title>
            <aug>
               <au>
                  <snm>Oda</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kitano</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Mol Syst Biol</source>
            <pubdate>2006</pubdate>
            <volume>2</volume>
            <fpage>2006.0015</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1681489</pubid>
                  <pubid idtype="pmpid" link="fulltext">16738560</pubid>
                  <pubid idtype="doi">10.1038/msb4100057</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Robustness trade-offs and host-microbial symbiosis in the immune system</p>
            </title>
            <aug>
               <au>
                  <snm>Kitano</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Oda</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Mol Syst Biol</source>
            <pubdate>2006</pubdate>
            <volume>2</volume>
            <fpage>2006.0022</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1681473</pubid>
                  <pubid idtype="pmpid" link="fulltext">16738567</pubid>
                  <pubid idtype="doi">10.1038/msb4100039</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>G-protein coupled receptor signaling architecture of mammalian immune cells</p>
            </title>
            <aug>
               <au>
                  <snm>Polouliakh</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Nock</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Kitano</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>PLoS ONE</source>
            <pubdate>2009</pubdate>
            <volume>4</volume>
            <fpage>e4189</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2615211</pubid>
                  <pubid idtype="pmpid" link="fulltext">19142232</pubid>
                  <pubid idtype="doi">10.1371/journal.pone.0004189</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>From genomics to chemical genomics: new developments in KEGG</p>
            </title>
            <aug>
               <au>
                  <snm>Kanehisa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Goto</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hattori</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Aoki-Kinoshita</snm>
                  <fnm>KF</fnm>
               </au>
               <au>
                  <snm>Itoh</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kawashima</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Katayama</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Araki</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hirakawa</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <issue>34 Database</issue>
            <fpage>D354</fpage>
            <lpage>D357</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347464</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381885</pubid>
                  <pubid idtype="doi">10.1093/nar/gkj102</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Modeling gene regulation and spatial organization of sequence based motifs</p>
            </title>
            <aug>
               <au>
                  <snm>Supper</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>aufm Kampe</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Wanke</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Berendzen</snm>
                  <fnm>KW</fnm>
               </au>
               <au>
                  <snm>Harter</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Bonneau</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Zell</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>8th IEEE international conference on BioInformatics and BioEngineering (BIBE)</source>
            <pubdate>2008</pubdate>
         </bibl>
         <bibl id="B22">
            <title>
               <p>STRING: a web-server to retrieve and display the repeatedly occurring neighbourhood of a gene</p>
            </title>
            <aug>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Lehmann</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Huynen</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2000</pubdate>
            <volume>28</volume>
            <issue>18</issue>
            <fpage>3442</fpage>
            <lpage>3444</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">110752</pubid>
                  <pubid idtype="pmpid" link="fulltext">10982861</pubid>
                  <pubid idtype="doi">10.1093/nar/28.18.3442</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>STRING: a database of predicted functional associations between proteins</p>
            </title>
            <aug>
               <au>
                  <snm>von Mering</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Huynen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jaeggi</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Schmidt</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <fpage>258</fpage>
            <lpage>261</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">165481</pubid>
                  <pubid idtype="pmpid" link="fulltext">12519996</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg034</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>STRING: known and predicted protein-protein associations, integrated and transferred across organisms</p>
            </title>
            <aug>
               <au>
                  <snm>von Mering</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Jensen</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Hooper</snm>
                  <fnm>SD</fnm>
               </au>
               <au>
                  <snm>Krupp</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Foglierini</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jouffre</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Huynen</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <issue>33 Database</issue>
            <fpage>D433</fpage>
            <lpage>D437</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">539959</pubid>
                  <pubid idtype="pmpid" link="fulltext">15608232</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>MINT: the Molecular INTeraction database</p>
            </title>
            <aug>
               <au>
                  <snm>Chatr-aryamontri</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ceol</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Palazzi</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Nardelli</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Schneider</snm>
                  <fnm>MV</fnm>
               </au>
               <au>
                  <snm>Castagnoli</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Cesareni</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <issue>35 Database</issue>
            <fpage>D572</fpage>
            <lpage>D574</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1751541</pubid>
                  <pubid idtype="pmpid" link="fulltext">17135203</pubid>
                  <pubid idtype="doi">10.1093/nar/gkl950</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>BioGRID: a general repository for interaction datasets</p>
            </title>
            <aug>
               <au>
                  <snm>Stark</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Breitkreutz</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Reguly</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Boucher</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Breitkreutz</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Tyers</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <issue>34 Database</issue>
            <fpage>D535</fpage>
            <lpage>D539</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347471</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381927</pubid>
                  <pubid idtype="doi">10.1093/nar/gkj109</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>DIP: the database of interacting proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Xenarios</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Rice</snm>
                  <fnm>DW</fnm>
               </au>
               <au>
                  <snm>Salwinski</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Baron</snm>
                  <fnm>MK</fnm>
               </au>
               <au>
                  <snm>Marcotte</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Eisenberg</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2000</pubdate>
            <volume>28</volume>
            <fpage>289</fpage>
            <lpage>291</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">102387</pubid>
                  <pubid idtype="pmpid" link="fulltext">10592249</pubid>
                  <pubid idtype="doi">10.1093/nar/28.1.289</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Reactome: a knowledge base of biologic pathways and processes</p>
            </title>
            <aug>
               <au>
                  <snm>Vastrik</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>D'Eustachio</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Schmidt</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Joshi-Tope</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Gopinath</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Croft</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>de Bono</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Gillespie</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jassal</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Matthews</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Stein</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <issue>3</issue>
            <fpage>R39</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1868929</pubid>
                  <pubid idtype="pmpid" link="fulltext">17367534</pubid>
                  <pubid idtype="doi">10.1186/gb-2007-8-3-r39</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>STRING 8-a global view on proteins and their functional interactions in 630 organisms</p>
            </title>
            <aug>
               <au>
                  <snm>Jensen</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Kuhn</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Stark</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chaffron</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Creevey</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Muller</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Doerks</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Julien</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Roth</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Simonovic</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>von Mering</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2008</pubdate>
         </bibl>
         <bibl id="B30">
            <title>
               <p>The Steiner Problem in Graphs: Topological Methods of Solution</p>
            </title>
            <aug>
               <au>
                  <snm>Panyukov</snm>
                  <fnm>AV</fnm>
               </au>
            </aug>
            <source>Autom Remote Control</source>
            <pubdate>2004</pubdate>
            <volume>65</volume>
            <issue>3</issue>
            <fpage>439</fpage>
            <lpage>448</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1023/B:AURC.0000019376.31168.20</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <aug>
               <au>
                  <snm>Garey</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>DS</fnm>
               </au>
            </aug>
            <source>Computers and Intractability; A Guide to the Theory of NP-Completeness</source>
            <publisher>New York, NY, USA: W. H. Freeman &amp; Co</publisher>
            <pubdate>1990</pubdate>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Shortest connection networks and some generalizations</p>
            </title>
            <aug>
               <au>
                  <snm>Prim</snm>
                  <fnm>RC</fnm>
               </au>
            </aug>
            <source>Bell System Technology Journal</source>
            <pubdate>1957</pubdate>
            <volume>36</volume>
            <fpage>1389</fpage>
            <lpage>1401</lpage>
         </bibl>
         <bibl id="B33">
            <title>
               <p>A faster approximation algorithm for the Steiner problem in graphs</p>
            </title>
            <aug>
               <au>
                  <snm>Mehlhorn</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Inf Process Lett</source>
            <pubdate>1988</pubdate>
            <volume>27</volume>
            <issue>3</issue>
            <fpage>125</fpage>
            <lpage>128</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/0020-0190(88)90066-X</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>A note on two problems in connexion with graphs</p>
            </title>
            <aug>
               <au>
                  <snm>Dijkstra</snm>
                  <fnm>EW</fnm>
               </au>
            </aug>
            <source>Numerische Mathematik</source>
            <pubdate>1959</pubdate>
            <volume>1</volume>
            <fpage>269</fpage>
            <lpage>271</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1007/BF01386390</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>yEd &#8211; Java graph editor</p>
            </title>
            <aug>
               <au>
                  <cnm>yWorks</cnm>
               </au>
            </aug>
            <url>http://www.yworks.com</url>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Cytoscape: a software environment for integrated models of biomolecular interaction networks</p>
            </title>
            <aug>
               <au>
                  <snm>Shannon</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Markiel</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ozier</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Baliga</snm>
                  <fnm>NS</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Ramage</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Amin</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Schwikowski</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Ideker</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <issue>11</issue>
            <fpage>2498</fpage>
            <lpage>2504</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">403769</pubid>
                  <pubid idtype="pmpid" link="fulltext">14597658</pubid>
                  <pubid idtype="doi">10.1101/gr.1239303</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Integration of biological networks and gene expression data using Cytoscape</p>
            </title>
            <aug>
               <au>
                  <snm>Cline</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Smoot</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Cerami</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Kuchinsky</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Landys</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Workman</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Christmas</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Avila-Campilo</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Creech</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gross</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Hanspers</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Isserlin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kelley</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Killcoyne</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lotia</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Maere</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Morris</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ono</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Pavlovic</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Pico</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Vailaya</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>PL</fnm>
               </au>
               <au>
                  <snm>Adler</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Conklin</snm>
                  <fnm>BR</fnm>
               </au>
               <au>
                  <snm>Hood</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Kuiper</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sander</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Schmulevich</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Schwikowski</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Warner</snm>
                  <fnm>GJ</fnm>
               </au>
               <au>
                  <snm>Ideker</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Bader</snm>
                  <fnm>GD</fnm>
               </au>
            </aug>
            <source>Nat Protoc</source>
            <pubdate>2007</pubdate>
            <volume>2</volume>
            <issue>10</issue>
            <fpage>2366</fpage>
            <lpage>2382</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nprot.2007.324</pubid>
                  <pubid idtype="pmpid" link="fulltext">17947979</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Gene ontology: tool for the unification of biology. The Gene Ontology Consortium</p>
            </title>
            <aug>
               <au>
                  <snm>Ashburner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ball</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Blake</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Botstein</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Butler</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Cherry</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Dolinski</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Dwight</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Eppig</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Hill</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Issel-Tarver</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Kasarskis</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Matese</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Richardson</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Ringwald</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Sherlock</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2000</pubdate>
            <volume>25</volume>
            <fpage>25</fpage>
            <lpage>29</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/75556</pubid>
                  <pubid idtype="pmpid" link="fulltext">10802651</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>GO::TermFinder-open source software for accessing Gene Ontology information and finding significantly enriched Gene Ontology terms associated with a list of genes</p>
            </title>
            <aug>
               <au>
                  <snm>Boyle</snm>
                  <fnm>EI</fnm>
               </au>
               <au>
                  <snm>Weng</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gollub</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Jin</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Botstein</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Cherry</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Sherlock</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <issue>18</issue>
            <fpage>3710</fpage>
            <lpage>3715</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bth456</pubid>
                  <pubid idtype="pmpid" link="fulltext">15297299</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization</p>
            </title>
            <aug>
               <au>
                  <snm>Spellman</snm>
                  <fnm>PT</fnm>
               </au>
               <au>
                  <snm>Sherlock</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>MQ</fnm>
               </au>
               <au>
                  <snm>Iyer</snm>
                  <fnm>VR</fnm>
               </au>
               <au>
                  <snm>Anders</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>PO</fnm>
               </au>
               <au>
                  <snm>Botstein</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Futcher</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Mol Biol Cell</source>
            <pubdate>1998</pubdate>
            <volume>9</volume>
            <issue>12</issue>
            <fpage>3273</fpage>
            <lpage>3297</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">25624</pubid>
                  <pubid idtype="pmpid" link="fulltext">9843569</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Genomic expression programs in the response of yeast cells to environmental changes</p>
            </title>
            <aug>
               <au>
                  <snm>Gasch</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Spellman</snm>
                  <fnm>PT</fnm>
               </au>
               <au>
                  <snm>Kao</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Carmel-Harel</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Storz</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Botstein</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>PO</fnm>
               </au>
            </aug>
            <source>Mol Biol Cell</source>
            <pubdate>2000</pubdate>
            <volume>11</volume>
            <issue>12</issue>
            <fpage>4241</fpage>
            <lpage>4257</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">15070</pubid>
                  <pubid idtype="pmpid" link="fulltext">11102521</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Regulation of Cdc28 cyclin-dependent protein kinase activity during the cell cycle of the yeast Saccharomyces cerevisiae</p>
            </title>
            <aug>
               <au>
                  <snm>Mendenhall</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Hodge</snm>
                  <fnm>AE</fnm>
               </au>
            </aug>
            <source>Microbiol Mol Biol Rev</source>
            <pubdate>1998</pubdate>
            <volume>62</volume>
            <issue>4</issue>
            <fpage>1191</fpage>
            <lpage>1243</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">98944</pubid>
                  <pubid idtype="pmpid" link="fulltext">9841670</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
