<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-10-S11-S5</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Proceedings</dochead>
      <bibl>
         <title>
            <p>HPD: an online integrated human pathway database enabling systems biology studies</p>
         </title>
         <aug>
            <au id="A1" ce="yes"><snm>Chowbina</snm><mi>R</mi><fnm>Sudhir</fnm><insr iid="I1"/><insr iid="I2"/><email>schowbin@iupui.edu</email></au>
            <au id="A2" ce="yes"><snm>Wu</snm><fnm>Xiaogang</fnm><insr iid="I1"/><insr iid="I2"/><email>wu33@iupui.edu</email></au>
            <au id="A3"><snm>Zhang</snm><fnm>Fan</fnm><insr iid="I1"/><insr iid="I2"/><email>fanzhan@iupui.edu</email></au>
            <au id="A4"><snm>Li</snm><mi>M</mi><fnm>Peter</fnm><insr iid="I1"/><email>pmli@iupui.edu</email></au>
            <au id="A5"><snm>Pandey</snm><fnm>Ragini</fnm><insr iid="I2"/><email>rapande@iupui.edu</email></au>
            <au id="A6"><snm>Kasamsetty</snm><mi>N</mi><fnm>Harini</fnm><insr iid="I1"/><email>hkasamse@iupui.edu</email></au>
            <au ca="yes" id="A7" ce="yes"><snm>Chen</snm><mi>Y</mi><fnm>Jake</fnm><insr iid="I1"/><insr iid="I2"/><insr iid="I3"/><email>jakechen@iupui.edu</email></au>
         </aug>
         <insg>
            <ins id="I1"><p>Indiana University School of Informatics, Indianapolis, IN 46202, USA</p></ins>
            <ins id="I2"><p>Indiana Center for Systems Biology and Personalized Medicine, Indiana University &#8211; Purdue University, Indianapolis, IN 46202, USA</p></ins>
            <ins id="I3"><p>Department of Computer and Information Science, Purdue University School of Science, Indianapolis, IN 46202, USA</p></ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <supplement>
            <title>
               <p>Proceedings of the Sixth Annual MCBIOS Conference. Transformational Bioinformatics: Delivering Value from Genomes</p>
            </title>
            <editor>Jonathan D Wren (Senior Editor), Yuriy Gusev, Raphael D Isokpehi, Dan Berleant, Ulisses Braga-Neto, Dawn Wilkins and Susan Bridges</editor>
            <note>Proceedings</note>
            <url>http://www.biomedcentral.com/content/pdf/1471-2105-10-S11-info.pdf</url>
         </supplement>
         <conference>
            <title>
               <p>Sixth Annual MCBIOS Conference. Transformational Bioinformatics: Delivering Value from Genomes</p>
            </title>
            <location>Starkville, MS, USA</location>
            <date-range>20&#8211;21 February 2009</date-range>
            <url>http://www.mcbios.org/</url>
         </conference>
         <issn>1471-2105</issn>
         <pubdate>2009</pubdate>
         <volume>10</volume>
         <issue>Suppl 11</issue>
         <fpage>S5</fpage>
         <url>http://www.biomedcentral.com/1471-2105/10/S11/S5</url>
         <xrefbib><pubidlist><pubid idtype="pmpid">19811689</pubid><pubid idtype="doi">10.1186/1471-2105-10-S11-S5</pubid></pubidlist></xrefbib>
      </bibl>
      <history><pub><date><day>8</day><month>10</month><year>2009</year></date></pub></history>
      <cpyrt><year>2009</year><collab>Chowbina et al; licensee BioMed Central Ltd.</collab><note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Pathway-oriented experimental and computational studies have led to a significant accumulation of biological knowledge concerning three major types of biological pathway events: molecular signaling events, gene regulation events, and metabolic reaction events. A pathway consists of a series of molecular pathway events that link molecular entities such as proteins, genes, and metabolites. There are approximately 300 biological pathway resources as of April 2009 according to the Pathguide database; however, these pathway databases generally have poor coverage or poor quality, and are difficult to integrate, due to syntactic-level and semantic-level data incompatibilities.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We developed the Human Pathway Database (HPD) by integrating heterogeneous human pathway data that are either curated at the NCI Pathway Interaction Database (PID), Reactome, BioCarta, KEGG or indexed from the Protein Lounge Web sites. Integration of pathway data at syntactic, semantic, and schematic levels was based on a unified pathway data model and data warehousing-based integration techniques. HPD provides a comprehensive online view that connects human proteins, genes, RNA transcripts, enzymes, signaling events, metabolic reaction events, and gene regulatory events. At the time of this writing HPD includes 999 human pathways and more than 59,341 human molecular entities. The HPD software provides both a user-friendly Web interface for online use and a robust relational database backend for advanced pathway querying. This pathway tool enables users to 1) search for human pathways from different resources by simply entering genes/proteins involved in pathways or words appearing in pathway names, 2) analyze pathway-protein association, 3) study pathway-pathway similarity, and 4) build integrated pathway networks. We demonstrated the usage and characteristics of the new HPD through three breast cancer case studies.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>HPD <url>http://bio.informatics.iupui.edu/HPD</url> is a new resource for searching, managing, and studying human biological pathways. Users of HPD can search against large collections of human biological pathways, compare related pathways and their molecular entity compositions, and build high-quality, expanded-scope disease pathway models. The current HPD software can help users address a wide range of pathway-related questions in human disease biology studies.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The study of biological pathways has become a central topic in molecular systems biology <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. While the precise definition of "biological pathway" is still debatable, most researchers regard a biological pathway as a series of inter-connected cellular events among biomolecular entities. A biological pathway can be activated by extracellular stimuli and lead to persistent changes of the biochemical state of cells. There are three major types of molecular pathway events (or, <it>events </it>for brevity) that define biological pathways:</p>
         <p indent="1">&#8226; <b>Signal transduction events</b>. Common in <it>signalling pathways </it>(e.g., Wnt signaling pathway <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>), these events define the interactions among molecular entities during signal transduction cascades, i.e., how external stimuli such as molecules in the cellular environment are transduced into intracellular molecular signals that are relayed among different cellular organelles. Examples of signal transduction events in signalling pathways are protein-protein interactions, protein post-translational modifications, protein translocations, and protein complex formations/dissociations.</p>
         <p indent="1">&#8226; <b>Enzymatic reaction events</b>. Common in <it>metabolic pathways </it>(e.g., glycolysis pathway), these events define chemical reactions that metabolites (as either substrates or products) and catalytic enzymes are involved in. Examples of enzymatic reaction events are catabolic reactions (breaking down of larger molecules to produce energy) and anabolic reactions (synthesis of cellular components from smaller molecules).</p>
         <p indent="1">&#8226; <b>Genetic regulation events</b>. Common in <it>genetic regulatory pathways </it>(e.g., usually abbreviated as <it>regulatory pathways</it>), these events define the dependent relationships between regulatory entities, e.g., a transcription factor that binds to specific DNA binding motifs, and target entities, and a gene whose transcription is being regulated by a transcription factor. In addition to gene regulation events, regulatory pathways may also include sRNA and sRNA target gene regulation.</p>
         <p>Collecting and modeling biological pathways are critical for interpreting "Omics" data <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. For example, pathway knowledge has been used to identify new functional modules from gene expression profiles <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp> and relate gene mutations to one another in polygenic diseases such as breast cancer <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. The development of biological pathways can also help build disease biology models, from which new hypotheses of targeted drugs and robust biomarkers may be developed. For example, molecular entities in FGFR1/PI3K/AKT signaling pathways, the Akt/PKB pathway, the Met pathway, and the Wnt signaling pathway have all been extensively investigated as potential cancer drug targets <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. Novel drug discovery strategies to screen small molecules based on an entire pathway instead of particular protein targets can also be developed by designing global disease-related pathway inhibitors <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. Pathway studies have also shown promise in molecular diagnostic applications, e.g., identifying efficacy and toxicity biomarkers <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>, and building new multi-marker panels to improve prediction of disease prognosis and development of treatment plans <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. Ongoing efforts to represent, develop, and apply pathway models will be crucial for future genome medicine and personalized medicine applications <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp>.</p>
         <p>While there are approximately 300 biological pathway-related online resources reported by Pathguide <url>http://www.pathguide.org/today</url>, these resources have been developed with variable degrees of data coverage, quality, and utility <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Examples of high-quality biological pathway database resources are: SPAD <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>, CST <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>, STKE <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> and COPE <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> for signaling pathways; TRANSFAC <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> for regulatory pathways; and KEGG <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>, WIT <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>, ExPASy <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>, UM-BBD <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> and HumanCyc <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> for metabolic pathways. In addition, new databases such as HPRD <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, HAPPI <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>, and STRING <abbrgrp><abbr bid="B28">28</abbr></abbrgrp> have been developed to provide available high-throughput protein-protein interaction data to help fill gaps in rapidly growing molecular signaling pathway data. Recent efforts to expand biological pathway coverage beyond a single pathway event type have also been reported, e.g., NCI-PID <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>, Reactome <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>, BioCarta <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>, Pathway Commons <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>, Panther <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>, Protein Lounge <abbrgrp><abbr bid="B34">34</abbr></abbrgrp> and WikiPathways <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. However, by comparing the coverage of high-quality protein-protein interactions from the HAPPI database <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> with annotated human pathways documented from the Reactome database, for example, it is not difficult to conclude that current coverage of known human biological pathway events is 1&#8211;2 orders of magnitude smaller than the theoretical maximum that can be defined by all known reliable human protein-protein interactions. Therefore, many pathway biology studies begin by expanding biological pathway data coverage and building high-quality integrative pathway models.</p>
         <p>The most reliable approach to expanding human pathway data coverage without sacrificing data quality continues to be database integration. While there are several computational techniques that can help predict metabolic pathways <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>, regulatory pathways <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr></abbrgrp>, and signaling pathways <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>, they all have limited applicability and are thus beyond the scope of this work. However, integrating biological pathway from different data sources has been challenging, due to the heterogeneity in pathway data formats, representation schemes, and retrieval methods. For example, at the <it>syntactic </it>level, while many pathway databases such as the NCI-PID <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>, Reactome <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>, and KEGG <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> provide both molecular component and molecular interaction data as XML documents, Protein Lounge <abbrgrp><abbr bid="B34">34</abbr></abbrgrp> and BioCarta <abbrgrp><abbr bid="B31">31</abbr></abbrgrp> provide pathway details (including molecular entities and pathway events) only in TXT file and embedded pathway diagrams. Pathway ontology standards such as PSI-MI <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> or BioPAX <abbrgrp><abbr bid="B41">41</abbr></abbrgrp> or GPML <abbrgrp><abbr bid="B42">42</abbr></abbrgrp> can help resolve syntactic level data heterogeneity; however, these standards are relatively new and are available only in a few recent systems such as cPATH <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>, NCI-PID <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>, Reactome <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> and WikiPathways <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. At the <it>semantic </it>level, incompatible pathway names, event representations, and molecular entity identifiers also poses challenges in querying pathway information across pathway data sources, particularly those with complementary information. Pathway names from different pathway data sources for the same pathway often differ slightly and therefore are poor choices as identifiers. Identifying pathways directly using pathway molecular entities can also be problematic, because the ensemble of molecular entities referring to the same pathway may vary among different annotation sources. Pathway molecular entities may be referred to with any public sequence identifier, which includes RefSeq ID, HGNC symbol, GenBank accession, SwissProt ID, UniProt name, KEGG ID, or IPI number. Furthermore, different databases may choose to provide available pathway information at different levels of molecular detail, e.g., with protein post-translational modification status, protein complex association status, or cellular location information. In summary, pathway data incompatibility at both the syntactic and semantic levels has inhibited the growth of high-quality integrative pathway data sources.</p>
         <p>In this work, we describe the development of a new online integrated pathway database resource, the Human Pathway Database (HPD). HPD is an ongoing pathway data warehousing project, in which we integrate all three types of human pathway data and compile additional detailed information on pathway genes, proteins, metabolites, protein complexes, and pathway events. The concept of developing an organism-specific integrated pathway database resource is not unique, e.g., MAtDB <abbrgrp><abbr bid="B44">44</abbr></abbrgrp> for managing all biological pathways for Arabidopsis and FlyMine <abbrgrp><abbr bid="B45">45</abbr></abbrgrp> for managing both functional genomics and pathway data for <it>Drosophila</it>. Applying semantic-level data integration techniques, we collect, represent, and manage human-specific pathway data in HPD based on information from NCI-PID, Protein Lounge, KEGG, BioCarta, and Reactome databases. HPD provides a comprehensive view of current human biological pathway data, which consists of a total of 999 pathways and 59,341 molecular entities. Online HPD users may search the database for all relevant pathway information related to query protein(s), identify all pathways involving a query protein(s), and examine details related to pathway components, molecular events, and related pathways. Using three case studies, we show how to take advantage of HPD online and backend database querying capabilities to manage, query, and compare different types of biological pathways for systems biology studies. HPD is freely available online at <url>http://bio.informatics.iupui.edu/HPD</url>.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Database content statistics</p>
            </st>
            <p>By integrating human biological pathway data from five major curated sources, we have developed HPD, a human pathway data warehouse. As of the current release, HPD contained a total of 999 human pathways that cover all three major types of pathway events. These pathways cover 59,341 molecular entities and 16,271 pathway events. As of April 2009, HPD contains the highest pathway data coverage among all human biological pathway databases publically available. Since HPD does not contain new pathways derived computationally, the quality of the database remains the same as that of each pathways curated from their respective source databases. A comparison of human pathways in HPD against several common human pathway data sources is shown in Table <tblr tid="T1">1</tblr>. Top 100 pathways, genes/proteins and compounds are listed in the additional file <supplr sid="S1">1</supplr>.</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p>This additional file lists top 100 pathways ranked by degree (number of neighbour pathways, with which similarity score &gt; 0); top 100 genes/proteins ranked by frequency, and top 100 compounds ranked by frequency. Here the frequency of a molecule entity (i.e. gene/protein or compound) also includes times appearing in same pathways.</p>
               </text>
               <file name="1471-2105-10-S11-S5-S1.doc">
   <p>Click here for file</p>
</file>
            </suppl>
            <tbl id="T1"><title><p>Table 1</p></title><caption><p>A comparison of human pathways in HPD against several common pathway data sources.</p></caption><tblbdy cols="9">
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <b>BioCarta</b>
               <sup>1</sup>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>HumanCyc</b>
               <sup>2</sup>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>KEGG</b>
               <sup>3</sup>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Reactome</b>
               <sup>4</sup>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>PID</b>
               <sup>5</sup>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Protein Lounge</b>
               <sup>6</sup>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>HPD</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="9">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>Quality</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Scope of Content</b>
            </p>
         </c>
         <c ca="left">
            <p>Metabolic and signaling pathways</p>
         </c>
         <c ca="left">
            <p>Metabolic Pathways</p>
         </c>
         <c ca="left">
            <p>Metabolic, Regulatory, signaling, disease and drug pathways</p>
         </c>
         <c ca="left">
            <p>Metabolic, signaling and regulatory pathways</p>
         </c>
         <c ca="left">
            <p>Signaling and regulatory pathways</p>
         </c>
         <c ca="left">
            <p>Metabolic, signaling and regulatory pathways</p>
         </c>
         <c ca="left">
            <p>Metabolic, signaling and regulatory pathways</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c cspan="8">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <b>Curation Type</b>
            </p>
         </c>
         <c ca="left">
            <p>Manual</p>
         </c>
         <c ca="left">
            <p>Manual and Computational Prediction</p>
         </c>
         <c ca="left">
            <p>Manual</p>
         </c>
         <c ca="left">
            <p>Manual</p>
         </c>
         <c ca="left">
            <p>Manual</p>
         </c>
         <c ca="left">
            <p>Manual</p>
         </c>
         <c ca="left">
            <p>Integrated from Manually curated database</p>
         </c>
      </r>
      <r>
         <c cspan="9">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>Coverage</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Human Pathways</b>
            </p>
         </c>
         <c ca="left">
            <p>354</p>
         </c>
         <c ca="left">
            <p>327</p>
         </c>
         <c ca="left">
            <p>205</p>
         </c>
         <c ca="left">
            <p>960</p>
         </c>
         <c ca="left">
            <p>87</p>
         </c>
         <c ca="left">
            <p>427</p>
         </c>
         <c ca="left">
            <p>999</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c cspan="8">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <b>Events</b>
            </p>
         </c>
         <c ca="left">
            <p>> 3000</p>
         </c>
         <c ca="left">
            <p>> 1500</p>
         </c>
         <c ca="left">
            <p>4149</p>
         </c>
         <c ca="left">
            <p>3203</p>
         </c>
         <c ca="left">
            <p>5232</p>
         </c>
         <c ca="left">
            <p>NA</p>
         </c>
         <c ca="left">
            <p>16271</p>
         </c>
      </r>
      <r>
         <c cspan="9">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>Functionality</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Related Pathways</b>
            </p>
         </c>
         <c ca="left">
            <p>No</p>
         </c>
         <c ca="left">
            <p>No</p>
         </c>
         <c ca="left">
            <p>Yes</p>
         </c>
         <c ca="left">
            <p>No</p>
         </c>
         <c ca="left">
            <p>No</p>
         </c>
         <c ca="left">
            <p>No</p>
         </c>
         <c ca="left">
            <p>Yes</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c cspan="8">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <b>Multiple Protein Search</b>
            </p>
         </c>
         <c ca="left">
            <p>No</p>
         </c>
         <c ca="left">
            <p>No</p>
         </c>
         <c ca="left">
            <p>No</p>
         </c>
         <c ca="left">
            <p>No</p>
         </c>
         <c ca="left">
            <p>Yes</p>
         </c>
         <c ca="left">
            <p>No</p>
         </c>
         <c ca="left">
            <p>Yes</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c cspan="8">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <b>Pathway-Protein Association Table</b>
            </p>
         </c>
         <c ca="left">
            <p>No</p>
         </c>
         <c ca="left">
            <p>No</p>
         </c>
         <c ca="left">
            <p>No</p>
         </c>
         <c ca="left">
            <p>No</p>
         </c>
         <c ca="left">
            <p>No</p>
         </c>
         <c ca="left">
            <p>No</p>
         </c>
         <c ca="left">
            <p>Yes</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c cspan="8">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <b>Pathway-Pathway Similarity Network</b>
            </p>
         </c>
         <c ca="left">
            <p>No</p>
         </c>
         <c ca="left">
            <p>No</p>
         </c>
         <c ca="left">
            <p>No</p>
         </c>
         <c ca="left">
            <p>No</p>
         </c>
         <c ca="left">
            <p>No</p>
         </c>
         <c ca="left">
            <p>No</p>
         </c>
         <c ca="left">
            <p>Yes</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p><sup>1 </sup>Data from <url>http://www.biocarta.com</url> (as of April 2009)</p>
      <p><sup>2</sup>Data from <url>http://humancyc.org/</url> (as of October 2008)</p>
      <p><sup>3</sup>Data from <url>http://www.genome.jp/kegg/</url> (as of March 2009)</p>
      <p><sup>4</sup>Data from <url>http://www.reactome.org</url> (as of April 2009)</p>
      <p><sup>5</sup>Data from <url>http://pid.nci.nih.gov/</url> (as of April 2009)</p>
      <p><sup>6</sup>Data from <url>http://www.proteinlounge.com</url> (Commercial Version, as of April 2009)</p>
      <p>** The count of HPD pathways has 39 pathways more than reactome since HPD utilizes 2004 Reactome (version 22 release). The next version would include the recent version from Reactome</p>
   </tblfn></tbl>
         </sec>
         <sec>
            <st>
               <p>Scale distributions of integrated HPD pathways</p>
            </st>
            <p>Pathway scale can reflect the integrality of information needed for a biological topic. Here, we define pathway scale as the number of entities (nodes, including gene, protein, complex and metabolite) or events (edges, including interaction, reaction and regulation) involved in a pathway. We performed a statistical analysis on Pathway Scale Distribution (PSD) in the whole HPD, shown in Figure <figr fid="F1">1</figr>, from which we can see that the PSD defined by entity in Figure <figr fid="F1">1a</figr> is almost the same as the PSD defined by event in Figure <figr fid="F1">1b</figr>. This result indicates that the ratio of entity (node) number and event (edge) number in HPD pathways is almost fixed, which implies that the quality of HPD is consistent. We can also find that the PSD defined by gene/protein in Figure <figr fid="F1">1a</figr> is much closer to the PSD defined by entity or event than the PSD defined by metabolite, interaction, reaction and regulation, which suggest that using gene/protein number can represent pathway scale more precisely. This is the most important evidence not only for the implemental definition of pathway scale, but also for the definition of pathway-pathway similarity, both of which can be defined by the number of the Uniprot IDs mapped from genes or proteins in a pathway.</p>
            <fig id="F1"><title><p>Figure 1</p></title><caption><p>Pathway scale distributions for HPD molecular entity data and molecular event data</p></caption><text>
   <p><b>Pathway scale distributions for HPD molecular entity data and molecular event data</b>. A pathway scale refers to the number of molecular entities or molecular events involved in a given pathway. The frequency on the y-axis refers to the count of all pathways falling in the category of a particular pathway scale size on the x-axis. (a) Distributions of pathway scale by counting molecular entities. (b) Distributions of pathway scale by counting molecular events.</p>
</text><graphic file="1471-2105-10-S11-S5-1"/></fig>
            <p>We can also notice that, since entities in a pathway here also include protein complexes, each of which will only count as one entity, the PSD defined by that entity is a little bit lower than the PSD defined by gene/protein in Figure <figr fid="F1">1a</figr>. Both of the results in Figure <figr fid="F1">1a</figr> and <figr fid="F2">2b</figr> suggest that the integration process of HPD is successful by considering pathway scales, but either small pathways or large pathways may still be under-represented in the whole HPD.</p>
            <fig id="F2"><title><p>Figure 2</p></title><caption><p>An overview core functionality of the online HPD software</p></caption><text>
   <p><b>An overview core functionality of the online HPD software</b>. (a) The HPD home page showing a query protein input box (supports multiple protein search using either gene names or UniProt identifiers). (b) A Web page containing the list of pathways retrieved as a result of a query protein input. Links to obtain pathway-pathway similarity matrix and pathway-protein association matrix and to download query-retrieved pathways were also shown on the Web page. (c) An advanced search page in which users may search pathways using gene name(s), with any term that appears within pathway names or with a list of one or more HPD Pathway IDs. This search retrieves a page similar to in (b). (d) A hyperlinked Web page showing pathway detailed information on molecular entities (Proteins, Complexes and Compounds) within the query pathway. (e) A hyperlinked Web page showing detailed pathway information on pathways similar to the query pathway. (f) A hyperlinked Web page showing detailed pathway information on molecular events (Interaction/Reaction/Regulation) within the query pathway. (g) A hyperlinked Web page showing pathway image link and reference articles.</p>
</text><graphic file="1471-2105-10-S11-S5-2"/></fig>
         </sec>
         <sec>
            <st>
               <p>General online features</p>
            </st>
            <p>In Figure <figr fid="F2">2</figr>, we show the user interfaces of the Web-based online version of HPD. It supports both standard and customized user search options that allow them to specify a list of genes/proteins or keywords as the query input. Upon executing the queries, HPD can retrieve a list of related human pathways in an HTML table, with which users can further explore pathway details by clicking the hyperlink on a pathway ID in the table. In the pathway detail HTML table that pops up, all listings of molecular entities, events, related pathways, and reference resources of a specific pathway are shown. Users can also directly interact with advanced HPD features; by selecting the pathway-protein association matrix applet (See Figure <figr fid="F3">3</figr> for an example) and the pathway-pathway similarity matrix applet (See Figure <figr fid="F4">4</figr> as an example) Comprehensive hyperlinks were built so that users can search for new pathways based on visual analysis performed on the applets. User queried pathway data stored in HPD can also be downloaded as flat files without restriction to Academic users.</p>
            <fig id="F3"><title><p>Figure 3</p></title><caption><p>A BRCA1-retrieved HPD pathway-protein association matrix</p></caption><text>
   <p><b>A BRCA1-retrieved HPD pathway-protein association matrix</b>. The matrix shows few HPD pathways involving the query protein BRCA1 (on the y-axis) against pathway molecular components (on the x-axis). A red cell in the matrix indicate that the molecular entity is present in the pathway, whereas a white cell in the matrix indicate that the molecular entity is absent from the pathway. Only few HPD pathways are shown. These BRCA1-retrieved pathways are sorted by their shared protein counts among all pairwise pathway comparisons in these pathways.</p>
</text><graphic file="1471-2105-10-S11-S5-3"/></fig>
            <fig id="F4"><title><p>Figure 4</p></title><caption><p>A BRCA1-retrieved HPD pathway-pathway similarity matrix</p></caption><text>
   <p><b>A BRCA1-retrieved HPD pathway-pathway similarity matrix</b>. This is an interactive heat map containing similarity scores among the pathways involving query protein(s) or gene name(s). The tooltip shows the two pathway IDs and names corresponding to the particular cell pointed to along with their similarity score. A right click context menu shows the links (HPD_644 and HPD_340) to a Web page containing pathway information (as shown in Figure 2d). The "Compare HPD_644 against HPD_340" option will redirect to a page with a pathway-protein matrix showing proteins shared by these two pathways. The legend above the map indicates the range of similarity score (0 to 1).</p>
</text><graphic file="1471-2105-10-S11-S5-4"/></fig>
         </sec>
         <sec>
            <st>
               <p>Case studies</p>
            </st>
            <p>To demonstrate the capabilities of HPD, we show three case studies of increasing complexity and biological significance to demonstrate how HPD could be used to solve real-world biological pathway problems.</p>
         </sec>
         <sec>
            <st>
               <p>Case study 1: searching for biological pathways and their components based on a single query protein</p>
            </st>
            <p>Using the standard query box provided at the HPD home page, we can search HPD for all biological pathways involving <it>BRCA1_HUMAN </it>(a major protein involved with breast cancer susceptibility). HPD returns a list of the top 20 BRCA1-related pathways, which are ordered by decreasing number of proteins that each pathway shares among all pathway pairs from retrieved pathways. The better the rank a retrieved pathway has, the more related it should be to both the query protein BRCA1 and all BRCA1-relevant pathways. In this list, highly-ranked pathways such as "Molecular Mechanisms of Cancer", "P53 Signaling", "DNA Repair Mechanism", and "BRCA1 pathway" are all well characterized signaling pathways in breast cancer. All pathways are hyperlinked to their own detailed pathway information pages, which include molecular entities (proteins, complexes and metabolites), related pathways, events, and external pathway images and reference articles. (See Figure <figr fid="F2">2</figr> for details).</p>
            <p>The Web page with the list of pathways related to BRCA1 also contains links to download data. Four types of data, pathway list, pathway-protein association matrix, and pathway-pathway similarity scores are downloadable as flat files.</p>
            <p>Note that the pathway-protein association matrix contains proteins that are involved in the top 20 pathways retrieved based on the single protein query, sorted according to their descending maximal pathway involvement by activity count. BRCA1 related proteins are retrieved by pathway, with each of the proteins covered by at least two of the 20 pathways. A close examination reveals that many breast cancer susceptibility genes including <it>BRCA1, BRCA2, P53, PCNA </it><abbrgrp><abbr bid="B46">46</abbr></abbrgrp>, <it>FOXA1 </it><abbrgrp><abbr bid="B47">47</abbr></abbrgrp> and <it>STK6 </it><abbrgrp><abbr bid="B48">48</abbr></abbrgrp> from recent individual studies and breast cancer biomarker genes such as <it>ERBB2, FGFR2, M3K1, and PTEN </it><abbrgrp><abbr bid="B49">49</abbr><abbr bid="B50">50</abbr></abbrgrp>, have all been found in this list.</p>
            <p>Particularly noteworthy is the Applet in the HPD Web page that shows all the query-related biological pathways with involved proteins in a heat map. In Figure <figr fid="F3">3</figr>, <it>BRCA1 </it>related pathways and involved proteins are sorted and used as two separate dimensions of the matrix. Mousing over a color-filled cell invokes an applet tooltip message, which shows the pathway and protein names.</p>
            <p>HPD users can also visualize the pathway-pathway similarity matrix (Figure <figr fid="F4">4</figr>) which shows the similarity score among the <it>BRCA1 </it>related pathways. The pathway-pathway similarity matrix allows users to visualize a cluster of similar pathway pairs as a 2-D interactive heat map. This heat map allows users to right click on any cell (shown in Figure 4) to compare pathway pair on the heat map (future versions will include multiple pathway selection) by looking at the pathway-protein association matrix. This facilitates better understanding for deriving novel pathways most similar to <it>BRCA1 </it>related pathways.</p>
         </sec>
         <sec>
            <st>
               <p>Case study 2: developing pathway-pathway similarity networks from heterogeneous data sources</p>
            </st>
            <p>Using the advanced HPD search function online, a user can specify multiple proteins as the query input to obtain a list of most relevant pathways related to the query protein set. For example, if the user enters "BRCA1_HUMAN, FOXA1_HUMAN, STK6_HUMAN" as query inputs, a significant number of pathways (Table <tblr tid="T2">2</tblr>) related to any of the query protein inputs will be returned. To ensure retrieved pathways are relevant to the query protein inputs and to avoid overly restrictive filtering of related pathways (e.g., requiring all pathways retrieved to contain all proteins in the input query would be too restrictive), we can use the concept of pathway similarity (see Methods section for details) and apply a minimal pathway similarity threshold {<it>S</it><sub><it>i</it>, <it>j </it></sub>&#8805; 0.2, and |<it>P</it><sub><it>i </it></sub>&#8745; <it>P</it><sub><it>j</it></sub>| &gt; 2}, <it>i </it>= 1...<it>N</it>, <it>j </it>= 1...<it>N</it>. The threshold indicates at least 20% minimal shared molecular entities with no fewer than 2 shared entities between two pathways. After applying this filter, 25 pathways and 39 pathway pairs are retrieved.</p>
            <tbl id="T2"><title><p>Table 2</p></title><caption><p>A list of HPD pathways retrieved by the query BRCA1.</p></caption><tblbdy cols="4">
      <r>
         <c ca="left">
            <p>
               <b>Pathway_ID</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Pathway_Name</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Source_Name</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Shared_Proteins</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>HPD_786</b>
            </p>
         </c>
         <c ca="left">
            <p>Molecular Mechanisms of Cancer</p>
         </c>
         <c ca="left">
            <p>Protein Lounge</p>
         </c>
         <c ca="left">
            <p>264</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>HPD_953</b>
            </p>
         </c>
         <c ca="left">
            <p>JAK/STAT Pathway</p>
         </c>
         <c ca="left">
            <p>Protein Lounge</p>
         </c>
         <c ca="left">
            <p>239</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>HPD_1182</b>
            </p>
         </c>
         <c ca="left">
            <p>p53 Signaling</p>
         </c>
         <c ca="left">
            <p>Protein Lounge</p>
         </c>
         <c ca="left">
            <p>76</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>HPD_1567</b>
            </p>
         </c>
         <c ca="left">
            <p>DNA Repair Mechanism</p>
         </c>
         <c ca="left">
            <p>Protein Lounge</p>
         </c>
         <c ca="left">
            <p>69</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>HPD_336</b>
            </p>
         </c>
         <c ca="left">
            <p>Chks in Checkpoint Regulation</p>
         </c>
         <c ca="left">
            <p>Protein Lounge</p>
         </c>
         <c ca="left">
            <p>52</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>HPD_596</b>
            </p>
         </c>
         <c ca="left">
            <p>BRCA1 Pathway</p>
         </c>
         <c ca="left">
            <p>Protein Lounge</p>
         </c>
         <c ca="left">
            <p>44</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>HPD_1058</b>
            </p>
         </c>
         <c ca="left">
            <p>Aurora A signaling</p>
         </c>
         <c ca="left">
            <p>NCI-Nature Curated</p>
         </c>
         <c ca="left">
            <p>43</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>HPD_1070</b>
            </p>
         </c>
         <c ca="left">
            <p>BARD1 signaling events</p>
         </c>
         <c ca="left">
            <p>NCI-Nature Curated</p>
         </c>
         <c ca="left">
            <p>40</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>HPD_788</b>
            </p>
         </c>
         <c ca="left">
            <p>ATM Pathway</p>
         </c>
         <c ca="left">
            <p>Protein Lounge</p>
         </c>
         <c ca="left">
            <p>33</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>HPD_644</b>
            </p>
         </c>
         <c ca="left">
            <p>role of brca1 brca2 and atr in cancer susceptibility</p>
         </c>
         <c ca="left">
            <p>BioCarta</p>
         </c>
         <c ca="left">
            <p>27</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>HPD_988</b>
            </p>
         </c>
         <c ca="left">
            <p>Ubiquitin mediated proteolysis</p>
         </c>
         <c ca="left">
            <p>KEGG</p>
         </c>
         <c ca="left">
            <p>25</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>HPD_12</b>
            </p>
         </c>
         <c ca="left">
            <p>cell cycle: g2/m checkpoint</p>
         </c>
         <c ca="left">
            <p>BioCarta</p>
         </c>
         <c ca="left">
            <p>23</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>HPD_628</b>
            </p>
         </c>
         <c ca="left">
            <p>atm signaling pathway</p>
         </c>
         <c ca="left">
            <p>BioCarta</p>
         </c>
         <c ca="left">
            <p>20</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>HPD_340</b>
            </p>
         </c>
         <c ca="left">
            <p>GADD45 Pathway</p>
         </c>
         <c ca="left">
            <p>Protein Lounge</p>
         </c>
         <c ca="left">
            <p>19</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>HPD_752</b>
            </p>
         </c>
         <c ca="left">
            <p>Fanconi's Anaemia Pathway</p>
         </c>
         <c ca="left">
            <p>Protein Lounge</p>
         </c>
         <c ca="left">
            <p>17</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>HPD_548</b>
            </p>
         </c>
         <c ca="left">
            <p>DNA Damage Induced 14-3-3Sigma Signaling</p>
         </c>
         <c ca="left">
            <p>Protein Lounge</p>
         </c>
         <c ca="left">
            <p>14</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>HPD_212</b>
            </p>
         </c>
         <c ca="left">
            <p>brca1 dependent ub ligase activity</p>
         </c>
         <c ca="left">
            <p>BioCarta</p>
         </c>
         <c ca="left">
            <p>11</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>HPD_1261</b>
            </p>
         </c>
         <c ca="left">
            <p>FOXA1 transcription factor network</p>
         </c>
         <c ca="left">
            <p>NCI-Nature Curated</p>
         </c>
         <c ca="left">
            <p>10</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>HPD_707</b>
            </p>
         </c>
         <c ca="left">
            <p>Recruitment of repair and signaling proteins to double-strand breaks</p>
         </c>
         <c ca="left">
            <p>Reactome</p>
         </c>
         <c ca="left">
            <p>9</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>HPD_276</b>
            </p>
         </c>
         <c ca="left">
            <p>ATM mediated phosphorylation of repair proteins</p>
         </c>
         <c ca="left">
            <p>Reactome</p>
         </c>
         <c ca="left">
            <p>5</p>
         </c>
      </r>
   </tblbdy></tbl>
            <p>In Figure <figr fid="F5">5</figr>, we show a visual display of the pathway-pathway similarity network, using pathway similarity scores retrieved from HPD using <it>ProteoLens </it><abbrgrp><abbr bid="B51">51</abbr></abbrgrp>. In order to generate a comprehensive perspective of breast cancer pathways seeded with the three initial query proteins, all five types of data sources have been used. This observation strengthens the claim for the necessity of integrating pathways from heterogeneous sources. HPD pathways in this case study provide a good meta-model that connects our fragmented pathway knowledge together in pathway-pathway similarity networks. This global perspective, supported by integration of otherwise incompatible pathways from different sources, enhances the chance of exposing novel insights in the search for disease drug targets and biomarkers.</p>
            <fig id="F5"><title><p>Figure 5</p></title><caption><p>A breast cancer-specific pathway-pathway similarity network</p></caption><text>
   <p><b>A breast cancer-specific pathway-pathway similarity network</b>. In this pathway-pathway similarity network, 25 HPD pathways derived from different sources are shown. The color and shapes in the diagram were drawn to indicate original HPD pathway data sources, based on the shape/color legend shown in the upper left corner. The subnetwork pathways and host pathways are also indicated with directed cyan edges. Edges are labeled with the number (in red) of molecular entities shared by the connected pathways. The count of molecular entity overlap between each pair of related pathways is labeled as red-colored numbers on the edge. Only pathway pairs with a similarity score and overlap above the threshold {<it>S</it><sub><it>i</it>, <it>j </it></sub>&#8805; 0.2, AND |<it>P</it><sub><it>i </it></sub>&#8745; <it>P</it><sub><it>j</it></sub>| >2} are shown.</p>
</text><graphic file="1471-2105-10-S11-S5-5"/></fig>
            <p>In Figure <figr fid="F6">6</figr>, we show a comparison of using the "Multiple Protein Search" feature among three databases: HPD, KEGG, and Panther. Three gene names BRCA1, FOXA1, and AURKA were used to build a common query gene set. The KEGG Genes database was manually searched and only one KEGG Pathway was found, using the "Search object in Pathways" functionality of KEGG (actual corresponding KEGG gene ID entered: hsa:672, hsa:3169, and hsa:6790). Panther had a "Batch ID search" which accepted the three gene symbols and retrieved only four unique pathways. HPD not only retrieved more pathways (n = 25), but also supported multiple identifier types as inputs, e.g., UniProt names.</p>
            <fig id="F6"><title><p>Figure 6</p></title><caption><p>A comparison of HPD "Multiple Protein Search" feature with those in Panther and KEGG</p></caption><text>
   <p><b>A comparison of HPD "Multiple Protein Search" feature with those in Panther and KEGG</b>. The example shows that different pathway databases retrieved different numbers of pathways. Three genes (AURKA, BRCA1, and FOXA1) were used to build this example. The KEGG database returned only one pathway in the result. Panther returned four pathways. HPD returned 25 pathways. The difference is primarily due to the use of "similar pathways" concept to allow retrieval of pathways that matched only partial gene/protein list.</p>
</text><graphic file="1471-2105-10-S11-S5-6"/></fig>
         </sec>
         <sec>
            <st>
               <p>Case study 3: developing integrated pathway models from heterogeneous sources</p>
            </st>
            <p>While pathway-pathway similarity networks are useful for generating global perspectives on the relationships between pathways, the next case study demonstrates how to connect different types of biological pathways within HPD to form integrated pathway networks. Since pathway data managed at HPD is integrated at the schematic level, "deep integration" and "deep integrative analysis" are possible. We will use two breast cancer-related proteins, BRCA1_HUMAN and FOXA1_HUMAN, as an example. According to the HPD data model (See additional file <supplr sid="S2">2</supplr> for details), the table <it>Connect_mol_updated </it>contains mappings among pathways, interactions, and molecules. To search for all related pathways containing the above two proteins within the HPD data warehouse, we can execute the following SQL query:</p>
            <suppl id="S2">
               <title>
                  <p>Additional file 2</p>
               </title>
               <text>
                  <p>This additional file describes the pathway entity-relationship (ER) data model for HPD pathway integrations in detail.</p>
               </text>
               <file name="1471-2105-10-S11-S5-S2.xls">
   <p>Click here for file</p>
</file>
            </suppl>
            <p indent="1">
               <it>SELECT pathway_name,mol_in, Mol_In_updated, name_in, Mol_out,</it>
            </p>
            <p indent="1">
               <it>Mol_Out_updated, name_out, interaction_type, SYS_CONNECT_BY_PATH(Mol_In, '/') "Path"</it>
            </p>
            <p indent="1">
               <it>FROM connect_mol_updated</it>
            </p>
            <p indent="1">
               <it>START WITH name_in = 'BRCA1_HUMAN'</it>
            </p>
            <p indent="1">
               <it>CONNECT BY nocycle PRIOR Mol_Out_updated=Mol_In_updated</it>
            </p>
            <p indent="1">
               <it>and level &lt; 3</it>
            </p>
            <p indent="1">
               <it>INTERSECT</it>
            </p>
            <p indent="1">
               <it>SELECT pathway_name,mol_in, Mol_In_updated, name_in, Mol_out,</it>
            </p>
            <p indent="1">
               <it>Mol_Out_updated, name_out, interaction_type, SYS_CONNECT_BY_PATH(Mol_In, '/') "Path"</it>
            </p>
            <p indent="1">
               <it>FROM connect_mol_updated</it>
            </p>
            <p indent="1">
               <it>START WITH name_in = 'FOXA1_HUMAN'</it>
            </p>
            <p indent="1">
               <it>CONNECT BY nocycle PRIOR Mol_Out_updated=Mol_In_updated</it>
            </p>
            <p indent="1">
               <it>and level &lt; 3;</it>
            </p>
            <p>We organize the results and present our final pathway analysis results in Figure <figr fid="F7">7</figr>, which shows many relationships not found in individual fragmented biological pathways separately. The <it>FOXA1 </it>transcription factor network contains 9-cis-Retinoic acid which regulates <it>FOXA1 </it>(Hepatocyte nuclear factor 3-alpha) <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>; it also contains <it>BRCA1 </it>(Breast cancer type 1 susceptibility protein) and <it>CYP2C18 </it>(Cytochrome P450 2C18), which is positively regulated by <it>FOXA1 </it><abbrgrp><abbr bid="B53">53</abbr></abbrgrp>. Arachidonic acid metabolism from KEGG PATHWAY involves arachidonic acid, which can be catalyzed by <it>CYP2C18 </it><abbrgrp><abbr bid="B54">54</abbr></abbrgrp> to produce 14,15-epoxy-5,8,11-eicosatrienoic acid. This intermediate product can be further catalyzed by <it>EPHX2 </it>(Epoxide hydrolase 2) to produce 14,15-dihydroxyeicosatrienoic acid <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>. In BioCarta, ATM signaling pathway involves <it>BRCA1</it>, which positively regulates <it>RAD51 </it>which regulates DNA Repair <abbrgrp><abbr bid="B56">56</abbr></abbrgrp>. The "Presynaptic phase of homologous DNA pairing and the strand exchange" pathway of Reactome contains <it>BRCA2</it>, which binds with <it>RAD51 </it>to form the <it>RAD51:BRCA2 </it>complex <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>. Human protein-protein interactions data could also be retrieved and show that <it>EPHX2 </it>interacts with <it>NSDHL </it>(Sterol-4-alpha-carboxylate 3-dehydrogenase, decarboxylating). Phosphatidylinositol-3,4,5-triphosphate (PIP3), a lipid molecule generated by the action of phosphoinositide-3-kinase (<it>PI3K</it>), can be induced by a variety of stimuli. PIP3 is thought to be the major physiological substrate for <it>PTEN</it>, a phosphatase that can dephosphorylate many phosphatidyl inositides, which has been implicated in tumorigenesis <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>. Activation of protein kinase B <it>(PKB)/Akt </it>contribute to resistance to antiproliferative signals and breast cancer progression in part by impairing the nuclear import and action of <it>p27 (CDKN) </it><abbrgrp><abbr bid="B59">59</abbr></abbrgrp>.</p>
            <fig id="F7"><title><p>Figure 7</p></title><caption><p>An integrated pathway model involving FOXA1 in breast cancer</p></caption><text>
   <p><b>An integrated pathway model involving FOXA1 in breast cancer</b>. The figure shows how information from different pathway database sources are readily integrated, queried, and analyzed together in HPD for FOXA1-related breast cancer signaling studies.</p>
</text><graphic file="1471-2105-10-S11-S5-7"/></fig>
            <p>The integrated pathway model based on HPD pathways can be used as an investigative tool for disease diagnostic and therapeutic applications. For example, 9-cis-Retinoic acid is recognized as a possible breast cancer biomarker <abbrgrp><abbr bid="B60">60</abbr></abbrgrp> and <it>FOXA1 </it>has gained increasing attention as a possible breast cancer therapeutic target <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>. The <it>BRCA2-RAD51 </it>interaction is essential for DNA repairs and has also been suggested as a novel target for anti-breast cancer drugs <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>. In addition to breast cancer, links between breast cancer and other diseases can be studied. For example, increased risk of hereditary prostate cancer is known to be a result of polymorphism in the <it>CDKN1B (p27) </it>gene <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>. Epoxide hydrolase 2 has been characterized as a key mediator molecule in hypertensive, cardiovascular, inflammatory, pulmonary, and diabetic-related diseases <abbrgrp><abbr bid="B64">64</abbr><abbr bid="B65">65</abbr><abbr bid="B66">66</abbr></abbrgrp>. CHILD syndrome, an X-linked dominant trait with lethality for male embryos, can also be traced to mutations in <it>NSDHL</it>, a gene playing crucial roles in the cholesterol biosynthetic pathway <abbrgrp><abbr bid="B67">67</abbr></abbrgrp>.</p>
            <p>Through this case study, we have shown the significance of integrating pathway information from different types and data sources. The interconnected network analysis offers researchers a rare opportunity to gain global perspectives on events previously perceived in isolation. This "deep integrative analysis" opportunity cannot be readily obtained by using multiple online pathway databases. For example, NCI Nature Curated Pathway Interaction Database has a 'Connected Molecules' functionality, which may only be used to find molecular connections within the same pathway data source. In all, the convenience of building new integrative pathway models with the new HPD may greatly facilitate new drug development and biomarker discovery.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>We developed HPD as an integrated pathway database system to manage, query, and analyze human biological pathways. HPD integrates all three types of biological pathways from five heterogeneous pathway database sources at syntactic, semantic, and schematic levels, primarily based on data warehousing techniques driven by a unified pathway data model. Pathway molecules, interactions, chemical reactions, and similar pathways can be searched, displayed, and downloaded from a unified online user interface. The current HPD software can help users address a wide range of pathway-related questions in human disease biology studies.</p>
         <p>While the human Reactome is still far from complete, an integrative pathway database such as HPD has the capability to help researchers establish a global perspective necessary for understanding molecular mechanisms and develop biomedical applications. We will further expand the database to include pathways from HumanCyc <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>, Wikipathways <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>, NetPath <abbrgrp><abbr bid="B68">68</abbr></abbrgrp>, Panther <abbrgrp><abbr bid="B33">33</abbr></abbrgrp> and TRANSFAC <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. We also plan to integrate protein-protein interaction data from HAPPI <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> with the aim of discovering novel pathways when combined with HPD. Additional functions will also be provided such as pathway reconstruction where users can select pathways and derive a reconstructed pathway expanded with protein-protein interaction data. With ongoing efforts, HPD can become a useful resource, linking proteins, genes, RNAs, signaling reactions, and gene regulatory events for systems biology applications.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Pathway data sources</p>
            </st>
            <p>We show an overview of the data integration process in Figure <figr fid="F8">8</figr>. Pathway data in HPD were collected or indexed from five different sources, i.e., NCI-Nature Curated data <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>, BioCarta <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>, Protein Lounge <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>, Reactome <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> and KEGG <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. The NCI-Nature Curated, Reactome and BioCarta data sets were all downloaded from Nature pathway interaction database Website and kept updated as of April 2009 release of the production HPD Website. In particular, the NCI-Nature Curated pathways are curated by Nature Publishing Group editors based on known biomolecular interactions and key cellular processes of signaling/regulatory pathways. The Reactome database was downloaded in December 2007 (Release version 22). Pathway molecules from both NCI-Nature and Reactome were identified by their UniProt identifiers and annotated with post-translational modification information. Pathway molecules from BioCarta were identified by Entrez Gene IDs without post-translational modification annotations. In all three data sets, each pathway was represented as a series of events, each of which consists of molecules in one of the following four roles: input molecule, output molecule, agent, and inhibitor. Content from the Protein Lounge was indexed by a web crawler accessing a publicly available Web site. The crawled content was verified with that provided as a site license to the authors and other users at Indiana University Simon Cancer Center. Since Protein Lounge is a commercial database that contains curated signaling, transduction, and metabolic pathways, we chose to index instead of integrate the full content into the data warehouse, which only indexed pathway-involving protein IDs and references to pathway diagram drawings. Original pathway molecules derived from Protein Lounge were identified by RefSeq ID or GI accession numbers. KEGG contains all known metabolic pathways and a small number of regulatory pathways and transport mechanisms. The KEGG PATHWAY database contains graphical representations of pathways and lists of enzymes and reactions within the pathways. All specific pathway maps and overviews were manually drawn and contained links to additional information on pathway compounds, enzymes and genes. Pathway molecules from KEGG are identified by E.C. Numbers which are mapped to KEGG Gene IDs and then to UniProt IDs. The total count of initial pathways, proteins, compounds, protein complexes, and pathway interactions are shown in Figure <figr fid="F8">8</figr>.</p>
            <fig id="F8"><title><p>Figure 8</p></title><caption><p>An overview of pathway data integration process</p></caption><text>
   <p><b>An overview of pathway data integration process</b>. The figure shows the whole process of pathway data integration and the basic statistics of pathway data sources.</p>
</text><graphic file="1471-2105-10-S11-S5-8"/></fig>
         </sec>
         <sec>
            <st>
               <p>Pathway data integration</p>
            </st>
            <p>We developed a model-driven approach for syntactic, semantic, and schematic level integrations of heterogeneous pathway data. Since pathway data were collected in a variety of formats, Python XML/HTML data parsers were developed to convert them into a common tab-delimited textual format to ensure syntactic level data compatibility. The semantic compatibility of the data was enforced by cleaning up data attributes and data values to keep them consistent, using a standard data extraction, transformation, and loading (ETL) process characteristics of data warehousing-based data integration approaches. All pre-processed data were parsed, cleaned, and loaded into data warehouse staging tables before reaching their final database table destinations. To maintain schematic data compatibilities, we model relationships among different pathway concepts using an entity-relationship (ER) data model (for more details on the data model, please refer to the documentation on the HPD Website and additional file <supplr sid="S2">2</supplr>). We further mapped all the involved proteins or genes to their UniProt Name Identifiers <abbrgrp><abbr bid="B69">69</abbr></abbrgrp> and metabolites to their KEGG compound IDs before loading the HPD pathway data into data warehouse tables defined by the ER data model. All HPD molecular entities, events, and pathways were assigned unique HPD-specific identifiers.</p>
         </sec>
         <sec>
            <st>
               <p>Online HPD software design</p>
            </st>
            <p>The HPD database was developed as a data warehouse application. The online version of HPD is a standard 3-tier Web application, which consists of an Oracle 10 g database at the backend database server layer, Apache/PHP server scripts at the middleware application Web server layer, and CSS-driven Web pages presented at the browser.</p>
         </sec>
         <sec>
            <st>
               <p>Pathway similarity measure</p>
            </st>
            <p>The pathway similarity measure can be defined as the extent of overlaps, e.g., common number of genes/proteins, shared between two different pathways. We define a pathway-pathway similarity score <it>S</it><sub><it>i</it>, <it>j </it></sub>based on equation (2) <abbrgrp><abbr bid="B70">70</abbr></abbrgrp>. Both overlap and similarity score values can be downloaded from HPD Website.</p>
            <p>
               <display-formula id="M2">
                  <graphic file="1471-2105-10-S11-S5-i1.gif"/>
               </display-formula>
            </p>
            <p>Here, <it>N </it>denotes total number of pathways. <it>P</it><sub><it>i </it></sub>and <it>P</it><sub><it>j </it></sub>denote two different pathways, while |<it>P</it><sub><it>i</it></sub>| and |<it>P</it><sub><it>j</it></sub>| are the numbers of molecules that can be mapped to UniProt ID respectively in these two pathways. Their intersection <it>P</it><sub><it>i </it></sub>&#8745; <it>P</it><sub><it>j </it></sub>denotes a common set of molecules that can be mapped to the same UniProt ID, while their union <it>P</it><sub><it>i </it></sub>&#8746; <it>P</it><sub><it>j </it></sub>is calculated as |<it>P</it><sub><it>i</it></sub>| + |<it>P</it><sub><it>j</it></sub>| - |<it>P</it><sub><it>i </it></sub>&#8745; <it>P</it><sub><it>j</it></sub>|. Here <it>&#945; </it>is a weight coefficient among [0, 1], and we currently use <it>&#945; </it>= 0.8 to count varying degree of contributions from calculations based both on the <it>overlap </it>(left item <it>S</it><sub><it>L</it></sub>) and the <it>cover </it>(right item <it>S</it><sub><it>R</it></sub>).</p>
            <p>We can also make special considerations for <it>subnetwork relationship </it>(defined by the Nature Pathway Interaction database at <url>http://pid.nci.nih.gov/</url>. For subnetwork relationship, we define <it>S</it><sub><it>i</it>, <it>j </it></sub>= 1.01, if pathway <it>P</it><sub><it>i </it></sub>has a subnetwork as <it>P</it><sub><it>j</it></sub>, and <it>S</it><sub><it>i</it>, <it>j </it></sub>= -1.01 if pathway <it>P</it><sub><it>i </it></sub>is a subnetwork of <it>P</it><sub><it>j</it></sub>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Competing interests</p>
         </st>
         <p>The authors declare that they have no competing interests.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>JYC conceived the initial work, designed the method for the database construction, and drafted the manuscript. SRC and HNK implemented the design, and developed the database from integrated data sets. SRC, FZ and PML implemented the Web-based database interface. XW performed HPD pathway analysis to generate the first two case studies and statistical analysis. RP and SRC together implemented data warehousing strategies, provided ID mapping tables, and performed data processing, extraction, transformation, and loading. All authors are involved in the revisions of the manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>The HPD database was developed with research funding from Department of Defense (DOD) Breast Cancer Research Program (BCRP) Concept Award (W81XWH-08-1-0623) to Dr. Jake Chen. We thank Stephanie Burks and Joseph Rinkovsky from the University Information Technology and Services (UITS) at Indiana University for providing generous support in Oracle 10 g database administration and configuring the Web server for the project. We especially thank David Michael Grobe from UITS at Indiana University for thoroughly proofreading the manuscript and provided helpful comments for this project.</p>
            <p>This article has been published as part of <it>BMC Bioinformatics </it>Volume 10 Supplement 11, 2009: Proceedings of the Sixth Annual MCBIOS Conference. Transformational Bioinformatics: Delivering Value from Genomes. The full contents of the supplement are available online at <url>http://www.biomedcentral.com/1471-2105/10?issue=S11</url>.</p>
         </sec>
      </ack>
      <refgrp><bibl id="B1"><title><p>Pathway information for systems biology</p></title><aug><au><snm>Cary</snm><fnm>MP</fnm></au><au><snm>Bader</snm><fnm>GD</fnm></au><au><snm>Sander</snm><fnm>C</fnm></au></aug><source>FEBS Lett</source><pubdate>2005</pubdate><volume>579</volume><issue>8</issue><fpage>1815</fpage><lpage>1820</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.febslet.2005.02.005</pubid><pubid idtype="pmpid" link="fulltext">15763557</pubid></pubidlist></xrefbib></bibl><bibl id="B2"><title><p>The Wnt signaling pathway in development and disease</p></title><aug><au><snm>Logan</snm><fnm>CY</fnm></au><au><snm>Nusse</snm><fnm>R</fnm></au></aug><source>Annu Rev Cell Dev Biol</source><pubdate>2004</pubdate><volume>20</volume><fpage>781</fpage><lpage>810</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1146/annurev.cellbio.20.010403.113126</pubid><pubid idtype="pmpid" link="fulltext">15473860</pubid></pubidlist></xrefbib></bibl><bibl id="B3"><title><p>Bioinformatics applications for pathway analysis of microarray data</p></title><aug><au><snm>Werner</snm><fnm>T</fnm></au></aug><source>Curr Opin Biotechnol</source><pubdate>2008</pubdate><volume>19</volume><issue>1</issue><fpage>50</fpage><lpage>54</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.copbio.2007.11.005</pubid><pubid idtype="pmpid" link="fulltext">18207385</pubid></pubidlist></xrefbib></bibl><bibl id="B4"><title><p>Pathway analysis reveals functional convergence of gene expression profiles in breast cancer</p></title><aug><au><snm>Shen</snm><fnm>R</fnm></au><au><snm>Chinnaiyan</snm><fnm>AM</fnm></au><au><snm>Ghosh</snm><fnm>D</fnm></au></aug><source>BMC Med Genomics</source><pubdate>2008</pubdate><volume>1</volume><fpage>28</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1755-8794-1-28</pubid><pubid idtype="pmcid">2447843</pubid><pubid idtype="pmpid" link="fulltext">18588682</pubid></pubidlist></xrefbib></bibl><bibl id="B5"><title><p>Profiling of estrogen up- and down-regulated gene expression in human breast cancer cells: Insights into gene networks and pathways underlying estrogenic control of proliferation and cell phenotype</p></title><aug><au><snm>Frasor</snm><fnm>J</fnm></au><au><snm>Danes</snm><fnm>JM</fnm></au><au><snm>Komm</snm><fnm>B</fnm></au><au><snm>Chang</snm><fnm>KCN</fnm></au><au><snm>Lyttle</snm><fnm>CR</fnm></au><au><snm>Katzenellenbogen</snm><fnm>BS</fnm></au></aug><source>Endocrinology</source><pubdate>2003</pubdate><volume>144</volume><issue>10</issue><fpage>4562</fpage><lpage>4574</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1210/en.2003-0567</pubid><pubid idtype="pmpid" link="fulltext">12959972</pubid></pubidlist></xrefbib></bibl><bibl id="B6"><title><p>Functional classification analysis of somatically mutated genes in human breast and colorectal cancers</p></title><aug><au><snm>Chittenden</snm><fnm>TW</fnm></au><au><snm>Howe</snm><fnm>EA</fnm></au><au><snm>Culhane</snm><fnm>AC</fnm></au><au><snm>Sultana</snm><fnm>R</fnm></au><au><snm>Taylor</snm><fnm>JM</fnm></au><au><snm>Holmes</snm><fnm>C</fnm></au><au><snm>Quackenbush</snm><fnm>J</fnm></au></aug><source>Genomics</source><pubdate>2008</pubdate><volume>91</volume><issue>6</issue><fpage>508</fpage><lpage>511</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.ygeno.2008.03.002</pubid><pubid idtype="pmcid">2492759</pubid><pubid idtype="pmpid" link="fulltext">18434084</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>FGFR1/PI3K/AKT signaling pathway is a novel target for antiangiogenic effects of the cancer drug fumagillin (TNP-470)</p></title><aug><au><snm>Chen</snm><fnm>GJ</fnm></au><au><snm>Weylie</snm><fnm>B</fnm></au><au><snm>Hu</snm><fnm>C</fnm></au><au><snm>Zhu</snm><fnm>J</fnm></au><au><snm>Forough</snm><fnm>R</fnm></au></aug><source>J Cell Biochem</source><pubdate>2007</pubdate><volume>101</volume><issue>6</issue><fpage>1492</fpage><lpage>1504</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/jcb.21265</pubid><pubid idtype="pmpid" link="fulltext">17295210</pubid></pubidlist></xrefbib></bibl><bibl id="B8"><title><p>The Akt/PKB pathway: molecular target for cancer drug discovery</p></title><aug><au><snm>Cheng</snm><fnm>JQ</fnm></au><au><snm>Lindsley</snm><fnm>CW</fnm></au><au><snm>Cheng</snm><fnm>GZ</fnm></au><au><snm>Yang</snm><fnm>H</fnm></au><au><snm>Nicosia</snm><fnm>SV</fnm></au></aug><source>Oncogene</source><pubdate>2005</pubdate><volume>24</volume><issue>50</issue><fpage>7482</fpage><lpage>7492</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/sj.onc.1209088</pubid><pubid idtype="pmpid" link="fulltext">16288295</pubid></pubidlist></xrefbib></bibl><bibl id="B9"><title><p>The Met pathway: master switch and drug target in cancer progression</p></title><aug><au><snm>Mazzone</snm><fnm>M</fnm></au><au><snm>Comoglio</snm><fnm>PM</fnm></au></aug><source>FASEB J</source><pubdate>2006</pubdate><volume>20</volume><issue>10</issue><fpage>1611</fpage><lpage>1621</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1096/fj.06-5947rev</pubid><pubid idtype="pmpid" link="fulltext">16873884</pubid></pubidlist></xrefbib></bibl><bibl id="B10"><title><p>The Wnt/beta-catenin signaling pathway as a target in drug discovery</p></title><aug><au><snm>Takahashi-Yanaga</snm><fnm>F</fnm></au><au><snm>Sasaguri</snm><fnm>T</fnm></au></aug><source>J Pharmacol Sci</source><pubdate>2007</pubdate><volume>104</volume><issue>4</issue><fpage>293</fpage><lpage>302</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1254/jphs.CR0070024</pubid><pubid idtype="pmpid" link="fulltext">17721040</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>Target-oriented and diversity-oriented organic synthesis in drug discovery</p></title><aug><au><snm>Schreiber</snm><fnm>SL</fnm></au></aug><source>Science</source><pubdate>2000</pubdate><volume>287</volume><issue>5460</issue><fpage>1964</fpage><lpage>1969</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.287.5460.1964</pubid><pubid idtype="pmpid" link="fulltext">10720315</pubid></pubidlist></xrefbib></bibl><bibl id="B12"><title><p>Metabolomics in pharmaceutical research and development: Metabolites, mechanisms and pathways</p></title><aug><au><snm>Xu</snm><fnm>EY</fnm></au><au><snm>Schaefer</snm><fnm>WH</fnm></au><au><snm>Xu</snm><fnm>QW</fnm></au></aug><source>Current Opinion in Drug Discovery &amp; Development</source><pubdate>2009</pubdate><volume>12</volume><issue>1</issue><fpage>40</fpage><lpage>52</lpage><xrefbib><pubid idtype="pmpid">19152212</pubid></xrefbib></bibl><bibl id="B13"><title><p>Survival-signaling pathway as a promising target for cancer chemotherapy</p></title><aug><au><snm>Fujita</snm><fnm>N</fnm></au><au><snm>Tsuruo</snm><fnm>T</fnm></au></aug><source>Cancer chemotherapy and pharmacology</source><pubdate>2003</pubdate><volume>52</volume><issue>Suppl 1</issue><fpage>S24</fpage><lpage>28</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00280-003-0591-2</pubid><pubid idtype="pmpid" link="fulltext">12819931</pubid></pubidlist></xrefbib></bibl><bibl id="B14"><title><p>Genomic strategies for personalized cancer therapy</p></title><aug><au><snm>Garman</snm><fnm>KS</fnm></au><au><snm>Nevins</snm><fnm>JR</fnm></au><au><snm>Potti</snm><fnm>A</fnm></au></aug><source>Hum Mol Genet</source><pubdate>2007</pubdate><volume>16</volume><issue>Spec No 2</issue><fpage>R226</fpage><lpage>232</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/hmg/ddm184</pubid><pubid idtype="pmpid" link="fulltext">17911165</pubid></pubidlist></xrefbib></bibl><bibl id="B15"><title><p>Genomic medicine and the future of health care</p></title><aug><au><snm>Sander</snm><fnm>C</fnm></au></aug><source>Science</source><pubdate>2000</pubdate><volume>287</volume><issue>5460</issue><fpage>1977</fpage><lpage>1978</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.287.5460.1977</pubid><pubid idtype="pmpid" link="fulltext">10755952</pubid></pubidlist></xrefbib></bibl><bibl id="B16"><title><p>An integrated database SPAD (Signaling PAthway Database) for signal transduction and genetic information</p></title><aug><au><snm>Tateishi HS</snm><fnm>Naoko</fnm></au><au><snm>Kuhara</snm><fnm>Satoru</fnm></au><au><snm>Takagi</snm><fnm>Toshihisa</fnm></au><au><snm>Kanehisa</snm><fnm>Minoru</fnm></au></aug><source>Genome Informatics</source><pubdate>1995</pubdate><volume>6</volume><fpage>160</fpage><lpage>161</lpage></bibl><bibl id="B17"><title><p>CST &#8211; Cell Signaling Technology Pathway Database</p></title><url>http://www.cellsignal.com/</url></bibl><bibl id="B18"><title><p>STKE &#8211; Signal Transduction Knowledge Environment</p></title><url>http://www.stke.org/</url></bibl><bibl id="B19"><title><p>COPE &#8211; Cytokines and Cells Online Pathfinder Encyclopedia</p></title><url>http://www.copewithcytokines.de/</url></bibl><bibl id="B20"><title><p>TRANSFAC: an integrated system for gene expression regulation</p></title><aug><au><snm>Wingender</snm><fnm>E</fnm></au><au><snm>Chen</snm><fnm>X</fnm></au><au><snm>Hehl</snm><fnm>R</fnm></au><au><snm>Karas</snm><fnm>H</fnm></au><au><snm>Liebich</snm><fnm>I</fnm></au><au><snm>Matys</snm><fnm>V</fnm></au><au><snm>Meinhardt</snm><fnm>T</fnm></au><au><snm>Pruss</snm><fnm>M</fnm></au><au><snm>Reuter</snm><fnm>I</fnm></au><au><snm>Schacherer</snm><fnm>F</fnm></au></aug><source>Nucleic Acids Research</source><pubdate>2000</pubdate><volume>28</volume><issue>1</issue><fpage>316</fpage><lpage>319</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/28.1.316</pubid><pubid idtype="pmcid">102445</pubid><pubid idtype="pmpid" link="fulltext">10592259</pubid></pubidlist></xrefbib></bibl><bibl id="B21"><title><p>KEGG: Kyoto Encyclopedia of Genes and Genomes</p></title><aug><au><snm>Kanehisa</snm><fnm>M</fnm></au><au><snm>Goto</snm><fnm>S</fnm></au></aug><source>Nucleic Acids Research</source><pubdate>2000</pubdate><volume>28</volume><issue>1</issue><fpage>27</fpage><lpage>30</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/28.1.27</pubid><pubid idtype="pmcid">102409</pubid><pubid idtype="pmpid" link="fulltext">10592173</pubid></pubidlist></xrefbib></bibl><bibl id="B22"><title><p>WIT: integrated system for high-throughput genome sequence analysis and metabolic reconstruction</p></title><aug><au><snm>Overbeek</snm><fnm>R</fnm></au><au><snm>Larsen</snm><fnm>N</fnm></au><au><snm>Pusch</snm><fnm>GD</fnm></au><au><snm>D'Souza</snm><fnm>M</fnm></au><au><snm>Selkov</snm><fnm>E</fnm><suf>Jr</suf></au><au><snm>Kyrpides</snm><fnm>N</fnm></au><au><snm>Fonstein</snm><fnm>M</fnm></au><au><snm>Maltsev</snm><fnm>N</fnm></au><au><snm>Selkov</snm><fnm>E</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2000</pubdate><volume>28</volume><issue>1</issue><fpage>123</fpage><lpage>125</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/28.1.123</pubid><pubid idtype="pmcid">102471</pubid><pubid idtype="pmpid" link="fulltext">10592199</pubid></pubidlist></xrefbib></bibl><bibl id="B23"><title><p>ExPASy &#8211; Biochemical Pathways</p></title><url>http://www.expasy.ch/cgi-bin/search-biochem-index</url></bibl><bibl id="B24"><title><p>The University of Minnesota Biocatalysis/Biodegradation Database: microorganisms, genomics and prediction</p></title><aug><au><snm>Ellis</snm><fnm>LBM</fnm></au><au><snm>Hershberger</snm><fnm>CD</fnm></au><au><snm>Wackett</snm><fnm>LP</fnm></au></aug><source>Nucleic Acids Research</source><pubdate>2000</pubdate><volume>28</volume><issue>1</issue><fpage>377</fpage><lpage>379</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/28.1.377</pubid><pubid idtype="pmcid">102401</pubid><pubid idtype="pmpid" link="fulltext">10592280</pubid></pubidlist></xrefbib></bibl><bibl id="B25"><title><p>Computational prediction of human metabolic pathways from the complete human genome</p></title><aug><au><snm>Romero</snm><fnm>P</fnm></au><au><snm>Wagg</snm><fnm>J</fnm></au><au><snm>Green</snm><fnm>ML</fnm></au><au><snm>Kaiser</snm><fnm>D</fnm></au><au><snm>Krummenacker</snm><fnm>M</fnm></au><au><snm>Karp</snm><fnm>PD</fnm></au></aug><source>Genome Biology</source><pubdate>2005</pubdate><volume>6</volume><issue>1</issue><fpage>R2</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2004-6-1-r2</pubid><pubid idtype="pmcid">549063</pubid><pubid idtype="pmpid" link="fulltext">15642094</pubid></pubidlist></xrefbib></bibl><bibl id="B26"><title><p>Human Protein Reference Database: Building a biological platform for systems biology</p></title><aug><au><snm>Peri</snm><fnm>S</fnm></au><au><snm>Navarro</snm><fnm>JD</fnm></au><au><snm>Amanchy</snm><fnm>R</fnm></au><au><snm>Kristiansen</snm><fnm>T</fnm></au><au><snm>Jonnalagadda</snm><fnm>J</fnm></au><au><snm>Vineeth</snm><fnm>S</fnm></au><au><snm>Niranjan</snm><fnm>V</fnm></au><au><snm>Muthusamy</snm><fnm>B</fnm></au><au><snm>Gandhi</snm><fnm>TKB</fnm></au><au><snm>Gronborg</snm><fnm>M</fnm></au><etal/></aug><source>American Journal of Human Genetics</source><pubdate>2003</pubdate><volume>73</volume><issue>5</issue><fpage>429</fpage><lpage>429</lpage></bibl><bibl id="B27"><title><p>HAPPI: an Online Database of Comprehensive Human Annotated and Predicted Protein Interactions</p></title><aug><au><snm>Chen</snm><fnm>JYS</fnm></au><etal/></aug><source>BMC Genomics</source><pubdate>2009</pubdate><volume>10</volume><issue>Suppl 1</issue><fpage>S16</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-10-S1-S16</pubid><pubid idtype="pmcid">2709259</pubid><pubid idtype="pmpid" link="fulltext">19594875</pubid></pubidlist></xrefbib></bibl><bibl id="B28"><title><p>STRING: known and predicted protein-protein associations, integrated and transferred across organisms</p></title><aug><au><snm>von Mering</snm><fnm>C</fnm></au><au><snm>Jensen</snm><fnm>LJ</fnm></au><au><snm>Snel</snm><fnm>B</fnm></au><au><snm>Hooper</snm><fnm>SD</fnm></au><au><snm>Krupp</snm><fnm>M</fnm></au><au><snm>Foglierini</snm><fnm>M</fnm></au><au><snm>Jouffre</snm><fnm>N</fnm></au><au><snm>Huynen</snm><fnm>MA</fnm></au><au><snm>Bork</snm><fnm>P</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2005</pubdate><issue>33 Database</issue><fpage>D433</fpage><lpage>437</lpage><xrefbib><pubidlist><pubid idtype="pmcid">539959</pubid><pubid idtype="pmpid" link="fulltext">15608232</pubid></pubidlist></xrefbib></bibl><bibl id="B29"><title><p>PID: the Pathway Interaction Database</p></title><aug><au><snm>Schaefer</snm><fnm>CF</fnm></au><au><snm>Anthony</snm><fnm>K</fnm></au><au><snm>Krupa</snm><fnm>S</fnm></au><au><snm>Buchoff</snm><fnm>J</fnm></au><au><snm>Day</snm><fnm>M</fnm></au><au><snm>Hannay</snm><fnm>T</fnm></au><au><snm>Buetow</snm><fnm>KH</fnm></au></aug><source>Nucleic Acids Research</source><pubdate>2009</pubdate><volume>37</volume><fpage>D674</fpage><lpage>D679</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkn653</pubid><pubid idtype="pmcid">2686461</pubid><pubid idtype="pmpid" link="fulltext">18832364</pubid></pubidlist></xrefbib></bibl><bibl id="B30"><title><p>Reactome knowledgebase of human biological pathways and processes</p></title><aug><au><snm>Matthews</snm><fnm>L</fnm></au><au><snm>Gopinath</snm><fnm>G</fnm></au><au><snm>Gillespie</snm><fnm>M</fnm></au><au><snm>Caudy</snm><fnm>M</fnm></au><au><snm>Croft</snm><fnm>D</fnm></au><au><snm>de Bono</snm><fnm>B</fnm></au><au><snm>Garapati</snm><fnm>P</fnm></au><au><snm>Hemish</snm><fnm>J</fnm></au><au><snm>Hermjakob</snm><fnm>H</fnm></au><au><snm>Jassal</snm><fnm>B</fnm></au><etal/></aug><source>Nucleic Acids Research</source><pubdate>2009</pubdate><volume>37</volume><fpage>D619</fpage><lpage>D622</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkn863</pubid><pubid idtype="pmcid">2686536</pubid><pubid idtype="pmpid" link="fulltext">18981052</pubid></pubidlist></xrefbib></bibl><bibl id="B31"><title><p>BioCarta</p></title><url>http://www.biocarta.com/index.asp</url></bibl><bibl id="B32"><title><p>Pathway Commons</p></title><url>http://www.pathwaycommons.org/pc/home.do</url></bibl><bibl id="B33"><title><p>PANTHER: a library of protein families and subfamilies indexed by function</p></title><aug><au><snm>Thomas</snm><fnm>PD</fnm></au><au><snm>Campbell</snm><fnm>MJ</fnm></au><au><snm>Kejariwal</snm><fnm>A</fnm></au><au><snm>Mi</snm><fnm>H</fnm></au><au><snm>Karlak</snm><fnm>B</fnm></au><au><snm>Daverman</snm><fnm>R</fnm></au><au><snm>Diemer</snm><fnm>K</fnm></au><au><snm>Muruganujan</snm><fnm>A</fnm></au><au><snm>Narechania</snm><fnm>A</fnm></au></aug><source>Genome Res</source><pubdate>2003</pubdate><volume>13</volume><issue>9</issue><fpage>2129</fpage><lpage>2141</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.772403</pubid><pubid idtype="pmcid">403709</pubid><pubid idtype="pmpid" link="fulltext">12952881</pubid></pubidlist></xrefbib></bibl><bibl id="B34"><title><p>Protein Lounge</p></title><url>http://www.proteinlounge.com/</url></bibl><bibl id="B35"><title><p>WikiPathways: Pathway editing for the people</p></title><aug><au><snm>Pico</snm><fnm>AR</fnm></au><au><snm>Kelder</snm><fnm>T</fnm></au><au><snm>van Iersel</snm><fnm>MP</fnm></au><au><snm>Hanspers</snm><fnm>K</fnm></au><au><snm>Conklin</snm><fnm>BR</fnm></au><au><snm>Evelo</snm><fnm>C</fnm></au></aug><source>Plos Biology</source><pubdate>2008</pubdate><volume>6</volume><issue>7</issue><fpage>1403</fpage><lpage>1407</lpage><xrefbib><pubid idtype="doi">10.1371/journal.pbio.0060184</pubid></xrefbib></bibl><bibl id="B36"><title><p>Computational prediction of human metabolic pathways from the complete human genome</p></title><aug><au><snm>Romero</snm><fnm>P</fnm></au><au><snm>Wagg</snm><fnm>J</fnm></au><au><snm>Green</snm><fnm>ML</fnm></au><au><snm>Kaiser</snm><fnm>D</fnm></au><au><snm>Krummenacker</snm><fnm>M</fnm></au><au><snm>Karp</snm><fnm>PD</fnm></au></aug><source>Genome Biol</source><pubdate>2005</pubdate><volume>6</volume><issue>1</issue><fpage>R2</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2004-6-1-r2</pubid><pubid idtype="pmcid">549063</pubid><pubid idtype="pmpid" link="fulltext">15642094</pubid></pubidlist></xrefbib></bibl><bibl id="B37"><title><p>Prediction of regulatory pathways using mRNA expression and protein interaction data: application to identification of galactose regulatory pathway</p></title><aug><au><snm>Darvish</snm><fnm>A</fnm></au><au><snm>Najarian</snm><fnm>K</fnm></au></aug><source>Biosystems</source><pubdate>2006</pubdate><volume>83</volume><issue>2&#8211;3</issue><fpage>125</fpage><lpage>135</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.biosystems.2005.06.013</pubid><pubid idtype="pmpid" link="fulltext">16384635</pubid></pubidlist></xrefbib></bibl><bibl id="B38"><title><p>Using functional and organizational information to improve genome-wide computational prediction of transcription units on pathway-genome databases</p></title><aug><au><snm>Romero</snm><fnm>PR</fnm></au><au><snm>Karp</snm><fnm>PD</fnm></au></aug><source>Bioinformatics</source><pubdate>2004</pubdate><volume>20</volume><issue>5</issue><fpage>709</fpage><lpage>717</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btg471</pubid><pubid idtype="pmpid" link="fulltext">14751985</pubid></pubidlist></xrefbib></bibl><bibl id="B39"><title><p>Predicting pathway membership via domain signatures</p></title><aug><au><snm>Frohlich</snm><fnm>H</fnm></au><au><snm>Fellmann</snm><fnm>M</fnm></au><au><snm>Sultmann</snm><fnm>H</fnm></au><au><snm>Poustka</snm><fnm>A</fnm></au><au><snm>Beissbarth</snm><fnm>T</fnm></au></aug><source>Bioinformatics</source><pubdate>2008</pubdate><volume>24</volume><issue>19</issue><fpage>2137</fpage><lpage>2142</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btn403</pubid><pubid idtype="pmcid">2553439</pubid><pubid idtype="pmpid" link="fulltext">18676972</pubid></pubidlist></xrefbib></bibl><bibl id="B40"><title><p>IntAct: an open source molecular interaction database</p></title><aug><au><snm>Hermjakob</snm><fnm>H</fnm></au><au><snm>Montecchi-Palazzi</snm><fnm>L</fnm></au><au><snm>Lewington</snm><fnm>C</fnm></au><au><snm>Mudali</snm><fnm>S</fnm></au><au><snm>Kerrien</snm><fnm>S</fnm></au><au><snm>Orchard</snm><fnm>S</fnm></au><au><snm>Vingron</snm><fnm>M</fnm></au><au><snm>Roechert</snm><fnm>B</fnm></au><au><snm>Roepstorff</snm><fnm>P</fnm></au><au><snm>Valencia</snm><fnm>A</fnm></au><etal/></aug><source>Nucleic Acids Research</source><pubdate>2004</pubdate><volume>32</volume><fpage>D452</fpage><lpage>D455</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkh052</pubid><pubid idtype="pmcid">308786</pubid><pubid idtype="pmpid" link="fulltext">14681455</pubid></pubidlist></xrefbib></bibl><bibl id="B41"><title><p>PAX of mind for pathway researchers</p></title><aug><au><snm>Luciano</snm><fnm>JS</fnm></au></aug><source>Drug Discovery Today</source><pubdate>2005</pubdate><volume>10</volume><issue>13</issue><fpage>937</fpage><lpage>942</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S1359-6446(05)03501-4</pubid><pubid idtype="pmpid" link="fulltext">15993813</pubid></pubidlist></xrefbib></bibl><bibl id="B42"><title><p>Presenting and exploring biological pathways with PathVisio</p></title><aug><au><snm>van Iersel</snm><fnm>MP</fnm></au><au><snm>Kelder</snm><fnm>T</fnm></au><au><snm>Pico</snm><fnm>AR</fnm></au><au><snm>Hanspers</snm><fnm>K</fnm></au><au><snm>Coort</snm><fnm>S</fnm></au><au><snm>Conklin</snm><fnm>BR</fnm></au><au><snm>Evelo</snm><fnm>C</fnm></au></aug><source>Bmc Bioinformatics</source><pubdate>2008</pubdate><volume>9</volume><fpage>399</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2105-9-399</pubid><pubid idtype="pmcid">2569944</pubid><pubid idtype="pmpid" link="fulltext">18817533</pubid></pubidlist></xrefbib></bibl><bibl id="B43"><title><p>cPath: open source software for collecting, storing, and querying biological pathways</p></title><aug><au><snm>Cerami</snm><fnm>EG</fnm></au><au><snm>Bader</snm><fnm>GD</fnm></au><au><snm>Gross</snm><fnm>BE</fnm></au><au><snm>Sander</snm><fnm>C</fnm></au></aug><source>Bmc Bioinformatics</source><pubdate>2006</pubdate><volume>7</volume><fpage>497</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2105-7-497</pubid><pubid idtype="pmcid">1660554</pubid><pubid idtype="pmpid" link="fulltext">17101041</pubid></pubidlist></xrefbib></bibl><bibl id="B44"><title><p>MIPS Arabidopsis thaliana Database (MAtDB): an integrated biological knowledge resource for plant genomics</p></title><aug><au><snm>Schoof</snm><fnm>H</fnm></au><au><snm>Ernst</snm><fnm>R</fnm></au><au><snm>Nazarov</snm><fnm>V</fnm></au><au><snm>Pfeifer</snm><fnm>L</fnm></au><au><snm>Mewes</snm><fnm>HW</fnm></au><au><snm>Mayer</snm><fnm>KF</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2004</pubdate><issue>32 Database</issue><fpage>D373</fpage><lpage>376</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkh068</pubid><pubid idtype="pmcid">308802</pubid><pubid idtype="pmpid" link="fulltext">14681437</pubid></pubidlist></xrefbib></bibl><bibl id="B45"><title><p>FlyMine: an integrated database for Drosophila and Anopheles genomics</p></title><aug><au><snm>Lyne</snm><fnm>R</fnm></au><au><snm>Smith</snm><fnm>R</fnm></au><au><snm>Rutherford</snm><fnm>K</fnm></au><au><snm>Wakeling</snm><fnm>M</fnm></au><au><snm>Varley</snm><fnm>A</fnm></au><au><snm>Guillier</snm><fnm>F</fnm></au><au><snm>Janssens</snm><fnm>H</fnm></au><au><snm>Ji</snm><fnm>W</fnm></au><au><snm>McLaren</snm><fnm>P</fnm></au><au><snm>North</snm><fnm>P</fnm></au><etal/></aug><source>Genome Biol</source><pubdate>2007</pubdate><volume>8</volume><issue>7</issue><fpage>R129</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2007-8-7-r129</pubid><pubid idtype="pmcid">2323218</pubid><pubid idtype="pmpid" link="fulltext">17615057</pubid></pubidlist></xrefbib></bibl><bibl id="B46"><title><p>The genetics and genomics of cancer</p></title><aug><au><snm>Balmain</snm><fnm>A</fnm></au><au><snm>Gray</snm><fnm>J</fnm></au><au><snm>Ponder</snm><fnm>B</fnm></au></aug><source>Nature Genetics</source><pubdate>2003</pubdate><volume>33</volume><issue>3 s</issue><fpage>238</fpage><lpage>244</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng1107</pubid><pubid idtype="pmpid" link="fulltext">12610533</pubid></pubidlist></xrefbib></bibl><bibl id="B47"><title><p>FOXA1 in breast cancer</p></title><aug><au><snm>Nakshatri</snm><fnm>H</fnm></au><au><snm>Badve</snm><fnm>S</fnm></au></aug><source>Expert reviews in molecular medicine</source><pubdate>2009</pubdate><volume>11</volume><fpage>e8</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1017/S1462399409001008</pubid><pubid idtype="pmpid" link="fulltext">19261198</pubid></pubidlist></xrefbib></bibl><bibl id="B48"><title><p>Meta-analysis of gene expression profiles in breast cancer: toward a unified understanding of breast cancer subtyping and prognosis signatures</p></title><aug><au><snm>Wirapati</snm><fnm>P</fnm></au><au><snm>Sotiriou</snm><fnm>C</fnm></au><au><snm>Kunkel</snm><fnm>S</fnm></au><au><snm>Farmer</snm><fnm>P</fnm></au><au><snm>Pradervand</snm><fnm>S</fnm></au><au><snm>Haibe-Kains</snm><fnm>B</fnm></au><au><snm>Desmedt</snm><fnm>C</fnm></au><au><snm>Ignatiadis</snm><fnm>M</fnm></au><au><snm>Sengstag</snm><fnm>T</fnm></au><au><snm>Sch&#252;tz</snm><fnm>F</fnm></au><etal/></aug><source>Breast Cancer Research: BCR</source><pubdate>2008</pubdate><volume>10</volume><issue>4</issue><fpage>R65</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/bcr2124</pubid><pubid idtype="pmcid">2575538</pubid><pubid idtype="pmpid" link="fulltext">18662380</pubid></pubidlist></xrefbib></bibl><bibl id="B49"><title><p>Genome-wide association study identifies novel breast cancer susceptibility loci</p></title><aug><au><snm>Easton</snm><fnm>DF</fnm></au><au><snm>Pooley</snm><fnm>KA</fnm></au><au><snm>Dunning</snm><fnm>AM</fnm></au><au><snm>Pharoah</snm><fnm>PD</fnm></au><au><snm>Thompson</snm><fnm>D</fnm></au><au><snm>Ballinger</snm><fnm>DG</fnm></au><au><snm>Struewing</snm><fnm>JP</fnm></au><au><snm>Morrison</snm><fnm>J</fnm></au><au><snm>Field</snm><fnm>H</fnm></au><au><snm>Luben</snm><fnm>R</fnm></au></aug><source>Nature</source><pubdate>2007</pubdate><volume>447</volume><issue>7148</issue><fpage>1087</fpage><lpage>1095</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature05887</pubid><pubid idtype="pmcid">2714974</pubid><pubid idtype="pmpid" link="fulltext">17529967</pubid></pubidlist></xrefbib></bibl><bibl id="B50"><title><p>Genome-wide association study provides evidence for a breast cancer risk locus at 6q22. 33</p></title><aug><au><snm>Gold</snm><fnm>B</fnm></au><au><snm>Kirchhoff</snm><fnm>T</fnm></au><au><snm>Stefanov</snm><fnm>S</fnm></au><au><snm>Lautenberger</snm><fnm>J</fnm></au><au><snm>Viale</snm><fnm>A</fnm></au><au><snm>Garber</snm><fnm>J</fnm></au><au><snm>Friedman</snm><fnm>E</fnm></au><au><snm>Narod</snm><fnm>S</fnm></au><au><snm>Olshen</snm><fnm>AB</fnm></au><au><snm>Gregersen</snm><fnm>P</fnm></au></aug><source>Proc Natl Acad Sci U S A</source><pubdate>2008</pubdate><volume>105</volume><issue>11</issue><fpage>4340</fpage><lpage>4345</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0800441105</pubid><pubid idtype="pmcid">2393811</pubid><pubid idtype="pmpid" link="fulltext">18326623</pubid></pubidlist></xrefbib></bibl><bibl id="B51"><title><p>ProteoLens: a visual analytic tool for multi-scale database-driven biological network data mining</p></title><aug><au><snm>Huan</snm><fnm>T</fnm></au><au><snm>Sivachenko</snm><fnm>A</fnm></au><au><snm>Harrison</snm><fnm>S</fnm></au><au><snm>Chen</snm><fnm>JY</fnm></au></aug><source>BMC bioinformatics</source><pubdate>2008</pubdate><volume>9</volume><issue>Suppl 9</issue><fpage>S5</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2105-9-S9-S5</pubid><pubid idtype="pmcid">2537576</pubid><pubid idtype="pmpid" link="fulltext">18793469</pubid></pubidlist></xrefbib></bibl><bibl id="B52"><title><p>Foxa (HNF3) up-regulates vitronectin expression during retinoic acid-induced differentiation in mouse neuroblastoma Neuro2a cells</p></title><aug><au><snm>Shimizu</snm><fnm>S</fnm></au><au><snm>Kondo</snm><fnm>M</fnm></au><au><snm>Miyamoto</snm><fnm>Y</fnm></au><au><snm>Hayashi</snm><fnm>M</fnm></au></aug><source>Cell Struct Funct</source><pubdate>2002</pubdate><volume>27</volume><issue>4</issue><fpage>181</fpage><lpage>188</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1247/csf.27.181</pubid><pubid idtype="pmpid" link="fulltext">12441652</pubid></pubidlist></xrefbib></bibl><bibl id="B53"><title><p>BRCA1 and FOXA1 proteins coregulate the expression of the cell cycle-dependent kinase inhibitor p27(Kip1)</p></title><aug><au><snm>Williamson</snm><fnm>EA</fnm></au><au><snm>Wolf</snm><fnm>I</fnm></au><au><snm>O'Kelly</snm><fnm>J</fnm></au><au><snm>Bose</snm><fnm>S</fnm></au><au><snm>Tanosaki</snm><fnm>S</fnm></au><au><snm>Koeffler</snm><fnm>HP</fnm></au></aug><source>Oncogene</source><pubdate>2006</pubdate><volume>25</volume><issue>9</issue><fpage>1391</fpage><lpage>1399</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/sj.onc.1209170</pubid><pubid idtype="pmpid" link="fulltext">16331276</pubid></pubidlist></xrefbib></bibl><bibl id="B54"><title><p>Role of cytochrome P450-dependent arachidonic acid metabolites in liver physiology and pathophysiology</p></title><aug><au><snm>Sacerdoti</snm><fnm>D</fnm></au><au><snm>Gatta</snm><fnm>A</fnm></au><au><snm>McGiff</snm><fnm>JC</fnm></au></aug><source>Prostaglandins &amp; Other Lipid Mediators</source><pubdate>2003</pubdate><volume>72</volume><issue>1&#8211;2</issue><fpage>51</fpage><lpage>71</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S1098-8823(03)00077-7</pubid><pubid idtype="pmpid">14626496</pubid></pubidlist></xrefbib></bibl><bibl id="B55"><title><p>Epoxyeicosatrienoic acids (EETs): metabolism and biochemical function</p></title><aug><au><snm>Spector</snm><fnm>AA</fnm></au><au><snm>Fang</snm><fnm>X</fnm></au><au><snm>Snyder</snm><fnm>GD</fnm></au><au><snm>Weintraub</snm><fnm>NL</fnm></au></aug><source>Progress in Lipid Research</source><pubdate>2004</pubdate><volume>43</volume><issue>1</issue><fpage>55</fpage><lpage>90</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0163-7827(03)00049-3</pubid><pubid idtype="pmpid" link="fulltext">14636671</pubid></pubidlist></xrefbib></bibl><bibl id="B56"><title><p>BRCA1 regulates RAD51 function in response to DNA damage and suppresses spontaneous sister chromatid replication slippage: Implications for sister chromatid cohesion, genome stability, and carcinogenesis</p></title><aug><au><snm>Cousineau</snm><fnm>I</fnm></au><au><snm>Abaji</snm><fnm>C</fnm></au><au><snm>Belmaaza</snm><fnm>A</fnm></au></aug><source>Cancer Research</source><pubdate>2005</pubdate><volume>65</volume><issue>24</issue><fpage>11384</fpage><lpage>11391</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1158/0008-5472.CAN-05-2156</pubid><pubid idtype="pmpid" link="fulltext">16357146</pubid></pubidlist></xrefbib></bibl><bibl id="B57"><title><p>BRCA2-dependent and independent formation of RAD51 nuclear foci</p></title><aug><au><snm>Tarsounas</snm><fnm>M</fnm></au><au><snm>Davies</snm><fnm>D</fnm></au><au><snm>West</snm><fnm>SC</fnm></au></aug><source>Oncogene</source><pubdate>2003</pubdate><volume>22</volume><issue>8</issue><fpage>1115</fpage><lpage>1123</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/sj.onc.1206263</pubid><pubid idtype="pmpid" link="fulltext">12606939</pubid></pubidlist></xrefbib></bibl><bibl id="B58"><title><p>The role of phosphatidylinositol 3'-kinase and its downstream signals in erbB-2-mediated transformation</p></title><aug><au><snm>Ignatoski</snm><fnm>KMW</fnm></au><au><snm>Livant</snm><fnm>DL</fnm></au><au><snm>Markwart</snm><fnm>S</fnm></au><au><snm>Grewal</snm><fnm>NK</fnm></au><au><snm>Ethier</snm><fnm>SP</fnm></au></aug><source>Molecular Cancer Research</source><pubdate>2003</pubdate><volume>1</volume><issue>7</issue><fpage>551</fpage><lpage>560</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">12754302</pubid></xrefbib></bibl><bibl id="B59"><title><p>PKB/Akt phosphorylates p27, impairs nuclear import of p27 and opposes p27-mediated G1 arrest</p></title><aug><au><snm>Liang</snm><fnm>J</fnm></au><au><snm>Zubovitz</snm><fnm>J</fnm></au><au><snm>Petrocelli</snm><fnm>T</fnm></au><au><snm>Kotchetkov</snm><fnm>R</fnm></au><au><snm>Connor</snm><fnm>MK</fnm></au><au><snm>Han</snm><fnm>K</fnm></au><au><snm>Lee</snm><fnm>JH</fnm></au><au><snm>Ciarallo</snm><fnm>S</fnm></au><au><snm>Catzavelos</snm><fnm>C</fnm></au><au><snm>Beniston</snm><fnm>R</fnm></au><etal/></aug><source>Nature Medicine</source><pubdate>2002</pubdate><volume>8</volume><issue>10</issue><fpage>1153</fpage><lpage>1160</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nm761</pubid><pubid idtype="pmpid" link="fulltext">12244302</pubid></pubidlist></xrefbib></bibl><bibl id="B60"><title><p>9-Cis Retinoic Acid Inhibits Growth of Breast-Cancer Cells and down-Regulates Estrogen-Receptor Rna and Protein</p></title><aug><au><snm>Rubin</snm><fnm>M</fnm></au><au><snm>Fenig</snm><fnm>E</fnm></au><au><snm>Rosenauer</snm><fnm>A</fnm></au><au><snm>Menendezbotet</snm><fnm>C</fnm></au><au><snm>Achkar</snm><fnm>C</fnm></au><au><snm>Bentel</snm><fnm>JM</fnm></au><au><snm>Yahalom</snm><fnm>J</fnm></au><au><snm>Mendelsohn</snm><fnm>J</fnm></au><au><snm>Miller</snm><fnm>WH</fnm></au></aug><source>Cancer Research</source><pubdate>1994</pubdate><volume>54</volume><issue>24</issue><fpage>6549</fpage><lpage>6556</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">7987855</pubid></xrefbib></bibl><bibl id="B61"><title><p>FOXA1 as a therapeutic target for breast cancer</p></title><aug><au><snm>Nakshatri</snm><fnm>H</fnm></au><au><snm>Badve</snm><fnm>S</fnm></au></aug><source>Expert Opinion on Therapeutic Targets</source><pubdate>2007</pubdate><volume>11</volume><issue>4</issue><fpage>507</fpage><lpage>514</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1517/14728222.11.4.507</pubid><pubid idtype="pmpid" link="fulltext">17373880</pubid></pubidlist></xrefbib></bibl><bibl id="B62"><title><p>Exploring the role of BRCA1, BRCA2 and RAD51 as biomarkers for breast cancer</p></title><aug><au><snm>Ziogas</snm><fnm>D</fnm></au><au><snm>Liakakos</snm><fnm>T</fnm></au><au><snm>Lykoudis</snm><fnm>E</fnm></au><au><snm>Fatourou</snm><fnm>E</fnm></au><au><snm>Roukos</snm><fnm>DH</fnm></au></aug><source>Radiother Oncol</source><pubdate>2009</pubdate><volume>90</volume><issue>1</issue><fpage>161</fpage><lpage>162</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.radonc.2008.02.020</pubid><pubid idtype="pmpid" link="fulltext">18343512</pubid></pubidlist></xrefbib></bibl><bibl id="B63"><title><p>A polymorphism in the CDKN1B gene is associated with increased risk of hereditary prostate cancer</p></title><aug><au><snm>Chang</snm><fnm>BL</fnm></au><au><snm>Zheng</snm><fnm>SL</fnm></au><au><snm>Isaacs</snm><fnm>SD</fnm></au><au><snm>Wiley</snm><fnm>KE</fnm></au><au><snm>Turner</snm><fnm>A</fnm></au><au><snm>Li</snm><fnm>G</fnm></au><au><snm>Walsh</snm><fnm>PC</fnm></au><au><snm>Meyers</snm><fnm>DA</fnm></au><au><snm>Isaacs</snm><fnm>WB</fnm></au><au><snm>Xu</snm><fnm>J</fnm></au></aug><source>Cancer Res</source><pubdate>2004</pubdate><volume>64</volume><issue>6</issue><fpage>1997</fpage><lpage>1999</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1158/0008-5472.CAN-03-2340</pubid><pubid idtype="pmpid" link="fulltext">15026335</pubid></pubidlist></xrefbib></bibl><bibl id="B64"><title><p>Soluble epoxide hydrolase inhibition reveals novel biological functions of epoxyeicosatrienoic acids (EETs)</p></title><aug><au><snm>Inceoglu</snm><fnm>B</fnm></au><au><snm>Schmelzer</snm><fnm>KR</fnm></au><au><snm>Morisseau</snm><fnm>C</fnm></au><au><snm>Jinks</snm><fnm>SL</fnm></au><au><snm>Hammock</snm><fnm>BD</fnm></au></aug><source>Prostaglandins &amp; Other Lipid Mediators</source><pubdate>2007</pubdate><volume>82</volume><issue>1&#8211;4</issue><fpage>42</fpage><lpage>49</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.prostaglandins.2006.05.004</pubid><pubid idtype="pmcid">1904338</pubid><pubid idtype="pmpid" link="fulltext">17164131</pubid></pubidlist></xrefbib></bibl><bibl id="B65"><title><p>Targeted disruption of soluble epoxide hydrolase reveals a role in blood pressure regulation</p></title><aug><au><snm>Sinal</snm><fnm>CJ</fnm></au><au><snm>Miyata</snm><fnm>M</fnm></au><au><snm>Tohkin</snm><fnm>M</fnm></au><au><snm>Nagata</snm><fnm>K</fnm></au><au><snm>Bend</snm><fnm>JR</fnm></au><au><snm>Gonzalez</snm><fnm>FJ</fnm></au></aug><source>Journal of Biological Chemistry</source><pubdate>2000</pubdate><volume>275</volume><issue>51</issue><fpage>40504</fpage><lpage>40510</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1074/jbc.M008106200</pubid><pubid idtype="pmpid" link="fulltext">11001943</pubid></pubidlist></xrefbib></bibl><bibl id="B66"><title><p>Soluble epoxide hydrolase regulates hydrolysis of vasoactive epoxyeicosatrienoic acids</p></title><aug><au><snm>Yu</snm><fnm>ZG</fnm></au><au><snm>Xu</snm><fnm>FY</fnm></au><au><snm>Huse</snm><fnm>LM</fnm></au><au><snm>Morisseau</snm><fnm>C</fnm></au><au><snm>Draper</snm><fnm>AJ</fnm></au><au><snm>Newman</snm><fnm>JW</fnm></au><au><snm>Parker</snm><fnm>C</fnm></au><au><snm>Graham</snm><fnm>L</fnm></au><au><snm>Engler</snm><fnm>MM</fnm></au><au><snm>Hammock</snm><fnm>BD</fnm></au><etal/></aug><source>Circulation Research</source><pubdate>2000</pubdate><volume>87</volume><issue>11</issue><fpage>992</fpage><lpage>998</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">11090543</pubid></xrefbib></bibl><bibl id="B67"><title><p>CHILD syndrome in 3 generations &#8211; The importance of mild or minimal skin lesions</p></title><aug><au><snm>Bittar</snm><fnm>M</fnm></au><au><snm>Happle</snm><fnm>R</fnm></au><au><snm>Grzeschik</snm><fnm>KH</fnm></au><au><snm>Leveleki</snm><fnm>L</fnm></au><au><snm>Hertl</snm><fnm>M</fnm></au><au><snm>Bornholdt</snm><fnm>D</fnm></au><au><snm>Konig</snm><fnm>A</fnm></au></aug><source>Archives of Dermatology</source><pubdate>2006</pubdate><volume>142</volume><issue>3</issue><fpage>348</fpage><lpage>351</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1001/archderm.142.3.348</pubid><pubid idtype="pmpid" link="fulltext">16549711</pubid></pubidlist></xrefbib></bibl><bibl id="B68"><title><p>NetPath &#8211; Signal Transduction Pathways</p></title><url>http://www.netpath.org/</url></bibl><bibl id="B69"><title><p>The Universal Protein Resource (UniProt): an expanding universe of protein information</p></title><aug><au><snm>Wu</snm><fnm>CH</fnm></au><au><snm>Apweiler</snm><fnm>R</fnm></au><au><snm>Bairoch</snm><fnm>A</fnm></au><au><snm>Natale</snm><fnm>DA</fnm></au><au><snm>Barker</snm><fnm>WC</fnm></au><au><snm>Boeckmann</snm><fnm>B</fnm></au><au><snm>Ferro</snm><fnm>S</fnm></au><au><snm>Gasteiger</snm><fnm>E</fnm></au><au><snm>Huang</snm><fnm>H</fnm></au><au><snm>Lopez</snm><fnm>R</fnm></au><etal/></aug><source>Nucleic Acids Res</source><pubdate>2006</pubdate><issue>34 Database</issue><fpage>D187</fpage><lpage>191</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkj161</pubid><pubid idtype="pmcid">1347523</pubid><pubid idtype="pmpid" link="fulltext">16381842</pubid></pubidlist></xrefbib></bibl><bibl id="B70"><title><p>Characterizing Mergeability of Human Molecular Pathways</p></title><aug><au><snm>Wu</snm><fnm>X</fnm></au><au><snm>Chowbina</snm><fnm>SR</fnm></au><au><snm>Li</snm><fnm>PM</fnm></au><au><snm>Pandey</snm><fnm>R</fnm></au><au><snm>Kasamsetty</snm><fnm>HN</fnm></au><au><snm>Chen</snm><fnm>JY</fnm></au></aug><inpress/></bibl></refgrp>
   </bm>
</art>
