<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-6-154</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Software</dochead>
      <bibl>
         <title>
            <p>Tools enabling the elucidation of molecular pathways active in human disease: Application to Hepatitis C virus infection</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Reiss</snm>
               <mi>J</mi>
               <fnm>David</fnm>
               <insr iid="I1"/>
               <email>dreiss@systemsbiology.org</email>
            </au>
            <au id="A2">
               <snm>Avila-Campillo</snm>
               <fnm>Iliana</fnm>
               <insr iid="I1"/>
               <email>iavila@systemsbiology.org</email>
            </au>
            <au id="A3">
               <snm>Thorsson</snm>
               <fnm>Vesteinn</fnm>
               <insr iid="I1"/>
               <email>thorsson@systemsbiology.org</email>
            </au>
            <au id="A4">
               <snm>Schwikowski</snm>
               <fnm>Benno</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>benno@pasteur.fr</email>
            </au>
            <au id="A5" ca="yes">
               <snm>Galitski</snm>
               <fnm>Timothy</fnm>
               <insr iid="I1"/>
               <email>tgalitski@systemsbiology.org</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Institute for Systems Biology, 1441 N. 34<sup>th </sup>Street, Seattle, WA 98103, USA</p>
            </ins>
            <ins id="I2">
               <p>Institut Pasteur, 25&#8211;28 Rue du Dr. Roux, 75724 Paris CEDEX 15, France</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2005</pubdate>
         <volume>6</volume>
         <issue>1</issue>
         <fpage>154</fpage>
         <url>http://www.biomedcentral.com/1471-2105/6/154</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">15967031</pubid>
               <pubid idtype="doi">10.1186/1471-2105-6-154</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>25</day>
               <month>2</month>
               <year>2005</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>20</day>
               <month>6</month>
               <year>2005</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>20</day>
               <month>6</month>
               <year>2005</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2005</year>
         <collab>Reiss et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The extraction of biological knowledge from genome-scale data sets requires its analysis in the context of additional biological information. The importance of integrating experimental data sets with molecular interaction networks has been recognized and applied to the study of model organisms, but its systematic application to the study of human disease has lagged behind due to the lack of tools for performing such integration.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We have developed techniques and software tools for simplifying and streamlining the process of integration of diverse experimental data types in molecular networks, as well as for the analysis of these networks. We applied these techniques to extract, from genomic expression data from Hepatitis C virus-infected liver tissue, potentially useful hypotheses related to the onset of this disease. Our integration of the expression data with large-scale molecular interaction networks and subsequent analyses identified molecular pathways that appear to be induced or repressed in the response to Hepatitis C viral infection.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The methods and tools we have implemented allow for the efficient dynamic integration and analysis of diverse data in a major human disease system. This integrated data set in turn enabled simple analyses to yield hypotheses related to the response to Hepatitis C viral infection.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>DNA microarrays have been applied with much success to study genomic patterns of gene expression across many organisms. It has become widely acknowledged that to extract hypotheses from these data, there are advantages to the integration of orthogonal sources of information, notably, molecular-interaction data <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Hypotheses derived from genomic-expression data typically involve pathways of metabolic and molecular information flow, and complex cellular processes and structures, formed by multiple interacting molecules. However, commonly these molecular interactions are gleaned <it>ad hoc </it>from the literature.</p>
         <p>In model organisms such as <it>Saccharomyces cerevisiae</it>, integrative systems-biology approaches to genomic-expression analysis have developed and employed sophisticated methods for the computational extraction of biological knowledge. Examples include: biological module identification and abstraction <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>; discovery of regulatory networks <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>; and identification of active pathways in networks <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. A hallmark of these advanced methods is the integration of diverse genome-scale data sets, in particular, the combination of genomic-expression data and molecular-interaction data. Another common characteristic of these methods is the use of graphs (vertices and edges, or nodes and links) to represent such integrated data. Graphical methods are highly intuitive. Also, the formalism of the graph facilitates the development and application of graph algorithms and machine-learning techniques to extract information.</p>
         <p>In studies of human disease, a limited repertoire of computational techniques, including ANOVA, hierarchical clustering, and discriminant analysis, has been applied to extract information from genomic-expression data derived from human tissues. Until recently, a critical barrier has been a lack of large-scale machine-readable sources of high-quality human molecular interaction data. Using a combination of artificial-intelligence methods and expert human curation, several efforts have made substantial progress in amassing, from the literature, databases with large numbers (greater than 14000) of human molecular interactions. These include the Human Protein Reference Database (HPRD) <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>, the Biomolecular Interaction Network Database (BIND) <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>, the Database of Interacting Proteins (DIP) <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>, and the Transcription Factor Database (Transfac) <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. Thus, the bottleneck has now shifted to the efficient integration of these data to enable the application of advanced network-based analysis and modelling methods. For this work, we have implemented solutions to this bottleneck and applied them to a set of genomic-expression data derived from biopsies of human liver tissue infected with Hepatitis C Virus (HCV) <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. About 3% of all humans are infected with HCV <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, and currently no vaccine exists. Chronic viral hepatitis C results in liver fibrosis and cirrhosis in about 20% of those infected <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. Liver transplant is often required.</p>
         <p>Specifically, we have developed two software tools, <it>InteractionFetcher </it>and <it>CytoTalk</it>, that function as plug-ins for <it>Cytoscape</it>, an open-source, platform-independent environment for the visualization and analysis of biological networks <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>. <it>InteractionFetcher </it>and <it>CytoTalk </it>simplify the integration and analysis of interaction data (and other data types) with genomic-expression data. To demonstrate their utility, we applied them to generate and analyze a large network of human molecular-interaction pathways that are putatively active during the infection of human liver tissue with HCV.</p>
      </sec>
      <sec>
         <st>
            <p>Implementation</p>
         </st>
         <sec>
            <st>
               <p><it>InteractionFetcher</it>, a <it>Cytoscape </it>plug-in</p>
            </st>
            <p><it>InteractionFetcher </it>dynamically retrieves remote biological information for selected nodes in the current network within <it>Cytoscape</it>. The plug-in requests biological data via the XML-RPC protocol <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> from a remote server, which retrieves the requested information from an SQL database and passes it back to the plug-in. The plug-in then adds the retrieved information to the current network as additional nodes, edges, and/or attributes. Currently implemented data types include: protein/gene synonyms, orthologs, sequences (gene/protein/upstream), and interactions/associations. Some of this information can be obtained via integrated queries. For example, retrieved gene/protein synonym information may be used to increase the number of molecular interactions that are found. Currently-available interaction-data sets include HPRD <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>, BIND <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>, DIP <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>, and several other predicted interaction and co-expression data sets <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr></abbrgrp>. Many options are available, including the ability to do cross-species queries, using ortholog information from Homologene <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> among species including <it>H. sapiens</it>, <it>M. musculus</it>, <it>S. cerevisiae</it>, <it>C. elegans</it>, and <it>D. melanogaster</it>. For example, if two proteins in <it>H. sapiens </it>have not been observed to interact, but both of their orthologs in <it>S. cerevisiae </it>are known to interact, then an <it>inferred interaction </it>(also known as an interolog) can be added to the network. Moreover, the tool allows for easy viewing of the source database's web page or linked PubMed abstract(s) describing each fetched interaction. Because the source code for both the client and server of this plug-in are available, we hope that the capabilities of plug-ins such as these can be expanded by other researchers to include, for example, experimental data (such as mRNA expression levels), metabolic information, or functional annotations. <it>Cytoscape</it>, the <it>InteractionFetcher </it>and related plug-ins, plus all server-side software are open-source and may be obtained at our laboratory web site <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> or at the <it>Cytoscape </it>web site <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p><it>CytoTalk</it>, a <it>Cytoscape </it>plug-in</p>
            </st>
            <p><it>CytoTalk </it>enables a <it>Cytoscape </it>user to dynamically interact with and manipulate the current network in a <it>Cytoscape </it>window from an external process. This plug-in runs an internal XML-RPC <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> server that enables the currently-displayed network and its various attributes to be manipulated from an external client that is XML-RPC-capable. Example clients may include Perl and Python scripts, scripts written in the <it>R </it>statistical language <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>, UNIX shell scripts, C or C++ programs, or Java processes. It moreover expands the developmental possibilities of <it>Cytoscape </it>plug-in developers by allowing other plug-ins to be written in these languages. The external process may be run on the same machine as <it>Cytoscape</it>, or anywhere else on an accessible network. The open-source <it>CytoTalk </it>and <it>Cytoscape </it>software as well as example <it>CytoTalk </it>clients in <it>Perl</it>, <it>Python</it>, and <it>R </it>may be obtained at our laboratory web site <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> or at the <it>Cytoscape </it>web site <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>Gene-expression data</p>
            </st>
            <p>For our study, we utilized expression data derived from 28 liver biopsies collected by <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> from 11 HCV-positive liver transplant patients, between 1 and 24 months post-transplant. Since roughly 50% of HCV+ liver transplant patients become re-infected during the two years after receiving their new livers <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>, these biopsies provide a unique model for tracking the changes in gene expression during HCV infection <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. To compare gene-expression patterns in liver tissue before and after infection with HCV, <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> collected 28 post-transplant liver biopsies, plus pre-transplant control biopsies, from 11 HCV+ liver-transplant patients. Liver biopsies were obtained at intervals of 3 to 6 months, between 1 and 24 months post-transplant <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. These samples contain a mixture of cell types including hepatocytes, hepatic stellate cells, Kupffer cells (liver macrophages), in addition to various blood cells <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. mRNA expression ratios of about 7000 genes were measured relative to a common reference pool of pre-transplant biopsies. Using Rosetta <it>Resolver</it>(R) software <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, the data were normalized and transformed to log<sub>10 </sub>ratios, and p-values were computed for expression difference from the reference pool. The measurements showed a high degree of patient-to-patient variation. Most of the genes (5968) were significantly expressed (<it>p </it>&lt;10<sup>-7</sup>) in at least one of the 28 samples. The research of <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> involved genomic-expression data derived from human subjects.</p>
         </sec>
         <sec>
            <st>
               <p>Construction of the molecular-interaction scaffold</p>
            </st>
            <p>We sought to generate a network of molecular pathways that are active (either induced or repressed) in HCV-infected human liver cells. The effects of HCV infection are likely to be complex, and the presence of contaminating blood cells and mixtures of various cell types in the biopsy samples will add further complexity. In order to emphasize the network interface between viral molecules and human molecules, we initiated network construction with a small "seed" network of interactions among HCV-encoded molecules and between HCV-encoded and host-encoded proteins. Interaction data were curated from review articles (<abbrgrp><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr></abbrgrp>, and references therein). The seed network also included the JAK-STAT interferon-response pathway that is known to play a role in the response to HCV infection <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. This set comprised 106 interactions between 86 macromolecules (proteins and the viral RNA). The proteins were, when possible, cross-referenced to RefSeq protein identifiers <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp>. Figure <figr fid="F1">1</figr> shows the seed network visualized using <it>Cytoscape </it><abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>. This network is available for exploration and analysis via <it>Cytoscape </it>at our laboratory web site <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Network of interactions among HCV-encoded molecules and host proteins</p>
               </caption>
               <text>
                  <p><b>Network of interactions among HCV-encoded molecules and host proteins.</b>Triangular nodes represent HCV-encoded molecules. Host molecules are square nodes. Edges represent molecular interactions of several types: black for protein-protein, yellow for protein-DNA, light-green for phosphorylations, red for activations, dark-green for repressions, purple for covalent interactions, brown for methylations. Sources: [14, 15 and references therein].</p>
               </text>
               <graphic file="1471-2105-6-154-1"/>
            </fig>
            <p>The seed network was expanded to a full "scaffold" network using the 5968 genes implicated by the genomic-expression data and large-scale molecular-interaction data sets in public databases by searching for interactions among the 5968 expressed genes and the molecules in the seed network. To automate the construction of the scaffold network, we implemented a <it>Cytoscape </it>plug-in, <it>InteractionFetcher</it>, for dynamic retrieval of molecular interactions and binding partners via the Internet. <it>InteractionFetcher </it>rapidly adds interactions among molecules of interest in a network. In addition, it may be used to iteratively expand a network through "in silico pull-down" of molecules that are currently not present in the network but are known to interact with molecules that are present. Using this plug-in, we were able to integrate as many as 15,000 interactions among the proteins implicated by the HCV expression-data set and seed-network proteins (among the available interaction data sets, which include HPRD <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>, BIND <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>, DIP <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>, PreBIND <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr></abbrgrp>, and several other predicted and co-expression data sets; see Methods). However, for this paper, we restricted our search to individually curated human-only protein interactions from HPRD and BIND, resulting in a scaffold network of 4,592 unique interactions among 1,950 molecules (Figure <figr fid="F2">2</figr>). This network is available for exploration and analysis with <it>Cytoscape </it>at our laboratory web site <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>, which also allows for easy viewing of additional information provided by <it>InteractionFetcher</it>, such as each interaction's source database web page and PubMed abstract identifier(s).</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Network of interactions among proteins implicated by genomic expression data</p>
               </caption>
               <text>
                  <p><b>Network of interactions among proteins implicated by genomic expression data. </b>Genes were implicated by expression profiling of HCV-infected liver biopsy data [13]. The network of interactions was assembled from external databases HPRD [6, 7] and BIND [8, 9], and automatically integrated using <it>InteractionFetcher</it>.</p>
               </text>
               <graphic file="1471-2105-6-154-2"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Computational analysis of integrated gene-expression data and molecular-interaction data</p>
            </st>
            <p>A useful method of integrated analysis of expression data within an interaction network is the <it>ActivePaths </it>algorithm <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. This method identifies contiguous pathways or subnetworks that are active (induced or repressed relative to randomly selected subnetworks) in subsets of the expression data. We applied this algorithm, which is available as a <it>Cytoscape </it>plug-in, to the scaffold network. Due to the high order (number of vertices) and size (number of edges) of the scaffold network, it was necessary to iteratively apply the algorithm, as suggested by the developers, to obtain increasingly smaller active subnetworks until they contained fewer than 100 nodes. The resulting four active subnetworks contained between 40 and 121 interactions. Because there were overlaps among these four highest-scoring active subnetworks, we combined them into a single fully connected active subnetwork.</p>
            <p>Additional analyses were performed by selecting scaffold subnetworks that are significantly active and/or co-regulated in temporal subsets of the microarray data. Because the scaffold network is not differentiated with regard to tissues, cell types, or cellular state, and the biopsy samples from which the expression data were derived likewise contain mixtures of cell types and other contaminants, the information in the active scaffold network does not, by itself, answer the questions we are addressing. To increase our chance of identifying pathways that might be modulated in response to HCV infection, we performed a differential analysis of the scaffold network, to identify subnetworks that become active more than eight months after transplant. This choice of cut-off was made to nearly-evenly divide the expression data into two halves (those from biopsies prior to, and after, eight months post-transplant), and by performing a differential analysis we can hope to subtract out some of the effects of the transplant and post-transplant immune response signals from those of HCV reinfection and progression.</p>
            <p>We used the <it>R </it>statistical environment <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> to perform this analysis. Because <it>R </it>is external to <it>Cytoscape</it>, we developed a plug-in, called <it>CytoTalk</it>, that enables a user of <it>R </it>(or a wide variety of other environments or languages; see Methods) to interactively query and modify <it>Cytoscape </it>networks, thereby greatly expanding the analytical capabilities available to users of <it>Cytoscape</it>. We used <it>R </it>with <it>CytoTalk </it>to select the proteins and interactions implicated by specific statistical queries on the expression data. This enabled us to extract a subnetwork of genes that were significantly induced or repressed with |log<sub>10</sub>(ratio)| > 0.4 in biopsies obtained more than 8 months after transplant. The proteins encoded by these genes form a "late-active" network. We similarly extracted an "early-active" network encoded by genes that were active in biopsies obtained earlier than 8 months after transplant. We compared these two networks and identified an "only-late-active" subnetwork that was not active prior to eight months, but was active afterward. The expectation is that this "only-late-active" subnetwork will contain pathways from the "late-active" network that are activated in response to Hepatitis C virus re-infection, while pathways from the "early-active" network that may contain pathways activated as a result of the transplant are removed.</p>
            <p>In Figure <figr fid="F3">3</figr>, we have integrated the seed network, the composite active-paths network, and the "only-late-active" network into one network. This network is available for exploration and analysis in <it>Cytoscape </it>at our laboratory web site <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. Genes that were induced on average after 8 months following transplant are indicated with a red colour. Genes that were repressed are green. We have highlighted the nodes and edges of the composite active-paths subnetwork in "bold". The network in Figure <figr fid="F3">3</figr> is significantly over-represented with genes of several biological processes, as annotated by the Gene Ontology Consortium Database <abbrgrp><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr></abbrgrp>; using the <it>BioDataServer </it>tool in <it>Cytoscape</it>, and computed in <it>R </it>via <it>CytoTalk</it>, using the Bonferroni-corrected hypergeometric distribution. Among these include blood coagulation (<it>p </it>= 10<sup>-11</sup>), immune response (<it>p </it>= 10<sup>-7</sup>), proteolysis and peptidolysis (<it>p </it>= 10<sup>-5</sup>), lipid transport (<it>p </it>= 10<sup>-3</sup>), and complement activation (<it>p </it>= 10<sup>-2</sup>). In addition, nearly the entire JAK-STAT interferon-response signalling pathway is activated in this network.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Composite network of molecular pathways active in HCV-infected liver tissue</p>
               </caption>
               <text>
                  <p><b>Composite network of molecular pathways active in HCV-infected liver tissue. </b>The network in Figure 1 was combined with active subnetworks from the network in Figure 2. The active subnetworks were identified by active-paths analysis ([5]; bold nodes and edges) and by identifying the subnetworks that changed most significantly in expression with time after transplant. Nodes (genes) colored red were induced in the expression data of biopsies from 8 months or more post-transplant; green nodes were repressed. Areas that contain differentially active pathways or subnetworks, as described in the text, are highlighted.</p>
               </text>
               <graphic file="1471-2105-6-154-3"/>
            </fig>
            <p>The visualization in Figure <figr fid="F3">3</figr> enables one to identify these pathways and see whether they are "turning on" (red) or "turning off" (green) in the expression data. For example, the blood coagulation pathway is active in the expression data, although (as is to be expected with large and complex pathways) not coherently induced or repressed. The interferon-response pathway and genes activated by ISGF are clearly induced, probably due in part to the immune response to viral infection and partly in response to standard treatment of HCV-positive patients with interferon-alpha. Also, genes encoding the Toll-like receptors TLR1 and 2, as well as the downstream signalling pathway connecting them, through MYD88, to the interferon-response pathway appear to be repressed. TLRs 1 and 2 are known viral detection receptors; it is known that TLR2 detects HCV <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. The interleukin receptor IL1R1, upstream of MYD88, is also repressed along with other IL receptors, whereas IL1A and B are induced. Additionally, we see that many apoptosis-related genes encoding TNF, TNF receptors, and TNF-signalling factors, are activated, whereas growth factors (IGF and connected pathways), and cell cycle and translation-related pathways (<it>e.g</it>. CDKN and connected pathways) are repressed. Ignoring the observed responses that are likely due to by-products of the biopsy process (<it>e.g</it>. the blood coagulation pathway), the active pathways observed are jointly consistent with a large-scale response of complex molecular pathways to viral infection: hepatic cell reproduction is repressed and programmed cell death is induced.</p>
            <p>Finally we note that a visual inspection of the network suggests that many of the proteins that bind directly to HCV-encoded molecules (<it>i.e</it>., are their first neighbours in the network) appear on average to be down-regulated relative to the rest of the network. Statistical analysis of the data supports this suggestion. As computed via <it>CytoTalk </it>and <it>R</it>, about 80% of the first neighbours of the viral RNA and proteins are down-regulated in the network of Figure <figr fid="F3">3</figr>, compared to 35% of the remaining genes in the network (<it>p </it>= 0.0029). This finding suggests two non-exclusive possibilities: genes encoding HCV-neighbour proteins are targets of host regulatory mechanisms counteracting viral replication; or they are targets of virus-encoded regulatory mechanisms that sabotage anti-viral defences.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>The methods and software tools described here enable the efficient dynamic integrated analysis of diverse data in a major human-disease system. The results show the utility of integrating large-scale human molecular-interaction databases with genomic expression data. This approach is useful for the extraction of biological hypotheses, because it allows us to focus on groups of genes that are not only apparently active in the expression data, but are also functionally associated based on other data, such as molecular interactions. Thus, information that is not restricted to any one data type can be obtained. Moreover, our analyses suggest how various pathways act in concert, and serves as a large-scale window into the genomic response to HCV infection of liver cells. Because the tools and methods we have described are data-type-neutral, there is the prospect of further data integration for a more complete systems-biological approach to understanding viral infection and response mechanisms. The integration of additional, orthogonal sources of information such as detailed clinical data will enable quantitative associations of clinical variables with the activities of molecular pathways and processes.</p>
      </sec>
      <sec>
         <st>
            <p>Availability and requirements</p>
         </st>
         <p>&#8226; Project name: <it>InteractionFetcher</it>, <it>SynonymFetcher</it>, <it>HomologFetcher</it>, and <it>CytoTalk</it>: plug-ins for <it>Cytoscape</it></p>
         <p>&#8226; Project home page: <url>http://labs.systemsbiology.net/galitski/hepc/</url></p>
         <p>&#8226; Operating system(s): Platform independent</p>
         <p>&#8226; Programming language: Java</p>
         <p>&#8226; Other requirements: Java 1.4 or higher</p>
         <p>&#8226; License: GNU LGPL</p>
         <p>&#8226; Any restrictions to use by non-academics: license required for access to HPRD interactions (see <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>)</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>DJR: Development of <it>InteractionFetcher</it>, <it>CytoTalk </it>and associated server-side software and databases, construction of seed and scaffold network, analyses of active pathways, functional analyses, manuscript preparation. IA: Expression data processing, <it>ActivePaths </it>and functional analysis. VT: Construction of seed and scaffold networks, functional analysis and biological interpretation. BS: Project conception and planning. TG: Guidance, construction of seed network, manuscript preparation.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>The authors would like to thank Maria Smith, Matt Fitzgibbon and Michael Katze for early access to the biopsy data. We would also like to gratefully acknowledge Paul Shannon and Rowan Christmas for their assistance with <it>Cytoscape</it>, Eric Deutsch for his support with data management and processing, and Akhilesh Pandey and Babylaksmi Muthusamy for their help in providing the HPRD interactions in a machine-readable format. We would also like to thank Wei Yan for helpful discussions. This project was funded by the National Institute on Drug Abuse, grant number 1P30DA01562501.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Molecular networks in model systems</p>
            </title>
            <aug>
               <au>
                  <snm>Galitski</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Annu Rev Genomics Hum Genet</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>177</fpage>
            <lpage>87</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.genom.5.061903.180053</pubid>
                  <pubid idtype="pmpid" link="fulltext">15485347</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Control of yeast filamentous-form growth by modules in an integrated molecular network</p>
            </title>
            <aug>
               <au>
                  <snm>Prinz</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Avila-Campillo</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Aldridge</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Srinivasan</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Dimitrov</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Siegel</snm>
                  <fnm>AF</fnm>
               </au>
               <au>
                  <snm>Galitski</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <issue>3</issue>
            <fpage>380</fpage>
            <lpage>90</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">353223</pubid>
                  <pubid idtype="pmpid" link="fulltext">14993204</pubid>
                  <pubid idtype="doi">10.1101/gr.2020604</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Computational discovery of gene modules and regulatory networks</p>
            </title>
            <aug>
               <au>
                  <snm>Bar-Joseph</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Gerber</snm>
                  <fnm>GK</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>TI</fnm>
               </au>
               <au>
                  <snm>Rinaldi</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Yoo</snm>
                  <fnm>JY</fnm>
               </au>
               <au>
                  <snm>Robert</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Gordon</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Fraenkel</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Jaakkola</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Gifford</snm>
                  <fnm>DK</fnm>
               </au>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>2003</pubdate>
            <volume>21</volume>
            <issue>11</issue>
            <fpage>1337</fpage>
            <lpage>42</lpage>
            <note>Epub 2003 Oct 12</note>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nbt890</pubid>
                  <pubid idtype="pmpid" link="fulltext">14555958</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Genome-wide discovery of transcriptional modules from DNA sequence and gene expression</p>
            </title>
            <aug>
               <au>
                  <snm>Segal</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Yelensky</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Koller</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <issue>Suppl 1</issue>
            <fpage>i273</fpage>
            <lpage>82</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btg1038</pubid>
                  <pubid idtype="pmpid" link="fulltext">12855470</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Discovering regulatory and signalling circuits in molecular interaction networks</p>
            </title>
            <aug>
               <au>
                  <snm>Ideker</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ozier</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Schwikowski</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Siegel</snm>
                  <fnm>AF</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2002</pubdate>
            <volume>18</volume>
            <issue>Suppl 1</issue>
            <fpage>S233</fpage>
            <lpage>40</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12169552</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Human protein reference database as a discovery resource for proteomics</p>
            </title>
            <aug>
               <au>
                  <snm>Peri</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Navarro</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Kristiansen</snm>
                  <fnm>TZ</fnm>
               </au>
               <au>
                  <snm>Amanchy</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Surendranath</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Muthusamy</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Gandhi</snm>
                  <fnm>TK</fnm>
               </au>
               <au>
                  <snm>Chandrika</snm>
                  <fnm>KN</fnm>
               </au>
               <au>
                  <snm>Deshpande</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Suresh</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Rashmi</snm>
                  <fnm>BP</fnm>
               </au>
               <au>
                  <snm>Shanker</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Padma</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Niranjan</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Harsha</snm>
                  <fnm>HC</fnm>
               </au>
               <au>
                  <snm>Talreja</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Vrushabendra</snm>
                  <fnm>BM</fnm>
               </au>
               <au>
                  <snm>Ramya</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Yatish</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Joy</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Shivashankar</snm>
                  <fnm>HN</fnm>
               </au>
               <au>
                  <snm>Kavitha</snm>
                  <fnm>MP</fnm>
               </au>
               <au>
                  <snm>Menezes</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Choudhury</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Ghosh</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Saravana</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Chandran</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Mohan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Jonnalagadda</snm>
                  <fnm>CK</fnm>
               </au>
               <au>
                  <snm>Prasad</snm>
                  <fnm>CK</fnm>
               </au>
               <au>
                  <snm>Kumar-Sinha</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Deshpande</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Pandey</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <volume>32</volume>
            <issue>Database</issue>
            <fpage>D497</fpage>
            <lpage>501</lpage>
            <note>2004 Jan 1</note>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Human Protein Reference Database</p>
            </title>
            <url>http://www.hprd.org</url>
         </bibl>
         <bibl id="B8">
            <title>
               <p>BIND: the Biomolecular Interaction Network Database</p>
            </title>
            <aug>
               <au>
                  <snm>Bader</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Betel</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Hogue</snm>
                  <fnm>CW</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <issue>1</issue>
            <fpage>248</fpage>
            <lpage>50</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">165503</pubid>
                  <pubid idtype="pmpid" link="fulltext">12519993</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg056</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Biomolecular Interaction Network Database</p>
            </title>
            <url>http://bind.ca</url>
         </bibl>
         <bibl id="B10">
            <title>
               <p>The Database of Interacting Proteins: 2004 update</p>
            </title>
            <aug>
               <au>
                  <snm>Salwinski</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>CS</fnm>
               </au>
               <au>
                  <snm>smit</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Pettit</snm>
                  <fnm>FK</fnm>
               </au>
               <au>
                  <snm>Bowie</snm>
                  <fnm>JU</fnm>
               </au>
               <au>
                  <snm>Eisenberg</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>NAR</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>Database</issue>
            <fpage>D449</fpage>
            <lpage>51</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">14681454</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh086</pubid>
                  <pubid idtype="pmcid">308820</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Database of Interacting Proteins</p>
            </title>
            <url>http://dip.doe-mbi.ucla.edu</url>
         </bibl>
         <bibl id="B12">
            <title>
               <p>TRANSPATH: an integrated database on signal transduction and a tool for array analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Krull</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Voss</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Choi</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Pistor</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Potapov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wingender</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <volume>31</volume>
            <issue>1</issue>
            <fpage>97</fpage>
            <lpage>100</lpage>
            <note>2003 Jan 1</note>
            <xrefbib>
               <pubid idtype="doi">10.1093/nar/gkg089</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <aug>
               <au>
                  <snm>Smith</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Yue</snm>
                  <fnm>ZN</fnm>
               </au>
               <au>
                  <snm>Do</snm>
                  <fnm>HA</fnm>
               </au>
               <au>
                  <snm>Netski</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Boix</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Bruix</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Thomas</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Carithers</snm>
                  <fnm>RL</fnm>
                  <suf>Jr</suf>
               </au>
               <au>
                  <snm>Katze</snm>
                  <fnm>MG</fnm>
               </au>
            </aug>
            <inpress/>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Epidemiology of hepatitis C: geographic differences and temporal trends</p>
            </title>
            <aug>
               <au>
                  <snm>Wasley</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Alter</snm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <source>Semin Liver Dis</source>
            <pubdate>2000</pubdate>
            <volume>20</volume>
            <issue>1</issue>
            <fpage>1</fpage>
            <lpage>16</lpage>
            <note>Review</note>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1055/s-2000-9506</pubid>
                  <pubid idtype="pmpid">10895428</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Recovery, persistence, and sequelae in hepatitis C virus infection: a perspective on long-term outcome</p>
            </title>
            <aug>
               <au>
                  <snm>Alter</snm>
                  <fnm>HJ</fnm>
               </au>
               <au>
                  <snm>Seeff</snm>
                  <fnm>LB</fnm>
               </au>
            </aug>
            <source>Semin Liver Dis</source>
            <pubdate>2000</pubdate>
            <volume>20</volume>
            <issue>1</issue>
            <fpage>17</fpage>
            <lpage>35</lpage>
            <note>Review</note>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1055/s-2000-9505</pubid>
                  <pubid idtype="pmpid">10895429</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Cytoscape: a software environment for integrated models of biomolecular interaction networks</p>
            </title>
            <aug>
               <au>
                  <snm>Shannon</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Markiel</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ozier</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Baliga</snm>
                  <fnm>NS</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Ramage</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Amin</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Schwikowski</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Shannon</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Markiel</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ozier</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Baliga</snm>
                  <fnm>NS</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Ramage</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Amin</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Schwikowski</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Ideker</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <issue>11</issue>
            <fpage>2498</fpage>
            <lpage>504</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">403769</pubid>
                  <pubid idtype="pmpid" link="fulltext">14597658</pubid>
                  <pubid idtype="doi">10.1101/gr.1239303</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Cytoscape</p>
            </title>
            <url>http://cytoscape.org</url>
         </bibl>
         <bibl id="B18">
            <title>
               <p>XML-RPC Specification</p>
            </title>
            <url>http://www.XML-RPC.com</url>
         </bibl>
         <bibl id="B19">
            <title>
               <p>A first-draft human protein-interaction map</p>
            </title>
            <aug>
               <au>
                  <snm>Lehner</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Fraser</snm>
                  <fnm>AG</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <issue>9</issue>
            <fpage>R63</fpage>
            <note>Epub 2004 Aug 13</note>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">522870</pubid>
                  <pubid idtype="pmpid" link="fulltext">15345047</pubid>
                  <pubid idtype="doi">10.1186/gb-2004-5-9-r63</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>A gene-coexpression network for global discovery of conserved genetic modules</p>
            </title>
            <aug>
               <au>
                  <snm>Stuart</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Segal</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Koller</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>SK</fnm>
               </au>
            </aug>
            <source>Science</source>
            <volume>302</volume>
            <issue>5643</issue>
            <fpage>249</fpage>
            <lpage>55</lpage>
            <note>2003 Oct 10, Epub 2003 Aug 21</note>
            <xrefbib>
               <pubid idtype="doi">10.1126/science.1087447</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Coexpression analysis of human genes across many microarray data sets</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>HK</fnm>
               </au>
               <au>
                  <snm>Hsu</snm>
                  <fnm>AK</fnm>
               </au>
               <au>
                  <snm>Sajdak</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Qin</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Pavlidis</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <issue>6</issue>
            <fpage>1085</fpage>
            <lpage>94</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">419787</pubid>
                  <pubid idtype="pmpid" link="fulltext">15173114</pubid>
                  <pubid idtype="doi">10.1101/gr.1910904</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Database resources of the National Center for Biotechnology Information</p>
            </title>
            <aug>
               <au>
                  <snm>Wheeler</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Barrett</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Benson</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Bryant</snm>
                  <fnm>SH</fnm>
               </au>
               <au>
                  <snm>Canese</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Church</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>DiCuccio</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Edgar</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Federhen</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Helmberg</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Kenton</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Khovayko</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Madden</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Maglott</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Ostell</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Pontius</snm>
                  <fnm>JU</fnm>
               </au>
               <au>
                  <snm>Pruitt</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Schuler</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Schriml</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Sequeira</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Sherry</snm>
                  <fnm>ST</fnm>
               </au>
               <au>
                  <snm>Sirotkin</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Starchenko</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Suzek</snm>
                  <fnm>TO</fnm>
               </au>
               <au>
                  <snm>Tatusov</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Tatusova</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Wagner</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Yaschenko</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <volume>33</volume>
            <issue>Database</issue>
            <fpage>D39</fpage>
            <lpage>45</lpage>
            <note>2005 Jan 1</note>
            <xrefbib>
               <pubid idtype="doi">10.1093/nar/gki062</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <url>http://labs.systemsbiology.net/galitski/hepc</url>
         </bibl>
         <bibl id="B24">
            <title>
               <p>The R Project for Statistical Computing</p>
            </title>
            <url>http://www.r-project.org</url>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Natural history of recurrent hepatitis C</p>
            </title>
            <aug>
               <au>
                  <snm>Berenguer</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Liver Transpl</source>
            <pubdate>2002</pubdate>
            <volume>8</volume>
            <issue>Suppl 1</issue>
            <fpage>S14</fpage>
            <lpage>S18</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1053/jlts.2002.35781</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Statistical combining of cell expression profiles</p>
            </title>
            <aug>
               <au>
                  <snm>Stoughton</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Dai</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>US Patent 6351712</source>
            <pubdate>2002</pubdate>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Interaction between hepatitis C virus proteins and host cell factors</p>
            </title>
            <aug>
               <au>
                  <snm>Tellinghuisen</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Rice</snm>
                  <fnm>CM</fnm>
               </au>
            </aug>
            <source>Curr Opin Microbiol</source>
            <pubdate>2002</pubdate>
            <volume>5</volume>
            <issue>4</issue>
            <fpage>419</fpage>
            <lpage>27</lpage>
            <note>Review</note>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S1369-5274(02)00341-7</pubid>
                  <pubid idtype="pmpid" link="fulltext">12160863</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Hepatitis C virus &#8211; cell interactions and their role in pathogenesis</p>
            </title>
            <aug>
               <au>
                  <snm>Polyak</snm>
                  <fnm>SJ</fnm>
               </au>
            </aug>
            <source>Clin Liver Dis</source>
            <pubdate>2003</pubdate>
            <volume>7</volume>
            <issue>1</issue>
            <fpage>67</fpage>
            <lpage>88</lpage>
            <note>Review</note>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S1089-3261(02)00075-2</pubid>
                  <pubid idtype="pmpid">12691459</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Antiviral actions of interferons</p>
            </title>
            <aug>
               <au>
                  <snm>Samuel</snm>
                  <fnm>CE</fnm>
               </au>
            </aug>
            <source>Clin Microbiol Rev</source>
            <pubdate>2001</pubdate>
            <volume>14</volume>
            <fpage>778</fpage>
            <lpage>809</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">89003</pubid>
                  <pubid idtype="pmpid" link="fulltext">11585785</pubid>
                  <pubid idtype="doi">10.1128/CMR.14.4.778-809.2001</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Entrez Gene: gene-centered information at NCBI</p>
            </title>
            <aug>
               <au>
                  <snm>Maglott</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Ostell</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Pruitt</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Tatusova</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <volume>33</volume>
            <issue>Database</issue>
            <fpage>D54</fpage>
            <lpage>8</lpage>
            <note>2005 Jan 1</note>
            <xrefbib>
               <pubid idtype="doi">10.1093/nar/gki031</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Entrez Gene</p>
            </title>
            <url>http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=gene</url>
         </bibl>
         <bibl id="B32">
            <title>
               <p>PreBIND and Textomy &#8211; mining the biomedical literature for protein-protein interactions using a support vector machine</p>
            </title>
            <aug>
               <au>
                  <snm>Donaldson</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Martin</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>de Bruijn</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Wolting</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Lay</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Tuekam</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Baskin</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Bader</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Michalickova</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Pawson</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hogue</snm>
                  <fnm>CW</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <volume>4</volume>
            <issue>1</issue>
            <fpage>11</fpage>
            <note>2003 Mar 27</note>
            <xrefbib>
               <pubid idtype="doi">10.1186/1471-2105-4-11</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>PreBIND</p>
            </title>
            <url>http://bind.ca</url>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Gene ontology: tool for the unification of biology. The Gene Ontology Consortium</p>
            </title>
            <aug>
               <au>
                  <snm>Ashburner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ball</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Blake</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Botstein</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Butler</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Cherry</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Dolinski</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Dwight</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Eppig</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Hill</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Issel-Tarver</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Kasarskis</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Matese</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Richardson</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Ringwald</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Sherlock</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2000</pubdate>
            <volume>25</volume>
            <issue>1</issue>
            <fpage>25</fpage>
            <lpage>9</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/75556</pubid>
                  <pubid idtype="pmpid" link="fulltext">10802651</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Gene Ontology Consortium</p>
            </title>
            <url>http://www.geneontology.org</url>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Hepatitis C core and nonstructural 3 proteins trigger toll-like receptor 2-mediated pathways and inflammatory activation</p>
            </title>
            <aug>
               <au>
                  <snm>Dolganiuc</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Oak</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kodys</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Golenbock</snm>
                  <fnm>DT</fnm>
               </au>
               <au>
                  <snm>Finberg</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Kurt-Jones</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Szabo</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Gastroenterology</source>
            <pubdate>2004</pubdate>
            <volume>127</volume>
            <issue>5</issue>
            <fpage>1513</fpage>
            <lpage>24</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15521019</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
