<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2148-3-11</ui>
   <ji>1471-2148</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>A simple dependence between protein evolution rate and the number of protein-protein interactions</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Fraser</snm>
               <mi>B</mi>
               <fnm>Hunter</fnm>
               <insr iid="I1"/>
               <email>hunter@ocf.berkeley.edu</email>
            </au>
            <au id="A2">
               <snm>Wall</snm>
               <mi>P</mi>
               <fnm>Dennis</fnm>
               <insr iid="I2"/>
               <email>dpwall@stanford.edu</email>
            </au>
            <au id="A3">
               <snm>Hirsh</snm>
               <mi>E</mi>
               <fnm>Aaron</fnm>
               <insr iid="I2"/>
               <email>aehirsh@stanford.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Molecular and Cell Biology, University of California, Berkeley, CA, 94720, USA</p>
            </ins>
            <ins id="I2">
               <p>Center for Computational Genetics and Biological Modeling, Department of Biological Sciences, Stanford University, Stanford, CA, 94305, USA</p>
            </ins>
         </insg>
         <source>BMC Evolutionary Biology</source>
         <issn>1471-2148</issn>
         <pubdate>2003</pubdate>
         <volume>3</volume>
         <issue>1</issue>
         <fpage>11</fpage>
         <url>http://www.biomedcentral.com/1471-2148/3/11</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">12769820</pubid>
               <pubid idtype="doi">10.1186/1471-2148-3-11</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>27</day>
               <month>3</month>
               <year>2003</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>23</day>
               <month>5</month>
               <year>2003</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>23</day>
               <month>5</month>
               <year>2003</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2003</year>
         <collab>Fraser et al; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.</collab>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>It has been shown for an evolutionarily distant genomic comparison that the number of protein-protein interactions a protein has correlates negatively with their rates of evolution. However, the generality of this observation has recently been challenged. Here we examine the problem using protein-protein interaction data from the yeast <it>Saccharomyces cerevisiae </it>and genome sequences from two other yeast species.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>In contrast to a previous study that used an incomplete set of protein-protein interactions, we observed a highly significant correlation between number of interactions and evolutionary distance to either <it>Candida albicans </it>or <it>Schizosaccharomyces pombe</it>. This study differs from the previous one in that it includes all known protein interactions from <it>S. cerevisiae</it>, and a larger set of protein evolutionary rates. In both evolutionary comparisons, a simple monotonic relationship was found across the entire range of the number of protein-protein interactions. In agreement with our earlier findings, this relationship cannot be explained by the fact that proteins with many interactions tend to be important to yeast. The generality of these correlations in other kingdoms of life unfortunately cannot be addressed at this time, due to the incompleteness of protein-protein interaction data from organisms other than <it>S. cerevisiae</it>.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p>Protein-protein interactions tend to slow the rate at which proteins evolve. This may be due to structural constraints that must be met to maintain interactions, but more work is needed to definitively establish the mechanism(s) behind the correlations we have observed.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Introduction</p>
         </st>
         <p>What factors determine the rates at which different proteins evolve is a fundamental question in molecular evolution. With the advent of functional genomics, this question can now be addressed on a genome-wide scale. Different determinants of evolutionary rate revealed by analysis of functional genomic data include protein dispensability <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>, transcript level <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, and number of protein-protein interactors <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>.</p>
         <p>Recently, Jordan <it>et al</it>. <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> suggested that the correlation between a protein's evolutionary rate and its number of protein interactions arises only because a few, highly interactive proteins evolve more slowly than all other proteins. In our original analysis, a distant genomic comparison of <it>S. cerevisiae </it>with <it>C. elegans </it>was used to find approximate evolutionary rates of putatively orthologous genes shared by these two species. One would expect that comparisons of more closely related species would increase the strength of the relationship, since more orthologs can be found and evolutionary rates can be estimated with greater precision. Surprisingly, when Jordan <it>et al</it>. compared orthologs between <it>S. cerevisiae </it>and another yeast, <it>S. pombe</it>, they found only an extremely weak relationship between number of protein interactions and evolutionary rate. Furthermore, they found that when proteins were binned by their number of interactions, only the bin containing the most highly interactive proteins showed any reduction in evolutionary rate. They concluded from this that there is no general correlation between number of protein interactions and evolutionary rate, and that the reduction of evolutionary rate observed in the most highly connected proteins may be an indirect effect of the relationship between protein dispensability and rate of evolution <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>.</p>
         <p>Here we show that the absence of a general correlation between protein interactions and evolutionary rate in the analysis of Jordan <it>et al. </it>can be attributed to an incomplete dataset. Our analysis differs from that of Jordan <it>et al. </it>in two basic ways. First, Jordan et al. used only protein-protein interactions from the MIPS database <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, which consists of individually reported interactions combined with data from the high-throughput screen of Uetz <it>et al</it>. <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. While the MIPS database contains many high-confidence interactions, it is very small when compared to the total number of interactions known from all high-throughput screens. Second, Jordan <it>et al</it>. identified orthologs by taking reciprocal best BLAST hits, a method that leads to an incomplete list, because the top BLAST hit is often not the most closely related protein <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. We used a method based on maximum likelihood estimation of evolutionary distances that results in a more complete list <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>.</p>
         <p>Using our more complete lists of both protein-protein interactions and orthologs, we show here that the correlation we originally reported in the <it>C. elegans </it>&#8211; <it>S. cerevisiae </it>comparison is indeed even stronger when more closely related genome sequences are compared. We use orthologs between <it>S. cerevisiae </it>and <it>S. pombe, </it>as well as the more closely related yeast, <it>C. albicans</it>, to probe this relationship in greater detail than we did in our previous study. We find a simple monotonic relationship between number of protein interactions and evolutionary rate, and we find that this relationship applies to proteins with few interactions, as well as to those with many.</p>
      </sec>
      <sec>
         <st>
            <p>Results and Discussion</p>
         </st>
         <sec>
            <st>
               <p>Protein-protein interactions and evolutionary rates in yeast</p>
            </st>
            <p>We compiled a list of <it>S. cerevisiae </it>protein-protein interactions from every major high-throughput study published to date <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>, as well as individually reported interactions from the MIPS database <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. The final non-redundant set consists of all interactions used in our previous study <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>, and contains 13,925 interactions involving 3575 proteins. This is a more comprehensive data set than that analyzed by Jordan <it>et al</it>. <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>, which contained fewer than 2500 interactions once duplicate interactions were removed [I.K. Jordan, pers comm].</p>
            <p>Using the genome sequences of <it>C. albicans </it>and <it>S. pombe </it>for comparison with <it>S. cerevisiae</it>, we identified putative orthologs using a maximum likelihood-based approach <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, which identified 3727 orthologs between <it>S. cerevisiae </it>and <it>C. albicans</it>, and 2988 orthologs between <it>S. cerevisiae </it>and <it>S. pombe</it>. All data will be made available upon request.</p>
            <p>Taking the intersections of our interaction and ortholog data sets, we plotted the number of protein-protein interactions vs. evolutionary rate for all genes for which we had both types of data. For all genes in the <it>S. pombe</it>&#8211;<it>S. cerevisiae </it>comparison, we found a highly significant relationship (Figure <figr fid="F1">1a</figr>; <it>n </it>= 2119, Spearman Rank <it>r </it>= -0.24, <it>P </it>= 5.8 &#215; 10<sup>-30</sup>). This correlation is stronger than the rank correlation that we reported in our original study <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>, and is over 27 orders of magnitude more statistically significant, due to both the increased strength and the far greater number of genes involved. Thus our expectation of a more significant correlation from a closer genomic comparison is borne out by the data.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>The relationship between number of protein-protein interactions and evolutionary rate between <it>S. cerevisiae </it>and <it>S. pombe</it></p>
               </caption>
               <text>
                  <p><b>The relationship between number of protein-protein interactions and evolutionary rate between <it>S. cerevisiae </it>and <it>S. pombe</it>. </b>(a) The relationship between number of protein-protein interactions and evolutionary rate for all 2119 orthologs with protein interaction data. Several outliers are not shown but were included in the analysis. (b) Average evolutionary rates of genes binned by their number of protein-protein interactions.</p>
               </text>
               <graphic file="1471-2148-3-11-1"/>
            </fig>
            <p>A conclusion of Jordan <it>et al</it>. <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> was that only the proteins with the most interactors showed any reduction in evolutionary rate-i.e., the relationship between interactions and evolutionary rate was confined to those proteins with the most interactors. As shown in Figure <figr fid="F1">1b</figr>, when a more complete set of interactions and orthologs is used, the relationship can be seen to extend over the entire range of number of interactions. It takes the form of a simple monotonic relationship. This supports the idea that regardless of how many protein-protein interactions a protein participates in, each interaction affects the protein's rate of evolution.</p>
            <p>This same analysis can be repeated using an <it>S. cerevisiae</it>&#8211;<it>C. albicans </it>genomic comparison, and the same set of <it>S. cerevisiae </it>protein-protein interactions. When we perform this analysis, the results are even stronger than for the <it>S. pombe </it>comparison. As shown in Figure <figr fid="F2">2a</figr>, a significant correlation is found (<it>n </it>= 2496, Spearman Rank <it>r </it>= -0.25, <it>P </it>= 5.2 &#215; 10<sup>-38</sup>). Separating the data into bins by their number of interactors also shows the same relationship as for the <it>S. pombe </it>comparison, with a clearly monotonic relationship observable over the entire range of protein interactions per protein (Figure <figr fid="F2">2b</figr>).</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>The relationship between number of protein-protein interactions and evolutionary rates between <it>S. cerevisiae </it>and <it>C. albicans</it></p>
               </caption>
               <text>
                  <p><b>The relationship between number of protein-protein interactions and evolutionary rates between <it>S. cerevisiae </it>and <it>C. albicans</it>. </b>(a) The relationship between number of protein-protein interactions and evolutionary rate for all 2496 orthologs with protein interaction data. Several outliers are not shown but were included in the analysis. (b) Average evolutionary rates of genes binned by their number of protein-protein interactions.</p>
               </text>
               <graphic file="1471-2148-3-11-2"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>What is the source of the difference between the two studies?</p>
            </st>
            <p>Our finding of a strong correlation where Jordan <it>et al</it>. <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> did not find one raises the question of what causes the difference. There are two possibilities: our lists of protein-protein interactions, or our lists of orthologs and the associated evolutionary rates. To answer this question, we first tested the correlation between our list of protein interactions and Jordan <it>et al</it>.'s list of orthologs and evolutionary rates. We observed a significant correlation between the two (Spearman Rank <it>r </it>= -0.22, <it>P </it>= 8.5 &#215; 10<sup>-24 </sup>Figure <figr fid="F3">3a</figr>), only slightly weaker than our correlation in Figure <figr fid="F1">1a</figr>. Next we plotted Jordan <it>et al</it>.'s protein interaction data against our list of evolutionary rates. We found no significant correlation between the two data sets (Spearman Rank <it>r </it>= -0.01, <it>P </it>= 0.79; Figure <figr fid="F3">3b</figr>). This demonstrates that the difference in our findings was due to the difference in our protein interaction lists, and not in our list of orthologs or evolutionary rates, and it underscores the importance of using datasets that are as complete as possible in this type of analysis.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Testing the different lists of protein-protein interactions and evolutionary rates from the two studies</p>
               </caption>
               <text>
                  <p><b>Testing the different lists of protein-protein interactions and evolutionary rates from the two studies. </b>(a) A significant correlation is found when using evolutionary rates of orthologs from Jordan <it>et al</it>. <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> with our list of protein-protein interactions. Several outliers are not shown but were included in the analysis. (b) No correlation is seen when using our evolutionary rates of orthologs with Jordan <it>et al</it>.'s list of protein-protein interactions.</p>
               </text>
               <graphic file="1471-2148-3-11-3"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Is it an indirect correlation?</p>
            </st>
            <p>Jordan <it>et al</it>. speculate that the reduction in evolutionary rate of the most highly connected proteins could be due to their greater likelihood of being essential for viability of the cell <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B12">12</abbr></abbrgrp>. However in our original analysis we showed that the effect on cell fitness when a gene is deleted cannot explain the correlation between number of protein interactions and evolutionary rate <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. In order to investigate this question for these less distant genomic comparisons, we repeated the analysis from our original study. We used Kendall's Partial Tau <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, a metric of partial correlation that allows one to quantify the magnitude of a correlation between two variables when a third, potentially related variable is statistically held constant. For example, in Figure <figr fid="F4">4a</figr>, a diagram is shown in which the arrows connecting the three variables represent the relationships among them. We used Kendall's Partial Tau to assign a <it>P</it>-value (by 10<sup>5 </sup>randomization tests of the data) to each arrow, representing the probability that the arrow represents a correlation that is significantly different from zero when the third variable is statistically controlled. When we use this method to analyze the number of protein-protein interactions, evolutionary rate, and fitness effect of each gene, we find that fitness effect cannot explain the relationship between number of protein-protein interactions and evolutionary rate for either of our genomic comparisons (Figure <figr fid="F4">4b,4c</figr>), consistent with our original study <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Diagram of correlations between number of protein-protein interactions, evolutionary rates, and fitness effects</p>
               </caption>
               <text>
                  <p><b>Diagram of correlations between number of protein-protein interactions, evolutionary rates, and fitness effects </b>(a) Each arrow represents the correlation between the two variables it connects. Whether or not the correlation is statistically significant by Kendall's Partial Tau is shown by the <it>P</it>-values next to each arrow in (b) and (c). (b) The significance of each correlation for the <it>S. cerevisiae-S. pombe </it>comparison. Note that the arrow connecting number of protein-protein interactions and evolutionary rates is highly significant, with none of the 10<sup>5 </sup>randomizations of the data having a stronger correlation. (c) The significance of each correlation for the <it>S. cerevisiae</it>&#8211;<it>C. albicans </it>comparison. Note that the arrow connecting number of protein-protein interactions and evolutionary rates is highly significant, with none of the 10<sup>5 </sup>randomizations of the data having a stronger correlation.</p>
               </text>
               <graphic file="1471-2148-3-11-4"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Protein-protein interactions and evolutionary rates in bacteria</p>
            </st>
            <p>Jordan <it>et al</it>. <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> also note that they cannot detect a correlation between number of protein-protein interactions and evolutionary rate in <it>Helicobacter pylori</it>. Based on this observation, they conclude that the relationship between interactions and evolutionary rate does not apply to bacteria. However, substantial caution must be exercised in interpreting results that are based on a single protein interaction study <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Indeed, when using either one of the first two published high-throughput yeast protein interaction data sets <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B9">9</abbr></abbrgrp> alone, it is not possible to find a significant correlation between the number of interactors and evolutionary rate; it is only through a compilation of several data sets that a significant relationship emerges for yeast. Until this is possible for <it>H. pylori</it>, we should be reluctant to conclude whether or not such a relationship exists in this organism.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusions</p>
         </st>
         <p>We have shown that the previously reported relationship between protein-protein interactions and evolutionary rates of proteins is even stronger when comparing different yeast species than it is when comparing yeast with <it>C. elegans</it>. The fact that the relationship can be detected at all with a genomic comparison of species separated by approximately 1 billion years of evolution (<it>S. cerevisiae </it>and <it>C. elegans</it>), as well as with the comparisons of the more closely related species presented here, underscores the robustness of the relationship. That the correlation cannot be detected when using a smaller set of protein-protein interactions, as in the study by Jordan <it>et al</it>. <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>, demonstrates the importance of using data that are as complete as possible when correlating diverse genomic data. Since no such complete data set is available for any organism other than <it>S. cerevisiae</it>, it is not yet possible to judge whether the relationship applies to prokaryotes as well as eukaryotes.</p>
         <p>It was correctly noted by Jordan <it>et al</it>. that the correlation we previously observed explains only about 6% of the variance in evolutionary rates <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>; the correlations presented here are only slightly stronger. However, when one considers the various and unavoidable sources of noise in the analysis (e.g., identifying orthologs, aligning orthologs, estimating evolutionary distances, and perhaps most importantly, false positives and negatives in the protein-protein interaction data), as well as confounding biological factors (e.g., the fact that protein-protein interactions will not be invariable between the species whose genomes are compared, so interactions recently evolved in the <it>S. cerevisiae </it>lineage will not show a significant effect on evolutionary rate), it seems surprising that the correlations are as strong as they are. In view of the sources of noise presently unavoidable in evolutionary analysis of functional genomic data, the fraction of variance in evolutionary rate that is explained by any one functional paramater &#8211; such as protein interactions, dispensability, or expression &#8211; cannot yet be taken as an accurate estimate of the relative importance of that factor's role in determining the rate of evolution. It will be interesting to see how much the strength of the correlations examined here increases, and whether the relationships take informative functional forms, as more high-quality protein-protein interaction data sets and genome sequences are published.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' Constributions</p>
         </st>
         <p>HBF performed the correlational and statistical analyses, and wrote the manuscript. DPW found putative orthologs and evolutionary rates, and edited the manscript. AEH edited the manuscript. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>HBF is supported by an NSF predoctoral fellowship. DPW is supported by an NSF postdoctoral fellowship in bioinformatics. AEH and DPW are supported by NIH grants GM 28016 and 28424 to Marcus W. Feldman.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Protein dispensability and rate of evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Hirsh</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Fraser</snm>
                  <fnm>HB</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2001</pubdate>
            <volume>411</volume>
            <fpage>1046</fpage>
            <lpage>1049</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35082561</pubid>
                  <pubid idtype="pmpid" link="fulltext">11429604</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Highly expressed genes in yeast evolve slowly</p>
            </title>
            <aug>
               <au>
                  <snm>Pal</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Papp</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Hurst</snm>
                  <fnm>LD</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2001</pubdate>
            <volume>158</volume>
            <fpage>927</fpage>
            <lpage>931</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11430355</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Evolutionary rate in the protein interaction network</p>
            </title>
            <aug>
               <au>
                  <snm>Fraser</snm>
                  <fnm>HB</fnm>
               </au>
               <au>
                  <snm>Hirsh</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Steinmetz</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Scharfe</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Feldman</snm>
                  <fnm>MW</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>296</volume>
            <fpage>750</fpage>
            <lpage>752</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1068696</pubid>
                  <pubid idtype="pmpid" link="fulltext">11976460</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>No simple dependence between protein evolution rate and the number of protein-protein interactions: only the most prolific interactors tend to evolve slowly</p>
            </title>
            <aug>
               <au>
                  <snm>Jordan</snm>
                  <fnm>IK</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>BMC Evolutionary Biology</source>
            <pubdate>2003</pubdate>
            <volume>3</volume>
            <fpage>1</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">140311</pubid>
                  <pubid idtype="pmpid" link="fulltext">12515583</pubid>
                  <pubid idtype="doi">10.1186/1471-2148-3-1</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>MIPS: a database for genomes and protein sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Mewes</snm>
                  <fnm>HW</fnm>
               </au>
               <au>
                  <snm>Frishman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Guldener</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Mannhaupt</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Mayer</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Mokrejs</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Morgenstern</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Munsterkotter</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rudd</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Weil</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <fpage>31</fpage>
            <lpage>34</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">99165</pubid>
                  <pubid idtype="pmpid" link="fulltext">11752246</pubid>
                  <pubid idtype="doi">10.1093/nar/30.1.31</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>A comprehensive analysis of protein-protein interactions in <it>Saccharomyces cerevisiae </it></p>
            </title>
            <aug>
               <au>
                  <snm>Uetz</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Giot</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Cagney</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Mansfield</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Judson</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Knight</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Lockshon</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Narayan</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Srinivasan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pochart</snm>
                  <fnm>P</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>403</volume>
            <fpage>623</fpage>
            <lpage>627</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35001009</pubid>
                  <pubid idtype="pmpid" link="fulltext">10688190</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>The closest BLAST hit is often not the nearest neighbor</p>
            </title>
            <aug>
               <au>
                  <snm>Koski</snm>
                  <fnm>LB</fnm>
               </au>
               <au>
                  <snm>Golding</snm>
                  <fnm>GB</fnm>
               </au>
            </aug>
            <source>Journal of Molecular Evolution</source>
            <pubdate>2001</pubdate>
            <volume>52</volume>
            <fpage>540</fpage>
            <lpage>542</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11443357</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>An improved method for detecting putative orthologs</p>
            </title>
            <aug>
               <au>
                  <snm>Wall</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Fraser</snm>
                  <fnm>HB</fnm>
               </au>
               <au>
                  <snm>Hirsh</snm>
                  <fnm>AE</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2003</pubdate>
            <inpress/>
         </bibl>
         <bibl id="B9">
            <title>
               <p>A comprehensive two-hybrid analysis to explore the yeast protein interactome</p>
            </title>
            <aug>
               <au>
                  <snm>Ito</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Chiba</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ozawa</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Yoshida</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hattori</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sakaki</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2001</pubdate>
            <volume>98</volume>
            <fpage>4569</fpage>
            <lpage>4574</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">31875</pubid>
                  <pubid idtype="pmpid" link="fulltext">11283351</pubid>
                  <pubid idtype="doi">10.1073/pnas.061034498</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Functional organization of the yeast proteome by systematic analysis of protein complexes</p>
            </title>
            <aug>
               <au>
                  <snm>Gavin</snm>
                  <fnm>AC</fnm>
               </au>
               <au>
                  <snm>Bosche</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Krause</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Grandi</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Marzioch</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bauer</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Schultz</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Rick</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Michon</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Cruciat</snm>
                  <fnm>CM</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>415</volume>
            <fpage>141</fpage>
            <lpage>147</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/415141a</pubid>
                  <pubid idtype="pmpid" link="fulltext">11805826</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Systematic identification of protein complexes in <it>Saccharomyces cerevisiae </it>by mass spectrometry</p>
            </title>
            <aug>
               <au>
                  <snm>Ho</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Gruhler</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Heilbut</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bader</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Moore</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Adams</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Millar</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Bennett</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Boutilier</snm>
                  <fnm>K</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>415</volume>
            <fpage>180</fpage>
            <lpage>183</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/415180a</pubid>
                  <pubid idtype="pmpid" link="fulltext">11805837</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Lethality and centrality in protein networks</p>
            </title>
            <aug>
               <au>
                  <snm>Jeong</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Mason</snm>
                  <fnm>SP</fnm>
               </au>
               <au>
                  <snm>Barabasi</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Oltvai</snm>
                  <fnm>ZN</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2001</pubdate>
            <volume>411</volume>
            <fpage>41</fpage>
            <lpage>42</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35075138</pubid>
                  <pubid idtype="pmpid" link="fulltext">11333967</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Nonparametric Measures of Association</p>
            </title>
            <aug>
               <au>
                  <snm>Gibbons</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>Newbury Park, UK: Sage</source>
            <pubdate>1993</pubdate>
         </bibl>
         <bibl id="B14">
            <title>
               <p>The protein-protein interaction map of <it>Helicobacter pylori </it></p>
            </title>
            <aug>
               <au>
                  <snm>Rain</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Selig</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>De Reuse</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Battaglia</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Reverdy</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Simon</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lenzen</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Petel</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Wojcik</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Schachter</snm>
                  <fnm>V</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2001</pubdate>
            <volume>409</volume>
            <fpage>211</fpage>
            <lpage>215</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35051615</pubid>
                  <pubid idtype="pmpid" link="fulltext">11196647</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
