<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2007-8-5-r94</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Modular organization in the reductive evolution of protein-protein interaction networks</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Tamames</snm>
               <fnm>Javier</fnm>
               <insr iid="I1"/>
               <email>javier.tamames@uv.es</email>
            </au>
            <au id="A2">
               <snm>Moya</snm>
               <fnm>Andr&#233;s</fnm>
               <insr iid="I1"/>
               <email>andres.moya@uv.es</email>
            </au>
            <au id="A3">
               <snm>Valencia</snm>
               <fnm>Alfonso</fnm>
               <insr iid="I2"/>
               <email>avalencia@cnio.es</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Instituto Cavanilles de Biodiversidad y Biolog&#237;a Evolutiva, Universitat de Val&#232;ncia, 46071 Valencia, Spain</p>
            </ins>
            <ins id="I2">
               <p>Structural and Computational Biology Programme, Spanish National Cancer Research Centre (CNIO), 28029 Madrid, Spain</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2007</pubdate>
         <volume>8</volume>
         <issue>5</issue>
         <fpage>R94</fpage>
         <url>http://genomebiology.com/2007/8/5/R94</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17532860</pubid>
               <pubid idtype="doi">10.1186/gb-2007-8-5-r94</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>28</day>
               <month>7</month>
               <year>2006</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>30</day>
               <month>1</month>
               <year>2007</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>28</day>
               <month>5</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>28</day>
               <month>05</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Tamames et al; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <shorttitle>
         <p>Protein interaction network evolution</p>
      </shorttitle>
      <shortabs>
         <p>Analysis of the reduction in genome size of <it>Buchnera aphidicola </it>from its common ancestor <it>E. coli </it>shows that the organization of networks into modules is the property that seems to be directly related with the evolutionary process of genome reduction.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The variation in the sizes of the genomes of distinct life forms remains somewhat puzzling. The organization of proteins into domains and the different mechanisms that regulate gene expression are two factors that potentially increase the capacity of genomes to create more complex systems. High-throughput protein interaction data now make it possible to examine the additional complexity generated by the way that protein interactions are organized.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We have studied the reduction in genome size of <it>Buchnera </it>compared to its close relative <it>Escherichia coli</it>. In this well defined evolutionary scenario, we found that among all the properties of the protein interaction networks, it is the organization of networks into modules that seems to be directly related to the evolutionary process of genome reduction.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>In <it>Buchnera</it>, the apparently non-random reduction of the modular structure of the networks and the retention of essential characteristics of the interaction network indicate that the roles of proteins within the interaction network are important in the reductive process.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010008">Evolution</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010014">Microbiology and parasitology</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Bacterial endosymbionts of insects, such as <it>Buchnera aphidicola </it><abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>, <it>Blochmannia floridanus </it><abbrgrp><abbr bid="B3">3</abbr></abbrgrp> and <it>Wigglesworthia glossinidia </it><abbrgrp><abbr bid="B4">4</abbr></abbrgrp>, are paradigms of reductive evolution. These bacteria live in a stable and isolated environment, the bacteriocyte of insects, where the host provides most of their nutritional requirements. As a consequence, the genomes of these bacteria have undergone a process of reduction, losing around 90% of their ancestral genes. These endosymbionts also fail to acquire new genes due to their incapacity to incorporate DNA via lateral gene transfer and their isolated environment. Nevertheless, although their genomes represent a subset of the genome of their ancestors, these gamma-proteobacteria remain closely related to <it>Escherichia coli </it>(98% of the genes in <it>Buchnera </it>have clear orthologues in <it>E. coli</it>). Accordingly, the process of genome shrinkage that these species have undergone has been well documented in terms of the evolution of the corresponding protein families <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>.</p>
         <p>Recent research indicates that the capacity of an organism for adaptation depends not only on the properties of its individual molecular components, but also on the structure and organization of its underlying network of molecular interactions. Indeed, it was recently proposed that the modular organization of the network of interactions is necessary to adapt to changing environments <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. In such a modular system, the compartmentalization of a set of interactions that are both closely interconnected and remain weakly connected to other components in the artificial environment increases. Accordingly, the organization into so-called modules is favored by constant changes in environmental conditions, highlighting the direct causal relationship between such changes and the increase in network modularity. Nevertheless, this proposal awaits a direct assessment in a real biological system.</p>
         <p>Studies on the organization and properties of protein networks have flourished recently thanks to data from high-throughput experiments, for example, two-hybrid screens, pull-down experiments and ChIP-on-chip studies <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. Despite limitations in terms of the extent and quality of the datasets, the results produced have been fundamental in enabling the first studies of network structure to be carried out <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B11">11</abbr></abbrgrp>. Such studies have involved the comparison of networks from different origins <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> and the construction of the first models of network behavior and evolution <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>.</p>
         <p>Taking advantage of the two recently published high-throughput protein interaction maps of <it>E. coli </it><abbrgrp><abbr bid="B9">9</abbr><abbr bid="B15">15</abbr></abbrgrp>, we have performed a study in which we focused on the reductive evolution of the <it>Buchnera </it>genome. The comparison between the <it>E. coli </it>and <it>Buchnera </it>interaction networks was based on the assumed low rate of protein interaction turnover <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> and the weak probability that new interactions would be generated in the restricted conditions in which <it>Buchnera </it>lives. Accordingly, it can be assumed that when proteins are conserved between <it>E. coli </it>and <it>Buchnera</it>, the protein interactions are also likely to be maintained <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. Therefore, the direct relationship between the genomes, the clear conservation of proteins and the probable similarity of their interactions provides a perfect scenario to assess the consequences of adaptation to a stable and nutrient-rich environment.</p>
         <p><it>E. coli </it>is a free-living bacteria known to be capable of adapting to very different environments <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>. In contrast, <it>Buchnera </it>is an endosymbiotic bacteria living in a very stable medium. As a result, we would expect the <it>E. coli </it>network to be more modular than that of <it>Buchnera</it>. Hence, reductive evolution might be responsible not only for decreasing the gene repertoire of <it>Buchnera</it>, but also for reducing its network modularity. This hypothesis can be tested by comparing the organization of the protein-protein interaction networks of these two species.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>Modular structure of the <it>E. coli </it>network</p>
            </st>
            <p>Modules are set of components (proteins) with a clear imbalance in favor of internal versus external connections. Therefore, the modularity of a network can be quantified by comparing the number of connections within and between modules. Consequently, the main problem when defining modules is the search for the optimal division of the network that maximizes the ratio between intra- and inter-module connectivities. Several algorithms have been proposed to carry out the task of decomposing networks into their modular components <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp>. We have used two recently proposed algorithms <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp> that have been shown to produce optimal decomposition of biological networks. Since both algorithms are based on different approaches, and two different maps of protein-protein interactions of <it>E. coli </it>are available <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B15">15</abbr></abbrgrp>, the validity of the conclusions is relatively independent of the method and the data source. It is important to realize that the values of the modularity coefficients have to be normalized/corrected with respect to the modularity expected in equivalent random networks of the same connectivity, thereby eliminating the effect that the pattern of connections in the network could have on the calculation of its modularity (see Materials and methods).</p>
            <p>The results of analyzing the structure of the <it>E. coli </it>network show that it is most modular at any level, irrespective of the clustering methods used (see Table S3 in Additional data file 1 for descriptions and results obtained using other clustering approaches for determining modularity). The optimal decompositions render between 10 and 15 modules (Table <tblr tid="T1">1</tblr>), most of them significant from a functional point of view (see Materials and methods). Some of the modules are quite homogeneous and contain easily discernible functions, that is, protein synthesis (including ribosomal proteins), transcription (RNA polymerase), cell division, DNA synthesis (DNA polymerase), or DNA maintenance, corresponding well to the empirical analysis of the original dataset established by Butland <it>et al</it>. <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. These modules account for more than half of the modularity in the network (Table S1 in Additional data file 1). Other modules contribute less to the global modularity and are composed of proteins with more diverse functions. The overall structure of the network indicates the existence of a central core that is clearly organized into modules of protein interactions, while many other functions or activities associated with this core display less modular structure.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Values of modularity for <it>E. coli </it>and <it>Buchnera </it>networks</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>Dataset</p>
                     </c>
                     <c ca="center">
                        <p>Modules and validation</p>
                     </c>
                     <c ca="center">
                        <p>Q<sub>real</sub></p>
                     </c>
                     <c ca="center">
                        <p>Q<sub>rand</sub></p>
                     </c>
                     <c ca="center">
                        <p>Q<sub>norm </sub>(Q<sub>real </sub>- Q<sub>rand</sub>)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Newman algorithm</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>E. coli</it>, Butland dataset</p>
                     </c>
                     <c ca="center">
                        <p>12 (5/10)</p>
                     </c>
                     <c ca="center">
                        <p>0.346</p>
                     </c>
                     <c ca="center">
                        <p>0.244</p>
                     </c>
                     <c ca="center">
                        <p>0.102</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>Buchnera</it>, Butland dataset</p>
                     </c>
                     <c ca="center">
                        <p>7 (3/7)</p>
                     </c>
                     <c ca="center">
                        <p>0.259</p>
                     </c>
                     <c ca="center">
                        <p>0.232</p>
                     </c>
                     <c ca="center">
                        <p>0.027</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>Buchnera </it>constrained, Butland dataset</p>
                     </c>
                     <c ca="center">
                        <p>7 (2/6)</p>
                     </c>
                     <c ca="center">
                        <p>0.182</p>
                     </c>
                     <c ca="center">
                        <p>0.168</p>
                     </c>
                     <c ca="center">
                        <p>0.014</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>E. coli</it>, Arifuzzaman dataset</p>
                     </c>
                     <c ca="center">
                        <p>15 (8/13)</p>
                     </c>
                     <c ca="center">
                        <p>0.409</p>
                     </c>
                     <c ca="center">
                        <p>0.329</p>
                     </c>
                     <c ca="center">
                        <p>0.080</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>Buchnera</it>, Arifuzzaman dataset</p>
                     </c>
                     <c ca="center">
                        <p>10 (4/9)</p>
                     </c>
                     <c ca="center">
                        <p>0.460</p>
                     </c>
                     <c ca="center">
                        <p>0.423</p>
                     </c>
                     <c ca="center">
                        <p>0.037</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>Buchnera </it>constrained, Arifuzzaman dataset</p>
                     </c>
                     <c ca="center">
                        <p>12 (4/10)</p>
                     </c>
                     <c ca="center">
                        <p>0.274</p>
                     </c>
                     <c ca="center">
                        <p>0.265</p>
                     </c>
                     <c ca="center">
                        <p>0.009</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>E. coli</it>, STRING</p>
                     </c>
                     <c ca="center">
                        <p>33 (32/32)</p>
                     </c>
                     <c ca="center">
                        <p>0.670</p>
                     </c>
                     <c ca="center">
                        <p>0.209</p>
                     </c>
                     <c ca="center">
                        <p>0.461</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>Buchnera</it>, STRING</p>
                     </c>
                     <c ca="center">
                        <p>12 (11/11)</p>
                     </c>
                     <c ca="center">
                        <p>0.581</p>
                     </c>
                     <c ca="center">
                        <p>0.272</p>
                     </c>
                     <c ca="center">
                        <p>0.309</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>Buchnera </it>constrained, STRING</p>
                     </c>
                     <c ca="center">
                        <p>14 (11/11)</p>
                     </c>
                     <c ca="center">
                        <p>0.493</p>
                     </c>
                     <c ca="center">
                        <p>0.210</p>
                     </c>
                     <c ca="center">
                        <p>0.283</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Guimer&#225; algorithm</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>E. coli</it>, Butland dataset</p>
                     </c>
                     <c ca="center">
                        <p>10 (7/10)</p>
                     </c>
                     <c ca="center">
                        <p>0.357</p>
                     </c>
                     <c ca="center">
                        <p>0.248</p>
                     </c>
                     <c ca="center">
                        <p>0.109</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>Buchnera</it>, Butland dataset</p>
                     </c>
                     <c ca="center">
                        <p>6 (3/5)</p>
                     </c>
                     <c ca="center">
                        <p>0.263</p>
                     </c>
                     <c ca="center">
                        <p>0.237</p>
                     </c>
                     <c ca="center">
                        <p>0.026</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>Buchnera </it>constrained, Butland dataset</p>
                     </c>
                     <c ca="center">
                        <p>8 (2/7)</p>
                     </c>
                     <c ca="center">
                        <p>0.192</p>
                     </c>
                     <c ca="center">
                        <p>0.179</p>
                     </c>
                     <c ca="center">
                        <p>0.013</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>E. coli</it>, Arifuzzaman dataset</p>
                     </c>
                     <c ca="center">
                        <p>12 (6/11)</p>
                     </c>
                     <c ca="center">
                        <p>0.413</p>
                     </c>
                     <c ca="center">
                        <p>0.332</p>
                     </c>
                     <c ca="center">
                        <p>0.081</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>Buchnera</it>, Arifuzzaman dataset</p>
                     </c>
                     <c ca="center">
                        <p>8 (4/8)</p>
                     </c>
                     <c ca="center">
                        <p>0.461</p>
                     </c>
                     <c ca="center">
                        <p>0.432</p>
                     </c>
                     <c ca="center">
                        <p>0.029</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>Buchnera </it>constrained, Arifuzzaman dataset</p>
                     </c>
                     <c ca="center">
                        <p>11 (2/8)</p>
                     </c>
                     <c ca="center">
                        <p>0.266</p>
                     </c>
                     <c ca="center">
                        <p>0.242</p>
                     </c>
                     <c ca="center">
                        <p>0.024</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>E. coli</it>, STRING</p>
                     </c>
                     <c ca="center">
                        <p>19 (17/17)</p>
                     </c>
                     <c ca="center">
                        <p>0.669</p>
                     </c>
                     <c ca="center">
                        <p>0.211</p>
                     </c>
                     <c ca="center">
                        <p>0.458</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>Buchnera</it>, STRING</p>
                     </c>
                     <c ca="center">
                        <p>11(10/10)</p>
                     </c>
                     <c ca="center">
                        <p>0.566</p>
                     </c>
                     <c ca="center">
                        <p>0.277</p>
                     </c>
                     <c ca="center">
                        <p>0.289</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>Buchnera </it>constrained, STRING</p>
                     </c>
                     <c ca="center">
                        <p>9 (7/7)</p>
                     </c>
                     <c ca="center">
                        <p>0.489</p>
                     </c>
                     <c ca="center">
                        <p>0.231</p>
                     </c>
                     <c ca="center">
                        <p>0.258</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Modularity is calculated using different algorithms as described in the text for the <it>E. coli </it>and <it>Buchnera </it>networks. The module validation is indicated between parentheses after the number of modules for each network and this provides information on the number of modules that are statistically significant with regards to the STRING data (see text for details). For instance, 5/10 means that five out of ten modules are significant in terms of STRING interactions. The number of modules validated is sometimes different to the total number of modules, since some modules are too small to be statistically assessed. When using STRING-derived networks, all modules can be validated since the same information was used to construct the network. The table also shows the modularity coefficient (Q) for real and randomized networks, and the normalized modularity coefficient, resulting from the subtraction of the modularity coefficients for real and random modules.</p>
               </tblfn>
            </tbl>
            <p>The potential <it>Buchnera </it>protein interaction network was obtained by maintaining the connections between the orthologous proteins in <it>E. coli</it>. The modular decomposition of the resulting network shows that the <it>Buchnera </it>network was always significantly less modular than that of <it>E. coli </it>(Table <tblr tid="T1">1</tblr>). The decrease in the modularity coefficient implies that the network obtained for <it>Buchnera </it>is much harder to separate into isolated components than that of <it>E. coli</it>. Therefore, we concluded that the process of reducing the genome size (reductive evolution) creates a less compartmentalized network with a smaller degree of modularity.</p>
            <p>An alternative approach is to study the process of module reduction maintaining the modular structure obtained for <it>E. coli </it>but deleting the proteins that do not have orthologues in <it>Buchnera</it>. In this way, the reduction of the modules originally defined in <it>E. coli </it>can be assessed. We found that the ensuing 'constrained' decomposition of the <it>Buchnera </it>network is also less modular than that of <it>E. coli</it>. Indeed, the modularity observed is similar to that observed when the <it>Buchnera </it>network was decomposed independently (Table <tblr tid="T1">1</tblr>). Furthermore, with the exception of the module containing ribosomal proteins, the modules in the 'constrained' network are significantly smaller than those in E. <it>coli</it>. The deletion involves between 70% and 91% of the nodes and, interestingly, the set of conserved nodes often consists of those involved in the connection between modules (Figure <figr fid="F1">1</figr>).</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>View of three modules of the <it>E. coli </it>network</p>
               </caption>
               <text>
                  <p>View of three modules of the <it>E. coli </it>network. The blue module corresponds to cell division and chaperones. The red module is related to RNA polymerase and the green module involves DNA metabolism. The size of the nodes indicates their absolute degree or number of connections. Conserved nodes in <it>Buchnera </it>are shown in darker colors, while conserved connections are shown in thick black lines. Connector hubs are completely conserved, whereas non-hub connectors are deleted in some instances.</p>
               </text>
               <graphic file="gb-2007-8-5-r94-1"/>
            </fig>
            <p>Nevertheless, the coefficients are low in all cases. In <it>E. coli</it>, they are around 0.1, indicating little modularity (high modularity is achieved when the coefficient reaches values around 0.3). The coefficients are close to zero in all <it>Buchnera </it>networks, indicating that modularity has been almost completely lost in these networks.</p>
         </sec>
         <sec>
            <st>
               <p>The role of the nodes in the reduction of the modular structure of the network</p>
            </st>
            <p>The connections between modules in the <it>E. coli </it>network are dominated by non-hub connectors, that is, nodes with an average number of links within their module but that are well connected to other modules <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. These nodes account for more than 80% of the connections between modules. The remaining connections are made by connector hubs with strong links both within and between modules but that are, in turn, weakly connected between themselves (examples of connector hubs are peptidyl-prolyl cis/trans isomerase <it>tig </it>and pyruvate dehydrogenase <it>aceE</it>). This is characteristic of a feature known as dissortativity <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>, which has been documented in several other biological networks<abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. There is extensive communication between modules in the <it>E. coli </it>network and this is mainly based on the links provided by non-hub connectors.</p>
            <p>In the constrained reduced <it>Buchnera </it>network, it is apparent that the number of peripheral nodes has diminished. While there was less than average loss of non-hub connectors, connector hubs were almost completely preserved (Figure <figr fid="F2">2</figr>). Therefore, connector hubs appear to create a highly preserved backbone of interactions. This emphasizes the crucial importance of connector hubs in maintaining the integrity of the protein network, in contrast to the findings from studies of metabolic networks <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Density map of the role of the nodes in the <it>E. coli </it>network that are conserved or deleted in <it>Buchnera</it>, according to the procedure described in [23]</p>
               </caption>
               <text>
                  <p>Density map of the role of the nodes in the <it>E. coli </it>network that are conserved or deleted in <it>Buchnera</it>, according to the procedure described in [23]. The degree of participation measures the connection of a given node with the nodes from modules other than its own. The within-module degree measures the connection of the node with other nodes within its own module. Peripheral nodes show both low participation and low within-module degree. Non-hub connectors participate significantly and with a low degree of within-module connections, while connector hubs have both high participation and high degree of within-module connections [23]. Connector hubs and non-hub connectors are mainly conserved in the <it>Buchnera </it>network, while the deletion of nodes mainly affects peripheral nodes. The measures are calculated as in [23], based on the modular division of the <it>E. coli </it>network obtained from the Butland dataset. The scale refers to the number of nodes in each position.</p>
               </text>
               <graphic file="gb-2007-8-5-r94-2"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>The reduction of network modularity and of the overall properties of the network</p>
            </st>
            <p>Reduction of modularity affects certain topological aspects of the network. For simplicity, we restrict our analysis to the results for the Butland dataset, since the results for the Arifuzzaman <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> dataset are very similar. The analysis of connectivity shows that the <it>E. coli </it>and <it>Buchnera </it>networks follow a power-law distribution with exponents (<it>&#947;</it>) of 2.25 for <it>E. coli </it>and 2.03 for <it>Buchnera</it>. The smaller exponent in <it>Buchnera </it>indicates that hubs are more prevalent in the network, since they are in contact with a larger proportion of nodes. This highlights the relevance of connector hubs, which produce a more compact network in <it>Buchnera</it>, as reflected by the average number of links per node (6.07 link per node in <it>Buchnera </it>versus 4.16 in <it>E. coli</it>) and the smaller diameter of the <it>Buchnera </it>network (2.821 versus 3.607 for <it>E. coli</it>). Both networks are almost completely connected, which means that there are very few nodes in islands not linked to the main component. In both networks, isolated nodes constitute just 2% of the total number of nodes. Additionally, the length of the paths crossing the network remains unaltered, and only 60 of a possible 37,408 paths were longer in <it>Buchnera </it>than in <it>E. coli</it>, with a difference of just one node. Therefore, rather than fragmenting the network, the removal of nodes and links in the <it>Buchnera </it>network maintains the global topology of the network, preserving the main interaction backbone. The preferential deletion of connections between peripheral nodes that lie outside of the core of the network creates an apparent enrichment of densely connected motifs in <it>Buchnera</it>, particularly when the relative proportions are considered (Table S1 in Additional data file 1).</p>
            <p>When nodes were randomly removed from the <it>E. coli </it>network until it reached a size equivalent to that of <it>Buchnera</it>, the organization of the network was completely lost. The resulting network is fragmented into a myriad of small components (islands), each with few isolated nodes. This is an important indication of how node deletion during reductive evolution has been accomplished in a controlled manner that preserves the network organization and the cross-talk between the remaining processes.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>We compare the structure of two independent sets of experimentally derived interactions for <it>E. coli </it>with the deduced structure of interactions for the closely related <it>Buchnera </it>genome. Thus, the reductive evolution followed by <it>Buchnera</it>, whereby more than 90% of the ancestral genes have been lost, is correlated with the loss of modularity of the protein interaction network. Nevertheless, the rest of the characteristics of the network in <it>Buchnera </it>essentially remain unchanged. These observations provide an initial model to understand reductive evolution, adaptation to environments and network organization. As in previous analyses of network structure, it is clear that, in this early phase, the models will benefit greatly from additional information from other genomes, and from an overall improvement in the quality of the proteomic experiments. Nevertheless, even bearing these limitations in mind, it is possible to see how the reduced modularity in the <it>Buchnera </it>genome is caused by the partial deletion of nodes in regions that are connected to dense clusters of essential functions in the <it>E. coli </it>protein interaction network. This is demonstrated by measuring the modularity in the reduced network. In contrast to what would be expected if the preferentially deleted genes were those participating in a non-modular part of the <it>E. coli </it>network, the modularity decreased with respect to the <it>E. coli </it>network.</p>
         <p>The <it>E. coli </it>network is apparently composed of a modular core and a mostly non-modular peripheral region. This could imply that, at this level, modular structures are not determinant for the evolution of the network. Reduction of modularity is not achieved by the removal of entire modules (which could even produce an increase in the modularity coefficient), but rather by selective deletion of nodes in the modular parts of the network (Figure <figr fid="F3">3</figr>). In other words, the process of genome reduction apparently involves deleting peripheral regions of the network and the selective loss of proteins forming part of densely packed clusters that are separated into modules. However, it affects the proteins directly implicated in maintaining the connections between modules to a much smaller extent (Figure <figr fid="F2">2</figr>). The result is a very compact network with a smaller diameter, a conserved backbone and an increase in the proportion of densely connected motifs, as well as the preservation of characteristics such as path length and network topology. The way to maintain or increase modularity in reduced networks would be to remove connections between modules and, therefore, communication between processes, which could be highly deleterious. Our conclusion is that the loss of modularity in <it>Buchnera </it>networks seems to be mainly related to the conservation of the network backbone, rather than resulting from the loss of adaptability to environmental conditions.</p>
         <fig id="F3">
            <title>
               <p>Figure 3</p>
            </title>
            <caption>
               <p>Deletion of interactions may produce reduced modularity</p>
            </caption>
            <text>
               <p>Deletion of interactions may produce reduced modularity. Three modules (red, yellow, blue) are shown, surrounded by a non-modular region. Even if the reduction is higher in peripheral nodes (non-modular region), modularity may decrease since the module structure is lost and only the backbone remains.</p>
            </text>
            <graphic file="gb-2007-8-5-r94-3"/>
         </fig>
         <p>These results might be important in the context of the evolutionary implications of network structure. It has been suggested that the organization of biological networks (interaction and control networks) is a direct product of the simple process of gene duplication and deletion, and that it is not directly subjected to natural selection <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. The apparently non-random reduction of the modular structure of the networks and the retention of essential characteristics of the interaction network indicate that the roles of proteins within the interaction network are important in the reductive process. Accordingly, the importance of the roles of the proteins must be taken into consideration when discussing the effect of the natural selection on the organization of protein networks.</p>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <p>Protein-protein interaction data for <it>E. coli </it>were obtained as described in the original studies <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B15">15</abbr></abbrgrp>.</p>
         <p>The first study <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> is based on yeast-based tandem affinity purification (TAP) adapted to <it>E. coli</it>. In this procedure, 1,000 <it>E. coli </it>open reading frames were tagged (22% of the genome) and their interactions with other proteins within this set were determined. It was possible to determine 5,254 protein-protein interactions, involving 1,264 proteins (Butland dataset). To our knowledge, this was the first set of <it>E. coli </it>protein-protein interaction data determined by high-throughput procedures.</p>
         <p>The second study <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> was based on producing His-tagged bait proteins; after co-purifying the interacting bait and prey proteins on a Ni<sup>2+</sup>-NTA column, they were identified by mass spectrometry. There were 4,339 <it>E. coli </it>proteins tested, for which 11,511 interactions were determined. The authors provided a reliable set of 8,893 of these interactions, involving 2,821 proteins, which were reproducible in the original study (Arifuzzaman dataset). The reliable set was the one used by us in this study.</p>
         <p>While both datasets share 983 proteins, only 168 interactions are present in both sources, a situation similar to that observed in yeast <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>.</p>
         <p>For <it>E. coli </it>proteins, orthologues in <it>B. aphidicola </it>strain APS (RefSeq NC_002528) were identified by perfoming BLASTP homology searches. To correctly identify orthologues, both proteins must fulfill the following criteria: one is the best hit of the other (best bi-directional hits); the BLASTP E-value must be above 1e-15; and the alignment must span at least 80% of the residues in both proteins. Considering complete genomes, we were able to identify <it>E. coli </it>orthologues for 98% of <it>Buchnera </it>proteins, while around 90% of <it>E. coli </it>protein-coding genes have been deleted from the <it>Buchnera </it>genome (<it>E. coli </it>strain K-12 contains 4,243 genes; <it>Buchnera </it>has 564 genes). For the two sources of data (1,264 <it>E. coli </it>proteins in the Butland dataset and 2,821 in the Arifuzzaman dataset), we identified 278 and 260 orthologues in the proteome of <it>Buchnera</it>, respectively.</p>
         <p>The protein-protein interaction network in <it>Buchnera </it>was generated by mapping <it>E. coli </it>interactions between conserved proteins in <it>Buchnera</it>. The removal of nodes (proteins) implies the removal of all links attached to them. This creates a network of 1,638 interaction pairs for the Butland dataset and 549 for the Arifuzzaman dataset, implying that the latter is enriched in interactions between proteins that are not conserved in <it>Buchnera</it>.</p>
         <p>We also created a third network based on data from the STRING database <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>. STRING contains known and inferred relationships between <it>E. coli </it>proteins derived using diverse methods. The version of STRING used in this work involves 3,868 proteins implicated in 33,733 relationships, and it does not include the data from the other two sources. Thus, it comprises an independent set of interactions that can be used to validate the modular decomposition of the networks.</p>
         <p>For the networks, the node degree was measured as the number of links for each node. Links were non-directional and corresponded to protein-protein interactions. Protein motifs were identified as described previously <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. The path length (<it>l</it>) between all pairs of nodes was calculated using a standard Dijstra algorithm.</p>
         <p>A module is defined as a part of the network with abundant connections between the nodes within it, and less connected to nodes outside the module. The ratio between these two measures (connections within the module and with other modules) defines the modularity coefficient <it>Q</it>. The modularity coefficient was calculated as the fraction of edges in the network that connect the nodes in a module minus the expected value of the same quantity in a network, with the same assignment of nodes in modules but with random connections between nodes <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>:</p>
         <p>
            <display-formula>
               <m:math name="gb-2007-8-5-r94-i1" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mrow>
                        <m:mi>Q</m:mi>
                        <m:mo>=</m:mo>
                        <m:mstyle displaystyle="true">
                           <m:munderover>
                              <m:mo>&#8721;</m:mo>
                              <m:mrow>
                                 <m:mi>s</m:mi>
                                 <m:mo>=</m:mo>
                                 <m:mn>1</m:mn>
                              </m:mrow>
                              <m:mi>K</m:mi>
                           </m:munderover>
                           <m:mrow>
                              <m:mrow>
                                 <m:mo>[</m:mo>
                                 <m:mrow>
                                    <m:mfrac>
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>l</m:mi>
                                             <m:mi>S</m:mi>
                                          </m:msub>
                                       </m:mrow>
                                       <m:mi>L</m:mi>
                                    </m:mfrac>
                                    <m:mo>&#8722;</m:mo>
                                    <m:msup>
                                       <m:mrow>
                                          <m:mrow>
                                             <m:mo>(</m:mo>
                                             <m:mrow>
                                                <m:mfrac>
                                                   <m:mrow>
                                                      <m:msub>
                                                         <m:mi>d</m:mi>
                                                         <m:mi>s</m:mi>
                                                      </m:msub>
                                                   </m:mrow>
                                                   <m:mrow>
                                                      <m:mn>2</m:mn>
                                                      <m:mi>L</m:mi>
                                                   </m:mrow>
                                                </m:mfrac>
                                             </m:mrow>
                                             <m:mo>)</m:mo>
                                          </m:mrow>
                                       </m:mrow>
                                       <m:mn>2</m:mn>
                                    </m:msup>
                                 </m:mrow>
                                 <m:mo>]</m:mo>
                              </m:mrow>
                           </m:mrow>
                        </m:mstyle>
                     </m:mrow>
                     <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfeBSjuyZL2yd9gzLbvyNv2Caerbhv2BYDwAHbqedmvETj2BSbqee0evGueE0jxyaibaiKI8=vI8tuQ8FMI8Gi=hEeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciGacaGaaeqabaqadeqadaaakeaacaWGrbGaeyypa0ZaaabCaeaadaWadaqaamaalaaabaGaamiBamaaBaaaleaacaWGtbaabeaaaOqaaiaadYeaaaGaeyOeI0YaaeWaaeaadaWcaaqaaiaadsgadaWgaaWcbaGaam4CaaqabaaakeaacaaIYaGaamitaaaaaiaawIcacaGLPaaadaahaaWcbeqaaiaaikdaaaaakiaawUfacaGLDbaaaSqaaiaadohacqGH9aqpcaaIXaaabaGaam4saaqdcqGHris5aaaa@46C3@</m:annotation>
                  </m:semantics>
               </m:math>
            </display-formula>
         </p>
         <p>where <it>K </it>is the number of modules, <it>L </it>is the number of edges in the network, <it>l</it><sub><it>s </it></sub>is the number of edges between nodes in modules, and <it>d</it><sub>5 </sub>is the sum of the degrees of the nodes in module <it>s</it>. Since modularity is possibly affected by the different size or connectivity of the networks, it is advisable to normalize this measure with respect to the modularity of random networks with the same connectivity. These random networks are generated by swapping the connections between pairs of nodes. For instance, if the real network contains the interactions A-B and C-D, the randomized network will contain A-D and B-C. In this way, the random network maintains node degrees and connectivity.</p>
         <p>Several algorithms have been proposed to extract modules from networks. To test the validity of our conclusions, we used two different methods to calculate modules and modularity coefficients. The algorithm of Guimer&#225; and Nunes-Amaral <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> is based on a simulated annealing procedure, and it has been successfully used to decompose metabolic networks. Newman's algorithm <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> is based on the spectral decomposition of the eigenvectors of a modularity matrix derived from the interactions between nodes. Both methods claim to obtain optimal decomposition of the networks, and the results using both algorithms are very similar (Table <tblr tid="T1">1</tblr>). Guimer&#225;'s algorithm achieves slightly higher modularities, while Newman's algorithm is considerably faster, especially when dealing with big networks. The analysis of the resulting modules shows that both decompositions are similar, with 70% of the interactions belonging to the same modules. The normalized modularity coefficients are very close, regardless of the algorithm or the data source used, indicating that they are robust and not influenced by such factors.</p>
         <p>Since we wanted to inspect the conservation of modularity when the network is reduced, the modularity of <it>Buchnera</it>'s networks was calculated either by generating a new modular decomposition for <it>Buchnera</it>, or using the same modular decomposition obtained for <it>E. coli </it>such that the modules were maintained while the nodes and interactions not present in <it>Buchnera </it>were removed. In this way, we are able to study the way in which original modules are reduced.</p>
         <p>To check the quality and functional relevance of modules, we used data from the STRING database <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>. Modules with functional significance would be expected to be enriched in these interactions. Therefore, we calculated the total number of interactions per pair of proteins in STRING and, accordingly, the number of interactions per pair that would be expected within each of the modules in the network based on the size of the module. We consider that the module is validated if it is significantly enriched in STRING interactions (<it>p </it>value &lt; 0.1).</p>
         <p>The networks were plotted with the Cytoscape software <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. The evaluation of functions over-represented in each of the modules (using Gene Ontology <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> 'biological process' category) was performed using the BiNGO plug-in <abbrgrp><abbr bid="B30">30</abbr></abbrgrp></p>
      </sec>
      <sec>
         <st>
            <p>Additional data files</p>
         </st>
         <p>The following additional data are available with the online version of this paper. Additional data file <supplr sid="S1">1</supplr> includes supplementary tables: Table S1 lists the composition of the main modules in <it>E. coli</it>, for the modular decomposition of the Butland dataset using Guimer&#225;'s algorithm; Table S2 shows the different motifs with three or four nodes found in the real networks and randomized networks; Table S3 shows the results of the modular decomposition of the Butland dataset by means of a k-means clustering algorithm, as an additional confirmation of the validity of the results; Table S4 lists the main conserved hubs in <it>Buchnera</it>, and their functions in the Butland dataset. Additional data file <supplr sid="S2">2</supplr> shows the relationship between the connectivity of the nodes and their deletion in <it>Buchnera</it>'s network (Butland dataset), and the probability of the deletion of nodes as a function of the probable number of connections. Additional data file <supplr sid="S3">3</supplr> illustrates three examples of hub deletion in <it>Buchnera</it>.</p>
         <suppl id="S1">
            <title>
               <p>Additional data file 1</p>
            </title>
            <caption>
               <p>Supplementary tables</p>
            </caption>
            <text>
               <p>Table S1 lists the composition of the main modules in <it>E. coli</it>, for the modular decomposition of the Butland dataset using Guimer&#225;'s algorithm. Table S2 shows the different motifs with three or four nodes found in the real networks and randomized networks. Table S3 shows the results of the modular decomposition of the Butland dataset by means of a k-means clustering algorithm, as an additional confirmation of the validity of the results. Table S4 lists the main conserved hubs in <it>Buchnera</it>, and their functions in the Butland dataset.</p>
            </text>
            <file name="gb-2007-8-5-r94-S1.doc">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S2">
            <title>
               <p>Additional data file 2</p>
            </title>
            <caption>
               <p>Relationship between the connectivity of the nodes and their deletion in <it>Buchnera</it>'s network (Butland dataset), and the probability of the deletion of nodes as a function of the probable number of connections</p>
            </caption>
            <text>
               <p>Relationship between the connectivity of the nodes and their deletion in <it>Buchnera</it>'s network (Butland dataset), and the probability of the deletion of nodes as a function of the probable number of connections.</p>
            </text>
            <file name="gb-2007-8-5-r94-S2.ppt">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S3">
            <title>
               <p>Additional data file 3</p>
            </title>
            <caption>
               <p>Three examples of hub deletion in <it>Buchnera</it></p>
            </caption>
            <text>
               <p>Three examples of hub deletion in <it>Buchnera</it>.</p>
            </text>
            <file name="gb-2007-8-5-r94-S3.ppt">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>JT wishes to acknowledge Roger Guimer&#225; and Mark Newman. JT is the recipient of a contract from the FIS programme, ISCIII, Ministerio de Sanidad y Consumo (Spain). This work has been supported by grant BMC2003-00305 from Ministerio de Educaci&#243;n y Ciencia (Spain), to A.M., and EU grants DIAMONDS: LSHG-CT-2004-512143 and EMERGENCE, to A.V.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Reductive genome evolution in <it>Buchnera aphidicola</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>van Ham</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Kamerbeek</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Palacios</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Rausell</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Abascal</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Bastolla</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Fernandez</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Jimenez</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Postigo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Silva</snm>
                  <fnm>FJ</fnm>
               </au>
               <etal/>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <fpage>581</fpage>
            <lpage>586</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">141039</pubid>
                  <pubid idtype="pmpid" link="fulltext">12522265</pubid>
                  <pubid idtype="doi">10.1073/pnas.0235981100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Genome sequence of the endocellular bacterial symbiont of aphids <it>Buchnera </it>sp. APS.</p>
            </title>
            <aug>
               <au>
                  <snm>Shigenobu</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Watanabe</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Hattori</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sakaki</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Ishikawa</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>407</volume>
            <fpage>81</fpage>
            <lpage>86</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35024074</pubid>
                  <pubid idtype="pmpid" link="fulltext">10993077</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>The genome sequence of <it>Blochmannia floridanus </it>: comparative analysis of reduced genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Gil</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Silva</snm>
                  <fnm>FJ</fnm>
               </au>
               <au>
                  <snm>Zientz</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Delmotte</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Gonzalez-Candelas</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Latorre</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rausell</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kamerbeek</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Gadau</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Holldobler</snm>
                  <fnm>B</fnm>
               </au>
               <etal/>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <fpage>9388</fpage>
            <lpage>9393</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">170928</pubid>
                  <pubid idtype="pmpid" link="fulltext">12886019</pubid>
                  <pubid idtype="doi">10.1073/pnas.1533499100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Genome sequence of the endocellular obligate symbiont of tsetse flies, <it>Wigglesworthia glossinidia</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Akman</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Yamashita</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Watanabe</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Oshima</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Shiba</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hattori</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Aksoy</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2002</pubdate>
            <volume>32</volume>
            <fpage>402</fpage>
            <lpage>407</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng986</pubid>
                  <pubid idtype="pmpid" link="fulltext">12219091</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Spontaneous evolution of modularity and network motifs.</p>
            </title>
            <aug>
               <au>
                  <snm>Kashtan</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Alon</snm>
                  <fnm>U</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>13773</fpage>
            <lpage>13778</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1236541</pubid>
                  <pubid idtype="pmpid" link="fulltext">16174729</pubid>
                  <pubid idtype="doi">10.1073/pnas.0503610102</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>A comprehensive two-hybrid analysis to explore the yeast protein interactome.</p>
            </title>
            <aug>
               <au>
                  <snm>Ito</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Chiba</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ozawa</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Yoshida</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hattori</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sakaki</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2001</pubdate>
            <volume>98</volume>
            <fpage>4569</fpage>
            <lpage>4574</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">31875</pubid>
                  <pubid idtype="pmpid" link="fulltext">11283351</pubid>
                  <pubid idtype="doi">10.1073/pnas.061034498</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Transcriptional regulatory networks in <it>Saccharomyces cerevisiae</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>TI</fnm>
               </au>
               <au>
                  <snm>Rinaldi</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Robert</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Odom</snm>
                  <fnm>DT</fnm>
               </au>
               <au>
                  <snm>Bar-Joseph</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Gerber</snm>
                  <fnm>GK</fnm>
               </au>
               <au>
                  <snm>Hannett</snm>
                  <fnm>NM</fnm>
               </au>
               <au>
                  <snm>Harbison</snm>
                  <fnm>CT</fnm>
               </au>
               <au>
                  <snm>Thompson</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Simon</snm>
                  <fnm>I</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>298</volume>
            <fpage>799</fpage>
            <lpage>804</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1075090</pubid>
                  <pubid idtype="pmpid" link="fulltext">12399584</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>A protein interaction map of <it>Drosophila melanogaster</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Giot</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Bader</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Brouwer</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Chaudhuri</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kuang</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Hao</snm>
                  <fnm>YL</fnm>
               </au>
               <au>
                  <snm>Ooi</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>Godwin</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Vitols</snm>
                  <fnm>E</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2003</pubdate>
            <volume>302</volume>
            <fpage>1727</fpage>
            <lpage>1736</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1090289</pubid>
                  <pubid idtype="pmpid" link="fulltext">14605208</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Interaction network containing conserved and essential protein complexes in <it>Escherichia coli</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Butland</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Peregrin-Alvarez</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Canadien</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Starostine</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Richards</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Beattie</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Krogan</snm>
                  <fnm>N</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>433</volume>
            <fpage>531</fpage>
            <lpage>537</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature03239</pubid>
                  <pubid idtype="pmpid" link="fulltext">15690043</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Proteome survey reveals modularity of the yeast cell machinery.</p>
            </title>
            <aug>
               <au>
                  <snm>Gavin</snm>
                  <fnm>AC</fnm>
               </au>
               <au>
                  <snm>Aloy</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Grandi</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Krause</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Boesche</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Marzioch</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rau</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Jensen</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Bastuck</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Dumpelfeld</snm>
                  <fnm>B</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2006</pubdate>
            <volume>440</volume>
            <fpage>631</fpage>
            <lpage>636</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature04532</pubid>
                  <pubid idtype="pmpid" link="fulltext">16429126</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Network biology: understanding the cell's functional organization.</p>
            </title>
            <aug>
               <au>
                  <snm>Barabasi</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Oltvai</snm>
                  <fnm>ZN</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>101</fpage>
            <lpage>113</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg1272</pubid>
                  <pubid idtype="pmpid" link="fulltext">14735121</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Network motifs: simple building blocks of complex networks.</p>
            </title>
            <aug>
               <au>
                  <snm>Milo</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Shen-Orr</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Itzkovitz</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kashtan</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Chklovskii</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Alon</snm>
                  <fnm>U</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>298</volume>
            <fpage>824</fpage>
            <lpage>827</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.298.5594.824</pubid>
                  <pubid idtype="pmpid" link="fulltext">12399590</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Topological and causal structure of the yeast transcriptional regulatory network.</p>
            </title>
            <aug>
               <au>
                  <snm>Guelzim</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Bottani</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bourgine</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Kepes</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2002</pubdate>
            <volume>31</volume>
            <fpage>60</fpage>
            <lpage>63</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng873</pubid>
                  <pubid idtype="pmpid" link="fulltext">11967534</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Genomic analysis of regulatory network dynamics reveals large topological changes.</p>
            </title>
            <aug>
               <au>
                  <snm>Luscombe</snm>
                  <fnm>NM</fnm>
               </au>
               <au>
                  <snm>Babu</snm>
                  <fnm>MM</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Snyder</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Teichmann</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Gerstein</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2004</pubdate>
            <volume>431</volume>
            <fpage>308</fpage>
            <lpage>312</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature02782</pubid>
                  <pubid idtype="pmpid" link="fulltext">15372033</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Large-scale identification of protein-protein interaction of <it>Escherichia coli </it>K-12.</p>
            </title>
            <aug>
               <au>
                  <snm>Arifuzzaman</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Maeda</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Itoh</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Nishikata</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Takita</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Saito</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ara</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Nakahigashi</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>HC</fnm>
               </au>
               <au>
                  <snm>Hirai</snm>
                  <fnm>A</fnm>
               </au>
               <etal/>
            </aug>
            <source>Genome Res</source>
            <pubdate>2006</pubdate>
            <volume>16</volume>
            <fpage>686</fpage>
            <lpage>691</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1457052</pubid>
                  <pubid idtype="pmpid" link="fulltext">16606699</pubid>
                  <pubid idtype="doi">10.1101/gr.4527806</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>How the global structure of protein interaction networks evolves.</p>
            </title>
            <aug>
               <au>
                  <snm>Wagner</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Proc Biol Sci</source>
            <pubdate>2003</pubdate>
            <volume>270</volume>
            <fpage>457</fpage>
            <lpage>466</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1691265</pubid>
                  <pubid idtype="pmpid" link="fulltext">12641899</pubid>
                  <pubid idtype="doi">10.1098/rspb.2002.2269</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Chance and necessity in the evolution of minimal metabolic networks.</p>
            </title>
            <aug>
               <au>
                  <snm>Pal</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Papp</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Lercher</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Csermely</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Oliver</snm>
                  <fnm>SG</fnm>
               </au>
               <au>
                  <snm>Hurst</snm>
                  <fnm>LD</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2006</pubdate>
            <volume>440</volume>
            <fpage>667</fpage>
            <lpage>670</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature04568</pubid>
                  <pubid idtype="pmpid" link="fulltext">16572170</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>The complete genome sequence of <it>Escherichia coli </it>K-12.</p>
            </title>
            <aug>
               <au>
                  <snm>Blattner</snm>
                  <fnm>FR</fnm>
               </au>
               <au>
                  <snm>Plunkett</snm>
                  <fnm>G</fnm>
                  <suf>3rd</suf>
               </au>
               <au>
                  <snm>Bloch</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Perna</snm>
                  <fnm>NT</fnm>
               </au>
               <au>
                  <snm>Burland</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Riley</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Collado-Vides</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Glasner</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Rode</snm>
                  <fnm>CK</fnm>
               </au>
               <au>
                  <snm>Mayhew</snm>
                  <fnm>GF</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>1997</pubdate>
            <volume>277</volume>
            <fpage>1453</fpage>
            <lpage>1474</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.277.5331.1453</pubid>
                  <pubid idtype="pmpid" link="fulltext">9278503</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Complete genome sequence of enterohemorrhagic <it>Escherichia coli </it>O157:H7 and genomic comparison with a laboratory strain K-12.</p>
            </title>
            <aug>
               <au>
                  <snm>Hayashi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Makino</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ohnishi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kurokawa</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ishii</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Yokoyama</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Han</snm>
                  <fnm>CG</fnm>
               </au>
               <au>
                  <snm>Ohtsubo</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Nakayama</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Murata</snm>
                  <fnm>T</fnm>
               </au>
               <etal/>
            </aug>
            <source>DNA Res</source>
            <pubdate>2001</pubdate>
            <volume>8</volume>
            <fpage>11</fpage>
            <lpage>22</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/dnares/8.1.11</pubid>
                  <pubid idtype="pmpid" link="fulltext">11258796</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Identification of genes subject to positive selection in uropathogenic strains of <it>Escherichia coli </it>: a comparative genomics approach.</p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Hung</snm>
                  <fnm>CS</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Reigstad</snm>
                  <fnm>CS</fnm>
               </au>
               <au>
                  <snm>Magrini</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Sabo</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Blasiar</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Bieri</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Meyer</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Ozersky</snm>
                  <fnm>P</fnm>
               </au>
               <etal/>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2006</pubdate>
            <volume>103</volume>
            <fpage>5977</fpage>
            <lpage>5982</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1424661</pubid>
                  <pubid idtype="pmpid" link="fulltext">16585510</pubid>
                  <pubid idtype="doi">10.1073/pnas.0600938103</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Modular organization of cellular networks.</p>
            </title>
            <aug>
               <au>
                  <snm>Rives</snm>
                  <fnm>AW</fnm>
               </au>
               <au>
                  <snm>Galitski</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <fpage>1128</fpage>
            <lpage>1133</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">298738</pubid>
                  <pubid idtype="pmpid" link="fulltext">12538875</pubid>
                  <pubid idtype="doi">10.1073/pnas.0237338100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Finding and evaluating community structure in networks.</p>
            </title>
            <aug>
               <au>
                  <snm>Newman</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Girvan</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Phys Rev E Stat Nonlin Soft Matter Phys</source>
            <pubdate>2004</pubdate>
            <volume>69</volume>
            <fpage>026113</fpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14995526</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Functional cartography of complex metabolic networks.</p>
            </title>
            <aug>
               <au>
                  <snm>Guimer&#225;</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Nunes-Amaral</snm>
                  <fnm>LA</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>433</volume>
            <fpage>895</fpage>
            <lpage>900</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature03288</pubid>
                  <pubid idtype="pmpid" link="fulltext">15729348</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Modularity and community structure in networks.</p>
            </title>
            <aug>
               <au>
                  <snm>Newman</snm>
                  <fnm>ME</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2006</pubdate>
            <volume>103</volume>
            <fpage>8577</fpage>
            <lpage>8582</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1482622</pubid>
                  <pubid idtype="pmpid" link="fulltext">16723398</pubid>
                  <pubid idtype="doi">10.1073/pnas.0601602103</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Is there a bias in proteome research?</p>
            </title>
            <aug>
               <au>
                  <snm>Mrowka</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Patzak</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Herzel</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2001</pubdate>
            <volume>11</volume>
            <fpage>1971</fpage>
            <lpage>1973</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.206701</pubid>
                  <pubid idtype="pmpid" link="fulltext">11731485</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>STRING: known and predicted protein-protein associations, integrated and transferred across organisms.</p>
            </title>
            <aug>
               <au>
                  <snm>von Mering</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Jensen</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Hooper</snm>
                  <fnm>SD</fnm>
               </au>
               <au>
                  <snm>Krupp</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Foglierini</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jouffre</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Huynen</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>D433</fpage>
            <lpage>437</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">539959</pubid>
                  <pubid idtype="pmpid" link="fulltext">15608232</pubid>
                  <pubid idtype="doi">10.1093/nar/gki005</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>STRING Database</p>
            </title>
            <url>http://string.embl-heidelberg.de</url>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Cytoscape</p>
            </title>
            <url>http://www.cytoscape.org</url>
         </bibl>
         <bibl id="B29">
            <title>
               <p>The Gene Ontology (GO) database and informatics resource.</p>
            </title>
            <aug>
               <au>
                  <snm>Harris</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ireland</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lomax</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ashburner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Foulger</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Eilbeck</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Marshall</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Mungall</snm>
                  <fnm>C</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>D258</fpage>
            <lpage>261</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308770</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681407</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh066</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>BiNGO: a Cytoscape plugin to assess overrepresentation of gene ontology categories in biological networks.</p>
            </title>
            <aug>
               <au>
                  <snm>Maere</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Heymans</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kuiper</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <fpage>3448</fpage>
            <lpage>3449</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti551</pubid>
                  <pubid idtype="pmpid" link="fulltext">15972284</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
