<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-7-488</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Evaluation of clustering algorithms for protein-protein interaction networks</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Broh&#233;e</snm>
               <fnm>Sylvain</fnm>
               <insr iid="I1"/>
               <email>sylvain@scmbb.ulb.ac.be</email>
            </au>
            <au id="A2">
               <snm>van Helden</snm>
               <fnm>Jacques</fnm>
               <insr iid="I1"/>
               <email>jvanheld@scmbb.ulb.ac.be</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Service de Conformation des Macromol&#233;cules Biologiques et de Bioinformatique. Universit&#233; Libre de Bruxelles, CP 263, Campus Plaine, Bd. du Triomphe, B-1050 Bruxelles, Belgium</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2006</pubdate>
         <volume>7</volume>
         <issue>1</issue>
         <fpage>488</fpage>
         <url>http://www.biomedcentral.com/1471-2105/7/488</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17087821</pubid>
               <pubid idtype="doi">10.1186/1471-2105-7-488</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>15</day>
               <month>3</month>
               <year>2006</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>06</day>
               <month>11</month>
               <year>2006</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>06</day>
               <month>11</month>
               <year>2006</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2006</year>
         <collab>Broh&#233;e and van Helden; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Protein interactions are crucial components of all cellular processes. Recently, high-throughput methods have been developed to obtain a global description of the interactome (the whole network of protein interactions for a given organism). In 2002, the yeast interactome was estimated to contain up to 80,000 potential interactions. This estimate is based on the integration of data sets obtained by various methods (mass spectrometry, two-hybrid methods, genetic studies). High-throughput methods are known, however, to yield a non-negligible rate of false positives, and to miss a fraction of existing interactions.</p>
               <p>The interactome can be represented as a graph where nodes correspond with proteins and edges with pairwise interactions. In recent years clustering methods have been developed and applied in order to extract relevant modules from such graphs. These algorithms require the specification of parameters that may drastically affect the results. In this paper we present a comparative assessment of four algorithms: Markov Clustering (MCL), Restricted Neighborhood Search Clustering (RNSC), Super Paramagnetic Clustering (SPC), and Molecular Complex Detection (MCODE).</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>A test graph was built on the basis of 220 complexes annotated in the MIPS database. To evaluate the robustness to false positives and false negatives, we derived 41 altered graphs by randomly removing edges from or adding edges to the test graph in various proportions.</p>
               <p>Each clustering algorithm was applied to these graphs with various parameter settings, and the clusters were compared with the annotated complexes.</p>
               <p>We analyzed the sensitivity of the algorithms to the parameters and determined their optimal parameter values.</p>
               <p>We also evaluated their robustness to alterations of the test graph.</p>
               <p>We then applied the four algorithms to six graphs obtained from high-throughput experiments and compared the resulting clusters with the annotated complexes.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>This analysis shows that MCL is remarkably robust to graph alterations. In the tests of robustness, RNSC is more sensitive to edge deletion but less sensitive to the use of suboptimal parameter values. The other two algorithms are clearly weaker under most conditions.</p>
               <p>The analysis of high-throughput data supports the superiority of MCL for the extraction of complexes from interaction networks.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Protein-protein interactions (PPI) play major roles in the cell: transient protein interactions are often involved in post-translational control of protein activity; enzymatic complexes ensure substrate channeling which drastically increases fluxes through metabolic pathways; large protein complexes play essential roles in basal cellular mechanisms such as DNA packaging (histones), transcription (RNA polymerase), replication (DNA polymerase), translation (ribosome), protein degradation (proteasome) ...</p>
         <p>Various methods have been used to detect PPI. Co-immunoprecipitation, co-sedimentation, and two-hybrid systems have traditionally been used to characterize interactions at the level of a single protein complex. More recently, high-throughput methods have been developed for large-scale detection of pairwise interactions (two-hybrid systems, the split-ubiquitin method) <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp> or multi-protein complexes (TAP-TAG, HMS-PCI) <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>.</p>
         <p>In 2002, von Mering <it>et al</it>. estimated that data resulting from combined experimental and computational approaches provide clues in favor of approximately 80,000 PPI in the yeast <it>Saccharomyces cerevisiae </it><abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. Clearly, however, this information should be considered with caution, since all methods are known to yield a non-negligible amount of noise (false positives) and to miss a fraction of existing interactions (false negatives). The error rate depends strongly on the method, high-throughput and computational methods being less reliable than traditional methods <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>.</p>
         <p>The network of interactions between proteins is generally represented as an interaction graph, where nodes represent proteins and edges represent pairwise interactions. Graph theory approaches have been applied to describe the topological properties of the network: distribution of node degree (number of incoming and outgoing edges per node), network diameter (average of the shortest distance between pairs of nodes), clustering coefficient (proportion of the potential edges between the neighbors of a node that are effectively observed in the graph). These analyses have led to the observation of some apparently recurrent properties of biological networks: power-law degree distribution, small world, high clustering coefficients, and modularity <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp>.</p>
         <p>Beyond these descriptive statistics, an important challenge for modern biology is to understand the relationship between the organization of a network and its function. In particular, it is essential to extract functional modules such as protein complexes <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> or regulatory pathways <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> from global interaction networks.</p>
         <p>To achieve this goal, several clustering methods have been applied to the protein interactome graph in order to detect highly connected subgraphs (e.g. <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>). These algorithms rely on very different approaches. Each of them requires specifying several parameters, some of which may drastically affect the results. To our knowledge, no systematic study has yet been performed to evaluate and compare these programs. It is thus very difficult for a biologist to estimate the reliability of hypotheses emerging from computer-based analyses of interaction networks.</p>
         <p>In this paper we present a systematic quantitative evaluation of the capability of four clustering methods for inferring protein complexes from a network of pairwise protein interactions. The four methods tested here are Markov Clustering (MCL <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp>), Restricted Neighborhood Search Clustering (RNSC <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>), Molecular Complex Detection (MCODE <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>), and Super Paramagnetic Clustering (SPC <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>). For each program, we sample the parameter space and select optimal parameters. We evaluate the robustness of the programs to false positives and false negatives. The algorithms are then applied to six data sets from high-throughput experiments.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>Algorithms</p>
            </st>
            <p>The four algorithms tested here rely on distinct approaches for extracting clusters from the graph (Table <tblr tid="T1">1</tblr>). We give hereafter a short conceptual description. More information can be found in the supplementary material [see <supplr sid="S1">Additional file 1</supplr>] and original publications.</p>
            <suppl id="S1">
               <title>
                  <p>Additional File 1</p>
               </title>
               <text>
                  <p>Supplementary information about the algorithms</p>
               </text>
               <file name="1471-2105-7-488-S1.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Main features of the graph clustering approaches presented in this study.</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Restricted Neighborhood Search Clustering (RNSC)</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Markov Clustering (MCL)</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Molecular Complex Detection (MCODE)</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Super-paramagnetic clustering (SPC)</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>Type</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>Local search cost based</p>
                     </c>
                     <c ca="center">
                        <p>Flow simulation</p>
                     </c>
                     <c ca="center">
                        <p>Local neighbourhood density search</p>
                     </c>
                     <c ca="center">
                        <p>Hierarchical</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>Allow multiple assignations</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>No</p>
                     </c>
                     <c ca="center">
                        <p>No</p>
                     </c>
                     <c ca="center">
                        <p>Yes</p>
                     </c>
                     <c ca="center">
                        <p>No</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>Allow unassigned nodes</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>No</p>
                     </c>
                     <c ca="center">
                        <p>No</p>
                     </c>
                     <c ca="center">
                        <p>Yes</p>
                     </c>
                     <c ca="center">
                        <p>No</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>Edge-weighted graphs supported</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>No</p>
                     </c>
                     <c ca="center">
                        <p>Yes</p>
                     </c>
                     <c ca="center">
                        <p>No</p>
                     </c>
                     <c ca="center">
                        <p>Yes</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>First application</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>Protein complex prediction</p>
                     </c>
                     <c ca="center">
                        <p>Protein family detection</p>
                     </c>
                     <c ca="center">
                        <p>Protein complex detection</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>Other applications</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>/</p>
                     </c>
                     <c ca="left">
                        <p>Identification of ortholog groups, protein complexes, peer-to-peer node clustering, image retrieval, Word Sense Discrimination, molecular pathway discovery, structural domains, ...</p>
                     </c>
                     <c ca="center">
                        <p>/</p>
                     </c>
                     <c ca="left">
                        <p>Image clustering, microarray data clustering, protein complexes detection, protein structure classification, identification of ortholog groups, ...</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>Availability</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>Upon request</p>
                     </c>
                     <c ca="center">
                        <p>
                           <url>http://micans.org/mcl/</url>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <url>ftp://ftp.blueprint.org/pub/BIND/README</url>
                        </p>
                     </c>
                     <c ca="center">
                        <p>Upon request</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>Developper</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>King AD</p>
                     </c>
                     <c ca="center">
                        <p>Van Dongen S</p>
                     </c>
                     <c ca="center">
                        <p>Bader GD and Hogue CWV</p>
                     </c>
                     <c ca="center">
                        <p>Blatt M, Wiseman S, Domany E</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>References</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>[21]</p>
                     </c>
                     <c ca="center">
                        <p>[35]</p>
                     </c>
                     <c ca="center">
                        <p>[19]</p>
                     </c>
                     <c ca="center">
                        <p>[18]</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>The Markov Cluster algorithm (MCL) <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp> simulates a flow on the graph by calculating successive powers of the associated adjacency matrix. At each iteration, an <it>inflation step </it>is applied to enhance the contrast between regions of strong or weak flow in the graph. The process converges towards a partition of the graph, with a set of high-flow regions (the clusters) separated by boundaries with no flow. The value of the <it>inflation parameter </it>strongly influences the number of clusters.</p>
            <p>The second algorithm, Restricted Neighborhood Search Clustering (RNSC) <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>), is a cost-based local search algorithm that explores the solution space to minimize a cost function, calculated according to the numbers of intra-cluster and inter-cluster edges. Starting from an initial random solution, RNSC iteratively moves a vertex from one cluster to another if this move reduces the general cost. When a (user-specified) number of moves has been reached without decreasing the cost function, the program ends up.</p>
            <p>The third algorithm, Super Paramagnetic Clustering (SPC) <abbrgrp><abbr bid="B37">37</abbr></abbrgrp> is a hierarchical clustering algorithm inspired from an analogy with the physical properties of a ferromagnetic model subject to fluctuation at nonzero temperature. At first, SPC associates a <it>spin </it>with each node of the graph. Spins belonging to a highly connected region fluctuate in a correlated fashion and nodes with correlated spins are placed in the same cluster. When the temperature increases, the system becomes less stable and the clusters become smaller.</p>
            <p>The fourth method, Molecular Complex Detection (MCODE) <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>, detects densely connected regions. First it assigns a weight to each vertex, corresponding to its local neighborhood density. Then, starting from the top-weighted vertex (seed vertex), it recursively moves outward, including in the cluster vertices whose weight is above a given threshold. This threshold corresponds to a user-defined percentage of the weight of the seed vertex.</p>
         </sec>
         <sec>
            <st>
               <p>Interaction graphs</p>
            </st>
            <p>From the collection of protein complexes annotated in the MIPS database <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>, we constructed an interaction graph by instantiating a node for each protein, and linking by an edge any two proteins that belong to the same complex. This graph is hereafter referred to as the <it>test graph</it>. As depicted in Figure <figr fid="F1">1A</figr>, the structure of the original test graph is almost trivial: most complexes correspond to isolated components. In this test graph each complex is represented as a clique (each protein is connected to each other one). This generally does not reflect the actual complex structure, where each protein is linked to specific partners. Consequently, this original graph is of poor value for evaluating the performances of clustering algorithms on real data sets. This applies particularly to high-throughput data sets, which are generally fragmentary (missing interactions), and noisy (false interactions).</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Graphical representation of interaction networks</p>
               </caption>
               <text>
                  <p><b>Graphical representation of interaction networks</b>. <b>(A) </b>Test graph built from the complexes annotated in the MIPS database (high-throughput data were excluded). <b>(B) </b>Altered graph <it>A</it><sub>100,40 </sub>with 100% of random edge addition (red) and 40% of random edge removal.</p>
               </text>
               <graphic file="1471-2105-7-488-1"/>
            </fig>
            <p>In order to evaluate the robustness of the algorithms to missing and false interactions, we generated 41 <it>altered graphs </it>from the original test graph, by combining addition and removal of edges in various proportions. We refer to altered graphs as <it>A</it><sub><it>add</it>,<it>del</it></sub>, where <it>add </it>and <it>del </it>indicate respectively the percentage of added and deleted edges (percentages with respect to the number of edges in the original test graph).</p>
            <p>Figure <figr fid="F1">1B</figr> shows an example of an altered graph <it>A</it><sub>100,40, </sub>with 100% edge addition and 40% edge removal. Another problem of evaluation is that a certain proportion of interacting proteins can be assigned to the same cluster by chance. In order to estimate the random expectation of correct grouping, we built a <it>random graph </it>by shuffling the edges between nodes of the test graph. With this type of randomization, each node preserves the same number of links as in the original graph.</p>
            <p>We also built 41 <it>altered random graphs </it>from the random graph, by randomly adding and removing random edges in the same proportions as for the original test graph.</p>
            <p>To each of these 84 graphs (test, altered test, random, altered random), we applied the four algorithms described above, with varying parameter values. As a second way to estimate the random expectation, each clustering result was also randomized so as to obtain a set of <it>permuted clusters </it>of the same sizes as those obtained from the test graph or altered graphs.</p>
         </sec>
         <sec>
            <st>
               <p>Parameter optimization</p>
            </st>
            <p>The quality of a clustering result was evaluated by comparing each cluster with each annotated complex. The <it>complex-wise sensitivity </it>(<it>Sn</it>) represents the coverage of a complex by its best-matching cluster (the maximal fraction of proteins in the complex found in a common cluster). Reciprocally, the <it>cluster-wise Positive Predictive Value </it>(<it>PPV</it>) measures how well a given cluster predicts its best-matching complex (see the chapter <it>Methods </it>for a detailed description of the matching statistics).</p>
            <p>To estimate the overall correspondence between a clustering result (a set of clusters) and the collection of annotated complexes, we computed the weighted means of all <it>PPV </it>values (averaged over all clusters) and <it>Sn </it>values (averaged over all complexes). The resulting statistics, <it>clustering-wise PPV </it>and <it>clustering-wise Sn</it>, provide complementary and somewhat contradictory information: when the number of clusters decreases, the <it>Sn </it>increases and, in the trivial case where all proteins are grouped in a single cluster, the calculated <it>Sn </it>reaches 1. Reciprocally, the <it>PPV </it>increases with the number of clusters, reaching 1 in the trivial case where each protein is assigned to one separate cluster. In order to integrate the two statistics, we computed a <it>geometrical accuracy </it>(<it>Acc</it>), defined as the geometrical mean of the averaged <it>Sn </it>and <it>PPV </it>values.</p>
            <p>Each algorithm has one or more parameters that influence properties such as number of clusters, cluster size, and cluster density (number of intra-cluster edges). For each algorithm we measured the impact of the main parameters on <it>Sn, PPV </it>and <it>Acc </it>and selected the combination of parameters giving maximal accuracy. This analysis revealed that some parameters have a drastic impact on accuracy, whereas others have a limited effect.</p>
            <p>Let us illustrate in more detail the procedure of parameter selection with the inflation parameter of the MCL algorithm. With the original test graph, interestingly, the effect of this parameter is barely detectable (Figure <figr fid="F2">2A</figr>). Yet this apparent robustness is an artifact due to the trivial structure of the graph. In the MIPS data set used as a reference, most proteins (73%) are members of a single complex, so that most complexes correspond to isolated components in the test graph (Figure <figr fid="F1">1A</figr>) on which the clustering is performed. Consequently, the clustering algorithm tends to define one cluster per connected component, irrespectively of the inflation parameter. Consistently with this interpretation, the number of clusters is almost constant whatever the inflation parameter value (Figure <figr fid="F2">2B</figr>, blue curve). In contrast, when the same algorithm is applied to a randomized graph, the number of clusters increases with the inflation parameter (Figure <figr fid="F2">2B</figr>, gray curve).</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Impact of the inflation parameter on MCL clustering results</p>
               </caption>
               <text>
                  <p><b>Impact of the inflation parameter on MCL clustering results</b>. <b>(A) </b>Impact of the inflation parameter on the clustering-wise Sensitivity (<it>Sn</it>), Positive Predictive Value (<it>PPV</it>) and geometric accuracy (<it>Acc</it>). Each curve represents the value of one evaluation statistics (ordinate) as a function of the inflation parameter (abscissa). Color code: <it>blue </it>: <it>Sn</it>; <it>red </it>: <it>PPV</it>; <it>green </it>: <it>Acc</it>; <it>grey </it>: geometrical accuracy for the first random control (randomized graph); orange : geometrical accuracy for the second random control (permuted clusters). <b>(B) </b>Number of complexes predicted as a function of the inflation factor for the original test graph. Color code: <it>blue </it>: test graph; <it>red </it>: random graph. <b>(C) </b><it>Sn</it>, <it>PPV </it>and <it>Acc </it>scores obtained with a highly altered graph (<it>A</it><sub>100,40</sub>). <b>(D) </b>Number of complexes predicted as a function of the inflation factor for <it>A</it><sub>100,40</sub>.</p>
               </text>
               <graphic file="1471-2105-7-488-2"/>
            </fig>
            <p>The crucial impact of the inflation parameter becomes obvious when MCL is applied to highly altered graphs. For example, for the altered graph <it>A</it><sub>100,40 </sub>(Figure <figr fid="F2">2C</figr>), the increase in inflation causes a decrease in <it>Sn </it>(red curve) and an increase in <it>PPV </it>(blue curve). These effects are explained by the fact that the number of clusters increases with the inflation parameter (Figure <figr fid="F2">2D</figr>). The optimal tradeoff between <it>Sn </it>and <it>PPV </it>is obtained for an inflation value of 1.7, and yields an <it>Acc </it>of 66% (green curve).</p>
            <p>We performed the same analysis and selected the optimal parameter values for each one of the 42 graphs (test and altered), as summarized in Table <tblr tid="T2">2</tblr> for the MCL algorithm. Since the optimal parameter values depend on the level of alteration, we cannot view one value as systematically optimal. We chose as a general optimum the most frequent value in this table. This criterion ensures a good robustness to graph alteration (it covers the widest range of graph alterations).</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Optimal values for MCL inflation parameter for the test and altered graphs</p>
               </caption>
               <tblbdy cols="8">
                  <r>
                     <c ca="left">
                        <p>% removal\% addition</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>5</p>
                     </c>
                     <c ca="left">
                        <p>10</p>
                     </c>
                     <c ca="left">
                        <p>20</p>
                     </c>
                     <c ca="left">
                        <p>40</p>
                     </c>
                     <c ca="left">
                        <p>80</p>
                     </c>
                     <c ca="left">
                        <p>100</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>3.4</p>
                     </c>
                     <c ca="left">
                        <p>3.1</p>
                     </c>
                     <c ca="left">
                        <p>2.7</p>
                     </c>
                     <c ca="left">
                        <p>2.4</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>1.8</p>
                     </c>
                     <c ca="left">
                        <p>1.8</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>5</p>
                     </c>
                     <c ca="left">
                        <p>5.7</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>2.6</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>1.9</p>
                     </c>
                     <c ca="left">
                        <p>1.8</p>
                     </c>
                     <c ca="left">
                        <p>1.8</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>10</p>
                     </c>
                     <c ca="left">
                        <p>2.35</p>
                     </c>
                     <c ca="left">
                        <p>2.2</p>
                     </c>
                     <c ca="left">
                        <p>2.2</p>
                     </c>
                     <c ca="left">
                        <p>2.3</p>
                     </c>
                     <c ca="left">
                        <p>1.8</p>
                     </c>
                     <c ca="left">
                        <p>1.8</p>
                     </c>
                     <c ca="left">
                        <p>1.8</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>20</p>
                     </c>
                     <c ca="left">
                        <p>1.7</p>
                     </c>
                     <c ca="left">
                        <p>2.2</p>
                     </c>
                     <c ca="left">
                        <p>2.1</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>1.8</p>
                     </c>
                     <c ca="left">
                        <p>1.7</p>
                     </c>
                     <c ca="left">
                        <p>1.8</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>40</p>
                     </c>
                     <c ca="left">
                        <p>1.8</p>
                     </c>
                     <c ca="left">
                        <p>1.8</p>
                     </c>
                     <c ca="left">
                        <p>1.8</p>
                     </c>
                     <c ca="left">
                        <p>1.9</p>
                     </c>
                     <c ca="left">
                        <p>1.7</p>
                     </c>
                     <c ca="left">
                        <p>1.7</p>
                     </c>
                     <c ca="left">
                        <p>1.7</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>80</p>
                     </c>
                     <c ca="left">
                        <p>1.3</p>
                     </c>
                     <c ca="left">
                        <p>1.4</p>
                     </c>
                     <c ca="left">
                        <p>1.5</p>
                     </c>
                     <c ca="left">
                        <p>1.5</p>
                     </c>
                     <c ca="left">
                        <p>1.5</p>
                     </c>
                     <c ca="left">
                        <p>1.6</p>
                     </c>
                     <c ca="left">
                        <p>1.6</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>Note that in the case of the inflation parameter, the most frequent value (1.8) is especially well suited for graphs with a high level of alteration, such as those resulting from high-throughput data. In addition, for the less altered graphs, the accuracy is generally more robust to fluctuations of the inflation (the extreme case of the unaltered test graph shown in Figure <figr fid="F2">2A,B</figr> is discussed above).</p>
            <p>For the RNSC algorithm, we tested the impact of 7 parameters on the quality of the clustering. This represents a total of 2,916 combinations of parameter values. Figure <figr fid="F3">3</figr> displays the <it>Sn </it>(abscissa) and <it>PPV </it>(ordinate) obtained with the same altered graph as in Figure <figr fid="F1">1B</figr> (<it>A</it><sub>100,40</sub>). Each dot corresponds to one particular combination of parameter values. This figure shows that the RNSC algorithm is remarkably robust to the choice of parameter values: all the results are grouped in a cloud, with an almost constant <it>PPV </it>(58%) and a restricted range of <it>Sn </it>(between 61% and 87%).</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Impact of the RNSC parameters on the clustering of an altered graph <it>A</it><sub>100,40</sub></p>
               </caption>
               <text>
                  <p><b>Impact of the RNSC parameters on the clustering of an altered graph <it>A</it><sub>100,40</sub></b>. Each dot represents the clustering-wise <it>PPV </it>and <it>Sn </it>value for one combination of the seven tested parameters. Color code: <it>blue </it>: altered graph <it>A</it><sub>100,20 </sub>(100% random edge addition and 20% of random edge removal); <it>orange </it>: randomized graph <it>R</it><sub>100,40</sub>; <it>grey </it>: permuted clusters.</p>
               </text>
               <graphic file="1471-2105-7-488-3"/>
            </fig>
            <p>The same analysis was carried out for each parameter of each algorithm. The complete tables of optimal values for the 42 graphs using both Accuracy and Separation (see next section) are available as supplementary material [see <supplr sid="S2">Additional file 2</supplr> and <supplr sid="S3">3</supplr>]. Table <tblr tid="T3">3</tblr> synthesizes the optimal values obtained for the four tested algorithms. These optimal values were systematically used for the robustness analysis in the next section.</p>
            <suppl id="S2">
               <title>
                  <p>Additional File 2</p>
               </title>
               <text>
                  <p>Optimal accuracy parameter values</p>
               </text>
               <file name="1471-2105-7-488-S2.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S3">
               <title>
                  <p>Additional File 3</p>
               </title>
               <text>
                  <p>Optimal separation parameter values. These files and supplementary figures are also available on <url>http://rsat.scmbb.ulb.ac.be/~sylvain/clustering_evaluation</url>.</p>
               </text>
               <file name="1471-2105-7-488-S3.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Optimal parameters</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="center">
                        <p>
                           <b>Algorithm</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Parameter</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Optimized for accuracy</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Optimized for separation</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>MCL</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>Inflation</p>
                     </c>
                     <c ca="center">
                        <p>1.8</p>
                     </c>
                     <c ca="center">
                        <p>1.8</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>MCODE</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>Depth</p>
                     </c>
                     <c ca="center">
                        <p>100</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Node score percentage</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Haircut</p>
                     </c>
                     <c ca="center">
                        <p>TRUE</p>
                     </c>
                     <c ca="center">
                        <p>TRUE</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Fluff</p>
                     </c>
                     <c ca="center">
                        <p>FALSE</p>
                     </c>
                     <c ca="center">
                        <p>FALSE</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Percentage for complex fluffing</p>
                     </c>
                     <c ca="center">
                        <p>0.2</p>
                     </c>
                     <c ca="center">
                        <p>0.9</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>RNSC</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>Diversification frequency</p>
                     </c>
                     <c ca="center">
                        <p>50</p>
                     </c>
                     <c ca="center">
                        <p>50</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Shuffling diversification length</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Tabu length</p>
                     </c>
                     <c ca="center">
                        <p>50</p>
                     </c>
                     <c ca="center">
                        <p>50</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Tabu list tolerance</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Number of experiments</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Naive stopping tolerance</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>15</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Scaled stopping tolerance</p>
                     </c>
                     <c ca="center">
                        <p>15</p>
                     </c>
                     <c ca="center">
                        <p>15</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>SPC</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>Number of nearest neighbours</p>
                     </c>
                     <c ca="center">
                        <p>15</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Temperature</p>
                     </c>
                     <c ca="center">
                        <p>0.132</p>
                     </c>
                     <c ca="center">
                        <p>0.116</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Robustness analysis</p>
            </st>
            <p>In this analysis, we chose fixed parameter values for each algorithm (Table <tblr tid="T3">3</tblr>) and analyzed the robustness of the different algorithms to various levels of graph alteration (edge removal and addition).</p>
            <p>Figure <figr fid="F4">4A</figr> displays the impact of edge addition on the geometric accuracy. Increasing proportions of edges (0%, 5%, 10%, 20%, 40%, 80% and 100%) were randomly added to the test graph. MCL and RNSC are barely affected by addition of up to 100% edges (blue and red curves, respectively). The performances of MCODE and SPC are reasonably good for low values of noise, but drop to 40% when the percentage of added edges increases to 100% (orange and green curves, respectively).</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Robustness of the algorithms to random edge addition and removal</p>
               </caption>
               <text>
                  <p><b>Robustness of the algorithms to random edge addition and removal</b>. Each curve represents the value of accuracy (left panels) or separation (right panels). <b>(A-B) </b>edge addition to the test graph. <b>(C-D) </b>edges removal from the test graph. <b>(E-F) </b>Edge removal from an altered graph with 100% of randomly added edges. <b>(G-H) </b>Edge addition to an altered test graph with 40% of randomly removed edges. Color code: <it>blue </it>: MCL, <it>red </it>: RNSC, <it>orange </it>: MCODE, <it>green </it>: SPC. Dotted lines show the results obtained by permuting the clusters (negative control).</p>
               </text>
               <graphic file="1471-2105-7-488-4"/>
            </fig>
            <p>To estimate the random expectation, we performed for each clustering result a permutation test, by shuffling the proteins between clusters. The number of clusters and their respective sizes thus remained unchanged. The geometric accuracy of the permuted clusters is displayed with dotted lines in Figure <figr fid="F4">4A</figr>. For MCL, RNSC and MCODE, the accuracy is relatively stable (between 15% and 22%). For SPC, surprisingly, the accuracy of the permuted clusters progressively increases with the addition of edges, reaching 38% when more than 80% egdes are added. This value almost equals that obtained with the non-random altered graph <it>A</it><sub>100,0</sub>. This illustrates the importance of the permutation test: the test makes it possible to estimate the performance of an algorithm in terms of gains relative to the random expectation. We inspected the clustering result in more detail in order to understand why the program can yield high accuracy values even when clusters are permuted. This effect comes from the fact that, under the chosen conditions, SPC yields a huge cluster of 567 proteins, plus a multitude of very small clusters of 1 or 2 proteins. The effect of the huge cluster is to artificially increase the <it>Sn</it>, since a good fraction of each complex is covered by this cluster. Each of the very small clusters yields a high <it>PPV </it>: single-element clusters have by definition a <it>PPV </it>of 1, and 2-member clusters have a minimal <it>PPV </it>of 0.5. This particular distribution of cluster sizes thus creates an artefactual situation by reaching, for two separate reasons, reasonably high scores for both criteria (<it>Sn </it>and <it>PPV</it>).</p>
            <p>In order to circumvent this problem, we defined an additional statistic, which we call <it>separation</it>, as the product of the proportion of complex elements found in the cluster by the proportion of cluster elements found in the complex (see Methods for the formula). High separation values indicate a bidirectional correspondence between a cluster and a complex: a maximal value of 1 is reached when a cluster corresponds perfectly with a complex, i.e. when it comprises all of its proteins and nothing more.</p>
            <p>The <it>complex-wise separation </it>indicates how well a given complex is isolated from the other complexes. The maximal value for complex-wise separation is 1. The simplest way to obtain <m:math name="1471-2105-7-488-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>S</m:mi><m:mi>e</m:mi><m:msub><m:mi>p</m:mi><m:mrow><m:mi>c</m:mi><m:msub><m:mi>o</m:mi><m:mi>i</m:mi></m:msub></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGtbWucqWGLbqzcqWGWbaCdaWgaaWcbaGaem4yamMaem4Ba82aaSbaaWqaaiabdMgaPbqabaaaleqaaaaa@350C@</m:annotation></m:semantics></m:math> = 1 is the perfect match, i.e. when all the proteins in the complex are contained in a single cluster, and this cluster does not contain any other protein (Table <tblr tid="T4">4</tblr>, cluster 1/complex 1). Yet the value of 1 can also be reached if the complex is split into two or more clusters, if each of these clusters contains only members of the complex (Table <tblr tid="T4">4</tblr>, complex 2 split into clusters 2 and 3). In other words, <m:math name="1471-2105-7-488-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>S</m:mi><m:mi>e</m:mi><m:msub><m:mi>p</m:mi><m:mrow><m:mi>c</m:mi><m:msub><m:mi>o</m:mi><m:mi>i</m:mi></m:msub></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGtbWucqWGLbqzcqWGWbaCdaWgaaWcbaGaem4yamMaem4Ba82aaSbaaWqaaiabdMgaPbqabaaaleqaaaaa@350C@</m:annotation></m:semantics></m:math> = 1 indicates that the clustering algorithm separates this complex perfectly from all other complexes (although this complex may be split into several clusters).</p>
            <tbl id="T4">
               <title>
                  <p>Table 4</p>
               </title>
               <caption>
                  <p>Schematic illustration of a contingency table, and the derived statistics</p>
               </caption>
               <tblbdy cols="8">
                  <r>
                     <c cspan="8" ca="center">
                        <p>
                           <b>Counts</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>T</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>cluster 1</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>cluster 2</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>cluster 3</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>cluster 4</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>cluster 5</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>sum</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>complex size</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>complex 1</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>complex 2</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>14</p>
                     </c>
                     <c ca="left">
                        <p>14</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>complex 3</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>14</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>17</p>
                     </c>
                     <c ca="left">
                        <p>20</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>complex 4</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="left">
                        <p>9</p>
                     </c>
                     <c ca="left">
                        <p>8</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>sum</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>18</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>47</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>cluster size</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>16</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c cspan="8" ca="center">
                        <p>
                           <b>Positive Predictive Value (PPV)</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>PPV</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>cluster 1</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>cluster 2</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>cluster 3</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>cluster 4</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>cluster 5</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>complex 1</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>complex 2</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>complex 3</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0.78</p>
                     </c>
                     <c ca="center">
                        <p>0.38</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>complex 4</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0.22</p>
                     </c>
                     <c ca="center">
                        <p>0.62</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>cluster-wise PPV</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0.78</p>
                     </c>
                     <c ca="center">
                        <p>0.62</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c cspan="8" ca="center">
                        <p>
                           <b>Sensitivity</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Sn</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>cluster 1</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>cluster 2</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>cluster 3</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>cluster 4</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>cluster 5</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>complex-wise Sn</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>complex 1</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>complex 2</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0.43</p>
                     </c>
                     <c ca="center">
                        <p>0.57</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0.57</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>complex 3</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0.70</p>
                     </c>
                     <c ca="center">
                        <p>0.15</p>
                     </c>
                     <c ca="left">
                        <p>0.70</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>complex 4</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0.50</p>
                     </c>
                     <c ca="center">
                        <p>0.62</p>
                     </c>
                     <c ca="left">
                        <p>0.62</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c cspan="8" ca="center">
                        <p>
                           <b>Frequency per row</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>F</it>
                           <sub>
                              <it>row</it>
                           </sub>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>cluster 1</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>cluster 2</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>cluster 3</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>cluster 4</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>cluster 5</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>complex 1</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>complex 2</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0.43</p>
                     </c>
                     <c ca="center">
                        <p>0.57</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>complex 3</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0.82</p>
                     </c>
                     <c ca="center">
                        <p>0.18</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>complex 4</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0.44</p>
                     </c>
                     <c ca="center">
                        <p>0.56</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c cspan="8" ca="center">
                        <p>
                           <b>Separation</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>C</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>cluster 1</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>cluster 2</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>cluster 3</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>cluster 4</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>cluster 5</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>complex-wise separation</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>complex 1</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>complex 2</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0.43</p>
                     </c>
                     <c ca="center">
                        <p>0.57</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>complex 3</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0.64</p>
                     </c>
                     <c ca="center">
                        <p>0.07</p>
                     </c>
                     <c ca="left">
                        <p>0.71</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>complex 4</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0.10</p>
                     </c>
                     <c ca="center">
                        <p>0.35</p>
                     </c>
                     <c ca="left">
                        <p>0.45</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>cluster-wise separation</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0.43</p>
                     </c>
                     <c ca="center">
                        <p>0.57</p>
                     </c>
                     <c ca="center">
                        <p>0.74</p>
                     </c>
                     <c ca="center">
                        <p>0.41</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Clustering-wise sensitivity 0.69</p>
                  <p>Clustering-wise PPV 0.85</p>
                  <p>Accuracy 0.77</p>
                  <p>Average cluster-wise separation 0.63</p>
                  <p>Average complex-wise separation 0.79</p>
                  <p>Clustering-wise separation 0.70</p>
               </tblfn>
            </tbl>
            <p>Similarly, we defined a <it>cluster-wise separation</it>, which indicates how well a given cluster isolates one or several complexes from the other clusters. The maximal value, <m:math name="1471-2105-7-488-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>S</m:mi><m:mi>e</m:mi><m:msub><m:mi>p</m:mi><m:mrow><m:mi>c</m:mi><m:msub><m:mi>l</m:mi><m:mi>j</m:mi></m:msub></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGtbWucqWGLbqzcqWGWbaCdaWgaaWcbaGaem4yamMaemiBaW2aaSbaaWqaaiabdQgaQbqabaaaleqaaaaa@3508@</m:annotation></m:semantics></m:math> = 1, indicates that a cluster fully and exclusively comprises all the elements of one or several complexes, i.e. it contains all the proteins of the considered complex(es), and no other cluster contains any of these proteins.</p>
            <p>The <it>clustering-wise separation </it>statistic integrates separation values over all complexes and clusters, and indicates the general correspondence between a clustering result and the set of annotated complexes. Separation is particularly relevant to assessing clustering algorithms like MCODE, which permit assigning a protein to multiple clusters. Under some particular parameter combinations, this program tends to yield highly redundant clusters. Table <tblr tid="T5">5</tblr> shows a fragment of the contingency table indicating the number mutual intersections between the 607 clusters obtained from the unaltered test graph. For example, the 50 first rows/columns show a series of imbricated clusters, each resulting from the addition of one node to the preceding cluster. Such strongly overlapping clusters artificially increase the performance, since a set of clusters representing the same complex will be taken into account multiple times in the average <it>PPV</it>.</p>
            <tbl id="T5">
               <title>
                  <p>Table 5</p>
               </title>
               <caption>
                  <p>Mutually overlapping clusters obtained under some parameter conditions with MCODE</p>
               </caption>
               <tblbdy cols="16">
                  <r>
                     <c ca="left">
                        <p>cluster\cluster</p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                     <c ca="right">
                        <p>2</p>
                     </c>
                     <c ca="right">
                        <p>3</p>
                     </c>
                     <c ca="right">
                        <p>4</p>
                     </c>
                     <c ca="right">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>49</p>
                     </c>
                     <c ca="right">
                        <p>50</p>
                     </c>
                     <c ca="right">
                        <p>51</p>
                     </c>
                     <c ca="right">
                        <p>52</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>102</p>
                     </c>
                     <c ca="right">
                        <p>103</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>607</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="16">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="right">
                        <p>81</p>
                     </c>
                     <c ca="right">
                        <p>80</p>
                     </c>
                     <c ca="right">
                        <p>79</p>
                     </c>
                     <c ca="right">
                        <p>78</p>
                     </c>
                     <c ca="right">
                        <p>77</p>
                     </c>
                     <c ca="center">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>47</p>
                     </c>
                     <c ca="right">
                        <p>46</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>32</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="right">
                        <p>80</p>
                     </c>
                     <c ca="right">
                        <p>80</p>
                     </c>
                     <c ca="right">
                        <p>79</p>
                     </c>
                     <c ca="right">
                        <p>78</p>
                     </c>
                     <c ca="right">
                        <p>77</p>
                     </c>
                     <c ca="center">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>47</p>
                     </c>
                     <c ca="right">
                        <p>46</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>32</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="right">
                        <p>79</p>
                     </c>
                     <c ca="right">
                        <p>79</p>
                     </c>
                     <c ca="right">
                        <p>79</p>
                     </c>
                     <c ca="right">
                        <p>78</p>
                     </c>
                     <c ca="right">
                        <p>77</p>
                     </c>
                     <c ca="center">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>47</p>
                     </c>
                     <c ca="right">
                        <p>46</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>32</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="right">
                        <p>78</p>
                     </c>
                     <c ca="right">
                        <p>78</p>
                     </c>
                     <c ca="right">
                        <p>78</p>
                     </c>
                     <c ca="right">
                        <p>78</p>
                     </c>
                     <c ca="right">
                        <p>77</p>
                     </c>
                     <c ca="center">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>47</p>
                     </c>
                     <c ca="right">
                        <p>46</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>32</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>5</p>
                     </c>
                     <c ca="right">
                        <p>77</p>
                     </c>
                     <c ca="right">
                        <p>77</p>
                     </c>
                     <c ca="right">
                        <p>77</p>
                     </c>
                     <c ca="right">
                        <p>77</p>
                     </c>
                     <c ca="right">
                        <p>77</p>
                     </c>
                     <c ca="center">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>47</p>
                     </c>
                     <c ca="right">
                        <p>46</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>32</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="center">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>49</p>
                     </c>
                     <c ca="right">
                        <p>47</p>
                     </c>
                     <c ca="right">
                        <p>47</p>
                     </c>
                     <c ca="right">
                        <p>47</p>
                     </c>
                     <c ca="right">
                        <p>47</p>
                     </c>
                     <c ca="right">
                        <p>47</p>
                     </c>
                     <c ca="center">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>47</p>
                     </c>
                     <c ca="right">
                        <p>46</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>32</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>50</p>
                     </c>
                     <c ca="right">
                        <p>46</p>
                     </c>
                     <c ca="right">
                        <p>46</p>
                     </c>
                     <c ca="right">
                        <p>46</p>
                     </c>
                     <c ca="right">
                        <p>46</p>
                     </c>
                     <c ca="right">
                        <p>46</p>
                     </c>
                     <c ca="center">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>46</p>
                     </c>
                     <c ca="right">
                        <p>46</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>32</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>51</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>46</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>52</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>46</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>32</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="center">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>102</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>32</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>32</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>103</p>
                     </c>
                     <c ca="right">
                        <p>32</p>
                     </c>
                     <c ca="right">
                        <p>32</p>
                     </c>
                     <c ca="right">
                        <p>32</p>
                     </c>
                     <c ca="right">
                        <p>32</p>
                     </c>
                     <c ca="right">
                        <p>32</p>
                     </c>
                     <c ca="center">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>32</p>
                     </c>
                     <c ca="right">
                        <p>32</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>32</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="center">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>607</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>...</p>
                     </c>
                     <c ca="right">
                        <p>3</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>Cluster-wise separation penalizes this effect by using the marginal sums rather than the cluster size. Thus, if a method generates many redundant clusters, each one intersecting with a given complex, the marginal sum will increase drastically, and <it>Sep</it><sub><it>cl </it></sub>will be reduced accordingly. Note that the result of Table <tblr tid="T5">5</tblr> is not representative of all MCODE conditions: when appropriate parameters are chosen, the level of mutual overlap between clusters is reasonable.</p>
            <p>Figure <figr fid="F4">4B</figr> displays the impact of edge addition on clustering-wise separation. The general trends are similar to those revealed by the accuracy curves (Figure <figr fid="F4">4A</figr>), but the random expectation curves are now roughly horizontal for SPC as well as for the other algorithms. We defined a second set of parameters optimized for separation, in the same way as described above for accuracy. These separation-optimized parameters are displayed in Table <tblr tid="T3">3</tblr> and were used for all separation curves in this robustness analysis (right panels in Figure <figr fid="F4">4</figr>).</p>
            <p>In Figure <figr fid="F4">4C</figr> and <figr fid="F4">4D</figr>, increasing proportions (0%, 5%, 10%, 20%, 40%, and 80%) of edges are randomly removed from the test graph. The general trend is for RNSC and MCL to outperform the other two algorithms under most conditions. RNSC, however, shows a higher sensitivity to edge removal, and its performance strongly decreases when more than 40% of the edges are removed. SPC is quite robust to edge removal, but its performance remains lower than that of MCL under all conditions. Note that this removal experiment is not very indicative of algorithm capability under realistic conditions, because the partitioning of the test graph corresponds almost with complex composition (Figure <figr fid="F1">1A</figr>). Thus, when edges are simply removed, this partitioning is mostly maintained: given the high level of intra-complex connectivity, most complexes remain linked, and no new inter-complex link is created.</p>
            <p>In order to obtain a realistic estimate of algorithm robustness, we thus need to combine edge addition and removal. Figure <figr fid="F4">4E</figr> and <figr fid="F4">4F</figr> shows the robustness to edge removal, starting from a graph with 100% edge addition. The performances of all programs are of course reduced as compared to Figures <figr fid="F4">4C</figr> and <figr fid="F4">4D</figr>. In terms of accuracy (Figure <figr fid="F4">4E</figr>), RNSC and MCL show grossly similar behaviours: the accuracy shows a good robustness in the low range of removal percentages (0&#8211;40%) but strongly decreases at higher percentages (80%). Yet in terms of separation (Figure <figr fid="F4">4F</figr>), RNSC shows a better performance than MCL at low rates of removal. The separation values of all algorithms drop to their respective levels of the random expectation when 80% of the edges are removed. MCODE and SPC show generally low performance, and are drastically affected by the combination of addition and removal. The performance of SPC is similar to that obtained by selecting random clusters, in terms of both accuracy (Figure <figr fid="F4">4E</figr>) and separation (Figure <figr fid="F4">4F</figr>).</p>
            <p>Figures <figr fid="F4">4G</figr> and <figr fid="F4">4H</figr> show the effect of edge addition on graphs from which 40% of the edges had previously been removed. These curves confirm the trends observed in Figures <figr fid="F4">4A</figr> and <figr fid="F4">4B</figr>: MCL and RNSC are weakly affected by edge addition, but as little as 20% edge addition suffices to prevent SPC from identifying the complexes (Figure <figr fid="F4">4H</figr>). MCODE is relatively robust to edge addition, but shows a weaker performance than MCL and RNSC over the whole range of conditions.</p>
         </sec>
         <sec>
            <st>
               <p>Analysis of data sets obtained in high-throughput experiments</p>
            </st>
            <p>In the previous chapters our evaluations were based on artificial graphs obtained by adding and removing various proportions of edges to a reference network (the MIPS complexes). The next step was to evaluate the capability of these algorithms to extract relevant information from high-throughput data sets. To this end, we downloaded from the GRID database <abbrgrp><abbr bid="B39">39</abbr></abbrgrp> six data sets representing the network of protein interactions in the yeast <it>Saccharomyces cerevisiae</it>. Two of these data sets consist of pairs of interacting proteins detected by the two-hybrid technique published respectively by Uetz <it>et al</it>. <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> and Ito <it>et al</it>. <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. The four other data sets contain protein complexes characterized by mass spectrometry, published respectively by Gavin <it>et al</it>. <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B6">6</abbr></abbrgrp>, Ho <it>et al</it>. <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, and Krogan <it>et al</it>. <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> (Table <tblr tid="T6">6</tblr>). For each of these data sets we built a graph with one node per protein, and one edge per interaction.</p>
            <tbl id="T6">
               <title>
                  <p>Table 6</p>
               </title>
               <caption>
                  <p>Main features of the four large scale data sets and clustering performances of the algorithms when applied to them</p>
               </caption>
               <tblbdy cols="14">
                  <r>
                     <c ca="left">
                        <p>Dataset</p>
                     </c>
                     <c ca="left">
                        <p>Nb nodes</p>
                     </c>
                     <c ca="left">
                        <p>Nb edges</p>
                     </c>
                     <c ca="left">
                        <p>Mean degree</p>
                     </c>
                     <c ca="left">
                        <p>Mean clust coeff</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2" ca="center">
                        <p>MCL</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>MCODE</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>RNSC</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>SPC</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="14">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>real</p>
                     </c>
                     <c ca="center">
                        <p>permuted</p>
                     </c>
                     <c ca="center">
                        <p>real</p>
                     </c>
                     <c ca="center">
                        <p>permuted</p>
                     </c>
                     <c ca="center">
                        <p>real</p>
                     </c>
                     <c ca="center">
                        <p>permuted</p>
                     </c>
                     <c ca="center">
                        <p>real</p>
                     </c>
                     <c ca="center">
                        <p>permuted</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="14">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Uetz <it>et al</it>. [1]</p>
                     </c>
                     <c ca="left">
                        <p>926</p>
                     </c>
                     <c ca="left">
                        <p>865</p>
                     </c>
                     <c ca="left">
                        <p>1.175</p>
                     </c>
                     <c ca="left">
                        <p>0.018</p>
                     </c>
                     <c ca="center">
                        <p>Number of clusters</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>288</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>10</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>48</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>234</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Mean nb prot/cluster</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>3.22</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>11.2</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>1.91</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>3.96</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Median nb prot/cluster</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>3</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>4.5</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>2</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Largest cluster size</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>16</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>53</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>6</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>276</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <it>Sn</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>57.3%</p>
                     </c>
                     <c ca="center">
                        <p>38.6%</p>
                     </c>
                     <c ca="center">
                        <p>84.3%</p>
                     </c>
                     <c ca="center">
                        <p>74.5%</p>
                     </c>
                     <c ca="center">
                        <p>49.4%</p>
                     </c>
                     <c ca="center">
                        <p>36.5%</p>
                     </c>
                     <c ca="center">
                        <p>65.5%</p>
                     </c>
                     <c ca="center">
                        <p>43.3%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>